{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,12,31]],"date-time":"2024-12-31T06:40:16Z","timestamp":1735627216485,"version":"3.32.0"},"reference-count":22,"publisher":"Association for Computing Machinery (ACM)","issue":"12","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2024,8]]},"abstract":"<jats:p>We demonstrate MedHunter, a system for assisting the early stage of drug development. MedHunter builds a biomedical knowledge graph DDKG by integrating data from eleven biochemical libraries and data banks, and aligning entities from different data sources by means of heterogeneous entity resolution. It identifies drug-disease associations and protein-protein interactions in DDKG by employing graph association rules (GARs). GARs use graph patterns to extract relevant entities and embed ML models as predicates. MedHunter discovers GARs from DDKG and incrementally enriches DDKG with external data; it cleans DDKG with a special form of GARs. We demonstrate MedHunter for its (a) interfaces, (b) data enrichment\/cleaning, and (c) applications in target identification, drug-drug interaction and protein-protein interaction.<\/jats:p>","DOI":"10.14778\/3685800.3685858","type":"journal-article","created":{"date-parts":[[2024,11,8]],"date-time":"2024-11-08T17:25:21Z","timestamp":1731086721000},"page":"4293-4296","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Graph Association Analyses for Early Drug Discovery"],"prefix":"10.14778","volume":"17","author":[{"given":"Wenfei","family":"Fan","sequence":"first","affiliation":[{"name":"Shenzhen Institute of Computing Sciences, China and University of Edinburgh, UK and Beihang University, China"}]},{"given":"Daji","family":"Li","sequence":"additional","affiliation":[{"name":"Shenzhen Institute of Computing Sciences, China"}]},{"given":"Peiyu","family":"Liang","sequence":"additional","affiliation":[{"name":"Shenzhen Institute of Computing Sciences, China"}]},{"given":"Shuhao","family":"Liu","sequence":"additional","affiliation":[{"name":"Shenzhen Institute of Computing Sciences, China"}]},{"given":"Yaoshu","family":"Wang","sequence":"additional","affiliation":[{"name":"Shenzhen Institute of Computing Sciences, China"}]},{"given":"Yiming","family":"Wang","sequence":"additional","affiliation":[{"name":"Shenzhen Institute of Computing Sciences, China"}]},{"given":"Min","family":"Xie","sequence":"additional","affiliation":[{"name":"Shenzhen Institute of Computing Sciences, China"}]},{"given":"Runjie","family":"Zhang","sequence":"additional","affiliation":[{"name":"Shenzhen Institute of Computing Sciences, China"}]}],"member":"320","published-online":{"date-parts":[[2024,11,8]]},"reference":[{"unstructured":"2024. AI-based Drug Development Solution. https:\/\/www.grandhoo.com\/en\/fishtown\/product-solution\/qubing\/.","key":"e_1_2_1_1_1"},{"unstructured":"2024. BioGRID. https:\/\/thebiogrid.org\/.","key":"e_1_2_1_2_1"},{"unstructured":"2024. Comparative Toxicogenomics Database (CTD). https:\/\/ctdbase.org\/.","key":"e_1_2_1_3_1"},{"unstructured":"2024. PharmGKB dataset. https:\/\/www.pharmgkb.org\/.","key":"e_1_2_1_4_1"},{"unstructured":"2024. UniProt. https:\/\/www.uniprot.org\/.","key":"e_1_2_1_5_1"},{"doi-asserted-by":"publisher","key":"e_1_2_1_6_1","DOI":"10.1007\/BF01305232"},{"doi-asserted-by":"publisher","key":"e_1_2_1_7_1","DOI":"10.1145\/3588940"},{"doi-asserted-by":"publisher","key":"e_1_2_1_8_1","DOI":"10.14778\/3523210.3523224"},{"volume-title":"Linking Entities across Relations and Graphs","author":"Fan Wenfei","unstructured":"Wenfei Fan, Ling Ge, Ruochun Jin, Ping Lu, and Wenyuan Yu. 2022. Linking Entities across Relations and Graphs. In ICDE. IEEE, 634--647.","key":"e_1_2_1_9_1"},{"doi-asserted-by":"publisher","key":"e_1_2_1_10_1","DOI":"10.14778\/3407790.3407795"},{"key":"e_1_2_1_11_1","volume-title":"Catching More Associations by Referencing Knowledge Graphs. PVLDB","author":"Fan Wenfei","year":"2024","unstructured":"Wenfei Fan, Muyang Liu, Shuhao Liu, and Chao Tian. 2024. Catching More Associations by Referencing Knowledge Graphs. PVLDB (2024)."},{"doi-asserted-by":"publisher","key":"e_1_2_1_12_1","DOI":"10.14778\/3317315.3317318"},{"key":"e_1_2_1_13_1","volume-title":"Alexopoulos","author":"Fotis Chris","year":"2017","unstructured":"Chris Fotis, Asier Antoranz, Dimitris Hatziavramidis, Theodore Sakellaropoulos, and Leonidas G. Alexopoulos. 2017. Pathway-based technologies for early drug discovery. Drug Discovery Today (2017)."},{"doi-asserted-by":"crossref","unstructured":"Martin Grohe. 2020. word2vec node2vec graph2vec X2vec: Towards a Theory of Vector Embeddings of Structured Data. In PODS. ACM 1--16.","key":"e_1_2_1_14_1","DOI":"10.1145\/3375395.3387641"},{"key":"e_1_2_1_15_1","volume-title":"Two-Dimensional Weisfeiler-Lehman Graph Neural Networks for Link Prediction. CoRR abs\/2206.09567","author":"Hu Yang","year":"2022","unstructured":"Yang Hu, Xiyuan Wang, Zhouchen Lin, Pan Li, and Muhan Zhang. 2022. Two-Dimensional Weisfeiler-Lehman Graph Neural Networks for Link Prediction. CoRR abs\/2206.09567 (2022)."},{"doi-asserted-by":"publisher","key":"e_1_2_1_16_1","DOI":"10.1016\/0304-3975(90)90192-K"},{"doi-asserted-by":"publisher","key":"e_1_2_1_17_1","DOI":"10.1038\/s41586-022-04543-1"},{"key":"e_1_2_1_18_1","volume-title":"PGCN: Disease gene prioritization by disease and gene embedding through graph convolutional neural networks. biorxiv","author":"Li Yu","year":"2019","unstructured":"Yu Li, Hiroyuki Kuwahara, Peng Yang, Le Song, and Xin Gao. 2019. PGCN: Disease gene prioritization by disease and gene embedding through graph convolutional neural networks. biorxiv (2019), 532226."},{"key":"e_1_2_1_19_1","volume-title":"Ivan Titov, and Max Welling.","author":"Schlichtkrull Michael","year":"2018","unstructured":"Michael Schlichtkrull, Thomas N Kipf, Peter Bloem, Rianne Van Den Berg, Ivan Titov, and Max Welling. 2018. Modeling relational data with graph convolutional networks. In The Semantic Web (ESWC). Springer, 593--607."},{"doi-asserted-by":"publisher","key":"e_1_2_1_20_1","DOI":"10.1093\/bioinformatics\/btab310"},{"doi-asserted-by":"publisher","key":"e_1_2_1_21_1","DOI":"10.1109\/BIBM47256.2019.8983134"},{"key":"e_1_2_1_22_1","volume-title":"Toward better drug discovery with knowledge graph. Current opinion in structural biology 72","author":"Zeng Xiangxiang","year":"2022","unstructured":"Xiangxiang Zeng, Xinqi Tu, Yuansheng Liu, Xiangzheng Fu, and Yansen Su. 2022. Toward better drug discovery with knowledge graph. Current opinion in structural biology 72 (2022), 114--126."}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/3685800.3685858","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,12,31]],"date-time":"2024-12-31T05:28:31Z","timestamp":1735622911000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/3685800.3685858"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,8]]},"references-count":22,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2024,8]]}},"alternative-id":["10.14778\/3685800.3685858"],"URL":"https:\/\/doi.org\/10.14778\/3685800.3685858","relation":{},"ISSN":["2150-8097"],"issn-type":[{"type":"print","value":"2150-8097"}],"subject":[],"published":{"date-parts":[[2024,8]]},"assertion":[{"value":"2024-11-08","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}