{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,21]],"date-time":"2026-05-21T10:10:45Z","timestamp":1779358245040,"version":"3.51.4"},"reference-count":55,"publisher":"Oxford University Press (OUP)","issue":"3","license":[{"start":{"date-parts":[[2024,4,12]],"date-time":"2024-04-12T00:00:00Z","timestamp":1712880000000},"content-version":"vor","delay-in-days":16,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100012166","name":"National Key Research and Development Program","doi-asserted-by":"publisher","award":["2023YFC3502604"],"award-info":[{"award-number":["2023YFC3502604"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100012166","name":"National Key Research and Development Program","doi-asserted-by":"publisher","award":["2021YFC1712901"],"award-info":[{"award-number":["2021YFC1712901"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["82174533"],"award-info":[{"award-number":["82174533"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["82374302"],"award-info":[{"award-number":["82374302"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["82204941"],"award-info":[{"award-number":["82204941"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["U23B2062"],"award-info":[{"award-number":["U23B2062"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004826","name":"Natural Science Foundation of Beijing","doi-asserted-by":"publisher","award":["L232033"],"award-info":[{"award-number":["L232033"]}],"id":[{"id":"10.13039\/501100004826","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Key R&D Program Project of Ningxia Hui Autonomous Region","award":["2022BEG02036"],"award-info":[{"award-number":["2022BEG02036"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,3,27]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>The accurate identification of disease-associated genes is crucial for understanding the molecular mechanisms underlying various diseases. Most current methods focus on constructing biological networks and utilizing machine learning, particularly deep learning, to identify disease genes. However, these methods overlook complex relations among entities in biological knowledge graphs. Such information has been successfully applied in other areas of life science research, demonstrating their effectiveness. Knowledge graph embedding methods can learn the semantic information of different relations within the knowledge graphs. Nonetheless, the performance of existing representation learning techniques, when applied to domain-specific biological data, remains suboptimal. To solve these problems, we construct a biological knowledge graph centered on diseases and genes, and develop an end-to-end knowledge graph completion framework for disease gene prediction using interactional tensor decomposition named KDGene. KDGene incorporates an interaction module that bridges entity and relation embeddings within tensor decomposition, aiming to improve the representation of semantically similar concepts in specific domains and enhance the ability to accurately predict disease genes. Experimental results show that KDGene significantly outperforms state-of-the-art algorithms, whether existing disease gene prediction methods or knowledge graph embedding methods for general domains. Moreover, the comprehensive biological analysis of the predicted results further validates KDGene\u2019s capability to accurately identify new candidate genes. This work proposes a scalable knowledge graph completion framework to identify disease candidate genes, from which the results are promising to provide valuable references for further wet experiments. Data and source codes are available at https:\/\/github.com\/2020MEAI\/KDGene.<\/jats:p>","DOI":"10.1093\/bib\/bbae161","type":"journal-article","created":{"date-parts":[[2024,4,12]],"date-time":"2024-04-12T05:17:25Z","timestamp":1712899045000},"source":"Crossref","is-referenced-by-count":15,"title":["KDGene: knowledge graph completion for disease gene prediction using interactional tensor decomposition"],"prefix":"10.1093","volume":"25","author":[{"given":"Xinyan","family":"Wang","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kuo","family":"Yang","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ting","family":"Jia","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Fanghui","family":"Gu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chongyu","family":"Wang","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kuan","family":"Xu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zixin","family":"Shu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jianan","family":"Xia","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Qiang","family":"Zhu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4713-3594","authenticated-orcid":false,"given":"Xuezhong","family":"Zhou","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2024,4,11]]},"reference":[{"issue":"9","key":"2024041205171593500_ref1","doi-asserted-by":"crossref","first-page":"507","DOI":"10.1038\/nrg.2016.86","article-title":"Towards precision medicine","volume":"17","author":"Ashley","year":"2016","journal-title":"Nat Rev Genet"},{"issue":"3","key":"2024041205171593500_ref2","doi-asserted-by":"crossref","first-page":"229","DOI":"10.1016\/j.cmpb.2006.12.003","article-title":"A partially supervised classification approach to dominant and recessive human disease gene prediction","volume":"85","author":"Calvo","year":"2007","journal-title":"Comput Methods Programs Biomed"},{"issue":"7333","key":"2024041205171593500_ref3","doi-asserted-by":"crossref","first-page":"187","DOI":"10.1038\/nature09792","article-title":"Initial impact of the sequencing of the human genome","volume":"470","author":"Lander","year":"2011","journal-title":"Nature"},{"issue":"18","key":"2024041205171593500_ref4","doi-asserted-by":"crossref","first-page":"2831","DOI":"10.1093\/bioinformatics\/btw358","article-title":"A knowledge-based approach for predicting gene\u2013disease associations","volume":"32","author":"Zhou","year":"2016","journal-title":"Bioinformatics"},{"issue":"2","key":"2024041205171593500_ref5","doi-asserted-by":"crossref","first-page":"e1383","DOI":"10.1002\/widm.1383","article-title":"Predicting disease-associated genes: computational methods, databases, and evaluations","volume":"11","author":"Luo","year":"2021","journal-title":"Wiley Interdiscip Rev Data Mining Knowl Discov"},{"issue":"1","key":"2024041205171593500_ref6","doi-asserted-by":"crossref","first-page":"e1000641","DOI":"10.1371\/journal.pcbi.1000641","article-title":"Associating genes and protein complexes with disease via network propagation","volume":"6","author":"Vanunu","year":"2010","journal-title":"PLoS Comput Biol"},{"issue":"1","key":"2024041205171593500_ref7","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1756-0381-4-19","article-title":"Da da: degree-aware algorithms for network-based disease gene prioritization","volume":"4","author":"Erten","year":"2011","journal-title":"BioData Mining"},{"issue":"1","key":"2024041205171593500_ref8","doi-asserted-by":"crossref","first-page":"189","DOI":"10.1038\/msb.2008.27","article-title":"Network-based global inference of human disease genes","volume":"4","author":"Xuebing","year":"2008","journal-title":"Mol Syst Biol"},{"key":"2024041205171593500_ref9","article-title":"Disease gene prioritization using network topological analysis from a sequence based human functional linkage network","author":"Jalilvand","year":"2019"},{"issue":"6","key":"2024041205171593500_ref10","doi-asserted-by":"crossref","first-page":"897","DOI":"10.1007\/s00439-020-02253-0","article-title":"A network-based machine-learning framework to identify both functional modules and disease genes","volume":"140","author":"Yang","year":"2021","journal-title":"Hum Genet"},{"issue":"11","key":"2024041205171593500_ref11","doi-asserted-by":"crossref","first-page":"1452","DOI":"10.1093\/jamia\/ocy117","article-title":"Heterogeneous network embedding for identifying symptom candidate genes","volume":"25","author":"Yang","year":"2018","journal-title":"J Am Med Inform Assoc"},{"issue":"4","key":"2024041205171593500_ref12","doi-asserted-by":"crossref","first-page":"1805","DOI":"10.1109\/JBHI.2018.2870728","article-title":"HerGePred: heterogeneous network embedding representation for disease gene prediction","volume":"23","author":"Yang","year":"2018","journal-title":"IEEE J Biomed Health Inform"},{"key":"2024041205171593500_ref13","doi-asserted-by":"crossref","DOI":"10.1093\/bib\/bbac170","article-title":"Decoding multilevel relationships with the human tissue-cell-molecule network","volume":"23","author":"Hou","year":"2022","journal-title":"Brief Bioinform"},{"key":"2024041205171593500_ref14","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-030-17083-7_17","article-title":"RENET: a deep learning approach for extracting gene-disease associations from literature","volume-title":"Research in Computational Molecular Biology. Lecture Notes in Computer Science","author":"Wu"},{"key":"2024041205171593500_ref15","article-title":"PDGNet: predicting disease genes using a deep neural network with multi-view features","volume":"19","author":"Yang","year":"2020","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"issue":"2","key":"2024041205171593500_ref16","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.1008686","article-title":"SaveRUNNER: a network-based algorithm for drug repurposing and its application to covid-19","volume":"17","author":"Fiscon","year":"2021","journal-title":"PLoS Comput Biol"},{"key":"2024041205171593500_ref17","doi-asserted-by":"crossref","DOI":"10.1093\/bib\/bbac390","article-title":"Predicting multiple types of miRNA\u2013disease associations using adaptive weighted nonnegative tensor factorization with self-paced learning and hypergraph regularization","volume":"23","author":"Ouyang","year":"2022","journal-title":"Brief Bioinform"},{"issue":"7","key":"2024041205171593500_ref18","doi-asserted-by":"crossref","first-page":"3539","DOI":"10.1109\/JBHI.2022.3152619","article-title":"DeepMNE: deep multi-network embedding for lncRNA-disease association prediction","volume":"26","author":"Ma","year":"2022","journal-title":"IEEE J Biomed Health Inform"},{"issue":"2","key":"2024041205171593500_ref19","doi-asserted-by":"crossref","first-page":"435","DOI":"10.1093\/bioinformatics\/btab652","article-title":"Hypergraph-based logistic matrix factorization for metabolite\u2013disease interaction prediction","volume":"38","author":"Ma","year":"2022","journal-title":"Bioinformatics"},{"issue":"1","key":"2024041205171593500_ref20","doi-asserted-by":"crossref","first-page":"bbac604","DOI":"10.1093\/bib\/bbac604","article-title":"Logistic tensor decomposition with sparse subspace learning for prediction of multiple disease types of human\u2013virus protein\u2013protein interactions","volume":"24","author":"Ma","year":"2023","journal-title":"Brief Bioinform"},{"issue":"1","key":"2024041205171593500_ref21","doi-asserted-by":"crossref","first-page":"6775","DOI":"10.1038\/s41467-021-27137-3","article-title":"A unified drug\u2013target interaction prediction framework based on knowledge graph and recommendation system","volume":"12","author":"Ye","year":"2021","journal-title":"Nat Commun"},{"issue":"2","key":"2024041205171593500_ref22","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3424672","article-title":"Knowledge graph embedding for link prediction: a comparative analysis","volume":"15","author":"Rossi","year":"2021","journal-title":"ACM Trans Knowl Discov Data"},{"issue":"12","key":"2024041205171593500_ref23","doi-asserted-by":"crossref","first-page":"2724","DOI":"10.1109\/TKDE.2017.2754499","article-title":"Knowledge graph embedding: a survey of approaches and applications","volume":"29","author":"Wang","year":"2017","journal-title":"IEEE Trans Knowl Data Eng"},{"issue":"1\u20134","key":"2024041205171593500_ref24","doi-asserted-by":"crossref","first-page":"164","DOI":"10.1002\/sapm192761164","article-title":"The expression of a tensor or a polyadic as a sum of products","volume":"6","author":"Hitchcock","year":"1927","journal-title":"J Math Phys"},{"key":"2024041205171593500_ref25","first-page":"2863","article-title":"Canonical tensor decomposition for knowledge base completion","volume-title":"International Conference on Machine Learning","author":"Lacroix","year":"2018"},{"key":"2024041205171593500_ref26","doi-asserted-by":"crossref","DOI":"10.1145\/2939672.2939754","article-title":"node2vec: Scalable feature learning for networks","volume-title":"Proceedings of the 22nd ACM SIGKDD international conference on Knowledge discovery and data mining","author":"Grover"},{"key":"2024041205171593500_ref27","first-page":"p. 468\u2013476","article-title":"A knowledge graph-based disease-gene prediction system using multi-relational graph convolution networks","volume-title":"AMIA Annual Symposium Proceedings","author":"Gao","year":"2022"},{"issue":"10","key":"2024041205171593500_ref28","doi-asserted-by":"crossref","first-page":"e0258626","DOI":"10.1371\/journal.pone.0258626","article-title":"Identifying disease-gene associations using a convolutional neural network-based model by embedding a biological knowledge graph with entity descriptions","volume":"16","author":"Choi","year":"2021","journal-title":"PloS One"},{"key":"2024041205171593500_ref29","article-title":"Predicting gene-disease associations with knowledge graph embeddings over multiple ontologies","author":"Nunes","year":"2021"},{"key":"2024041205171593500_ref30","doi-asserted-by":"crossref","first-page":"e13181","DOI":"10.1111\/exsy.13181","article-title":"Biomedical knowledge graph embeddings for personalized medicine: predicting disease-gene associations","volume":"40","author":"Vilela","year":"2022","journal-title":"Exp Syst"},{"key":"2024041205171593500_ref31","doi-asserted-by":"crossref","first-page":"179373","DOI":"10.1109\/ACCESS.2019.2957812","article-title":"Inference of biomedical relations among chemicals, genes, diseases, and symptoms using knowledge representation learning","volume":"7","author":"Choi","year":"2019","journal-title":"IEEE Access"},{"issue":"8","key":"2024041205171593500_ref32","doi-asserted-by":"crossref","first-page":"2235","DOI":"10.1093\/bioinformatics\/btac085","article-title":"Multimodal reasoning based on knowledge graph embedding for specific diseases","volume":"38","author":"Zhu","year":"2022","journal-title":"Bioinformatics"},{"issue":"D1","key":"2024041205171593500_ref33","doi-asserted-by":"crossref","first-page":"D1110","DOI":"10.1093\/nar\/gky1021","article-title":"SymMap: an integrative database of traditional chinese medicine enhanced by symptom mapping","volume":"47","author":"Yang","year":"2019","journal-title":"Nucleic Acids Res"},{"issue":"D1","key":"2024041205171593500_ref34","doi-asserted-by":"crossref","first-page":"D1075","DOI":"10.1093\/nar\/gkv1075","article-title":"The SIDER database of drugs and side effects","volume":"44","author":"Kuhn","year":"2016","journal-title":"Nucleic Acids Res"},{"issue":"suppl_1","key":"2024041205171593500_ref35","first-page":"D433","article-title":"STRING: known and predicted protein\u2013protein associations, integrated and transferred across organisms","volume":"33","author":"Von Mering","year":"2005","journal-title":"Nucleic Acids Res"},{"issue":"D1","key":"2024041205171593500_ref36","doi-asserted-by":"crossref","first-page":"D380","DOI":"10.1093\/nar\/gkv1277","article-title":"STITCH5: augmenting protein\u2013chemical interaction networks with tissue and affinity data","volume":"44","author":"Szklarczyk","year":"2016","journal-title":"Nucleic Acids Res"},{"issue":"D1","key":"2024041205171593500_ref37","doi-asserted-by":"crossref","first-page":"D325","DOI":"10.1093\/nar\/gkaa1113","article-title":"Enriching a gold mine","volume":"49","author":"The Gene Ontology Resource","year":"2021","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"2024041205171593500_ref38","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1093\/nar\/28.1.27","article-title":"KEGG: Kyoto Encyclopedia of Genes and Genomes","volume":"28","author":"Kanehisa","year":"2000","journal-title":"Nucleic Acids Res"},{"issue":"8","key":"2024041205171593500_ref39","doi-asserted-by":"crossref","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","article-title":"Long short-term memory","volume":"9","author":"Hochreiter","year":"1997","journal-title":"Neural Comput"},{"issue":"7","key":"2024041205171593500_ref40","doi-asserted-by":"crossref","first-page":"1235","DOI":"10.1162\/neco_a_01199","article-title":"A review of recurrent neural networks: LSTM cells and network architectures","volume":"31","author":"Yong","year":"2019","journal-title":"Neural Comput"},{"issue":"10","key":"2024041205171593500_ref41","doi-asserted-by":"crossref","first-page":"2451","DOI":"10.1162\/089976600300015015","article-title":"Learning to forget: continual prediction with LSTM","volume":"12","author":"Gers","year":"2000","journal-title":"Neural Comput"},{"key":"2024041205171593500_ref42","article-title":"Translating embeddings for modeling multi-relational data","volume":"26","author":"Bordes","year":"2013","journal-title":"Adv Neural Inf Process Syst"},{"key":"2024041205171593500_ref43","doi-asserted-by":"crossref","DOI":"10.1609\/aaai.v32i1.11573","article-title":"Convolutional 2D knowledge graph embeddings","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","author":"Dettmers"},{"key":"2024041205171593500_ref44","doi-asserted-by":"crossref","first-page":"D833","DOI":"10.1093\/nar\/gkw943","article-title":"DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants","volume":"45","author":"Pi\u00f1ero","year":"2016","journal-title":"Nucleic Acids Res"},{"key":"2024041205171593500_ref45","doi-asserted-by":"crossref","article-title":"Exploiting protein-protein interaction networks for genome-wide disease-gene prioritization","author":"Guney","DOI":"10.1371\/journal.pone.0043557"},{"issue":"9","key":"2024041205171593500_ref46","doi-asserted-by":"crossref","first-page":"1219","DOI":"10.1093\/bioinformatics\/btq108","article-title":"Genome-wide inferring gene\u2013phenotype relationship by walking on the heterogeneous network","volume":"26","author":"Li","year":"2010","journal-title":"Bioinformatics"},{"key":"2024041205171593500_ref47","article-title":"RotatE: knowledge graph embedding by relational rotation in complex space","volume-title":"Proceedings of the International Conference on Learning Representations (ICLR)","author":"Sun"},{"key":"2024041205171593500_ref48","article-title":"Embedding entities and relations for learning and inference in knowledge bases","volume-title":"Proceedings of the International Conference on Learning Representations (ICLR)","author":"Yang"},{"key":"2024041205171593500_ref49","article-title":"Complex embeddings for simple link prediction","volume-title":"Proceedings of The 33rd International Conference on Machine Learning","author":"Trouillon"},{"key":"2024041205171593500_ref50","article-title":"TuckER: tensor factorization for knowledge graph completion","author":"Bala\u017eevi\u0107","year":"2019"},{"key":"2024041205171593500_ref51","doi-asserted-by":"crossref","first-page":"2847","DOI":"10.1109\/BIBM49941.2020.9313152","article-title":"Network-based gene prediction for TCM symptoms","volume-title":"In 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","author":"Wang","year":"2020"},{"issue":"1","key":"2024041205171593500_ref52","doi-asserted-by":"crossref","first-page":"16199","DOI":"10.1038\/s41598-017-16270-z","article-title":"Functional diversity of topological modules in human protein-protein interaction networks","volume":"7","author":"Liu","year":"2017","journal-title":"Sci Rep"},{"issue":"5","key":"2024041205171593500_ref53","doi-asserted-by":"crossref","first-page":"500","DOI":"10.1002\/dmrr.2621","article-title":"Association of polymorphisms in il6 gene promoter region with type 1 diabetes and increased albumin-to-creatinine ratio","volume":"31","author":"Ururahy","year":"2015","journal-title":"Diabetes Metab Res Rev"},{"issue":"37","key":"2024041205171593500_ref54","doi-asserted-by":"crossref","first-page":"4738","DOI":"10.3748\/wjg.v16.i37.4738","article-title":"Mn-SOD and CuZn-SOD polymorphisms and interactions with risk factors in gastric cancer","volume":"16","author":"Yi","year":"2010","journal-title":"World J Gastroenterol"},{"issue":"1","key":"2024041205171593500_ref55","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13568-022-01456-0","article-title":"Relationship between mucosal TNF-$\\alpha $ expression and Th1, Th17, Th22 and Treg responses in helicobacter pylori infection","volume":"12","author":"Rahimian","year":"2022","journal-title":"AMB Express"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/25\/3\/bbae161\/57215907\/bbae161.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/25\/3\/bbae161\/57215907\/bbae161.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,4,12]],"date-time":"2024-04-12T05:17:57Z","timestamp":1712899077000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbae161\/7644136"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,3,27]]},"references-count":55,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2024,3,27]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbae161","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2024,5]]},"published":{"date-parts":[[2024,3,27]]},"article-number":"bbae161"}}