{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,24]],"date-time":"2025-09-24T09:35:37Z","timestamp":1758706537158,"version":"3.41.2"},"reference-count":59,"publisher":"Frontiers Media SA","license":[{"start":{"date-parts":[[2025,3,5]],"date-time":"2025-03-05T00:00:00Z","timestamp":1741132800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["frontiersin.org"],"crossmark-restriction":true},"short-container-title":["Front. Res. Metr. Anal."],"abstract":"<jats:sec><jats:title>Objective<\/jats:title><jats:p>Literature-based Discovery (LBD) identifies new knowledge by leveraging existing literature. It exploits interconnecting implicit relationships to build bridges between isolated sets of non-interacting literatures. It has been used to facilitate drug repurposing, new drug discovery, and study adverse event reactions. Within the last decade, LBD systems have transitioned from using statistical methods to exploring deep learning (DL) to analyze semantic spaces between non-interacting literatures. Recent works explore knowledge graphs (KG) to represent explicit relationships. These works envision LBD as a knowledge graph completion (KGC) task and use DL to generate implicit relationships. However, these systems require the researcher to have domain-expert knowledge when submitting relevant queries for novel hypothesis discovery.<\/jats:p><\/jats:sec><jats:sec><jats:title>Methods<\/jats:title><jats:p>Our method explores a novel approach to identify all implicit hypotheses given the researcher's search query and expedites the knowledge discovery process. We revise the KGC task as the task of predicting interconnecting vertex embeddings within the graph. We train our model using a similarity learning objective and compare our model's predictions against all known vertices within the graph to determine the likelihood of an implicit relationship (i.e., connecting edge). We also explore three approaches to represent edge connections between vertices within the KG: average, concatenation, and Hadamard. Lastly, we explore an approach to induce inductive biases and expedite model convergence (i.e., input representation scaling).<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>We evaluate our method by replicating five known discoveries within the Hallmark of Cancer (HOC) datasets and compare our method to two existing works. Our results show no significant difference in reported ranks and model convergence rate when comparing scaling our input representations and not using this method. Comparing our method to previous works, we found our method achieves optimal performance on two of five datasets and achieves comparable performance on the remaining datasets. We further analyze our results using statistical significance testing to demonstrate the efficacy of our method.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusion<\/jats:title><jats:p>We found our similarity-based learning objective predicts linking vertex embeddings for single relationship closed discovery replication. Our method also provides a ranked list of linking vertices between a set of inputs. This approach reduces researcher burden and allows further exploration of generated hypotheses.<\/jats:p><\/jats:sec>","DOI":"10.3389\/frma.2025.1509502","type":"journal-article","created":{"date-parts":[[2025,3,5]],"date-time":"2025-03-05T07:12:26Z","timestamp":1741158746000},"update-policy":"https:\/\/doi.org\/10.3389\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Predicting implicit concept embeddings for singular relationship discovery replication of closed literature-based discovery"],"prefix":"10.3389","volume":"10","author":[{"given":"Clint","family":"Cuffy","sequence":"first","affiliation":[]},{"given":"Bridget T.","family":"McInnes","sequence":"additional","affiliation":[]}],"member":"1965","published-online":{"date-parts":[[2025,3,5]]},"reference":[{"key":"B1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.3115\/v1\/E14-3001","article-title":"\u201cLiterature-based discovery for oceanographic climate science,\u201d","volume-title":"Proceedings of the Student Research Workshop at the 14th Conference of the European Chapter of the Association for Computational Linguistics","author":"Aamot","year":"2014"},{"key":"B2","unstructured":"Abadi\n              M.\n            \n            \n              Agarwal\n              A.\n            \n            \n              Barham\n              P.\n            \n            \n              Brevdo\n              E.\n            \n            \n              Chen\n              Z.\n            \n            \n              Citro\n              C.\n            \n          \n          TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems\n          \n          2015"},{"key":"B3","doi-asserted-by":"publisher","first-page":"D267","DOI":"10.1093\/nar\/gkh061","article-title":"The Unified Medical Language System (UMLS): integrating biomedical terminology","volume":"32","author":"Bodenreider","year":"2004","journal-title":"Nucleic Acids Res"},{"key":"B4","doi-asserted-by":"publisher","first-page":"614","DOI":"10.1093\/jamia\/ocw142","article-title":"MeSHDD: literature-based drug-drug similarity for drug repositioning","volume":"24","author":"Brown","year":"2016","journal-title":"J. Am. Med. Inform. Assoc"},{"key":"B5","doi-asserted-by":"crossref","first-page":"891","DOI":"10.1145\/2806416.2806512","article-title":"\u201cGraRep: learning graph representations with global structural information,\u201d","volume-title":"Proceedings of the 24th ACM International on Conference on Information and Knowledge Management","author":"Cao","year":"2015"},{"key":"B6","doi-asserted-by":"publisher","first-page":"e232891","DOI":"10.1371\/journal.pone.0232891","article-title":"Neural networks for open and closed literature-based discovery","volume":"15","author":"Crichton","year":"2020","journal-title":"PLoS ONE"},{"key":"B7","doi-asserted-by":"publisher","first-page":"104362","DOI":"10.1016\/j.jbi.2023.104362","article-title":"Exploring a deep learning neural architecture for closed literature-based discovery","volume":"143","author":"Cuffy","year":"2023","journal-title":"J. Biomed. Inform"},{"key":"B8","doi-asserted-by":"crossref","first-page":"24","DOI":"10.1007\/978-3-031-09342-5_3","article-title":"\u201cA knowledge graph completion method applied to literature-based discovery for predicting missing links targeting cancer drug repurposing,\u201d","volume-title":"Artificial Intelligence in Medicine","author":"Daowd","year":"2022"},{"key":"B9","doi-asserted-by":"publisher","first-page":"D344","DOI":"10.1093\/nar\/gkm791","article-title":"ChEBI: a database and ontology for chemical entities of biological interest","volume":"36","author":"Degtyarenko","year":"2007","journal-title":"Nucleic Acids Res"},{"key":"B10","doi-asserted-by":"publisher","first-page":"1","DOI":"10.5555\/1248547.1248548","article-title":"Statistical comparisons of classifiers over multiple data sets","volume":"7","author":"Dem\u0161ar","year":"2006","journal-title":"J. Mach. Learn. Res"},{"key":"B11","doi-asserted-by":"publisher","first-page":"106","DOI":"10.1038\/nature10189","article-title":"Oncogene-induced Nrf2 transcription promotes ROS detoxification and tumorigenesis","volume":"475","author":"DeNicola","year":"2011","journal-title":"Nature"},{"key":"B12","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1109\/ICHI52183.2021.00021","article-title":"\u201cExploring self-supervised graph learning in literature-based discovery,\u201d","volume-title":"2021 IEEE 9th International Conference on Healthcare Informatics (ICHI)","author":"Ding","year":"2021"},{"key":"B13","doi-asserted-by":"publisher","first-page":"D136","DOI":"10.1093\/nar\/gkr1178","article-title":"The NCBI taxonomy database","volume":"40","author":"Federhen","year":"2012","journal-title":"Nucleic Acids Res"},{"key":"B14","doi-asserted-by":"publisher","first-page":"86","DOI":"10.1214\/aoms\/1177731944","article-title":"A comparison of alternative tests of significance for the problem of m rankings","volume":"11","author":"Friedman","year":"1940","journal-title":"Ann. Mathem. Statist"},{"key":"B15","doi-asserted-by":"publisher","DOI":"10.1126\/scisignal.aaa8398","article-title":"Integrating p38\u03b1 MAPK immune signals in nonimmune cells","author":"Gaffen","year":"2015","journal-title":"Sci. Signal"},{"key":"B16","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1007\/978-3-540-68690-3_3","article-title":"\u201cThe tip of the iceberg: the quest for innovation at the base of the pyramid,\u201d","volume-title":"Literature-based Discovery","author":"Gordon","year":"2008"},{"key":"B17","doi-asserted-by":"publisher","DOI":"10.1145\/2939672.2939754","article-title":"node2vec: scalable feature learning for networks","author":"Grover","year":"2016","journal-title":"arXiv"},{"volume-title":"Exploiting Literature-Based Discovery to Study Effects of Bullying","year":"2019","author":"Hasan","key":"B18"},{"key":"B19","doi-asserted-by":"publisher","first-page":"979","DOI":"10.1038\/ncb3397","article-title":"NOTCH1 mediates a switch between two distinct secretomes during senescence","volume":"18","author":"Hoare","year":"2016","journal-title":"Nat. Cell Biol"},{"key":"B20","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s10916-016-0544-z","article-title":"Using literature-based discovery to explain adverse drug effects","volume":"40","author":"Hristovski","year":"2016","journal-title":"J. Med. Syst"},{"key":"B21","doi-asserted-by":"publisher","first-page":"289","DOI":"10.1016\/j.ijmedinf.2004.04.024","article-title":"Using literature-based discovery to identify disease candidate genes","volume":"74","author":"Hristovski","year":"2005","journal-title":"Int. J. Med. Inform"},{"key":"B22","doi-asserted-by":"publisher","first-page":"207","DOI":"10.5555\/1688157.1688164","article-title":"Mining hidden connections among biomedical concepts from disjoint biomedical literature sets through semantic-based association rule","volume":"25","author":"Hu","year":"2010","journal-title":"Int. J. Intellig. Syst"},{"key":"B23","doi-asserted-by":"publisher","first-page":"3158","DOI":"10.1093\/bioinformatics\/bts591","article-title":"SemMedDB: a PubMed-scale repository of biomedical semantic predications","volume":"28","author":"Kilicoglu","year":"2012","journal-title":"Bioinformatics"},{"key":"B24","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/ncomms15208","article-title":"Senescent tumor cells lead the collective invasion in thyroid cancer","volume":"8","author":"Kim","year":"2017","journal-title":"Nat. Commun"},{"key":"B25","doi-asserted-by":"publisher","first-page":"256","DOI":"10.1016\/j.techfore.2007.11.009","article-title":"Literature-related discovery (lrd): water purification","volume":"75","author":"Kostoff","year":"2008","journal-title":"Technol. Forecast. Soc. Change"},{"key":"B26","first-page":"265","article-title":"Medical subject headings (MeSH)","volume":"88","author":"Lipscomb","year":"2000","journal-title":"Bull. Med. Libr. Assoc"},{"key":"B27","doi-asserted-by":"crossref","first-page":"1","DOI":"10.18653\/v1\/W18-1701","article-title":"\u201cScientific discovery as link prediction in influence and citation graphs,\u201d","volume-title":"Proceedings of the Twelfth Workshop on Graph-Based Methods for Natural Language Processing (TextGraphs-12)","author":"Luo","year":"2018"},{"key":"B28","doi-asserted-by":"publisher","first-page":"D54","DOI":"10.1093\/nar\/gki031","article-title":"Entrez gene: gene-centered information at NCBI","volume":"33","author":"Maglott","year":"2005","journal-title":"Nucleic Acids Res"},{"key":"B29","doi-asserted-by":"publisher","first-page":"794","DOI":"10.3390\/pharmaceutics13060794","article-title":"Biomedical text link prediction for drug discovery: a case study with covid-19","volume":"13","author":"McCoy","year":"2021","journal-title":"Pharmaceutics"},{"key":"B30","doi-asserted-by":"publisher","first-page":"652285","DOI":"10.3389\/frma.2021.652285","article-title":"Exploration of shared themes between food security and internet of things research through literature-based discovery","volume":"6","author":"Mej\u00eda","year":"2021","journal-title":"Front. Res. Metrics Analyt"},{"key":"B31","first-page":"807","article-title":"\u201cRectified linear units improve restricted boltzmann machines,\u201d","volume-title":"Proceedings of the 27th International Conference on Machine Learning (ICML-10)","author":"Nair","year":"2010"},{"key":"B32","unstructured":"Nemenyi\n              P. B.\n            \n          \n          Post-hoc test f\u00fcr Friedman-test\n          \n          1963"},{"key":"B33","first-page":"809","article-title":"\u201cA three-way model for collective learning on multi-relational data,\u201d","volume-title":"Proceedings of the 28th International Conference on International Conference on Machine Learning","author":"Nickel","year":"2011"},{"key":"B34","doi-asserted-by":"crossref","first-page":"701","DOI":"10.1145\/2623330.2623732","article-title":"\u201cDeepwalk: online learning of social representations,\u201d","author":"Perozzi","year":"2014","journal-title":"Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery And Data Mining"},{"key":"B35","first-page":"34","article-title":"\u201cHiDE: a tool for unrestricted literature based discovery,\u201d","volume-title":"Proceedings of the 27th International Conference on Computational Linguistics: System Demonstrations","author":"Preiss","year":"2018"},{"key":"B36","unstructured":"1996"},{"key":"B37","doi-asserted-by":"publisher","first-page":"1553","DOI":"10.1093\/bioinformatics\/bty845","article-title":"LION LBD: a literature-based discovery system for cancer biology","volume":"35","author":"Pyysalo","year":"2018","journal-title":"Bioinformatics"},{"first-page":"669","year":"2015","author":"Rastegar-Mojarad","key":"B38"},{"key":"B39","first-page":"289","article-title":"\u201cPrioritizing adverse drug reaction and drug repositioning candidates generated by literature-based discovery,\u201d","volume-title":"Proceedings of the 7th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics, BCB '16","author":"Rastegar-Mojarad","year":"2016"},{"key":"B40","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1186\/s13326-018-0192-y","article-title":"Investigating the role of interleukin-1 beta and glutamate in inflammatory bowel disease and epilepsy using discovery browsing","volume":"9","author":"Rindflesch","year":"2018","journal-title":"J. Biomed. Semantics"},{"key":"B41","doi-asserted-by":"publisher","first-page":"2323","DOI":"10.1126\/science.290.5500.2323","article-title":"Nonlinear dimensionality reduction by locally linear embedding","volume":"290","author":"Roweis","year":"2000","journal-title":"Science"},{"key":"B42","doi-asserted-by":"publisher","first-page":"8404","DOI":"10.1109\/ACCESS.2018.2886311","article-title":"Gredel: A knowledge graph embedding based method for drug discovery from biomedical literatures","volume":"7","author":"Sang","year":"2019","journal-title":"IEEE Access"},{"key":"B43","first-page":"751","article-title":"\u201cA knowledge graph based bidirectional recurrent neural network method for literature-based discovery,\u201d","volume-title":"2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","author":"Sang","year":""},{"key":"B44","doi-asserted-by":"publisher","first-page":"193","DOI":"10.1186\/s12859-018-2167-5","article-title":"Sematyp: A knowledge graph based literature mining method for drug discovery","volume":"19","author":"Sang","year":"","journal-title":"BMC Bioinformatics"},{"key":"B45","doi-asserted-by":"publisher","first-page":"293","DOI":"10.1016\/j.jbi.2014.07.011","article-title":"Identifying plausible adverse drug reactions using knowledge extracted from the literature","volume":"52","author":"Shang","year":"2014","journal-title":"J. Biomed. Inform"},{"key":"B46","doi-asserted-by":"publisher","first-page":"644614","DOI":"10.3389\/frma.2021.644614","article-title":"Pubmed-scale chemical concept embeddings reconstruct physical protein interaction networks","volume":"6","author":"\u0160krlj","year":"2021","journal-title":"Front. Res. Metrics Analyt"},{"key":"B47","doi-asserted-by":"publisher","first-page":"149","DOI":"10.1016\/S0169-2607(98)00033-9","article-title":"Using ARROWSMITH: a computer-assisted approach to formulating and assessing scientific hypotheses","volume":"57","author":"Smalheiser","year":"1998","journal-title":"Comput. Methods Prog. Biomed"},{"key":"B48","doi-asserted-by":"publisher","first-page":"190","DOI":"10.1016\/j.cmpb.2008.12.006","article-title":"Arrowsmith two-node search interface: A tutorial on finding meaningful links between two disparate sets of articles in MEDLINE","volume":"94","author":"Smalheiser","year":"2009","journal-title":"Comput. Methods Prog. Biomed"},{"key":"B49","doi-asserted-by":"publisher","first-page":"526","DOI":"10.1353\/pbm.1988.0009","article-title":"Migraine and magnesium: eleven neglected connections","volume":"31","author":"Swanson","year":"1988","journal-title":"Perspect. Biol. Med"},{"key":"B50","doi-asserted-by":"publisher","first-page":"7","DOI":"10.1353\/pbm.1986.0087","article-title":"Fish oil, raynaud's syndrome, and undiscovered public knowledge","volume":"30","author":"Swanson","year":"1986","journal-title":"Perspect. Biol. Med"},{"key":"B51","first-page":"1067","article-title":"\u201cLINE: large-scale information network embedding,\u201d","volume-title":"Proceedings of the 24th International Conference on World Wide Web, WWW '15","author":"Tang","year":"2015"},{"key":"B52","doi-asserted-by":"crossref","first-page":"45","DOI":"10.18653\/v1\/P18-3007","article-title":"\u201cAutomatic detection of cross-disciplinary knowledge associations,\u201d","volume-title":"Proceedings of ACL 2018, Student Research Workshop","author":"Thilakaratne","year":"2018"},{"key":"B53","doi-asserted-by":"publisher","first-page":"215","DOI":"10.3233\/FAIA210488","article-title":"Towards drug repurposing for covid-19 treatment using literature-based discovery","volume":"343","author":"Tropmann-Frick","year":"2022","journal-title":"Front. Artif. Intell. Appl"},{"key":"B54","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/ncomms10916","article-title":"Bcl-2 is a critical mediator of intestinal transformation","volume":"7","author":"Van Der Heijden","year":"2016","journal-title":"Nat. Commun"},{"key":"B55","doi-asserted-by":"crossref","first-page":"1225","DOI":"10.1145\/2939672.2939753","article-title":"\u201cStructural deep network embedding,\u201d","volume-title":"Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining","author":"Wang","year":"2016"},{"key":"B56","doi-asserted-by":"publisher","first-page":"W518","DOI":"10.1093\/nar\/gkt441","article-title":"PubTator: a web-based text mining tool for assisting biocuration","volume":"41","author":"Wei","year":"2013","journal-title":"Nucleic Acids Res"},{"key":"B57","doi-asserted-by":"publisher","first-page":"488","DOI":"10.1093\/bib\/bbw030","article-title":"Literature-based discovery of new candidates for drug repurposing","volume":"18","author":"Yang","year":"2016","journal-title":"Brief. Bioinform"},{"key":"B58","doi-asserted-by":"publisher","first-page":"103696","DOI":"10.1016\/j.jbi.2021.103696","article-title":"Drug repurposing for covid-19 via knowledge graph completion","volume":"115","author":"Zhang","year":"2021","journal-title":"J. Biomed. Inform"},{"key":"B59","doi-asserted-by":"publisher","first-page":"59","DOI":"10.1186\/s12911-019-0764-5","article-title":"Relation path feature embedding based convolutional neural network method for drug discovery","volume":"19","author":"Zhao","year":"2019","journal-title":"BMC Med. Inform. Decis. Mak"}],"container-title":["Frontiers in Research Metrics and Analytics"],"original-title":[],"link":[{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/frma.2025.1509502\/full","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,5]],"date-time":"2025-03-05T07:12:38Z","timestamp":1741158758000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/frma.2025.1509502\/full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,3,5]]},"references-count":59,"alternative-id":["10.3389\/frma.2025.1509502"],"URL":"https:\/\/doi.org\/10.3389\/frma.2025.1509502","relation":{},"ISSN":["2504-0537"],"issn-type":[{"type":"electronic","value":"2504-0537"}],"subject":[],"published":{"date-parts":[[2025,3,5]]},"article-number":"1509502"}}