{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,19]],"date-time":"2026-02-19T02:13:32Z","timestamp":1771467212093,"version":"3.50.1"},"reference-count":16,"publisher":"Oxford University Press (OUP)","issue":"2","funder":[{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["R01HG009979"],"award-info":[{"award-number":["R01HG009979"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["U19AI135990"],"award-info":[{"award-number":["U19AI135990"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["U54CA209891"],"award-info":[{"award-number":["U54CA209891"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["U24CA184427"],"award-info":[{"award-number":["U24CA184427"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2026,2,3]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Knowledge graphs (KGs) are powerful tools for structuring and analyzing biological information due to their ability to represent data and improve queries across heterogeneous datasets. However, constructing KGs from unstructured literature remains challenging due to the cost and expertise required for manual curation. Prior works have explored text-mining techniques to automate this process, but have limitations that impact their ability to capture complex relationships fully. Traditional text-mining methods struggle with understanding context across sentences. Additionally, these methods lack expert-level background knowledge, making it difficult to infer relationships that require awareness of concepts indirectly described in the text. Large Language Models (LLMs) present an opportunity to overcome these challenges. LLMs are trained on diverse literature, equipping them with contextual knowledge that enables more accurate information extraction.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>We present textToKnowledgeGraph, an artificial intelligence tool using LLMs to extract interactions from individual publications directly in Biological Expression Language (BEL). BEL was chosen for its compact, detailed representation of biological relationships, enabling structured, computationally accessible encoding. This work makes several contributions. (i) Development of the open-source Python textToKnowledgeGraph package (pypi.org\/project\/texttoknowledgegraph) for BEL extraction from scientific articles, usable from the command line and within other projects, (ii) an interactive application within Cytoscape Web to simplify extraction and exploration, (iii) a dataset of extractions that have been both computationally and manually reviewed to support future fine-tuning efforts.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>https:\/\/github.com\/ndexbio\/llm-text-to-knowledge-graph.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btag031","type":"journal-article","created":{"date-parts":[[2026,1,15]],"date-time":"2026-01-15T12:48:50Z","timestamp":1768481330000},"source":"Crossref","is-referenced-by-count":0,"title":["textToKnowledgeGraph: generation of molecular interaction knowledge graphs using large language models for exploration in Cytoscape"],"prefix":"10.1093","volume":"42","author":[{"ORCID":"https:\/\/orcid.org\/0009-0005-9299-2442","authenticated-orcid":false,"given":"Favour","family":"James","sequence":"first","affiliation":[{"name":"Department of Electronic and Electrical Engineering, Obafemi Awolowo University , Ife-Ife, Osun, 220103,","place":["Nigeria"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1471-9513","authenticated-orcid":false,"given":"Dexter","family":"Pratt","sequence":"additional","affiliation":[{"name":"Department of Medicine, University of California, San Diego , La Jolla, CA, 92093,","place":["United States"]}]},{"given":"Christopher","family":"Churas","sequence":"additional","affiliation":[{"name":"Department of Medicine, University of California, San Diego , La Jolla, CA, 92093,","place":["United States"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5709-371X","authenticated-orcid":false,"given":"Augustin","family":"Luna","sequence":"additional","affiliation":[{"name":"Computational Biology Branch, National Library of Medicine and Developmental Therapeutics Branch, National Cancer Institute , 9000 Rockville Pike , Bethesda, MD, 20892,","place":["United States"]}]}],"member":"286","published-online":{"date-parts":[[2026,1,19]]},"reference":[{"key":"2026021820511222400_btag031-B1","doi-asserted-by":"crossref","DOI":"10.15252\/msb.202211325","article-title":"Automated assembly of molecular mechanisms at scale from text mining and curated databases","volume-title":"Mol Sys Biol","author":"Bachman","year":"2023"},{"key":"2026021820511222400_btag031-B2","doi-asserted-by":"crossref","DOI":"10.1093\/gigascience\/giae082","article-title":"Knowledge graph\u2013based thought: a knowledge graph\u2013enhanced LLM framework for pan-cancer question answering","volume":"14","author":"Feng","year":"2025","journal-title":"Gigascience"},{"key":"2026021820511222400_btag031-B3","doi-asserted-by":"crossref","first-page":"vbac034","DOI":"10.1093\/bioadv\/vbac034","article-title":"Gilda: biomedical entity text normalization with machine-learned disambiguation as a service","volume":"2","author":"Gyori","year":"2022","journal-title":"Bioinform Adv"},{"key":"2026021820511222400_btag031-B4","doi-asserted-by":"crossref","first-page":"703","DOI":"10.1093\/bioinformatics\/btx660","article-title":"PyBEL: a computational framework for biological expression language","volume":"34","author":"Hoyt","year":"2018","journal-title":"Bioinformatics"},{"key":"2026021820511222400_btag031-B5","doi-asserted-by":"crossref","first-page":"2703","DOI":"10.1091\/mbc.10.8.2703","article-title":"Molecular interaction map of the mammalian cell cycle control and DNA repair systems","volume":"10","author":"Kohn","year":"1999","journal-title":"Mol Biol Cell"},{"key":"2026021820511222400_btag031-B6","author":"Liu","year":"2019"},{"key":"2026021820511222400_btag031-B7","doi-asserted-by":"crossref","first-page":"6","DOI":"10.1186\/2041-9414-4-6","article-title":"SIRT1\/PARP1 crosstalk: connecting DNA damage and metabolism","volume":"4","author":"Luna","year":"2013","journal-title":"Genome Integr"},{"key":"2026021820511222400_btag031-B8","doi-asserted-by":"crossref","DOI":"10.1093\/bioinformatics\/btad080","article-title":"The scalable precision medicine open knowledge engine (SPOKE): a massive knowledge graph of biomedical information","volume":"39","author":"Morris","year":"2023","journal-title":"Bioinformatics"},{"key":"2026021820511222400_btag031-B9","doi-asserted-by":"crossref","first-page":"735","DOI":"10.1038\/nbt.1558","article-title":"The systems biology graphical notation","volume":"27","author":"Nov\u00e8re","year":"2009","journal-title":"Nat Biotechnol"},{"key":"2026021820511222400_btag031-B10","doi-asserted-by":"crossref","first-page":"W203","DOI":"10.1093\/nar\/gkaf365","article-title":"Cytoscape web: bringing network biology to the browser","volume":"53","author":"Ono","year":"2025","journal-title":"Nucleic Acids Res"},{"key":"2026021820511222400_btag031-B2486807","doi-asserted-by":"publisher","first-page":"302","DOI":"10.1016\/j.cels.2015.10.001","article-title":"NDEx, the network data exchange","volume":"1","author":"Pratt","year":"2015","journal-title":"Cell Syst"},{"key":"2026021820511222400_btag031-B77315582","doi-asserted-by":"publisher","first-page":"2498","DOI":"10.1101\/gr.1239303","article-title":"Cytoscape: A software environment for integrated models of biomolecular interaction networks","volume":"13","author":"Shannon","year":"2003","journal-title":"Genome Res"},{"key":"2026021820511222400_btag031-B11","doi-asserted-by":"crossref","first-page":"193","DOI":"10.1016\/j.drudis.2013.12.011","article-title":"Recent advances in modeling languages for pathway maps and computable biological networks","volume":"19","author":"Slater","year":"2014","journal-title":"Drug Discov Today"},{"key":"2026021820511222400_btag031-B12","author":"Wang","year":"2024"},{"key":"2026021820511222400_btag031-B13","author":"Wang","year":"2024"},{"key":"2026021820511222400_btag031-B14","doi-asserted-by":"crossref","first-page":"W540","DOI":"10.1093\/nar\/gkae235","article-title":"PubTator 3.0: an AI-powered literature resource for unlocking biomedical knowledge","volume":"52","author":"Wei","year":"2024","journal-title":"Nucleic Acids Res"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btag031\/66464644\/btag031.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/42\/2\/btag031\/66464644\/btag031.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/42\/2\/btag031\/66464644\/btag031.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,2,19]],"date-time":"2026-02-19T01:51:21Z","timestamp":1771465881000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btag031\/8430289"}},"subtitle":[],"editor":[{"given":"Jonathan","family":"Wren","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2026,1,19]]},"references-count":16,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2026,2,3]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btag031","relation":{},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2026,2]]},"published":{"date-parts":[[2026,1,19]]},"article-number":"btag031"}}