{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T03:44:37Z","timestamp":1772163877412,"version":"3.50.1"},"reference-count":14,"publisher":"World Scientific Pub Co Pte Ltd","issue":"02","funder":[{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["R01-LM013392"],"award-info":[{"award-number":["R01-LM013392"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Artif. Intell. Robot. Res."],"published-print":{"date-parts":[[2024,6]]},"abstract":"<jats:p>ChatGPT has demonstrated its potential as a surrogate knowledge graph. Trained on extensive data sources, including open-access publications, peer-reviewed research articles, and biomedical websites, ChatGPT extracted information on gene relationships and biological pathways so that it can be used to predict them. However, a major challenge is model hallucination, that is, high false positive rates. To assess and address this challenge, we systematically evaluated ChatGPT\u2019s capacity for predicting gene relationships using GPT-3.5-turbo, GPT-4, and GPT-4o. Benchmarking against the KEGG Pathway Database as the ground truth, we experimented with diverse prompting strategies, targeting gene relationships of activation, inhibition, and phosphorylation. We introduced an innovative iterative prompt refinement technique. By assessing prompt efficacy using metrics such as F-1 score, precision, and recall, GPT-4 suggested improved prompts. A refined prompt, which combines a specialized role with explanatory text, significantly enhanced the performance. Going beyond pairwise gene relationships, we also deciphered complex gene interplays, such as gene interaction chains and pathways pertinent to diseases such as non-small cell lung cancer. Direct prompts showed limited success, but \u201cleast-to-most\u201d prompting exhibited significant potentials for such network constructions. The methods in this study may be used for other bioinformatics prediction problems.<\/jats:p>","DOI":"10.1142\/s2972335324500054","type":"journal-article","created":{"date-parts":[[2024,6,17]],"date-time":"2024-06-17T03:34:31Z","timestamp":1718595271000},"source":"Crossref","is-referenced-by-count":4,"title":["Iterative Prompt Refinement for Mining Gene Relationships from ChatGPT"],"prefix":"10.1142","volume":"01","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3969-2679","authenticated-orcid":false,"given":"Yibo","family":"Chen","sequence":"first","affiliation":[{"name":"Institute for Data Science and Informatics, Bond Life Sciences Center, University of Missouri, Columbia, MO 65211, USA"}]},{"ORCID":"https:\/\/orcid.org\/0009-0001-4878-3978","authenticated-orcid":false,"given":"Jeffrey","family":"Gao","sequence":"additional","affiliation":[{"name":"Marriotts Ridge High School, Marriottsville, MD 21104, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6022-2620","authenticated-orcid":false,"given":"Marius","family":"Petruc","sequence":"additional","affiliation":[{"name":"Institute for Data Science and Informatics, University of Missouri, Columbia, MO 65211, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7173-9414","authenticated-orcid":false,"given":"Richard D.","family":"Hammer","sequence":"additional","affiliation":[{"name":"Department of Pathology and Anatomical Sciences, University of Missouri, Columbia, MO 65211, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6145-8096","authenticated-orcid":false,"given":"Mihail","family":"Popescu","sequence":"additional","affiliation":[{"name":"Department of Biomedical Informatics, Biostatistics and Medical Epidemiology, University of Missouri, Columbia, MO 65211, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4809-0514","authenticated-orcid":false,"given":"Dong","family":"Xu","sequence":"additional","affiliation":[{"name":"Department of Electrical Engineering and Computer Science, Bond Life Sciences Center, Institute for Data Science and Informatics, University of Missouri, Columbia, MO 65211, USA"}]}],"member":"219","published-online":{"date-parts":[[2024,7,13]]},"reference":[{"key":"S2972335324500054BIB001","doi-asserted-by":"publisher","DOI":"10.1016\/j.tibtech.2006.10.002"},{"key":"S2972335324500054BIB002","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbi.2003.10.001"},{"key":"S2972335324500054BIB003","doi-asserted-by":"publisher","DOI":"10.1093\/bib\/bbm029"},{"key":"S2972335324500054BIB004","first-page":"392","volume":"4","author":"Salamonsen W.","year":"1999","journal-title":"Pac. Symp. Biocomput."},{"key":"S2972335324500054BIB006","first-page":"77","volume-title":"Proc. Int. Conf. Intell. Syst. Mol. Biol.","author":"Craven M.","year":"1999"},{"key":"S2972335324500054BIB007","doi-asserted-by":"publisher","DOI":"10.1109\/BIBM52615.2021.9669391"},{"key":"S2972335324500054BIB008","first-page":"3791","volume-title":"2022 IEEE Int. Conf. Bioinformatics and Biomedicine (BIBM)","author":"Chen Y."},{"key":"S2972335324500054BIB011","doi-asserted-by":"publisher","DOI":"10.1038\/s41746-023-00856-1"},{"issue":"1","key":"S2972335324500054BIB012","volume":"6","author":"Wornow M.","year":"2023","journal-title":"npj Dig. Med."},{"key":"S2972335324500054BIB013","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gkac963"},{"key":"S2972335324500054BIB021","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.acl-long.295"},{"key":"S2972335324500054BIB029","doi-asserted-by":"publisher","DOI":"10.1007\/s10462-022-10228-y"},{"key":"S2972335324500054BIB032","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gkac1000"},{"key":"S2972335324500054BIB033","doi-asserted-by":"publisher","DOI":"10.3390\/pharmaceutics15082090"}],"container-title":["International Journal of Artificial Intelligence and Robotics Research"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S2972335324500054","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,9,8]],"date-time":"2024-09-08T23:23:07Z","timestamp":1725837787000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S2972335324500054"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,6]]},"references-count":14,"journal-issue":{"issue":"02","published-print":{"date-parts":[[2024,6]]}},"alternative-id":["10.1142\/S2972335324500054"],"URL":"https:\/\/doi.org\/10.1142\/s2972335324500054","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2023.12.23.573201","asserted-by":"object"}]},"ISSN":["2972-3353","2972-3361"],"issn-type":[{"value":"2972-3353","type":"print"},{"value":"2972-3361","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,6]]},"article-number":"2450005"}}