{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,1]],"date-time":"2025-10-01T19:10:19Z","timestamp":1759345819382,"version":"build-2065373602"},"reference-count":56,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2025,10,1]],"date-time":"2025-10-01T00:00:00Z","timestamp":1759276800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,10,1]],"date-time":"2025-10-01T00:00:00Z","timestamp":1759276800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100000051","name":"National Human Genome Research Institute","doi-asserted-by":"publisher","award":["HG010860"],"award-info":[{"award-number":["HG010860"]}],"id":[{"id":"10.13039\/100000051","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000015","name":"U.S. Department of Energy","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100000015","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100013060","name":"European Molecular Biology Laboratory","doi-asserted-by":"crossref","id":[{"id":"10.13039\/100013060","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100001711","name":"Swiss National Science Foundation","doi-asserted-by":"crossref","award":["215906"],"award-info":[{"award-number":["215906"]}],"id":[{"id":"10.13039\/501100001711","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100001711","name":"Schweizerischer Nationalfonds zur F\u00f6rderung der Wissenschaftlichen Forschung","doi-asserted-by":"publisher","award":["10002786"],"award-info":[{"award-number":["10002786"]}],"id":[{"id":"10.13039\/501100001711","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>Accurately classifying chemical structures is essential for cheminformatics and bioinformatics, including tasks such as identifying bioactive compounds of interest, screening molecules for toxicity to humans, finding non-organic compounds with desirable material properties, or organizing large chemical libraries for drug discovery or environmental monitoring. However, manual classification is labor-intensive and difficult to scale to large chemical databases. Existing automated approaches either rely on manually constructed classification rules, or are deep learning methods that lack explainability. This work presents an approach that uses generative artificial intelligence to automatically write <jats:italic>chemical classifier programs<\/jats:italic> for classes in the Chemical Entities of Biological Interest (ChEBI) database. These programs can be used for efficient deterministic run-time classification of SMILES structures, with natural language explanations. The programs themselves constitute an explainable computable ontological model of chemical class nomenclature, which we call the ChEBI Chemical Class Program Ontology (C3PO). We validated our approach against the ChEBI database, and compared our results against deep learning models and a naive SMARTS pattern based classifier. C3PO outperforms the naive classifier, but does not reach the performance of state of the art deep learning methods. However, C3PO has a number of strengths that complement deep learning methods, including explainability and reduced data dependence. C3PO can be used alongside deep learning classifiers to provide an explanation of the classification, where both methods agree. The programs can be used as part of the ontology development process, and iteratively refined by expert human curators. <\/jats:p>","DOI":"10.1186\/s13321-025-01092-3","type":"journal-article","created":{"date-parts":[[2025,10,1]],"date-time":"2025-10-01T18:32:27Z","timestamp":1759343547000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Chemical classification program synthesis using generative artificial intelligence"],"prefix":"10.1186","volume":"17","author":[{"given":"Christopher J.","family":"Mungall","sequence":"first","affiliation":[]},{"given":"Adnan","family":"Malik","sequence":"additional","affiliation":[]},{"given":"Daniel R.","family":"Korn","sequence":"additional","affiliation":[]},{"given":"Justin T.","family":"Reese","sequence":"additional","affiliation":[]},{"given":"Noel M.","family":"O\u2019Boyle","sequence":"additional","affiliation":[]},{"given":"Janna","family":"Hastings","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2025,10,1]]},"reference":[{"key":"1092_CR1","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gkae1059","author":"S Kim","year":"2024","unstructured":"Kim S, Chen J, Cheng T, Gindulyte A, He J, He S et al (2024) PubChem 2025 update. Nucleic Acids Res. https:\/\/doi.org\/10.1093\/nar\/gkae1059","journal-title":"Nucleic Acids Res"},{"key":"1092_CR2","doi-asserted-by":"publisher","first-page":"D1180","DOI":"10.1093\/nar\/gkad1004","volume":"52","author":"B Zdrazil","year":"2024","unstructured":"Zdrazil B, Felix E, Hunter F, Manners EJ, Blackshaw J, Corbett S et al (2024) The ChEMBL database in 2023: a drug discovery platform spanning multiple bioactivity data types and time periods. Nucleic Acids Res 52:D1180\u2013D1192","journal-title":"Nucleic Acids Res"},{"key":"1092_CR3","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1002\/(SICI)1098-1128(199601)16:1<3::AID-MED1>3.0.CO;2-6","volume":"16","author":"RS Bohacek","year":"1996","unstructured":"Bohacek RS, McMartin C, Guida WC (1996) The art and practice of structure-based drug design: a molecular modeling perspective. Med Res Rev 16:3\u201350","journal-title":"Med Res Rev"},{"key":"1092_CR4","doi-asserted-by":"publisher","first-page":"8","DOI":"10.1186\/1758-2946-4-8","volume":"4","author":"J Hastings","year":"2012","unstructured":"Hastings J, Magka D, Batchelor C, Duan L, Stevens R, Ennis M et al (2012) Structure-based classification and ontology in chemistry. J Cheminform 4:8","journal-title":"J Cheminform"},{"key":"1092_CR5","doi-asserted-by":"publisher","first-page":"D1214","DOI":"10.1093\/nar\/gkv1031","volume":"44","author":"J Hastings","year":"2016","unstructured":"Hastings J, Owen G, Dekker A, Ennis M, Kale N, Muthukrishnan V et al (2016) ChEBI in 2016: Improved services and an expanding collection of metabolites. Nucleic Acids Res 44:D1214\u2013D1219","journal-title":"Nucleic Acids Res"},{"key":"1092_CR6","doi-asserted-by":"publisher","first-page":"513","DOI":"10.1186\/1471-2164-14-513","volume":"14","author":"DP Hill","year":"2013","unstructured":"Hill DP, Adams N, Bada M, Batchelor C, Berardini TZ, Dietze H et al (2013) Dovetailing biology and chemistry: integrating the Gene Ontology with the ChEBI chemical ontology. BMC Genomics 14:513","journal-title":"BMC Genomics"},{"key":"1092_CR7","doi-asserted-by":"publisher","first-page":"1429","DOI":"10.1038\/s41588-019-0500-1","volume":"51","author":"PD Thomas","year":"2019","unstructured":"Thomas PD, Hill DP, Mi H, Osumi-Sutherland D, Van Auken K, Carbon S et al (2019) Gene ontology causal activity modeling (GO-CAM) moves beyond GO annotations to structured descriptions of biological functions and systems. Nat Genet 51:1429\u20131433","journal-title":"Nat Genet"},{"key":"1092_CR8","doi-asserted-by":"publisher","first-page":"D640","DOI":"10.1093\/nar\/gkad1045","volume":"52","author":"O Yurekten","year":"2024","unstructured":"Yurekten O, Payne T, Tejera N, Amaladoss FX, Martin C, Williams M et al (2024) MetaboLights: open data repository for metabolomics. Nucleic Acids Res 52:D640\u2013D646","journal-title":"Nucleic Acids Res"},{"key":"1092_CR9","doi-asserted-by":"publisher","first-page":"31","DOI":"10.1021\/ci00057a005","volume":"28","author":"D Weininger","year":"1988","unstructured":"Weininger D (1988) SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules. J Chem Inf Comput Sci 28:31\u201336","journal-title":"J Chem Inf Comput Sci"},{"key":"1092_CR10","doi-asserted-by":"publisher","first-page":"22","DOI":"10.1186\/1758-2946-4-22","volume":"4","author":"NM O\u2019Boyle","year":"2012","unstructured":"O\u2019Boyle NM (2012) Towards a universal SMILES representation\u2014a standard method to generate canonical SMILES based on the InChI. J Cheminform 4:22","journal-title":"J Cheminform"},{"key":"1092_CR11","doi-asserted-by":"publisher","first-page":"7","DOI":"10.1186\/1758-2946-5-7","volume":"5","author":"S Heller","year":"2013","unstructured":"Heller S, McNaught A, Stein S, Tchekhovskoi D, Pletnev I (2013) InChI\u2014the worldwide chemical structure identifier standard. J Cheminform 5:7","journal-title":"J Cheminform"},{"key":"1092_CR12","first-page":"2665","volume-title":"Terpenes: chemistry, biological role, and therapeutic applications. Natural Products","author":"PP Brahmkshatriya","year":"2013","unstructured":"Brahmkshatriya PP, Brahmkshatriya PS (2013) Terpenes: chemistry, biological role, and therapeutic applications. Natural Products. Springer, Berlin Heidelberg, Berlin, Heidelberg, pp 2665\u20132691"},{"key":"1092_CR13","doi-asserted-by":"publisher","first-page":"232","DOI":"10.1007\/978-3-642-02326-2_24","volume-title":"True path rule hierarchical ensembles. Multiple Classifier Systems","author":"G Valentini","year":"2009","unstructured":"Valentini G (2009) True path rule hierarchical ensembles. Multiple Classifier Systems. Springer, Berlin Heidelberg, Berlin, Heidelberg, pp 232\u2013241"},{"key":"1092_CR14","volume-title":"The Manchester OWL Syntax","author":"M Horridge","year":"2006","unstructured":"Horridge M, Drummond N, Goodwin J, Rector A, Stevens R, Wan H (2006) The Manchester OWL Syntax. Experience and Directions 2006, OWL"},{"key":"1092_CR15","doi-asserted-by":"publisher","first-page":"67","DOI":"10.1007\/978-3-540-24750-0_4","volume-title":"Web ontology language: OWL. Handbook on Ontologies","author":"G Antoniou","year":"2004","unstructured":"Antoniou G, van Harmelen F (2004) Web ontology language: OWL. Handbook on Ontologies. Springer, Berlin Heidelberg, Berlin, Heidelberg, pp 67\u201392"},{"key":"1092_CR16","doi-asserted-by":"publisher","unstructured":"Mungall CJ, Dietze H, Osumi-Sutherland D (2014) Use of OWL within the Gene Ontology. bioRxiv. pp 010090. https:\/\/doi.org\/10.1101\/010090","DOI":"10.1101\/010090"},{"key":"1092_CR17","doi-asserted-by":"publisher","first-page":"R5","DOI":"10.1186\/gb-2012-13-1-r5","volume":"13","author":"CJ Mungall","year":"2012","unstructured":"Mungall CJ, Torniai C, Gkoutos GV, Lewis SE, Haendel MA (2012) Uberon, an integrative multi-species anatomy ontology. Genome Biol 13:R5","journal-title":"Genome Biol"},{"key":"1092_CR18","doi-asserted-by":"publisher","unstructured":"Caron AR, Puig-Barbe A, Quardokus EM, Balhoff JP, Belfiore J, Chipampe N-J, et al (2024) A general strategy for generating expert-guided, simplified views of ontologies. bioRxiv. pp 2024.12.13.628309. https:\/\/doi.org\/10.1101\/2024.12.13.628309","DOI":"10.1101\/2024.12.13.628309"},{"key":"1092_CR19","doi-asserted-by":"publisher","unstructured":"Matentzoglu N, Bello SM, Stefancsik R, Alghamdi SM, Anagnostopoulos AV, Balhoff JP, et al (2024) The Unified Phenotype Ontology (uPheno): a framework for cross-species integrative phenomics. bioRxivorg. https:\/\/doi.org\/10.1101\/2024.09.18.613276","DOI":"10.1101\/2024.09.18.613276"},{"key":"1092_CR20","doi-asserted-by":"publisher","DOI":"10.1093\/database\/baac087","author":"N Matentzoglu","year":"2022","unstructured":"Matentzoglu N, Goutte-Gattat D, Tan SZK, Balhoff JP, Carbon S, Caron AR et al (2022) Ontology development kit: a toolkit for building, maintaining and standardizing biomedical ontologies. Database. https:\/\/doi.org\/10.1093\/database\/baac087","journal-title":"Database"},{"key":"1092_CR21","unstructured":"Villanueva-Rosales N, Dumontier M (2007) Describing chemical functional groups in OWL-DL for the classification of chemical compounds. OWLED. 258. Available: https:\/\/citeseerx.ist.psu.edu\/document?repid=rep1&type=pdf&doi=4df753563290f668a5c0bf39d20b235f090119a9"},{"key":"1092_CR22","doi-asserted-by":"publisher","unstructured":"Magka D, Motik B, Horrocks I (2011) Chemical knowledge representation with description graphs and logic programming Proceedings of the 4th International Workshop on Semantic Web Applications and Tools for the Life Sciences. New York. ACM. https:\/\/doi.org\/10.1145\/2166896.2166916","DOI":"10.1145\/2166896.2166916"},{"key":"1092_CR23","unstructured":"Hastings J, Dumontier M, Hull D, Horridge M, Steinbeck C, Sattler U, et al (2010) Representing chemicals using OWL, description graphs and rules. Available: https:\/\/scholar.google.com\/citations?view_op=view_citation&hl=en&citation_for_view=cz-hhPUAAAAJ:9yKSN-GCB0IC"},{"key":"1092_CR24","unstructured":"Fl\u00fcgel S, Glauer M, Neuhaus F, Hastings J (2022) When one Logic is Not Enough: Integrating First-order Annotations in OWL Ontologies. arXiv [cs.AI]. Available: https:\/\/semantic-web-journal.net\/system\/files\/swj3440.pdf"},{"key":"1092_CR25","doi-asserted-by":"publisher","first-page":"257","DOI":"10.1007\/BF01536398","volume":"15","author":"JS Schlipf","year":"1995","unstructured":"Schlipf JS (1995) Complexity and undecidability results for logic programming. Ann Math Artif Intell 15:257\u2013288","journal-title":"Ann Math Artif Intell"},{"key":"1092_CR26","unstructured":"Daylight Theory: SMARTS\u2014a Language for Describing Molecular Patterns. [cited 29 Dec 2024]. Available: https:\/\/www.daylight.com\/dayhtml\/doc\/theory\/theory.smarts.html."},{"key":"1092_CR27","doi-asserted-by":"publisher","first-page":"2560","DOI":"10.1021\/acs.jcim.9b00250","volume":"59","author":"R Schmidt","year":"2019","unstructured":"Schmidt R, Ehmki ESR, Ohm F, Ehrlich H-C, Mashychev A, Rarey M (2019) Comparing molecular patterns using the example of SMARTS: theory and algorithms. J Chem Inf Model 59:2560\u20132571","journal-title":"J Chem Inf Model"},{"key":"1092_CR28","doi-asserted-by":"publisher","first-page":"61","DOI":"10.1186\/s13321-016-0174-y","volume":"8","author":"Y Djoumbou Feunang","year":"2016","unstructured":"Djoumbou Feunang Y, Eisner R, Knox C, Chepelev L, Hastings J, Owen G et al (2016) Classyfire: automated chemical classification with a comprehensive, computable taxonomy. J Cheminform 8:61","journal-title":"J Cheminform"},{"key":"1092_CR29","doi-asserted-by":"publisher","first-page":"23","DOI":"10.1186\/s13321-021-00500-8","volume":"13","author":"J Hastings","year":"2021","unstructured":"Hastings J, Glauer M, Memariani A, Neuhaus F, Mossakowski T (2021) Learning chemistry: exploring the suitability of machine learning for the task of structure-based chemical ontology classification. J Cheminform 13:23","journal-title":"J Cheminform"},{"key":"1092_CR30","doi-asserted-by":"publisher","DOI":"10.1126\/sciadv.aap7885","volume":"4","author":"M Popova","year":"2018","unstructured":"Popova M, Isayev O, Tropsha A (2018) Deep reinforcement learning for de novo drug design. Sci Adv 4:eaap7885","journal-title":"Sci Adv"},{"key":"1092_CR31","doi-asserted-by":"publisher","first-page":"896","DOI":"10.1039\/D3DD00238A","volume":"3","author":"M Glauer","year":"2024","unstructured":"Glauer M, Neuhaus F, Fl\u00fcgel S, Wosny M, Mossakowski T, Memariani A et al (2024) Chebifier: automating semantic classification in ChEBI to accelerate data-driven discovery. Digital Discov 3:896\u2013907","journal-title":"Digital Discov"},{"key":"1092_CR32","first-page":"937","volume":"15","author":"M Glauer","year":"2024","unstructured":"Glauer M, Memariani A, Neuhaus F, Mossakowski T, Hastings J (2024) Interpretable ontology extension in chemistry. Semantic Web 15:937\u2013958","journal-title":"Semantic Web"},{"key":"1092_CR33","unstructured":"Naveed H, Khan AU, Qiu S, Saqib M, Anwar S, Usman M, et al (2023) A comprehensive overview of large Language Models. arXiv [cs.CL]. Available: http:\/\/arxiv.org\/abs\/2307.06435"},{"key":"1092_CR34","unstructured":"Wei J, Tay Y, Bommasani R, Raffel C, Zoph B, Borgeaud S, et al (2022) Emergent Abilities of Large Language Models. Transactions on Machine Learning Research. Available: https:\/\/openreview.net\/pdf?id=yzkSU5zdwD"},{"key":"1092_CR35","unstructured":"Huang Y, Zhang R, He X, Zhi X, Wang H, Li X, et al (2024) ChemEval: a comprehensive multi-level chemical evaluation for large language models. arXiv [cs.CL]. Available: http:\/\/arxiv.org\/abs\/2409.13989"},{"key":"1092_CR36","doi-asserted-by":"publisher","DOI":"10.1186\/s13326-024-00320-3","volume":"15","author":"S Toro","year":"2024","unstructured":"Toro S, Anagnostopoulos AV, Bello SM, Blumberg K, Cameron R, Carmody L et al (2024) Dynamic Retrieval Augmented Generation of ontologies using artificial intelligence (DRAGON-AI). J Biomed Semantics 15:19","journal-title":"J Biomed Semantics"},{"key":"1092_CR37","unstructured":"Landrum G (2013) RDKit: a software suite for cheminformatics, computational chemistry, and predictive modeling. Available: https:\/\/www.rdkit.org\/RDKit_Overview.pdf"},{"key":"1092_CR38","doi-asserted-by":"publisher","first-page":"33","DOI":"10.1186\/1758-2946-3-33","volume":"3","author":"NM O\u2019Boyle","year":"2011","unstructured":"O\u2019Boyle NM, Banck M, James CA, Morley C, Vandermeersch T, Hutchison GR (2011) Open babel: an open chemical toolbox. J Cheminform 3:33","journal-title":"J Cheminform"},{"key":"1092_CR39","doi-asserted-by":"publisher","first-page":"51","DOI":"10.1186\/s13321-020-00456-1","volume":"12","author":"AP Bento","year":"2020","unstructured":"Bento AP, Hersey A, F\u00e9lix E, Landrum G, Gaulton A, Atkinson F et al (2020) An open source chemical structure curation pipeline using RDKit. J Cheminform 12:51","journal-title":"J Cheminform"},{"key":"1092_CR40","unstructured":"Austin J, Odena A, Nye M, Bosma M, Michalewski H, Dohan D, et al (2021) Program synthesis with large language models. arXiv [cs.PL]. Available: http:\/\/arxiv.org\/abs\/2108.07732"},{"key":"1092_CR41","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3606367","volume":"56","author":"P Christen","year":"2024","unstructured":"Christen P, Hand DJ, Kirielle N (2024) A review of the F-measure: its history, properties, criticism, and alternatives. ACM Comput Surv 56:1\u201324","journal-title":"ACM Comput Surv"},{"key":"1092_CR42","volume-title":"Information Retrieval","author":"CJ Van Rijsbergen","year":"1979","unstructured":"Van Rijsbergen CJ (1979) Information Retrieval, 2nd edn. Butterworth-Heinemann, Oxford, England","edition":"2"},{"key":"1092_CR43","unstructured":"Wei J, Wang X, Schuurmans D, Bosma M, Ichter B, Xia F, et al (2022) Chain-of-Thought Prompting Elicits Reasoning in Large Language Models. arXiv [cs.CL]. Available: http:\/\/arxiv.org\/abs\/2201.11903"},{"key":"1092_CR44","unstructured":"chemrof: Schema for chemistry ontology classes. Github; Available: https:\/\/github.com\/chemkg\/chemrof"},{"key":"1092_CR45","first-page":"148","volume":"3073","author":"S Moxon","year":"2021","unstructured":"Moxon S, Solbrig H, Unni D, Jiao D, Bruskiewich R, Balhoff J et al (2021) The Linked Data Modeling Language (LinkML): a general-purpose data modeling framework grounded in machine-readable semantics. CEUR Worksh Proc 3073:148\u2013151","journal-title":"CEUR Worksh Proc"},{"key":"1092_CR46","doi-asserted-by":"publisher","unstructured":"Hoyt CT, Mungall C, Vasilevsky N, Domingo-Fern\u00e1ndez D, Healy M, Colluru V (2020) Extension of Roles in the ChEBI Ontology. ChemRxiv. https:\/\/doi.org\/10.26434\/chemrxiv.12591221.v1","DOI":"10.26434\/chemrxiv.12591221.v1"},{"key":"1092_CR47","doi-asserted-by":"publisher","unstructured":"MonarchInitiative (2025) C3PO Dataset. C3PO Dataset. Hugging Face. https:\/\/doi.org\/10.57967\/hf\/4033","DOI":"10.57967\/hf\/4033"},{"key":"1092_CR48","doi-asserted-by":"publisher","DOI":"10.1093\/database\/baac035","author":"N Matentzoglu","year":"2022","unstructured":"Matentzoglu N, Balhoff JP, Bello SM, Bizon C, Brush M, Callahan TJ et al (2022) A simple standard for sharing ontological mappings (SSSOM). Database. https:\/\/doi.org\/10.1093\/database\/baac035","journal-title":"Database"},{"key":"1092_CR49","doi-asserted-by":"publisher","first-page":"27","DOI":"10.1093\/nar\/28.1.27","volume":"28","author":"M Kanehisa","year":"2000","unstructured":"Kanehisa M, Goto S (2000) KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res 28:27\u201330","journal-title":"Nucleic Acids Res"},{"key":"1092_CR50","unstructured":"Kalai AT, Vempala SS (2023) Calibrated language models must hallucinate. arXiv [cs.CL]. Available: http:\/\/arxiv.org\/abs\/2311.14648"},{"key":"1092_CR51","unstructured":"Agarwal V, Pei Y, Alamir S, Liu X (2024) CodeMirage: hallucinations in code generated by large Language Models. arXiv [cs.SE]. Available: http:\/\/arxiv.org\/abs\/2408.08333"},{"key":"1092_CR52","doi-asserted-by":"publisher","DOI":"10.5281\/ZENODO.14298221","author":"C Mungall","year":"2024","unstructured":"Mungall C (2024) The need for a simpler collaboratively maintained CHEBI hierarchy. Zenodo. https:\/\/doi.org\/10.5281\/ZENODO.14298221"},{"key":"1092_CR53","doi-asserted-by":"publisher","first-page":"471","DOI":"10.5650\/jos.53.471","volume":"53","author":"ER Gunawan","year":"2004","unstructured":"Gunawan ER, Basri M, Rahman MBA, Salleh AB, Rahman RNZA (2004) Lipase-catalyzed synthesis of palm-based wax esters. J Oleo Sci 53:471\u2013477","journal-title":"J Oleo Sci"},{"key":"1092_CR54","doi-asserted-by":"publisher","unstructured":"Matentzoglu N, Reese J, Mungall C (2025) cmungall\/odk-ai: v0.2.1. Zenodo. https:\/\/doi.org\/10.5281\/ZENODO.15300288","DOI":"10.5281\/ZENODO.15300288"},{"key":"1092_CR55","unstructured":"Fl\u00fcgel S, Glauer M, Hastings J, Mossakowski T, Neuhaus F (2025) Chebifier 2: an ensemble for chemistry."},{"key":"1092_CR56","doi-asserted-by":"publisher","first-page":"1896","DOI":"10.1093\/bioinformatics\/btz817","volume":"36","author":"A Morgat","year":"2020","unstructured":"Morgat A, Lombardot T, Coudert E, Axelsen K, Neto TB, Gehant S et al (2020) Enzyme annotation in UniProtKB using Rhea. Bioinformatics 36:1896\u20131901","journal-title":"Bioinformatics"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-025-01092-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13321-025-01092-3\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-025-01092-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,1]],"date-time":"2025-10-01T18:32:32Z","timestamp":1759343552000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/s13321-025-01092-3"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,10,1]]},"references-count":56,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2025,12]]}},"alternative-id":["1092"],"URL":"https:\/\/doi.org\/10.1186\/s13321-025-01092-3","relation":{"references":[{"id-type":"doi","id":"10.5281\/ZENODO.14298221","asserted-by":"subject"}]},"ISSN":["1758-2946"],"issn-type":[{"value":"1758-2946","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,10,1]]},"assertion":[{"value":"24 May 2025","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"30 August 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"1 October 2025","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"152"}}