{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,7]],"date-time":"2026-04-07T17:01:43Z","timestamp":1775581303074,"version":"3.50.1"},"reference-count":71,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2025,8,18]],"date-time":"2025-08-18T00:00:00Z","timestamp":1755475200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,8,18]],"date-time":"2025-08-18T00:00:00Z","timestamp":1755475200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100019180","name":"HORIZON EUROPE European Research Council","doi-asserted-by":"publisher","award":["101084642"],"award-info":[{"award-number":["101084642"]}],"id":[{"id":"10.13039\/100019180","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100014013","name":"UK Research and Innovation","doi-asserted-by":"publisher","award":["101084642"],"award-info":[{"award-number":["101084642"]}],"id":[{"id":"10.13039\/100014013","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100000265","name":"Medical Research Council","doi-asserted-by":"publisher","award":["MR\/V012452\/1"],"award-info":[{"award-number":["MR\/V012452\/1"]}],"id":[{"id":"10.13039\/501100000265","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Purpose<\/jats:title>\n                    <jats:p>Drug-induced liver injury (DILI) is a significant concern in drug development, often leading to the discontinuation of clinical trials and the withdrawal of drugs from the market. This study explores the application of graph neural networks (GNNs) for DILI prediction, using molecular graph representations as the primary input.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Methods<\/jats:title>\n                    <jats:p>We evaluated several GNN architectures, including Graph Convolutional Networks (GCNs), Graph Attention Networks (GATs), Graph Sample and Aggregation (GraphSAGE), and Graph Isomorphism Networks (GINs), using the latest FDA DILI dataset and other molecular property prediction datasets. We introduce a novel approach that creates a custom graph dataset, driven by molecular optimisation, that incorporates detailed and realistic chemical features such as bond lengths and partial charges as input into the GNN models. We have named our model approach DILIGeNN.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>DILIGeNN achieved an AUC of 0.897 on the DILI dataset, surpassing the current state-of-the-art model in the DILI prediction task. Furthermore, DILIGeNN outperformed the state-of-the-art in other graph-based molecular prediction tasks, achieving an AUC of 0.918 on the Clintox dataset, 0.993 on the BBBP dataset, and 0.953 on the BACE dataset, indicating strong generalisation and performance across different datasets.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Conclusion<\/jats:title>\n                    <jats:p>DILIGeNN, utilising a single graph representation as input, outperforms the state-of-the-art methods in DILI prediction that incorporate both molecular fingerprint and graph-structured data. These findings highlight the effectiveness of our molecular graph generation and the GNN training approach as a powerful tool for early-stage drug development and drug repurposing pipeline.<\/jats:p>\n                    <jats:p>Scientific Contribution: DILIGeNN is a GNN framework that extracts graph features from 3D optimised molecular structures as is done in target-based drug discovery and molecular docking simulation. Our method is the first to encode spatial and electrostatic information into a single graph representation, as opposed to other work that require multiple graphs or additional chemical descriptors for feature representation. Our approach, using warm starts following repeated early stopping during training, outperforms the current state-of-the-art methods in liver toxicity (DILI), permeability (BBBP) and activity (BACE) prediction tasks.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Graphic Abstract<\/jats:title>\n                  <\/jats:sec>","DOI":"10.1186\/s13321-025-01068-3","type":"journal-article","created":{"date-parts":[[2025,8,18]],"date-time":"2025-08-18T07:58:40Z","timestamp":1755503920000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["Improving drug-induced liver injury prediction using graph neural networks with augmented graph features from molecular optimisation"],"prefix":"10.1186","volume":"17","author":[{"given":"Taeyeub","family":"Lee","sequence":"first","affiliation":[]},{"given":"Joram M.","family":"Posma","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2025,8,18]]},"reference":[{"key":"1068_CR1","first-page":"73","volume":"6","author":"S David","year":"2010","unstructured":"David S, Hamilton JP (2010) Drug-induced liver injury. US Gastroenterol Hepatol Rev 6:73","journal-title":"US Gastroenterol Hepatol Rev"},{"issue":"1","key":"1068_CR2","doi-asserted-by":"publisher","first-page":"58","DOI":"10.1038\/s41572-019-0105-0","volume":"5","author":"RJ Andrade","year":"2019","unstructured":"Andrade RJ, Chalasani N, Bj\u00f6rnsson ES, Suzuki A, Kullak-Ublick GA, Watkins PB, Devarbhavi H, Merz M, Lucena MI, Kaplowitz N et al (2019) Drug-induced liver injury. Nat Rev Dis Primers 5(1):58","journal-title":"Nat Rev Dis Primers"},{"issue":"1","key":"1068_CR3","doi-asserted-by":"publisher","first-page":"31","DOI":"10.1080\/17474124.2018.1383154","volume":"12","author":"S Thakkar","year":"2018","unstructured":"Thakkar S, Chen M, Fang H, Liu Z, Roberts R, Tong W (2018) The liver toxicity knowledge base (lktb) and drug-induced liver injury (dili) classification for assessment of human liver injury. Expert Rev Gastroenterol Hepatol 12(1):31\u201338","journal-title":"Expert Rev Gastroenterol Hepatol"},{"issue":"2","key":"1068_CR4","doi-asserted-by":"publisher","first-page":"550","DOI":"10.1021\/acs.chemrestox.0c00374","volume":"34","author":"T Li","year":"2020","unstructured":"Li T, Tong W, Roberts R, Liu Z, Thakkar S (2020) Deepdili: deep learning-powered drug-induced liver injury prediction using model-level representation. Chem Res Toxicol 34(2):550\u2013565","journal-title":"Chem Res Toxicol"},{"issue":"7","key":"1068_CR5","doi-asserted-by":"publisher","first-page":"2246","DOI":"10.1053\/j.gastro.2010.04.001","volume":"138","author":"N Chalasani","year":"2010","unstructured":"Chalasani N, Bj\u00f6rnsson E (2010) Risk factors for idiosyncratic drug-induced liver injury. Gastroenterology 138(7):2246\u20132259","journal-title":"Gastroenterology"},{"issue":"22","key":"1068_CR6","doi-asserted-by":"publisher","first-page":"4967","DOI":"10.1002\/cncr.33153","volume":"126","author":"SP Mondaca","year":"2020","unstructured":"Mondaca SP, Liu D, Flynn JR, Badson S, Hamaway S, Gounder MM, Khalil DN, Drilon AE, Li BT, Jhaveri KL et al (2020) Clinical implications of drug-induced liver injury in early-phase oncology clinical trials. Cancer 126(22):4967\u20134974","journal-title":"Cancer"},{"issue":"2","key":"1068_CR7","doi-asserted-by":"publisher","first-page":"655","DOI":"10.1093\/toxsci\/kfy058","volume":"163","author":"SU Vorrink","year":"2018","unstructured":"Vorrink SU, Zhou Y, Ingelman-Sundberg M, Lauschke VM (2018) Prediction of drug-induced hepatotoxicity using long-term stable primary hepatic 3d spheroid cultures in chemically defined conditions. Toxicol Sci 163(2):655\u2013665","journal-title":"Toxicol Sci"},{"issue":"6","key":"1068_CR8","doi-asserted-by":"publisher","first-page":"489","DOI":"10.1038\/nrd1750","volume":"4","author":"N Kaplowitz","year":"2005","unstructured":"Kaplowitz N (2005) Idiosyncratic drug hepatotoxicity. Nat Rev Drug Discov 4(6):489\u2013499","journal-title":"Nat Rev Drug Discov"},{"issue":"15\u201316","key":"1068_CR9","doi-asserted-by":"publisher","first-page":"697","DOI":"10.1016\/j.drudis.2011.05.007","volume":"16","author":"M Chen","year":"2011","unstructured":"Chen M, Vijay V, Shi Q, Liu Z, Fang H, Tong W (2011) Fda-approved drug labeling for the study of drug-induced liver injury. Drug Discov Today 16(15\u201316):697\u2013703","journal-title":"Drug Discov Today"},{"issue":"4","key":"1068_CR10","doi-asserted-by":"publisher","first-page":"648","DOI":"10.1016\/j.drudis.2016.02.015","volume":"21","author":"M Chen","year":"2016","unstructured":"Chen M, Suzuki A, Thakkar S, Yu K, Hu C, Tong W (2016) Dilirank: the largest reference drug list ranked by the risk for developing drug-induced liver injury in humans. Drug Discov Today 21(4):648\u2013653","journal-title":"Drug Discov Today"},{"issue":"1","key":"1068_CR11","doi-asserted-by":"publisher","first-page":"201","DOI":"10.1016\/j.drudis.2019.09.022","volume":"25","author":"S Thakkar","year":"2020","unstructured":"Thakkar S, Li T, Liu Z, Wu L, Roberts R, Tong W (2020) Drug-induced liver injury severity and toxicity (dilist): binary classification of 1279 drugs by human hepatotoxicity. Drug Discov Today 25(1):201\u2013208","journal-title":"Drug Discov Today"},{"key":"1068_CR12","doi-asserted-by":"publisher","first-page":"393","DOI":"10.1007\/978-1-0716-1960-5_15","volume-title":"In silico methods for predicting drug toxicity","author":"J Liu","year":"2022","unstructured":"Liu J, Guo W, Sakkiah S, Ji Z, Yavas G, Zou W, Chen M, Tong W, Patterson TA, Hong H (2022) Machine learning models for predicting liver toxicity. In: Benfenati E (ed) In silico methods for predicting drug toxicity. Humana, New York, pp 393\u2013415"},{"key":"1068_CR13","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1186\/s12859-017-1638-4","volume":"18","author":"E Kim","year":"2017","unstructured":"Kim E, Nam H (2017) Prediction models for drug-induced hepatotoxicity by using weighted molecular fingerprints. BMC Bioinform 18:25\u201334","journal-title":"BMC Bioinform"},{"issue":"7","key":"1068_CR14","doi-asserted-by":"publisher","first-page":"559","DOI":"10.1080\/1062936X.2016.1201142","volume":"27","author":"X-W Zhu","year":"2016","unstructured":"Zhu X-W, Xin Y-J, Chen Q-H (2016) Chemical and in vitro biological information to predict mouse liver toxicity using recursive random forests. SAR QSAR Environ Res 27(7):559\u2013572","journal-title":"SAR QSAR Environ Res"},{"issue":"4","key":"1068_CR15","doi-asserted-by":"publisher","first-page":"738","DOI":"10.1021\/tx500501h","volume":"28","author":"J Liu","year":"2015","unstructured":"Liu J, Mansouri K, Judson RS, Martin MT, Hong H, Chen M, Xu X, Thomas RS, Shah I (2015) Predicting hepatotoxicity using toxcast in vitro bioactivity and chemical structure. Chem Res Toxicol 28(4):738\u2013751","journal-title":"Chem Res Toxicol"},{"key":"1068_CR16","doi-asserted-by":"publisher","first-page":"725","DOI":"10.1016\/B978-0-12-387817-5.00040-6","volume-title":"Drug-induced liver disease","author":"JH Hoofnagle","year":"2013","unstructured":"Hoofnagle JH (2013) Livertox: a website on drug-induced liver injury. In: Kaplowitz N, DeLeve LD (eds) Drug-induced liver disease. Elsevier, Amsterdam, pp 725\u2013732"},{"issue":"1","key":"1068_CR17","doi-asserted-by":"publisher","first-page":"105677","DOI":"10.1016\/j.isci.2022.105677","volume":"26","author":"S Lim","year":"2023","unstructured":"Lim S, Kim Y, Gu J, Lee S, Shin W, Kim S (2023) Supervised chemical graph mining improves drug-induced liver injury prediction. iScience 26(1):105677","journal-title":"iScience"},{"key":"1068_CR18","doi-asserted-by":"publisher","first-page":"1290","DOI":"10.1021\/acs.chemrestox.4c00015","volume":"37","author":"S Seal","year":"2024","unstructured":"Seal S, Williams D, Hosseini-Gerami L, Mahale M, Carpenter AE, Spjuth O, Bender A (2024) Improved detection of drug-induced liver injury by integrating predicted in vivo and in vitro data. Chem Res Toxicol 37:1290","journal-title":"Chem Res Toxicol"},{"key":"1068_CR19","doi-asserted-by":"publisher","first-page":"153736","DOI":"10.1016\/j.tox.2024.153736","volume":"502","author":"Q Yang","year":"2024","unstructured":"Yang Q, Zhang S, Li Y (2024) Deep learning algorithm based on molecular fingerprint for prediction of drug-induced liver injury. Toxicology 502:153736","journal-title":"Toxicology"},{"key":"1068_CR20","doi-asserted-by":"publisher","first-page":"154108","DOI":"10.1016\/j.tox.2025.154108","volume":"514","author":"AS Wibowo","year":"2025","unstructured":"Wibowo AS, Chong KT, Tayara H (2025) Enhancing dili toxicity prediction through integrated graph attention (gatnn) and dense neural networks (dnn). Toxicology 514:154108","journal-title":"Toxicology"},{"issue":"6","key":"1068_CR21","doi-asserted-by":"publisher","first-page":"1273","DOI":"10.1021\/ci010132r","volume":"42","author":"JL Durant","year":"2002","unstructured":"Durant JL, Leland BA, Henry DR, Nourse JG (2002) Reoptimization of mdl keys for use in drug discovery. J Chem Inf Comput Sci 42(6):1273\u20131280","journal-title":"J Chem Inf Comput Sci"},{"issue":"D1","key":"1068_CR22","doi-asserted-by":"publisher","first-page":"1202","DOI":"10.1093\/nar\/gkv951","volume":"44","author":"S Kim","year":"2016","unstructured":"Kim S, Thiessen PA, Bolton EE, Chen J, Fu G, Gindulyte A, Han L, He J, He S, Shoemaker BA et al (2016) Pubchem substance and compound databases. Nucleic Acids Res 44(D1):1202\u20131213","journal-title":"Nucleic Acids Res"},{"key":"1068_CR23","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13321-020-00456-1","volume":"12","author":"AP Bento","year":"2020","unstructured":"Bento AP, Hersey A, F\u00e9lix E, Landrum G, Gaulton A, Atkinson F, Bellis LJ, De Veij M, Leach AR (2020) An open source chemical structure curation pipeline using rdkit. J Cheminform 12:1\u201316","journal-title":"J Cheminform"},{"issue":"6","key":"1068_CR24","doi-asserted-by":"publisher","first-page":"1039","DOI":"10.1021\/ci00028a014","volume":"35","author":"LH Hall","year":"1995","unstructured":"Hall LH, Kier LB (1995) Electrotopological state indices for atom types: a novel combination of electronic, topological, and valence state information. J Chem Inf Comput Sci 35(6):1039\u20131045","journal-title":"J Chem Inf Comput Sci"},{"issue":"5","key":"1068_CR25","doi-asserted-by":"publisher","first-page":"742","DOI":"10.1021\/ci100050t","volume":"50","author":"D Rogers","year":"2010","unstructured":"Rogers D, Hahn M (2010) Extended-connectivity fingerprints. J Chem Inf Model 50(5):742\u2013754","journal-title":"J Chem Inf Model"},{"key":"1068_CR26","unstructured":"Kipf TN, Welling M (2016) Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907"},{"issue":"14","key":"1068_CR27","doi-asserted-by":"publisher","first-page":"3389","DOI":"10.3390\/ijms20143389","volume":"20","author":"K Liu","year":"2019","unstructured":"Liu K, Sun X, Jia L, Ma J, Xing H, Wu J, Gao H, Sun Y, Boulnois F, Fan J (2019) Chemi-net: a molecular graph convolutional network for accurate drug property prediction. Int J Mol Sci 20(14):3389","journal-title":"Int J Mol Sci"},{"key":"1068_CR28","unstructured":"Veli\u010dkovi\u0107 P, Cucurull G, Casanova A, Romero A, Lio P, Bengio Y (2017) Graph attention networks. arXiv preprint arXiv:1710.10903"},{"key":"1068_CR29","unstructured":"Zhou J, Li S, Huang L, Xiong H, Wang F, Xu T, Xiong H, Dou D (2020) Distance-aware molecule graph attention network for drug-target binding affinity prediction. arXiv preprint arXiv:2012.09624"},{"key":"1068_CR30","unstructured":"Hamilton W, Ying Z, Leskovec J (2017) Inductive representation learning on large graphs. Advances in neural information processing systems 30"},{"key":"1068_CR31","doi-asserted-by":"publisher","first-page":"106524","DOI":"10.1016\/j.compbiomed.2022.106524","volume":"153","author":"J Liu","year":"2023","unstructured":"Liu J, Lei X, Zhang Y, Pan Y (2023) The prediction of molecular toxicity based on bigru and graphsage. Comput Biol Med 153:106524","journal-title":"Comput Biol Med"},{"key":"1068_CR32","unstructured":"Xu K, Hu W, Leskovec J, Jegelka S (2018) How powerful are graph neural networks? arXiv preprint arXiv:1810.00826"},{"key":"1068_CR33","doi-asserted-by":"crossref","unstructured":"Grover A, Leskovec J (2016) node2vec: Scalable feature learning for networks. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 855\u2013864","DOI":"10.1145\/2939672.2939754"},{"issue":"18","key":"1068_CR34","doi-asserted-by":"publisher","first-page":"3990","DOI":"10.3390\/math11183990","volume":"11","author":"S Wang","year":"2023","unstructured":"Wang S, Su X, Zhao B, Hu P, Bai T, Hu L (2023) An improved graph isomorphism network for accurate prediction of drug-drug interactions. Mathematics 11(18):3990","journal-title":"Mathematics"},{"key":"1068_CR35","doi-asserted-by":"crossref","unstructured":"Zhang J, Xiao X, Huang L-K, Rong Y, Bian Y (2022) Fine-tuning graph neural networks via graph topology induced optimal transport. arXiv preprint arXiv:2203.10453","DOI":"10.24963\/ijcai.2022\/518"},{"issue":"8","key":"1068_CR36","doi-asserted-by":"publisher","first-page":"3370","DOI":"10.1021\/acs.jcim.9b00237","volume":"59","author":"K Yang","year":"2019","unstructured":"Yang K, Swanson K, Jin W, Coley C, Eiden P, Gao H, Guzman-Perez A, Hopper T, Kelley B, Mathea M, Palmer A, Settels V, Jaakkola T, Jensen K, Barzilay R (2019) Analyzing learned molecular representations for property prediction. J Chem Inf Model 59(8):3370\u20133388. https:\/\/doi.org\/10.1021\/acs.jcim.9b00237","journal-title":"J Chem Inf Model"},{"issue":"2","key":"1068_CR37","doi-asserted-by":"publisher","first-page":"127","DOI":"10.1038\/s42256-021-00438-4","volume":"4","author":"X Fang","year":"2022","unstructured":"Fang X, Liu L, Lei J, He D, Zhang S, Zhou J, Wang F, Wu H, Wang H (2022) Geometry-enhanced molecular representation learning for property prediction. Nat Mach Intell 4(2):127\u2013134. https:\/\/doi.org\/10.1038\/s42256-021-00438-4","journal-title":"Nat Mach Intell"},{"issue":"6","key":"1068_CR38","doi-asserted-by":"publisher","first-page":"474","DOI":"10.1093\/bib\/bbae474","volume":"25","author":"X Jiang","year":"2024","unstructured":"Jiang X, Tan L, Zou Q (2024) Dgcl: dual-graph neural networks contrastive learning for molecular property prediction. Brief Bioinform 25(6):474","journal-title":"Brief Bioinform"},{"issue":"3","key":"1068_CR39","doi-asserted-by":"publisher","first-page":"279","DOI":"10.1038\/s42256-022-00447-x","volume":"4","author":"Y Wang","year":"2022","unstructured":"Wang Y, Wang J, Cao Z, Barati Farimani A (2022) Molecular contrastive learning of representations via graph neural networks. Nat Mach Intell 4(3):279\u2013287","journal-title":"Nat Mach Intell"},{"issue":"12","key":"1068_CR40","doi-asserted-by":"publisher","first-page":"1256","DOI":"10.1038\/s42256-022-00580-7","volume":"4","author":"J Ross","year":"2022","unstructured":"Ross J, Belgodere B, Chenthamarakshan V, Padhi I, Mroueh Y, Das P (2022) Large-scale chemical language representations capture molecular structure and properties. Nat Mach Intell 4(12):1256\u20131264","journal-title":"Nat Mach Intell"},{"issue":"1","key":"1068_CR41","doi-asserted-by":"publisher","first-page":"29025","DOI":"10.1038\/s41598-024-80009-w","volume":"14","author":"D Jones","year":"2024","unstructured":"Jones D, Zhang X, Bennion BJ, Pinge S, Xu W, Kang J, Khaleghi B, Moshiri N, Allen JE, Rosing TS (2024) Hdbind: encoding of molecular structure with hyperdimensional binary representations. Sci Rep 14(1):29025","journal-title":"Sci Rep"},{"issue":"2","key":"1068_CR42","doi-asserted-by":"publisher","first-page":"590","DOI":"10.1002\/hep.28323","volume":"63","author":"ES Bj\u00f6rnsson","year":"2016","unstructured":"Bj\u00f6rnsson ES, Hoofnagle JH (2016) Categorization of drugs implicated in causing liver injury: critical assessment based on published case reports. Hepatology 63(2):590\u2013603","journal-title":"Hepatology"},{"key":"1068_CR43","doi-asserted-by":"publisher","first-page":"503","DOI":"10.2165\/11535340-000000000-00000","volume":"33","author":"A Suzuki","year":"2010","unstructured":"Suzuki A, Andrade RJ, Bjornsson E, Lucena MI, Lee WM, Yuen NA, Hunt CM, Freston JW (2010) Drugs associated with hepatotoxicity and their reporting frequency of liver adverse events in vigibase$$^{{\\rm TM}}$$$$: unified list based on international collaborative work$$. Drug Saf 33:503\u2013522","journal-title":"Drug Saf"},{"issue":"7","key":"1068_CR44","doi-asserted-by":"publisher","first-page":"1215","DOI":"10.1021\/tx1000865","volume":"23","author":"N Greene","year":"2010","unstructured":"Greene N, Fisk L, Naven RT, Note RR, Patel ML, Pelletier DJ (2010) Developing structure- activity relationships for the prediction of hepatotoxicity. Chem Res Toxicol 23(7):1215\u20131222","journal-title":"Chem Res Toxicol"},{"key":"1068_CR45","doi-asserted-by":"publisher","first-page":"62","DOI":"10.1016\/j.tox.2014.03.009","volume":"321","author":"X Zhu","year":"2014","unstructured":"Zhu X, Kruhlak NL (2014) Construction and analysis of a human hepatotoxicity database suitable for qsar modeling using post-market safety data. Toxicology 321:62\u201372","journal-title":"Toxicology"},{"issue":"2","key":"1068_CR46","doi-asserted-by":"publisher","first-page":"513","DOI":"10.1039\/C7SC02664A","volume":"9","author":"Z Wu","year":"2018","unstructured":"Wu Z, Ramsundar B, Feinberg EN, Gomes J, Geniesse C, Pappu AS, Leswing K, Pande V (2018) Moleculenet: a benchmark for molecular machine learning. Chem Sci 9(2):513\u2013530","journal-title":"Chem Sci"},{"key":"1068_CR47","unstructured":"Fey M, Lenssen JE (2019) Fast graph representation learning with pytorch geometric. arXiv preprint arXiv:1903.02428"},{"issue":"3","key":"1068_CR48","doi-asserted-by":"publisher","first-page":"585","DOI":"10.3390\/molecules29030585","volume":"29","author":"BG Torre","year":"2024","unstructured":"Torre BG, Albericio F (2024) The pharmaceutical industry in 2023: an analysis of fda drug approvals from the perspective of molecules. Molecules 29(3):585","journal-title":"Molecules"},{"issue":"20","key":"1068_CR49","doi-asserted-by":"publisher","first-page":"9900","DOI":"10.1021\/acs.joc.5b01370","volume":"80","author":"L Guasch","year":"2015","unstructured":"Guasch L, Peach ML, Nicklaus MC (2015) Tautomerism of warfarin: combined chemoinformatics, quantum chemical, and nmr investigation. J Org Chem 80(20):9900\u20139909","journal-title":"J Org Chem"},{"issue":"12","key":"1068_CR50","doi-asserted-by":"publisher","first-page":"2562","DOI":"10.1021\/acs.jcim.5b00654","volume":"55","author":"S Riniker","year":"2015","unstructured":"Riniker S, Landrum GA (2015) Better informed distance geometry: using what we know to improve conformation generation. J Chem Inf Model 55(12):2562\u20132574","journal-title":"J Chem Inf Model"},{"issue":"5\u20136","key":"1068_CR51","doi-asserted-by":"publisher","first-page":"490","DOI":"10.1002\/(SICI)1096-987X(199604)17:5\/6<490::AID-JCC1>3.0.CO;2-P","volume":"17","author":"TA Halgren","year":"1996","unstructured":"Halgren TA (1996) Merck molecular force field. i. basis, form, scope, parameterization, and performance of mmff94. J Comput Chem 17(5\u20136):490\u2013519","journal-title":"J Comput Chem"},{"issue":"1","key":"1068_CR52","first-page":"126858","volume":"2015","author":"IV Oferkin","year":"2015","unstructured":"Oferkin IV, Katkova EV, Sulimov AV, Kutov DC, Sobolev SI, Voevodin VV, Sulimov VB (2015) Evaluation of docking target functions by the comprehensive investigation of protein-ligand energy minima. Adv Bioinform 2015(1):126858","journal-title":"Adv Bioinform"},{"issue":"12","key":"1068_CR53","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1039\/p298700000s1","volume":"2","author":"FH Allen","year":"1987","unstructured":"Allen FH, Kennard O, Watson DG, Brammer L, Orpen AG, Taylor R (1987) Tables of bond lengths determined by x-ray and neutron diffraction. Part 1. Bond lengths in organic compounds. J Chem Soc Perkin Trans 2(12):1\u201319","journal-title":"J Chem Soc Perkin Trans"},{"issue":"11","key":"1068_CR54","doi-asserted-by":"publisher","first-page":"935","DOI":"10.1038\/nrd1549","volume":"3","author":"DB Kitchen","year":"2004","unstructured":"Kitchen DB, Decornez H, Furr JR, Bajorath J (2004) Docking and scoring in virtual screening for drug discovery: methods and applications. Nat Rev Drug Discov 3(11):935\u2013949","journal-title":"Nat Rev Drug Discov"},{"issue":"22","key":"1068_CR55","doi-asserted-by":"publisher","first-page":"3219","DOI":"10.1016\/0040-4020(80)80168-2","volume":"36","author":"J Gasteiger","year":"1980","unstructured":"Gasteiger J, Marsili M (1980) Iterative partial equalization of orbital electronegativity-a rapid access to atomic charges. Tetrahedron 36(22):3219\u20133228","journal-title":"Tetrahedron"},{"key":"1068_CR56","unstructured":"Gilmer J, Schoenholz SS, Riley PF, Vinyals O, Dahl GE (2017) Neural message passing for quantum chemistry. In: International Conference on Machine Learning, PMLR, p 1263\u20131272"},{"issue":"9","key":"1068_CR57","first-page":"1","volume":"14","author":"W McKinney","year":"2011","unstructured":"McKinney W et al (2011) pandas: a foundational python library for data analysis and statistics. Python High Perform Sci Comput 14(9):1\u20139","journal-title":"Python High Perform Sci Comput"},{"key":"1068_CR58","first-page":"2825","volume":"12","author":"F Pedregosa","year":"2011","unstructured":"Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V et al (2011) Scikit-learn: machine learning in python. J Mach Learn Res 12:2825\u20132830","journal-title":"J Mach Learn Res"},{"key":"1068_CR59","unstructured":"Paszke A, Gross S, Massa F, Lerer A, Bradbury J, Chanan G, Killeen T, Lin Z, Gimelshein N, Antiga L, et al (2019) Pytorch: An imperative style, high-performance deep learning library. Advances in neural information processing systems 32"},{"key":"1068_CR60","first-page":"171","volume":"1","author":"F Wilcoxon","year":"1970","unstructured":"Wilcoxon F, Katti S, Wilcox RA et al (1970) Critical values and probability levels for the Wilcoxon rank sum test and the Wilcoxon signed rank test. Sel Tables Math Stat 1:171\u2013259","journal-title":"Sel Tables Math Stat"},{"issue":"01","key":"1068_CR61","first-page":"2007","volume":"3","author":"H Abdi","year":"2007","unstructured":"Abdi H et al (2007) Bonferroni and \u0161id\u00e1k corrections for multiple comparisons. Encycl Meas Stat 3(01):2007","journal-title":"Encycl Meas Stat"},{"key":"1068_CR62","unstructured":"Glorot X, Bengio Y (2010) Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, JMLR Workshop and Conference Proceedings, p 249\u2013256"},{"key":"1068_CR63","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13321-020-0408-x","volume":"12","author":"N Schaduangrat","year":"2020","unstructured":"Schaduangrat N, Lampa S, Simeon S, Gleeson MP, Spjuth O, Nantasenamat C (2020) Towards reproducible computational drug discovery. J Cheminform 12:1\u201330","journal-title":"J Cheminform"},{"key":"1068_CR64","doi-asserted-by":"publisher","first-page":"116250","DOI":"10.1016\/j.taap.2022.116250","volume":"454","author":"L Ye","year":"2022","unstructured":"Ye L, Ngan DK, Xu T, Liu Z, Zhao J, Sakamuru S, Zhang L, Zhao T, Xia M, Simeonov A et al (2022) Prediction of drug-induced liver injury and cardiotoxicity using chemical structure and in vitro assay data. Toxicol Appl Pharmacol 454:116250","journal-title":"Toxicol Appl Pharmacol"},{"key":"1068_CR65","unstructured":"Huang K, Fu T, Gao W, Zhao Y, Roohani Y, Leskovec J, Coley CW, Xiao C, Sun J, Zitnik M (2021) Therapeutics data commons: Machine learning datasets and tasks for drug discovery and development. arXiv preprint arXiv:2102.09548"},{"key":"1068_CR66","doi-asserted-by":"publisher","first-page":"18","DOI":"10.1016\/j.ymeth.2023.11.014","volume":"221","author":"J Wang","year":"2024","unstructured":"Wang J, Zhang L, Sun J, Yang X, Wu W, Chen W, Zhao Q (2024) Predicting drug-induced liver injury using graph attention mechanism and molecular fingerprints. Methods 221:18\u201326","journal-title":"Methods"},{"issue":"5","key":"1068_CR67","doi-asserted-by":"publisher","first-page":"865","DOI":"10.1021\/acs.chemrestox.4c00532","volume":"38","author":"H Chutia","year":"2025","unstructured":"Chutia H, Borah GS, Mahanta HJ, Nagamani S (2025) Boostdili: extreme gradient boost-powered drug-induced liver injury prediction and structural alerts generation. Chem Res Toxicol 38(5):865\u2013876","journal-title":"Chem Res Toxicol"},{"issue":"6","key":"1068_CR68","doi-asserted-by":"publisher","first-page":"384","DOI":"10.1093\/bib\/bbac384","volume":"23","author":"B-W Zhao","year":"2022","unstructured":"Zhao B-W, Su X-R, Hu P-W, Ma Y-P, Zhou X, Hu L (2022) A geometric deep learning framework for drug repositioning over heterogeneous information networks. Brief Bioinform 23(6):384","journal-title":"Brief Bioinform"},{"key":"1068_CR69","doi-asserted-by":"publisher","first-page":"121360","DOI":"10.1016\/j.ins.2024.121360","volume":"686","author":"B-W Zhao","year":"2025","unstructured":"Zhao B-W, Su X-R, Yang Y, Li D-X, Li G-D, Hu P-W, You Z-H, Luo X, Hu L (2025) Regulation-aware graph learning for drug repositioning over heterogeneous biological network. Inf Sci 686:121360","journal-title":"Inf Sci"},{"issue":"10","key":"1068_CR70","doi-asserted-by":"publisher","first-page":"8932","DOI":"10.1126\/sciadv.ads8932","volume":"11","author":"J Wang","year":"2025","unstructured":"Wang J, Feng J, Kang Y, Pan P, Ge J, Wang Y, Wang M, Wu Z, Zhang X, Yu J et al (2025) Discovery of antimicrobial peptides with notable antibacterial potency by an llm-based foundation model. Sci Adv 11(10):8932","journal-title":"Sci Adv"},{"key":"1068_CR71","unstructured":"Walters P (2023) We need better benchmarks for machine learning in drug discovery. https:\/\/practicalcheminformatics.blogspot.com\/2023\/08\/we-need-better-benchmarks-for-machine.html. Accessed 06 July 2025"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-025-01068-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13321-025-01068-3\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-025-01068-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,9]],"date-time":"2025-09-09T15:38:34Z","timestamp":1757432314000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/s13321-025-01068-3"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,8,18]]},"references-count":71,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2025,12]]}},"alternative-id":["1068"],"URL":"https:\/\/doi.org\/10.1186\/s13321-025-01068-3","relation":{"has-preprint":[{"id-type":"doi","id":"10.26434\/chemrxiv-2024-d12gk-v2","asserted-by":"object"},{"id-type":"doi","id":"10.26434\/chemrxiv-2024-d12gk","asserted-by":"object"}]},"ISSN":["1758-2946"],"issn-type":[{"value":"1758-2946","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,8,18]]},"assertion":[{"value":"7 February 2025","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"30 July 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 August 2025","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no Competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"124"}}