{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,15]],"date-time":"2026-06-15T06:09:02Z","timestamp":1781503742177,"version":"3.54.1"},"reference-count":66,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2025,9,26]],"date-time":"2025-09-26T00:00:00Z","timestamp":1758844800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,9,26]],"date-time":"2025-09-26T00:00:00Z","timestamp":1758844800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100007051","name":"Uppsala University","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100007051","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"abstract":"<jats:sec>\n            <jats:title>Abstract<\/jats:title>\n            <jats:p>Drug exposure, a key determinant of drug safety and efficacy, is governed by pharmacokinetic (PK) parameters such as volume of distribution (VDss), clearance (CL), half-life (t\u00bd), fraction unbound in plasma (fu), and mean residence time (MRT). In this study, we developed machine learning models to predict human PK parameters for 1,283 unique compounds using molecular structure, physicochemical properties, and predicted animal PK data. Our approach involved a two-stage modeling pipeline. First, we trained models to predict rat, dog, and monkey PK parameters (VDss, CL, fu) from chemical structure and properties for 371 compounds. These models were used to predict animal PK values for 1,283 unique compounds with human PK data. These animal PK predictions were then integrated with molecular descriptors and fingerprints to build Random Forest models for human PK parameters. The models demonstrated consistent performance across nested cross-validation and external validation sets, with predictive accuracy for VDss comparable to proprietary models developed by AstraZeneca. Notably, human VDss and CL predictions achieved external R<jats:sup>2<\/jats:sup> values of 0.39 and 0.46, respectively. To support broad accessibility and integration into early drug discovery workflows such as Design-Make-Test-Analyze (DMTA), we developed PKSmart (<jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" xlink:href=\"https:\/\/broad.io\/PKSmart\" ext-link-type=\"uri\">https:\/\/broad.io\/PKSmart<\/jats:ext-link>), a freely available web application. All code and models are also open source, enabling local deployment. To our knowledge, this represents the first public suite of PK prediction models with performance on par with industry standard models.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Scientific contribution<\/jats:title>\n            <jats:p>This study introduces the first publicly available pharmacokinetic (PK) models that match industry-standard predictions, utilizing molecular structural fingerprints, physicochemical properties, and predicted animal PK data to model human pharmacokinetics. Our approach is validated through repeated nested cross-validation and an external test set, including comparing predictions to an industry standard model. The models are released via a web-hosted application (<jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" xlink:href=\"https:\/\/broad.io\/PKSmart\" ext-link-type=\"uri\">https:\/\/broad.io\/PKSmart<\/jats:ext-link>) for wider accessibility and utility in drug development processes.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Graphical Abstract<\/jats:title>\n          <\/jats:sec>","DOI":"10.1186\/s13321-025-01066-5","type":"journal-article","created":{"date-parts":[[2025,9,26]],"date-time":"2025-09-26T10:21:29Z","timestamp":1758882089000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["PKSmart: an open-source computational model to predict intravenous pharmacokinetics of small molecules"],"prefix":"10.1186","volume":"17","author":[{"given":"Srijit","family":"Seal","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Maria-Anna","family":"Trapotsi","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Manas","family":"Mahale","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Vigneshwari","family":"Subramanian","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Nigel","family":"Greene","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Ola","family":"Spjuth","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Andreas","family":"Bender","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2025,9,26]]},"reference":[{"key":"1066_CR1","doi-asserted-by":"publisher","first-page":"985","DOI":"10.1007\/s40262-015-0287-x","volume":"54","author":"N Goyal","year":"2015","unstructured":"Goyal N (2015) The role of drug exposure in clinical development: to what extent is pharmacokinetic assessment needed in a drug development programme? Clin Pharmacokinet 54:985\u2013987. https:\/\/doi.org\/10.1007\/s40262-015-0287-x","journal-title":"Clin Pharmacokinet"},{"key":"1066_CR2","doi-asserted-by":"publisher","first-page":"201","DOI":"10.1016\/B978-0-323-35214-7.00019-6","volume-title":"Principles of pharmacokinetics. Fetal neonatal physiology 2-volume set","author":"RM Ward","year":"2017","unstructured":"Ward RM, Kern SE (2017) Principles of pharmacokinetics. Fetal neonatal physiology 2-volume set. Elsevier, Amsterdam, pp 201\u2013207"},{"key":"1066_CR3","doi-asserted-by":"publisher","first-page":"1178","DOI":"10.1124\/dmd.117.077040","volume":"45","author":"FL Wood","year":"2017","unstructured":"Wood FL, Houston JB, Hallifax D (2017) Clearance prediction methodology needs fundamental improvement: trends common to rat and human hepatocytes\/microsomes and implications for experimental methodology. Drug Metab Dispos 45:1178\u20131188. https:\/\/doi.org\/10.1124\/dmd.117.077040","journal-title":"Drug Metab Dispos"},{"key":"1066_CR4","doi-asserted-by":"publisher","first-page":"133","DOI":"10.1007\/BF03190588","volume":"29","author":"GW Caldwell","year":"2004","unstructured":"Caldwell GW, Masucci JA, Yan Z, Hageman W (2004) Allometric scaling of pharmacokinetic parameters in drug discovery: can human CL, Vss and t\u00bd be predicted from in-vivo rat data? Eur J Drug Metab Pharmacokinet 29:133\u2013143. https:\/\/doi.org\/10.1007\/BF03190588","journal-title":"Eur J Drug Metab Pharmacokinet"},{"key":"1066_CR5","doi-asserted-by":"publisher","first-page":"e63","DOI":"10.1038\/psp.2013.41","volume":"2","author":"HM Jones","year":"2013","unstructured":"Jones HM, Rowland-Yeo K (2013) Basic concepts in physiologically based pharmacokinetic modeling in drug discovery and development. CPT Pharmacometrics Syst Pharmacol 2:e63. https:\/\/doi.org\/10.1038\/psp.2013.41","journal-title":"CPT Pharmacometrics Syst Pharmacol"},{"key":"1066_CR6","doi-asserted-by":"publisher","first-page":"972","DOI":"10.3109\/00498254.2011.598582","volume":"41","author":"Y Huh","year":"2011","unstructured":"Huh Y, Smith DE, Rose Feng M (2011) Interspecies scaling and prediction of human clearance: comparison of small- and macro-molecule drugs. Xenobiotica 41:972\u2013987. https:\/\/doi.org\/10.3109\/00498254.2011.598582","journal-title":"Xenobiotica"},{"key":"1066_CR7","doi-asserted-by":"publisher","first-page":"1474","DOI":"10.1023\/A:1011974226596","volume":"15","author":"WL Chiou","year":"1998","unstructured":"Chiou WL, Robbie G, Chung SM et al (1998) Correlation of plasma clearance of 54 extensively metabolized drugs between humans and rats: mean allometric coefficient of 0.66. Pharm Res 15:1474\u20131479. https:\/\/doi.org\/10.1023\/A:1011974226596","journal-title":"Pharm Res"},{"key":"1066_CR8","volume-title":"Physiol pharmacokinet model simulations princ methods, appl pharm ind","author":"SA Peters","year":"2012","unstructured":"Peters SA (2012) Physiologically-based pharmacokinetic (PBPK) modeling and simulations: principles, methods, and applications in the pharmaceutical industry. In: Peters SA (ed) Physiol pharmacokinet model simulations princ methods, appl pharm ind. John Wiley & Sons, Hoboken"},{"key":"1066_CR9","doi-asserted-by":"publisher","first-page":"1259","DOI":"10.1002\/jps.20322","volume":"94","author":"T Rodgers","year":"2005","unstructured":"Rodgers T, Leahy D, Rowland M (2005) Physiologically based pharmacokinetic modeling 1: predicting the tissue distribution of moderate-to-strong bases. J Pharm Sci 94:1259\u20131276. https:\/\/doi.org\/10.1002\/jps.20322","journal-title":"J Pharm Sci"},{"key":"1066_CR10","doi-asserted-by":"publisher","first-page":"1238","DOI":"10.1002\/jps.20502","volume":"95","author":"T Rodgers","year":"2006","unstructured":"Rodgers T, Rowland M (2006) Physiologically based pharmacokinetic modelling 2: predicting the tissue distribution of acids, very weak bases, neutrals and zwitterions. J Pharm Sci 95:1238\u20131257. https:\/\/doi.org\/10.1002\/jps.20502","journal-title":"J Pharm Sci"},{"key":"1066_CR11","doi-asserted-by":"publisher","first-page":"918","DOI":"10.1007\/s11095-006-9210-3","volume":"24","author":"T Rodgers","year":"2007","unstructured":"Rodgers T, Rowland M (2007) Mechanistic approaches to volume of distribution predictions: understanding the processes. Pharm Res 24:918\u2013933. https:\/\/doi.org\/10.1007\/s11095-006-9210-3","journal-title":"Pharm Res"},{"key":"1066_CR12","doi-asserted-by":"publisher","first-page":"1203","DOI":"10.1002\/jps.2600680948","volume":"68","author":"S \u00d8ie","year":"1979","unstructured":"\u00d8ie S, Tozer TN (1979) Effect of altered plasma protein binding on apparent volume of distribution. J Pharm Sci 68:1203\u20131205. https:\/\/doi.org\/10.1002\/jps.2600680948","journal-title":"J Pharm Sci"},{"key":"1066_CR13","doi-asserted-by":"publisher","first-page":"391","DOI":"10.1002\/j.1552-4604.1983.tb02753.x","volume":"23","author":"DJ Greenblatt","year":"1983","unstructured":"Greenblatt DJ, Abernethy DR, Divoll M (1983) Is volume of distribution at steady state a meaningful kinetic variable? J Clin Pharmacol 23:391\u2013400. https:\/\/doi.org\/10.1002\/j.1552-4604.1983.tb02753.x","journal-title":"J Clin Pharmacol"},{"key":"1066_CR14","doi-asserted-by":"publisher","first-page":"2245","DOI":"10.1021\/acs.jmedchem.8b01263","volume":"62","author":"DA Smith","year":"2019","unstructured":"Smith DA, Beaumont K, Maurer TS, Di L (2019) Clearance in drug design. J Med Chem 62:2245\u20132255. https:\/\/doi.org\/10.1021\/acs.jmedchem.8b01263","journal-title":"J Med Chem"},{"key":"1066_CR15","doi-asserted-by":"publisher","DOI":"10.1016\/B978-008055232-3.60032-7","author":"E Flynn","year":"2007","unstructured":"Flynn E (2007) Clinical pharmacokinetics. xPharm Compr Pharmacol Ref 1\u20132. https:\/\/doi.org\/10.1016\/B978-008055232-3.60032-7","journal-title":"xPharm Compr Pharmacol Ref 1\u20132"},{"key":"1066_CR16","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1177\/1091581811429865","volume":"31","author":"T Grabowski","year":"2012","unstructured":"Grabowski T, Jaroszewski JJ, Gad SC, Feder M (2012) Correlation between in silico physicochemical characteristics of drugs and their mean residence time in human and dog. Int J Toxicol 31:25\u201333. https:\/\/doi.org\/10.1177\/1091581811429865","journal-title":"Int J Toxicol"},{"key":"1066_CR17","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13321-015-0054-x","volume":"7","author":"AA Freitas","year":"2015","unstructured":"Freitas AA, Limbu K, Ghafourian T (2015) Predicting volume of distribution with decision tree-based regression methods using predicted tissue:plasma partition coefficients. J Cheminform 7:1\u201317. https:\/\/doi.org\/10.1186\/s13321-015-0054-x","journal-title":"J Cheminform"},{"key":"1066_CR18","doi-asserted-by":"publisher","first-page":"71","DOI":"10.1007\/s12539-014-0166-4","volume":"6","author":"B Louis","year":"2014","unstructured":"Louis B, Agrawal VK (2014) Prediction of human volume of distribution values for drugs using linear and nonlinear quantitative structure pharmacokinetic relationship models. Interdiscip Sci Comput Life Sci 6:71\u201383. https:\/\/doi.org\/10.1007\/s12539-014-0166-4","journal-title":"Interdiscip Sci Comput Life Sci"},{"key":"1066_CR19","doi-asserted-by":"publisher","first-page":"2664","DOI":"10.1016\/j.bbagen.2016.05.019","volume":"1860","author":"J Lu","year":"2016","unstructured":"Lu J, Lu D, Zhang X et al (2016) Estimation of elimination half-lives of organic chemicals in humans using gradient boosting machine. Biochim Biophys Acta-Gen Subj 1860:2664\u20132671. https:\/\/doi.org\/10.1016\/j.bbagen.2016.05.019","journal-title":"Biochim Biophys Acta-Gen Subj"},{"key":"1066_CR20","doi-asserted-by":"publisher","DOI":"10.62186\/001c.125154","author":"A Gupta","year":"2024","unstructured":"Gupta A, Stead TS, Ganti L (2024) Determining a meaningful R-squared value in clinical medicine. Acad Med Surg. https:\/\/doi.org\/10.62186\/001c.125154","journal-title":"Acad Med Surg"},{"key":"1066_CR21","doi-asserted-by":"publisher","first-page":"383","DOI":"10.1016\/j.jmgm.2005.10.004","volume":"24","author":"CW Yap","year":"2006","unstructured":"Yap CW, Li ZR, Chen YZ (2006) Quantitative structure-pharmacokinetic relationships for drug clearance by using statistical learning methods. J Mol Graph Model 24:383\u2013395. https:\/\/doi.org\/10.1016\/j.jmgm.2005.10.004","journal-title":"J Mol Graph Model"},{"key":"1066_CR22","doi-asserted-by":"publisher","first-page":"572","DOI":"10.1002\/cmdc.201700582","volume":"13","author":"L Sun","year":"2018","unstructured":"Sun L, Yang H, Li J et al (2018) In silico prediction of compounds binding to human plasma proteins by QSAR models. ChemMedChem 13:572\u2013581. https:\/\/doi.org\/10.1002\/cmdc.201700582","journal-title":"ChemMedChem"},{"key":"1066_CR23","doi-asserted-by":"publisher","first-page":"103962","DOI":"10.1016\/j.chemolab.2020.103962","volume":"199","author":"Y Yuan","year":"2020","unstructured":"Yuan Y, Chang S, Zhang Z et al (2020) A novel strategy for prediction of human plasma protein binding using machine learning techniques. Chemom Intell Lab Syst 199:103962. https:\/\/doi.org\/10.1016\/j.chemolab.2020.103962","journal-title":"Chemom Intell Lab Syst"},{"key":"1066_CR24","doi-asserted-by":"publisher","first-page":"1790","DOI":"10.1007\/s11095-013-1023-6","volume":"30","author":"XW Zhu","year":"2013","unstructured":"Zhu XW, Sedykh A, Zhu H et al (2013) The use of pseudo-equilibrium constant affords improved QSAR models of human plasma protein binding. Pharm Res 30:1790\u20131798. https:\/\/doi.org\/10.1007\/s11095-013-1023-6","journal-title":"Pharm Res"},{"key":"1066_CR25","doi-asserted-by":"publisher","first-page":"5302","DOI":"10.1021\/acs.molpharmaceut.8b00785","volume":"15","author":"R Watanabe","year":"2018","unstructured":"Watanabe R, Esaki T, Kawashima H et al (2018) Predicting fraction unbound in human plasma from chemical structure: improved accuracy in the low value ranges. Mol Pharm 15:5302\u20135311. https:\/\/doi.org\/10.1021\/acs.molpharmaceut.8b00785","journal-title":"Mol Pharm"},{"key":"1066_CR26","doi-asserted-by":"publisher","first-page":"4893","DOI":"10.1021\/acs.jcim.9b00460","volume":"59","author":"S Schneckener","year":"2019","unstructured":"Schneckener S, Grimbs S, Hey J et al (2019) Prediction of oral bioavailability in rats: transferring insights from in vitro correlations to (deep) machine learning models using in silico model outputs and chemical structure parameters. J Chem Inf Model 59:4893\u20134905. https:\/\/doi.org\/10.1021\/acs.jcim.9b00460","journal-title":"J Chem Inf Model"},{"key":"1066_CR27","doi-asserted-by":"publisher","first-page":"1488","DOI":"10.1021\/acs.molpharmaceut.2c00027","volume":"19","author":"O Obrezanova","year":"2022","unstructured":"Obrezanova O, Martinsson A, Whitehead T et al (2022) Prediction of in vivo pharmacokinetic parameters and time-exposure curves in rats using machine learning from the chemical structure. Mol Pharm 19:1488\u20131504. https:\/\/doi.org\/10.1021\/acs.molpharmaceut.2c00027","journal-title":"Mol Pharm"},{"key":"1066_CR28","doi-asserted-by":"publisher","first-page":"1366","DOI":"10.1080\/00498254.2021.2011471","volume":"51","author":"U Fagerholm","year":"2021","unstructured":"Fagerholm U, Hellberg S, Alvarsson J et al (2021) In silico prediction of volume of distribution of drugs in man using conformal prediction performs on par with animal data-based models. Xenobiotica 51:1366\u20131371. https:\/\/doi.org\/10.1080\/00498254.2021.2011471","journal-title":"Xenobiotica"},{"key":"1066_CR29","doi-asserted-by":"publisher","first-page":"113","DOI":"10.1080\/00498254.2022.2049397","volume":"52","author":"U Fagerholm","year":"2022","unstructured":"Fagerholm U, Hellberg S, Alvarsson J, Spjuth O (2022) In silico predictions of the human pharmacokinetics\/toxicokinetics of 65 chemicals from various classes using conformal prediction methodology. Xenobiotica 52:113\u2013118. https:\/\/doi.org\/10.1080\/00498254.2022.2049397","journal-title":"Xenobiotica"},{"key":"1066_CR30","doi-asserted-by":"publisher","first-page":"4520","DOI":"10.1021\/acs.molpharmaceut.1c00718","volume":"18","author":"F Miljkovi\u0107","year":"2021","unstructured":"Miljkovi\u0107 F, Martinsson A, Obrezanova O et al (2021) Machine learning models for human in vivo pharmacokinetic parameters with in-house validation. Mol Pharm 18:4520\u20134530. https:\/\/doi.org\/10.1021\/acs.molpharmaceut.1c00718","journal-title":"Mol Pharm"},{"issue":"3","key":"1066_CR31","doi-asserted-by":"publisher","first-page":"53","DOI":"10.1016\/j.slasd.2022.12.003","volume":"28","author":"A Liu","year":"2023","unstructured":"Liu A, Seal S, Yang H, Bender A (2023) Using chemical and biological data to predict drug toxicity. SLAS Discov 28(3):53\u201364. https:\/\/doi.org\/10.1016\/j.slasd.2022.12.003","journal-title":"SLAS Discov"},{"key":"1066_CR32","doi-asserted-by":"publisher","first-page":"254","DOI":"10.1038\/s41592-024-02528-8","volume":"22","author":"S Seal","year":"2025","unstructured":"Seal S, Trapotsi MA, Spjuth O et al (2025) Cell painting: a decade of discovery and innovation in cellular imaging. Nat Methods 22:254\u2013268. https:\/\/doi.org\/10.1038\/s41592-024-02528-8","journal-title":"Nat Methods"},{"key":"1066_CR33","doi-asserted-by":"publisher","DOI":"10.1101\/2024.01.10.575128","author":"S Seal","year":"2024","unstructured":"Seal S, Williams DP, Hosseini-Gerami L et al (2024) Improved early detection of drug-induced liver injury by integrating predicted in vivo and in vitro data. bioRxiv. https:\/\/doi.org\/10.1101\/2024.01.10.575128","journal-title":"bioRxiv"},{"key":"1066_CR34","doi-asserted-by":"publisher","DOI":"10.1021\/ACS.JCIM.3C01834","author":"S Seal","year":"2024","unstructured":"Seal S, Spjuth O, Hosseini-Gerami L et al (2024) Insights into drug cardiotoxicity from biological and chemical data: the first public classifiers for FDA drug-induced cardiotoxicity rank. J Chem Inf Model. https:\/\/doi.org\/10.1021\/ACS.JCIM.3C01834","journal-title":"J Chem Inf Model"},{"key":"1066_CR35","doi-asserted-by":"publisher","DOI":"10.1101\/2025.01.22.634152","author":"JD Ewald","year":"2025","unstructured":"Ewald JD, Titterton KL, B\u00e4uerle A et al (2025) Cell painting for cytotoxicity and mode-of-action analysis in primary human hepatocytes. bioRxiv. https:\/\/doi.org\/10.1101\/2025.01.22.634152","journal-title":"bioRxiv"},{"key":"1066_CR36","doi-asserted-by":"publisher","first-page":"102566","DOI":"10.1016\/j.sbi.2023.102566","volume":"80","author":"B Baillif","year":"2023","unstructured":"Baillif B, Cole J, McCabe P, Bender A (2023) Deep generative models for 3D molecular structure. Curr Opin Struct Biol 80:102566. https:\/\/doi.org\/10.1016\/j.sbi.2023.102566","journal-title":"Curr Opin Struct Biol"},{"key":"1066_CR37","doi-asserted-by":"publisher","first-page":"580","DOI":"10.1007\/s00204-006-0091-3","volume":"80","author":"PJ O\u2019Brien","year":"2006","unstructured":"O\u2019Brien PJ, Irwin W, Diaz D et al (2006) High concordance of drug-induced human hepatotoxicity with in vitro cytotoxicity measured in a novel cell-based model using high content screening. Arch Toxicol 80:580\u2013604. https:\/\/doi.org\/10.1007\/s00204-006-0091-3","journal-title":"Arch Toxicol"},{"key":"1066_CR38","doi-asserted-by":"publisher","DOI":"10.1021\/ACS.JCIM.3C01777","author":"RI Horne","year":"2024","unstructured":"Horne RI, Wilson-Godber J, D\u00edaz AG et al (2024) Using generative modeling to endow with potency initially inert compounds with good bioavailability and low toxicity. J Chem Inf Model. https:\/\/doi.org\/10.1021\/ACS.JCIM.3C01777","journal-title":"J Chem Inf Model"},{"key":"1066_CR39","doi-asserted-by":"publisher","first-page":"1385","DOI":"10.1124\/dmd.108.020479","volume":"36","author":"RS Obach","year":"2008","unstructured":"Obach RS, Lombardo F, Waters NJ (2008) Trend analysis of a database of intravenous pharmacokinetic parameters in humans for 670 drug compounds. Drug Metab Dispos 36:1385\u20131405. https:\/\/doi.org\/10.1124\/dmd.108.020479","journal-title":"Drug Metab Dispos"},{"issue":"11","key":"1066_CR40","doi-asserted-by":"publisher","first-page":"1466","DOI":"10.1124\/dmd.118.082966","volume":"46","author":"F Lombardo","year":"2018","unstructured":"Lombardo F, Berellini G, Obach RS (2018) Trend analysis of a database of intravenous pharmacokinetic parameters in humans for 1352 drug compounds. Drug Metab Dispos 46(11):1466\u20131477","journal-title":"Drug Metab Dispos"},{"issue":"2","key":"1066_CR41","doi-asserted-by":"publisher","first-page":"178","DOI":"10.1177\/0091270012440282","volume":"53","author":"F Lombardo","year":"2013","unstructured":"Lombardo F, Waters NJ, Argikar UA, Dennehy MK, Zhan J, Gunduz M, Harriman SP, Berellini G, Rajlic IL, Obach RS (2013) Comprehensive assessment of human pharmacokinetic prediction based on in vivo animal pharmacokinetic data, Part 2: clearance. J Clin Pharmacol 53(2):178\u2013191","journal-title":"J Clin Pharmacol"},{"key":"1066_CR42","unstructured":"Landrum G (2006) RDKit: open-source cheminformatics. In: http:\/\/www.rdkit.org. Accessed 2 Mar 2022."},{"key":"1066_CR43","unstructured":"New Drugs at FDA: CDER\u2019s new molecular entities and new therapeutic biological products | FDA. https:\/\/www.fda.gov\/drugs\/development-approval-process-drugs\/new-drugs-fda-cders-new-molecular-entities-and-new-therapeutic-biological-products. Accessed 02 Nov 2022."},{"key":"1066_CR44","doi-asserted-by":"publisher","first-page":"D1074","DOI":"10.1093\/nar\/gkx1037","volume":"46","author":"DS Wishart","year":"2018","unstructured":"Wishart DS, Feunang YD, Guo AC et al (2018) DrugBank 5.0: a major update to the DrugBank database for 2018. Nucleic Acids Res 46:D1074\u2013D1082. https:\/\/doi.org\/10.1093\/nar\/gkx1037","journal-title":"Nucleic Acids Res"},{"key":"1066_CR45","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13321-018-0258-y","volume":"10","author":"H Moriwaki","year":"2018","unstructured":"Moriwaki H, Tian YS, Kawashita N, Takagi T (2018) Mordred: a molecular descriptor calculator. J Cheminform 10:1\u201314. https:\/\/doi.org\/10.1186\/s13321-018-0258-y","journal-title":"J Cheminform"},{"key":"1066_CR46","first-page":"2825","volume":"12","author":"F Pedregosa","year":"2011","unstructured":"Pedregosa F, Michel V, Grisel O et al (2011) Scikit-learn: machine learning in python. J Mach Learn Res 12:2825","journal-title":"J Mach Learn Res"},{"key":"1066_CR47","doi-asserted-by":"publisher","first-page":"100024","DOI":"10.1016\/j.namjnl.2025.100024","volume":"1","author":"S Seal","year":"2025","unstructured":"Seal S, Rabbit RR (2025) Advice for bad computational toxicologists. NAM J 1:100024. https:\/\/doi.org\/10.1016\/j.namjnl.2025.100024","journal-title":"NAM J"},{"key":"1066_CR48","unstructured":"SciPy. https:\/\/scipy.org\/. Accessed 12 Aug 2023."},{"key":"1066_CR49","doi-asserted-by":"publisher","first-page":"181","DOI":"10.1021\/acs.jcim.8b00597","volume":"59","author":"R Liu","year":"2019","unstructured":"Liu R, Wallqvist A (2019) Molecular similarity-based domain applicability metric efficiently identifies out-of-domain compounds. J Chem Inf Model 59:181\u2013189. https:\/\/doi.org\/10.1021\/acs.jcim.8b00597","journal-title":"J Chem Inf Model"},{"issue":"5","key":"1066_CR50","doi-asserted-by":"publisher","first-page":"1488","DOI":"10.1021\/ACS.MOLPHARMACEUT.2C00027","volume":"19","author":"O Obrezanova","year":"2022","unstructured":"Obrezanova O, Martinsson A, Whitehead T, Mahmoud S, Bender A, Miljkovi\u0107 F, Grabowski P, Irwin B, Oprisiu I, Conduit G, Segall M, Smith GF, Williamson B, Winiwarter S, Greene N (2022) Prediction of in vivo pharmacokinetic parameters and time-exposure curves in rats using machine learning from the chemical structure. Mol Pharm 19(5):1488\u20131504. https:\/\/doi.org\/10.1021\/ACS.MOLPHARMACEUT.2C00027","journal-title":"Mol Pharm"},{"key":"1066_CR51","doi-asserted-by":"publisher","first-page":"591","DOI":"10.12688\/f1000research.8357.2","volume":"5","author":"S Jasial","year":"2016","unstructured":"Jasial S, Hu Y, Vogt M, Bajorath J (2016) Activity-relevant similarity values for fingerprints and implications for similarity searching. F1000Res 5:591. https:\/\/doi.org\/10.12688\/f1000research.8357.2","journal-title":"F1000Res"},{"key":"1066_CR52","doi-asserted-by":"publisher","first-page":"2275","DOI":"10.1021\/ci4004078","volume":"53","author":"D Dimova","year":"2013","unstructured":"Dimova D, Stumpfe D, Bajorath J (2013) Quantifying the fingerprint descriptor dependence of structure-activity relationship information on a large scale. J Chem Inf Model 53:2275\u20132281. https:\/\/doi.org\/10.1021\/ci4004078","journal-title":"J Chem Inf Model"},{"key":"1066_CR53","doi-asserted-by":"publisher","first-page":"352","DOI":"10.1208\/s12248-014-9691-7","volume":"17","author":"L Di","year":"2015","unstructured":"Di L, Obach RS (2015) Addressing the challenges of low clearance in drug research. AAPS J 17:352\u2013357. https:\/\/doi.org\/10.1208\/s12248-014-9691-7","journal-title":"AAPS J"},{"key":"1066_CR54","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s12967-019-1976-2","volume":"17","author":"CHC Leenaars","year":"2019","unstructured":"Leenaars CHC, Kouwenaar C, Stafleu FR et al (2019) Animal to human translation: a systematic scoping review of reported concordance rates. J Transl Med 17:1\u201322. https:\/\/doi.org\/10.1186\/s12967-019-1976-2","journal-title":"J Transl Med"},{"key":"1066_CR55","doi-asserted-by":"publisher","first-page":"390","DOI":"10.1016\/j.tips.2020.03.004","volume":"41","author":"M Davies","year":"2020","unstructured":"Davies M, Jones RDO, Grime K et al (2020) Improving the accuracy of predicted human pharmacokinetics: lessons learned from the AstraZeneca drug pipeline over two decades. Trends Pharmacol Sci 41:390\u2013408. https:\/\/doi.org\/10.1016\/j.tips.2020.03.004","journal-title":"Trends Pharmacol Sci"},{"key":"1066_CR56","doi-asserted-by":"publisher","first-page":"58","DOI":"10.1124\/DMD.113.053397FULLARTICLECORRECTION","volume":"47","author":"TS Chan","year":"2019","unstructured":"Chan TS, Yu H, Moore A et al (2019) Meeting the challenge of predicting hepatic clearance of compounds slowly metabolized by cytochrome P450 using a novel hepatocyte model, HepatoPac. Drug Metab Dispos 47:58\u201366. https:\/\/doi.org\/10.1124\/DMD.113.053397FULLARTICLECORRECTION","journal-title":"Drug Metab Dispos"},{"key":"1066_CR57","doi-asserted-by":"publisher","first-page":"2500","DOI":"10.1016\/j.xphs.2019.02.009","volume":"108","author":"CM Bowman","year":"2019","unstructured":"Bowman CM, Benet LZ (2019) In vitro-in vivo extrapolation and hepatic clearance-dependent underprediction. J Pharm Sci 108:2500\u20132504. https:\/\/doi.org\/10.1016\/j.xphs.2019.02.009","journal-title":"J Pharm Sci"},{"key":"1066_CR58","doi-asserted-by":"publisher","first-page":"1834","DOI":"10.1016\/j.xphs.2021.01.020","volume":"110","author":"H Iwata","year":"2021","unstructured":"Iwata H, Matsuo T, Mamada H et al (2021) Prediction of total drug clearance in humans using animal data: proposal of a multimodal learning method based on deep learning. J Pharm Sci 110:1834\u20131841. https:\/\/doi.org\/10.1016\/j.xphs.2021.01.020","journal-title":"J Pharm Sci"},{"key":"1066_CR59","doi-asserted-by":"publisher","first-page":"3968","DOI":"10.1021\/acs.jcim.9b00300","volume":"59","author":"Y Wang","year":"2019","unstructured":"Wang Y, Liu H, Fan Y et al (2019) In silico prediction of human intravenous pharmacokinetic parameters with improved accuracy. J Chem Inf Model 59:3968\u20133980. https:\/\/doi.org\/10.1021\/acs.jcim.9b00300","journal-title":"J Chem Inf Model"},{"issue":"7","key":"1066_CR60","doi-asserted-by":"publisher","first-page":"2299","DOI":"10.1021\/acs.molpharmaceut.9b01294","volume":"17","author":"Y Kosugi","year":"2020","unstructured":"Kosugi Y, Hosea N (2020) Direct comparison of total clearance prediction: computational machine learning model versus bottom-up approach using in vitro assay. Mol Pharm 17(7):2299\u20132309. https:\/\/doi.org\/10.1021\/acs.molpharmaceut.9b01294","journal-title":"Mol Pharm"},{"key":"1066_CR61","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41598-019-55325-1","volume":"9","author":"R Watanabe","year":"2019","unstructured":"Watanabe R, Ohashi R, Esaki T et al (2019) Development of an in silico prediction system of human renal excretion and clearance from chemical structure information incorporating fraction unbound in plasma as a descriptor. Sci Rep 9:1\u201311. https:\/\/doi.org\/10.1038\/s41598-019-55325-1","journal-title":"Sci Rep"},{"key":"1066_CR62","doi-asserted-by":"publisher","first-page":"429","DOI":"10.2165\/00003088-199120060-00001","volume":"20","author":"TM Ludden","year":"1991","unstructured":"Ludden TM (1991) Nonlinear pharmacokinetics: clinical implications. Clin Pharmacokinet 20:429\u2013446. https:\/\/doi.org\/10.2165\/00003088-199120060-00001","journal-title":"Clin Pharmacokinet"},{"key":"1066_CR63","doi-asserted-by":"publisher","DOI":"10.1101\/2024.07.02.601740","author":"US Saha","year":"2024","unstructured":"Saha US, Vendruscolo M, Carpenter AE et al (2024) Step forward cross validation for bioactivity prediction: out of distribution validation in drug discovery. bioRxiv. https:\/\/doi.org\/10.1101\/2024.07.02.601740","journal-title":"bioRxiv"},{"key":"1066_CR64","doi-asserted-by":"publisher","first-page":"759","DOI":"10.1021\/ACS.CHEMRESTOX.5C00033","volume":"38","author":"S Seal","year":"2025","unstructured":"Seal S, Mahale M, Garc\u00eda-Orteg\u00f3n M et al (2025) Machine learning for toxicity prediction using chemical structures: pillars for success in the real world. Chem Res Toxicol 38:759\u2013807. https:\/\/doi.org\/10.1021\/ACS.CHEMRESTOX.5C00033","journal-title":"Chem Res Toxicol"},{"key":"1066_CR65","doi-asserted-by":"publisher","DOI":"10.5281\/zenodo.10535844","author":"G Liu","year":"2024","unstructured":"Liu G, Seal S, Arevalo J et al (2024) Learning molecular representation in a cell. ArXiv. https:\/\/doi.org\/10.5281\/zenodo.10535844"},{"key":"1066_CR66","doi-asserted-by":"publisher","first-page":"1356","DOI":"10.1021\/acscentsci.1c00546","volume":"7","author":"AP Soleimany","year":"2021","unstructured":"Soleimany AP, Amini A, Goldman S et al (2021) Evidential deep learning for guided molecular property prediction and discovery. ACS Cent Sci 7:1356\u20131367. https:\/\/doi.org\/10.1021\/acscentsci.1c00546","journal-title":"ACS Cent Sci"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-025-01066-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13321-025-01066-5\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-025-01066-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,26]],"date-time":"2025-09-26T10:21:31Z","timestamp":1758882091000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/s13321-025-01066-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,9,26]]},"references-count":66,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2025,12]]}},"alternative-id":["1066"],"URL":"https:\/\/doi.org\/10.1186\/s13321-025-01066-5","relation":{"references":[{"id-type":"doi","id":"10.5281\/zenodo.10535844","asserted-by":"subject"}]},"ISSN":["1758-2946"],"issn-type":[{"value":"1758-2946","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,9,26]]},"assertion":[{"value":"14 January 2025","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 July 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 September 2025","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"All authors have approved the final version of the manuscript.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"147"}}