{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,13]],"date-time":"2026-04-13T23:45:26Z","timestamp":1776123926961,"version":"3.50.1"},"reference-count":58,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2023,9,5]],"date-time":"2023-09-05T00:00:00Z","timestamp":1693872000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,9,5]],"date-time":"2023-09-05T00:00:00Z","timestamp":1693872000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"Shanghai Sailing Program","award":["22YF1460800"],"award-info":[{"award-number":["22YF1460800"]}]},{"DOI":"10.13039\/501100002858","name":"China Postdoctoral Science Foundation","doi-asserted-by":"publisher","award":["2022M720153"],"award-info":[{"award-number":["2022M720153"]}],"id":[{"id":"10.13039\/501100002858","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["82204278"],"award-info":[{"award-number":["82204278"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["82273855"],"award-info":[{"award-number":["82273855"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Lingang Laboratory","award":["LG202102-01-02"],"award-info":[{"award-number":["LG202102-01-02"]}]},{"name":"National Key Research and Development Program of China","award":["2022YFC3400504"],"award-info":[{"award-number":["2022YFC3400504"]}]},{"name":"SIMM-SHUTCM Traditional Chinese Medicine Innovation Joint Research Program","award":["E2G805H"],"award-info":[{"award-number":["E2G805H"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Lipophilicity is a fundamental physical property that significantly affects various aspects of drug behavior, including solubility, permeability, metabolism, distribution, protein binding, and toxicity. Accurate prediction of lipophilicity, measured by the logD7.4 value (the distribution coefficient between n-octanol and buffer at physiological pH 7.4), is crucial for successful drug discovery and design. However, the limited availability of data for logD modeling poses a significant challenge to achieving satisfactory generalization capability. To address this challenge, we have developed a novel logD7.4 prediction model called RTlogD, which leverages knowledge from multiple sources. RTlogD combines pre-training on a chromatographic retention time (RT) dataset since the RT is influenced by lipophilicity. Additionally, microscopic pKa values are incorporated as atomic features, providing valuable insights into ionizable sites and ionization capacity. Furthermore, logP is integrated as an auxiliary task within a multitask learning framework. We conducted ablation studies and presented a detailed analysis, showcasing the effectiveness and interpretability of RT, pKa, and logP in the RTlogD model. Notably, our RTlogD model demonstrated superior performance compared to commonly used algorithms and prediction tools. These results underscore the potential of the RTlogD model to improve the accuracy and generalization of logD prediction in drug discovery and design. In summary, the RTlogD model addresses the challenge of limited data availability in logD modeling by leveraging knowledge from RT, microscopic pKa, and logP. Incorporating these factors enhances the predictive capabilities of our model, and it holds promise for real-world applications in drug discovery and design scenarios.<\/jats:p>\n                <jats:p><jats:bold>Graphical Abstract<\/jats:bold><\/jats:p>","DOI":"10.1186\/s13321-023-00754-4","type":"journal-article","created":{"date-parts":[[2023,9,5]],"date-time":"2023-09-05T09:01:44Z","timestamp":1693904504000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":34,"title":["LogD7.4 prediction enhanced by transferring knowledge from chromatographic retention time, microscopic pKa and logP"],"prefix":"10.1186","volume":"15","author":[{"given":"Yitian","family":"Wang","sequence":"first","affiliation":[]},{"given":"Jiacheng","family":"Xiong","sequence":"additional","affiliation":[]},{"given":"Fu","family":"Xiao","sequence":"additional","affiliation":[]},{"given":"Wei","family":"Zhang","sequence":"additional","affiliation":[]},{"given":"Kaiyang","family":"Cheng","sequence":"additional","affiliation":[]},{"given":"Jingxin","family":"Rao","sequence":"additional","affiliation":[]},{"given":"Buying","family":"Niu","sequence":"additional","affiliation":[]},{"given":"Xiaochu","family":"Tong","sequence":"additional","affiliation":[]},{"given":"Ning","family":"Qu","sequence":"additional","affiliation":[]},{"given":"Runze","family":"Zhang","sequence":"additional","affiliation":[]},{"given":"Dingyan","family":"Wang","sequence":"additional","affiliation":[]},{"given":"Kaixian","family":"Chen","sequence":"additional","affiliation":[]},{"given":"Xutong","family":"Li","sequence":"additional","affiliation":[]},{"given":"Mingyue","family":"Zheng","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,9,5]]},"reference":[{"key":"754_CR1","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1517\/17460441003605098","volume":"5","author":"MJ Waring","year":"2010","unstructured":"Waring MJ (2010) Lipophilicity in drug discovery. Expert Opin Drug Discov 5:235\u2013248","journal-title":"Expert Opin Drug Discov"},{"key":"754_CR2","first-page":"3","volume":"70","author":"E Rutkowska","year":"2013","unstructured":"Rutkowska E, Pajak K, Jozwiak K (2013) Lipophilicity\u2013methods of determination and its role in medicinal chemistry. Acta Pol Pharm 70:3\u201318","journal-title":"Acta Pol Pharm"},{"key":"754_CR3","doi-asserted-by":"crossref","first-page":"4872","DOI":"10.1016\/j.bmcl.2008.07.071","volume":"18","author":"JD Hughes","year":"2008","unstructured":"Hughes JD, Blagg J, Price DA, Bailey S, Decrescenzo GA, Devraj RV, Ellsworth E, Fobian YM, Gibbs ME, Gilles RW et al (2008) Physiochemical drug properties associated with in vivo toxicological outcomes. Bioorg Med Chem Lett 18:4872\u20134875","journal-title":"Bioorg Med Chem Lett"},{"key":"754_CR4","first-page":"24","volume":"41","author":"C Challener","year":"2017","unstructured":"Challener C (2017) Oral delivery of biologic APIs: the challenge continues. PharmTech Home 41:24\u201328","journal-title":"PharmTech Home"},{"key":"754_CR5","doi-asserted-by":"crossref","first-page":"522","DOI":"10.1021\/acsmedchemlett.8b00047","volume":"9","author":"F Broccatelli","year":"2018","unstructured":"Broccatelli F, Aliagas I, Zheng H (2018) Why decreasing lipophilicity alone is often not a reliable strategy for extending IV half-life. ACS Med Chem Lett 9:522\u2013527","journal-title":"ACS Med Chem Lett"},{"key":"754_CR6","doi-asserted-by":"crossref","first-page":"863","DOI":"10.1517\/17460441.2012.714363","volume":"7","author":"JA Arnott","year":"2012","unstructured":"Arnott JA, Planey SL (2012) The influence of lipophilicity in drug discovery and design. Expert Opin Drug Discov 7:863\u2013875","journal-title":"Expert Opin Drug Discov"},{"key":"754_CR7","doi-asserted-by":"crossref","first-page":"635","DOI":"10.1007\/s11224-011-9741-z","volume":"22","author":"M Remko","year":"2011","unstructured":"Remko M, Boh\u00e1\u010d A, Kov\u00e1\u010dikov\u00e1 L (2011) Molecular structure, pKa, lipophilicity, solubility, absorption, polar surface area, and blood brain barrier penetration of some antiangiogenic agents. Struct Chem 22:635\u2013648","journal-title":"Struct Chem"},{"key":"754_CR8","doi-asserted-by":"crossref","first-page":"556","DOI":"10.1021\/mp0700209","volume":"4","author":"SK Bhal","year":"2007","unstructured":"Bhal SK, Kassam K, Peirson IG, Pearl GM (2007) The rule of five revisited: applying log D in place of log P in drug-likeness filters. Mol Pharm 4:556\u2013560","journal-title":"Mol Pharm"},{"key":"754_CR9","doi-asserted-by":"crossref","first-page":"3714","DOI":"10.1021\/acs.jcim.9b00541","volume":"59","author":"ZY Yang","year":"2019","unstructured":"Yang ZY, Yang ZJ, Dong J, Wang LL, Zhang LX, Ding JJ, Ding XQ, Lu AP, Hou TJ, Cao DS (2019) Structural analysis and identification of colloidal aggregators in drug discovery. J Chem Inf Model 59:3714\u20133726","journal-title":"J Chem Inf Model"},{"key":"754_CR10","doi-asserted-by":"crossref","first-page":"181","DOI":"10.1016\/j.ejps.2015.05.008","volume":"76","author":"A Andr\u00e9s","year":"2015","unstructured":"Andr\u00e9s A, Ros\u00e9s M, R\u00e0fols C, Bosch E, Espinosa S, Segarra V, Huerta JM (2015) Setup and validation of shake-flask procedures for the determination of partition coefficients (log D) from low drug amounts. Eur J Pharm Sci 76:181\u2013191","journal-title":"Eur J Pharm Sci"},{"key":"754_CR11","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1016\/S0021-9673(02)00064-X","volume":"952","author":"SF Donovan","year":"2002","unstructured":"Donovan SF, Pescatore MC (2002) Method for measuring the logarithm of the octanol\u2013water partition coefficient by using short octadecyl\u2013poly (vinyl alcohol) high-performance liquid chromatography columns. J Chromatogr A 952:47\u201361","journal-title":"J Chromatogr A"},{"key":"754_CR12","doi-asserted-by":"crossref","first-page":"293","DOI":"10.1016\/j.talanta.2016.10.038","volume":"162","author":"C R\u00e0fols","year":"2017","unstructured":"R\u00e0fols C, Subirats X, Rubio J, Ros\u00e9s M, Bosch E (2017) Lipophilicity of amphoteric and zwitterionic compounds: a comparative study of determination methods. Talanta 162:293\u2013299","journal-title":"Talanta"},{"key":"754_CR13","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1186\/s13321-021-00557-5","volume":"13","author":"V Venkatraman","year":"2021","unstructured":"Venkatraman V (2021) FP-ADMET: a compendium of fingerprint-based ADMET prediction models. J Cheminform 13:75","journal-title":"J Cheminform"},{"key":"754_CR14","doi-asserted-by":"crossref","first-page":"W5","DOI":"10.1093\/nar\/gkab255","volume":"49","author":"G Xiong","year":"2021","unstructured":"Xiong G, Wu Z, Yi J, Fu L, Yang Z, Hsieh C, Yin M, Zeng X, Wu C, Lu A et al (2021) ADMETlab 2.0: an integrated online platform for accurate and comprehensive predictions of ADMET properties. Nucleic Acids Res 49:W5\u2013W14","journal-title":"Nucleic Acids Res"},{"key":"754_CR15","doi-asserted-by":"crossref","first-page":"17","DOI":"10.1186\/s13321-018-0271-1","volume":"10","author":"M Lapins","year":"2018","unstructured":"Lapins M, Arvidsson S, Lampa S, Berg A, Schaal W, Alvarsson J, Spjuth O (2018) A confidence predictor for logD using conformal regression and a support-vector machine. J Cheminform 10:17","journal-title":"J Cheminform"},{"key":"754_CR16","doi-asserted-by":"crossref","first-page":"13345","DOI":"10.1007\/s00521-021-05961-4","volume":"33","author":"M Galushka","year":"2021","unstructured":"Galushka M, Swain C, Browne F, Mulvenna MD, Bond R, Gray D (2021) Prediction of chemical compounds properties using a deep learning model. Neural Comput Appl 33:13345\u201313366","journal-title":"Neural Comput Appl"},{"key":"754_CR17","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1021\/acs.jcim.9b00718","volume":"60","author":"L Fu","year":"2020","unstructured":"Fu L, Liu L, Yang ZJ, Li P, Ding JJ, Yun YH, Lu AP, Hou TJ, Cao DS (2020) Systematic modeling of log D(7.4) based on ensemble machine learning, group contribution, and matched molecular pair analysis. J Chem Inf Model 60:63\u201376","journal-title":"J Chem Inf Model"},{"key":"754_CR18","doi-asserted-by":"crossref","first-page":"8749","DOI":"10.1021\/acs.jmedchem.9b00959","volume":"63","author":"Z Xiong","year":"2020","unstructured":"Xiong Z, Wang D, Liu X, Zhong F, Wan X, Li X, Li Z, Luo X, Chen K, Jiang H et al (2020) Pushing the boundaries of molecular representation for drug discovery with the graph attention mechanism. J Med Chem 63:8749\u20138760","journal-title":"J Med Chem"},{"key":"754_CR19","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1186\/s13321-020-0414-z","volume":"12","author":"B Tang","year":"2020","unstructured":"Tang B, Kramer ST, Fang M, Qiu Y, Wu Z, Xu D (2020) A self-attention based message passing neural network for predicting molecular lipophilicity and aqueous solubility. J Cheminform 12:15","journal-title":"J Cheminform"},{"key":"754_CR20","doi-asserted-by":"crossref","first-page":"27955","DOI":"10.1021\/acsomega.1c03839","volume":"6","author":"T Hasebe","year":"2021","unstructured":"Hasebe T (2021) Knowledge-embedded message-passing neural networks: improving molecular property prediction with human knowledge. ACS Omega 6:27955\u201327967","journal-title":"ACS Omega"},{"key":"754_CR21","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1016\/B978-0-12-821092-5.00012-7","volume-title":"Applications of artificial intelligence in process systems engineering","author":"Y Su","year":"2021","unstructured":"Su Y, Shen W (2021) Deep learning in QSPR modeling for the prediction of critical properties. In: Ren J, Shen W, Man Y, Dong L (eds) Applications of artificial intelligence in process systems engineering. Elsevier, Amsterdam, pp 11\u201337"},{"key":"754_CR22","doi-asserted-by":"crossref","first-page":"e16678","DOI":"10.1002\/aic.16678","volume":"65","author":"Y Su","year":"2019","unstructured":"Su Y, Wang Z, Jin S, Shen W, Ren J, Eden MR (2019) An architecture of deep learning in QSPR modeling for the prediction of critical properties using molecular signatures. AlChE J 65:e16678","journal-title":"AlChE J"},{"key":"754_CR23","doi-asserted-by":"crossref","first-page":"1702","DOI":"10.1016\/j.drudis.2020.07.001","volume":"25","author":"AH G\u00f6ller","year":"2020","unstructured":"G\u00f6ller AH, Kuhnke L, Montanari F, Bonin A, Schneckener S, Ter Laak A, Wichard J, Lobell M, Hillisch A (2020) Bayer\u2019s in silico ADMET platform: a journey of machine learning over the past two decades. Drug Discov Today 25:1702\u20131709","journal-title":"Drug Discov Today"},{"key":"754_CR24","volume-title":"Systems medicine: integrative, qualitative and computational approaches","author":"O Wolkenhauer","year":"2020","unstructured":"Wolkenhauer O (2020) Systems medicine: integrative, qualitative and computational approaches. Academic Press, Cambridge"},{"key":"754_CR25","doi-asserted-by":"crossref","first-page":"8835","DOI":"10.1021\/acs.jmedchem.9b02187","volume":"63","author":"EN Feinberg","year":"2020","unstructured":"Feinberg EN, Joshi E, Pande VS, Cheng AC (2020) Improvement in ADMET prediction with multitask deep featurization. J Med Chem 63:8835\u20138848","journal-title":"J Med Chem"},{"key":"754_CR26","doi-asserted-by":"crossref","first-page":"5975","DOI":"10.1021\/acs.jcim.2c01290","volume":"62","author":"J Wu","year":"2022","unstructured":"Wu J, Wang J, Wu Z, Zhang S, Deng Y, Kang Y, Cao D, Hsieh CY, Hou T (2022) ALipSol: an attention-driven mixture-of-experts model for lipophilicity and solubility prediction. J Chem Inf Model 62:5975\u20135987","journal-title":"J Chem Inf Model"},{"key":"754_CR27","doi-asserted-by":"crossref","first-page":"771","DOI":"10.1007\/s10822-021-00397-3","volume":"35","author":"TD Bergazin","year":"2021","unstructured":"Bergazin TD, Tielker N, Zhang Y, Mao J, Gunner MR, Francisco K, Ballatore C, Kast SM, Mobley DL (2021) Evaluation of log P, pK(a), and log D predictions from the SAMPL7 blind challenge. J Comput Aided Mol Des 35:771\u2013802","journal-title":"J Comput Aided Mol Des"},{"key":"754_CR28","doi-asserted-by":"crossref","first-page":"1171","DOI":"10.2174\/1568026033452078","volume":"3","author":"DJ Livingstone","year":"2003","unstructured":"Livingstone DJ (2003) Theoretical property predictions. Curr Top Med Chem 3:1171\u20131192","journal-title":"Curr Top Med Chem"},{"key":"754_CR29","doi-asserted-by":"crossref","first-page":"1345","DOI":"10.1109\/TKDE.2009.191","volume":"22","author":"SJ Pan","year":"2010","unstructured":"Pan SJ, Yang Q (2010) A survey on transfer learning. IEEE Trans Knowl Data Eng 22:1345\u20131359","journal-title":"IEEE Trans Knowl Data Eng"},{"key":"754_CR30","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1023\/A:1007379606734","volume":"28","author":"R Caruana","year":"1997","unstructured":"Caruana R (1997) Multitask learning. Mach Learn 28:41\u201375","journal-title":"Mach Learn"},{"key":"754_CR31","doi-asserted-by":"crossref","first-page":"253","DOI":"10.1007\/s10822-022-00450-9","volume":"36","author":"I Aliagas","year":"2022","unstructured":"Aliagas I, Gobbi A, Lee ML, Sellers BD (2022) Comparison of logP and logD correction models trained with public and proprietary data sets. J Comput Aided Mol Des 36:253\u2013262","journal-title":"J Comput Aided Mol Des"},{"key":"754_CR32","doi-asserted-by":"publisher","DOI":"10.4855\/arXiv.2011.12117","author":"N Lukashina","year":"2020","unstructured":"Lukashina N, Alenicheva A, Vlasova E, Kondiukov A, Khakimova A, Magerramov E, Churikov N, Shpilman A (2020) Lipophilicity prediction with multitask learning and molecular substructures representation. arXiv. https:\/\/doi.org\/10.4855\/arXiv.2011.12117","journal-title":"arXiv"},{"key":"754_CR33","doi-asserted-by":"crossref","first-page":"6185","DOI":"10.3390\/molecules26206185","volume":"26","author":"O Wieder","year":"2021","unstructured":"Wieder O, Kuenemann M, Wieder M, Seidel T, Meyer C, Bryant SD, Langer T (2021) Improved lipophilicity and aqueous solubility prediction with composite graph neural networks. Molecules 26:6185","journal-title":"Molecules"},{"key":"754_CR34","doi-asserted-by":"crossref","first-page":"e08563","DOI":"10.1016\/j.heliyon.2021.e08563","volume":"7","author":"J Parinet","year":"2021","unstructured":"Parinet J (2021) Predicting reversed-phase liquid chromatographic retention times of pesticides by deep neural networks. Heliyon 7:e08563","journal-title":"Heliyon"},{"key":"754_CR35","doi-asserted-by":"crossref","first-page":"1906","DOI":"10.1021\/acs.jcim.2c01373","volume":"63","author":"ZM Win","year":"2023","unstructured":"Win ZM, Cheong AMY, Hopkins WS (2023) Using machine learning to predict partition coefficient (Log P) and distribution coefficient (Log D) with molecular descriptors and liquid chromatography retention time. J Chem Inf Model 63:1906\u20131913","journal-title":"J Chem Inf Model"},{"key":"754_CR36","doi-asserted-by":"crossref","first-page":"5811","DOI":"10.1038\/s41467-019-13680-7","volume":"10","author":"X Domingo-Almenara","year":"2019","unstructured":"Domingo-Almenara X, Guijas C, Billings E, Montenegro-Burke JR, Uritboonthai W, Aisporna AE, Chen E, Benton HP, Siuzdak G (2019) The METLIN small molecule dataset for machine learning-based retention time prediction. Nat Commun 10:5811","journal-title":"Nat Commun"},{"key":"754_CR37","doi-asserted-by":"crossref","first-page":"2345","DOI":"10.1021\/acs.jcim.2c01564","volume":"63","author":"Y-J Duan","year":"2023","unstructured":"Duan Y-J, Fu L, Zhang X-C, Long T-Z, He Y-H, Liu Z-Q, Lu A-P, Deng Y-F, Hsieh C-Y, Hou T-J et al (2023) Improved GNNs for Log D7.4 prediction by transferring knowledge from low-fidelity data. J Chem Inf Model 63:2345\u20132359","journal-title":"J Chem Inf Model"},{"key":"754_CR38","doi-asserted-by":"crossref","first-page":"1136","DOI":"10.1021\/ci025515j","volume":"42","author":"IV Tetko","year":"2002","unstructured":"Tetko IV, Tanchuk VY (2002) Application of associative neural networks for prediction of lipophilicity in ALOGPS 2.1 program. J Chem Inf Comput Sci 42:1136\u20131145","journal-title":"J Chem Inf Comput Sci"},{"key":"754_CR39","unstructured":"ChemAxon Marvin Suite;. ChemAxon Inc, 2017."},{"key":"754_CR40","doi-asserted-by":"crossref","first-page":"D930","DOI":"10.1093\/nar\/gky1075","volume":"47","author":"D Mendez","year":"2019","unstructured":"Mendez D, Gaulton A, Bento AP, Chambers J, De Veij M, F\u00e9lix E, Magari\u00f1os MP, Mosquera JF, Mutowo P, Nowotka M (2019) ChEMBL: towards direct deposition of bioassay data. Nucleic Acids Res 47:D930\u2013D940","journal-title":"Nucleic Acids Res"},{"key":"754_CR41","first-page":"4","volume":"1","author":"G Landrum","year":"2013","unstructured":"Landrum G (2013) Rdkit documentation. Release 1:4","journal-title":"Release"},{"key":"754_CR42","doi-asserted-by":"crossref","first-page":"513","DOI":"10.1039\/C7SC02664A","volume":"9","author":"Z Wu","year":"2018","unstructured":"Wu Z, Ramsundar B, Feinberg EN, Gomes J, Geniesse C, Pappu AS, Leswing K, Pande V (2018) MoleculeNet: a benchmark for molecular machine learning. Chem Sci 9:513\u2013530","journal-title":"Chem Sci"},{"key":"754_CR43","unstructured":"The Physical Properties Database (PHYSPROP) by Syracuse Research Corporation (SRC) https:\/\/www.srcinc.com\/what-we-do\/environmental\/scientific-databases.html"},{"key":"754_CR44","doi-asserted-by":"crossref","first-page":"46","DOI":"10.1021\/ci010056s","volume":"42","author":"WD Ihlenfeldt","year":"2002","unstructured":"Ihlenfeldt WD, Voigt JH, Bienfait B, Oellien F, Nicklaus MC (2002) Enhanced CACTVS browser of the open NCI database. J Chem Inf Comput Sci 42:46\u201357","journal-title":"J Chem Inf Comput Sci"},{"key":"754_CR45","doi-asserted-by":"crossref","first-page":"533","DOI":"10.1007\/s10822-011-9440-2","volume":"25","author":"I Sushko","year":"2011","unstructured":"Sushko I, Novotarskyi S, Korner R, Pandey AK, Rupp M, Teetz W, Brandmaier S, Abdelaziz A, Prokopenko VV, Tanchuk VY et al (2011) Online chemical modeling environment (OCHEM): web platform for data storage, model development and publishing of chemical information. J Comput Aided Mol Des 25:533\u2013554","journal-title":"J Comput Aided Mol Des"},{"key":"754_CR46","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1016\/j.ejps.2012.10.019","volume":"48","author":"S Martel","year":"2013","unstructured":"Martel S, Gillerat F, Carosati E, Maiarelli D, Tetko IV, Mannhold R, Carrupt PA (2013) Large, chemically diverse dataset of logP measurements for benchmarking studies. Eur J Pharm Sci 48:21\u201329","journal-title":"Eur J Pharm Sci"},{"key":"754_CR47","doi-asserted-by":"crossref","first-page":"31","DOI":"10.17849\/insm-47-01-31-39.1","volume":"47","author":"SJ Rigatti","year":"2017","unstructured":"Rigatti SJ (2017) Random forest. J Insur Med 47:31\u201339","journal-title":"J Insur Med"},{"key":"754_CR48","doi-asserted-by":"crossref","first-page":"1565","DOI":"10.1038\/nbt1206-1565","volume":"24","author":"WS Noble","year":"2006","unstructured":"Noble WS (2006) What is a support vector machine? Nat Biotechnol 24:1565\u20131567","journal-title":"Nat Biotechnol"},{"key":"754_CR49","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1109\/2.485891","volume":"29","author":"AK Jain","year":"1996","unstructured":"Jain AK, Mao J, Mohiuddin KM (1996) Artificial neural networks: a tutorial. Computer 29:31\u201344","journal-title":"Computer"},{"issue":"1","key":"754_CR50","first-page":"1","volume":"04\u20132","author":"T Chen","year":"2015","unstructured":"Chen T, He T, Benesty M, Khotilovich V, Tang Y, Cho H, Chen K, Mitchell R, Cano I, Zhou T (2015) Xgboost: extreme gradient boosting. R Package Version 04\u20132(1):1\u20134","journal-title":"R Package Version"},{"key":"754_CR51","first-page":"2825","volume":"12","author":"F Pedregosa","year":"2011","unstructured":"Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V (2011) Scikit-learn: machine learning in python. J Mach Learn Res 12:2825\u20132830","journal-title":"J Mach Learn Res"},{"key":"754_CR52","doi-asserted-by":"crossref","first-page":"27233","DOI":"10.1021\/acsomega.1c04017","volume":"6","author":"M Li","year":"2021","unstructured":"Li M, Zhou J, Hu J, Fan W, Zhang Y, Gu Y, Karypis G (2021) Dgl-lifesci: an open-source toolkit for deep learning on graphs in life science. ACS Omega 6:27233\u201327238","journal-title":"ACS Omega"},{"key":"754_CR53","doi-asserted-by":"crossref","first-page":"792","DOI":"10.1093\/bioinformatics\/btab714","volume":"38","author":"J Xiong","year":"2022","unstructured":"Xiong J, Li Z, Wang G, Fu Z, Zhong F, Xu T, Liu X, Huang Z, Liu X, Chen K et al (2022) Multi-instance learning of graph neural networks for aqueous pKa prediction. Bioinformatics 38:792\u2013798","journal-title":"Bioinformatics"},{"key":"754_CR54","doi-asserted-by":"crossref","first-page":"2200","DOI":"10.1021\/acs.analchem.0c04071","volume":"93","author":"Q Yang","year":"2021","unstructured":"Yang Q, Ji H, Lu H, Zhang Z (2021) Prediction of liquid chromatographic retention time with graph neural networks to assist in small molecule identification. Anal Chem 93:2200\u20132206","journal-title":"Anal Chem"},{"key":"754_CR55","doi-asserted-by":"crossref","first-page":"2068","DOI":"10.1021\/acs.jcim.7b00146","volume":"57","author":"B Ramsundar","year":"2017","unstructured":"Ramsundar B, Liu B, Wu Z, Verras A, Tudor M, Sheridan RP, Pande V (2017) Is Multitask deep learning practical for pharma? J Chem Inf Model 57:2068\u20132076","journal-title":"J Chem Inf Model"},{"key":"754_CR56","doi-asserted-by":"crossref","first-page":"742","DOI":"10.1021\/ci100050t","volume":"50","author":"D Rogers","year":"2010","unstructured":"Rogers D, Hahn M (2010) Extended-connectivity fingerprints. J Chem Inf Model 50:742\u2013754","journal-title":"J Chem Inf Model"},{"key":"754_CR57","first-page":"2579","volume":"9","author":"M Lvd","year":"2008","unstructured":"Lvd M, Hinton GE (2008) Visualizing data using t-SNE. J Mach Learn Res 9:2579\u20132605","journal-title":"J Mach Learn Res"},{"key":"754_CR58","doi-asserted-by":"publisher","DOI":"10.4855\/arXiv.1706.05098","author":"S Ruder","year":"2017","unstructured":"Ruder S (2017) An overview of multi-task learning in deep neural networks. arXiv. https:\/\/doi.org\/10.4855\/arXiv.1706.05098","journal-title":"arXiv"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-023-00754-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13321-023-00754-4\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-023-00754-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,18]],"date-time":"2023-11-18T17:29:12Z","timestamp":1700328552000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/s13321-023-00754-4"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,9,5]]},"references-count":58,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,12]]}},"alternative-id":["754"],"URL":"https:\/\/doi.org\/10.1186\/s13321-023-00754-4","relation":{},"ISSN":["1758-2946"],"issn-type":[{"value":"1758-2946","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,9,5]]},"assertion":[{"value":"12 June 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"25 August 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 September 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"76"}}