{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,10]],"date-time":"2026-06-10T12:28:20Z","timestamp":1781094500910,"version":"3.54.1"},"reference-count":99,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2024,11,14]],"date-time":"2024-11-14T00:00:00Z","timestamp":1731542400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,11,14]],"date-time":"2024-11-14T00:00:00Z","timestamp":1731542400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"Czech Science Foundation Grant","award":["22-17367O"],"award-info":[{"award-number":["22-17367O"]}]},{"name":"Ministry of Education, Youth and Sports of the Czech Republic","award":["LM2023052"],"award-info":[{"award-number":["LM2023052"]}]},{"DOI":"10.13039\/100018694","name":"HORIZON EUROPE Marie Sklodowska-Curie Actions","doi-asserted-by":"publisher","award":["955879"],"award-info":[{"award-number":["955879"]}],"id":[{"id":"10.13039\/100018694","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Dutch National Growth Fund","award":["NGFOP2201"],"award-info":[{"award-number":["NGFOP2201"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>\n                    Building reliable and robust quantitative structure\u2013property relationship (QSPR) models is a challenging task. First, the experimental data needs to be obtained, analyzed and curated. Second, the number of available methods is continuously growing and evaluating different algorithms and methodologies can be arduous. Finally, the last hurdle that researchers face is to ensure the reproducibility of their models and facilitate their transferability into practice. In this work, we introduce QSPRpred, a toolkit for analysis of bioactivity data sets and QSPR modelling, which attempts to address the aforementioned challenges. QSPRpred\u2019s modular Python API enables users to intuitively describe different parts of a modelling workflow using a plethora of pre-implemented components, but also integrates customized implementations in a \u201cplug-and-play\u201d manner. QSPRpred data sets and models are directly serializable, which means they can be readily reproduced and put into operation after training as the models are saved with all required data pre-processing steps to make predictions on new compounds directly from SMILES strings. The general-purpose character of QSPRpred is also demonstrated by inclusion of support for multi-task and proteochemometric modelling. The package is extensively documented and comes with a large collection of tutorials to help new users. In this paper, we describe all of QSPRpred\u2019s functionalities and also conduct a small benchmarking case study to illustrate how different components can be leveraged to compare a diverse set of models. QSPRpred is fully open-source and available at\n                    <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/github.com\/CDDLeiden\/QSPRpred\">https:\/\/github.com\/CDDLeiden\/QSPRpred<\/jats:ext-link>\n                    .\n                  <\/jats:p>\n                  <jats:p>\n                    <jats:bold>Scientific Contribution<\/jats:bold>\n                  <\/jats:p>\n                  <jats:p>QSPRpred aims to provide a complex, but comprehensive Python API to conduct all tasks encountered in QSPR modelling from data preparation and analysis to model creation and model deployment. In contrast to similar packages, QSPRpred offers a wider and more exhaustive range of capabilities and integrations with many popular packages that also go beyond QSPR modelling. A significant contribution of QSPRpred is also in its automated and highly standardized serialization scheme, which significantly improves reproducibility and transferability of models.<\/jats:p>","DOI":"10.1186\/s13321-024-00908-y","type":"journal-article","created":{"date-parts":[[2024,11,14]],"date-time":"2024-11-14T11:04:42Z","timestamp":1731582282000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":18,"title":["QSPRpred: a Flexible Open-Source Quantitative Structure-Property Relationship Modelling Tool"],"prefix":"10.1186","volume":"16","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-9718-7806","authenticated-orcid":false,"given":"Helle W.","family":"van den Maagdenberg","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8771-1731","authenticated-orcid":false,"given":"Martin","family":"\u0160\u00edcho","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5104-1959","authenticated-orcid":false,"given":"David Alencar","family":"Araripe","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9387-1427","authenticated-orcid":false,"given":"Sohvi","family":"Luukkonen","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9879-1004","authenticated-orcid":false,"given":"Linde","family":"Schoenmaker","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0003-2083-0159","authenticated-orcid":false,"given":"Michiel","family":"Jespers","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7554-9220","authenticated-orcid":false,"given":"Olivier J. M.","family":"B\u00e9quignon","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1568-0881","authenticated-orcid":false,"given":"Marina Gorostiola","family":"Gonz\u00e1lez","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0008-5661-1157","authenticated-orcid":false,"given":"Remco L.","family":"van den Broek","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0058-3678","authenticated-orcid":false,"given":"Andrius","family":"Bernatavicius","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1664-7314","authenticated-orcid":false,"given":"J. G. Coen","family":"van Hasselt","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1314-3484","authenticated-orcid":false,"given":"Piet. H.","family":"van der Graaf","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0717-1817","authenticated-orcid":false,"given":"Gerard J. P.","family":"van Westen","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2024,11,14]]},"reference":[{"issue":"11","key":"908_CR1","doi-asserted-by":"publisher","first-page":"3525","DOI":"10.1039\/D0CS00098A","volume":"49","author":"EN Muratov","year":"2020","unstructured":"Muratov EN, Bajorath J, Sheridan RP, Tetko IV, Filimonov D, Poroikov V, Oprea TI, Baskin II, Varnek A, Roitberg A, Isayev O, Curtalolo S, Fourches D, Cohen Y, Aspuru-Guzik A, Winkler DA, Agrafiotis D, Cherkasov A, Tropsha A (2020) QSAR without borders. Chem Soc Rev 49(11):3525\u20133564. https:\/\/doi.org\/10.1039\/D0CS00098A","journal-title":"Chem Soc Rev"},{"key":"908_CR2","doi-asserted-by":"publisher","unstructured":"Hansch C, Fujita T (1964) p-$$\\upsigma$$-$$\\upphi$$ Analysis. A Method for the Correlation of Biological Activity and Chemical Structure. Journal of the American Chemical Society 86(8), 1616\u20131626 https:\/\/doi.org\/10.1021\/ja01062a035","DOI":"10.1021\/ja01062a035"},{"issue":"9","key":"908_CR3","doi-asserted-by":"publisher","first-page":"949","DOI":"10.1080\/17460441.2021.1909567","volume":"16","author":"J Jim\u00e9nez-Luna","year":"2021","unstructured":"Jim\u00e9nez-Luna J, Grisoni F, Weskamp N, Schneider G (2021) Artificial intelligence in drug discovery: recent advances and future perspectives. Exp Opin Drug Disc 16(9):949\u2013959. https:\/\/doi.org\/10.1080\/17460441.2021.1909567","journal-title":"Exp Opin Drug Disc"},{"issue":"1","key":"908_CR4","doi-asserted-by":"publisher","first-page":"2000113","DOI":"10.1002\/minf.202000113","volume":"40","author":"VM Alves","year":"2021","unstructured":"Alves VM, Bobrowski T, Melo-Filho CC, Korn D, Auerbach S, Schmitt C, Muratov EN, Tropsha A (2021) QSAR Modeling of SARS-CoV Mpro Inhibitors Identifies Sufugolix, Cenicriviroc, Proglumetacin, and other Drugs as Candidates for Repurposing against SARS-CoV-2. Mol Inform 40(1):2000113. https:\/\/doi.org\/10.1002\/minf.202000113","journal-title":"Mol Inform"},{"issue":"21","key":"908_CR5","doi-asserted-by":"publisher","first-page":"5172","DOI":"10.3390\/molecules25215172","volume":"25","author":"E Tejera","year":"2020","unstructured":"Tejera E, Munteanu CR, L\u00f3pez-Cort\u00e9s A, Cabrera-Andrade A, P\u00e9rez-Castillo Y (2020) Drugs Repurposing Using QSAR, Docking and Molecular Dynamics for Possible Inhibitors of the SARS-CoV-2 Mpro Protease. Molecules 25(21):5172. https:\/\/doi.org\/10.3390\/molecules25215172","journal-title":"Molecules"},{"issue":"4","key":"908_CR6","doi-asserted-by":"publisher","first-page":"856","DOI":"10.1007\/s11095-015-1832-x","volume":"33","author":"PAJ V\u00e4litalo","year":"2016","unstructured":"V\u00e4litalo PAJ, Griffioen K, Rizk ML, Visser SAG, Danhof M, Rao G, Graaf PH, Hasselt JGC (2016) Structure-Based Prediction of Anti-infective Drug Concentrations in the Human Lung Epithelial Lining Fluid. Pharmac Res 33(4):856\u2013867. https:\/\/doi.org\/10.1007\/s11095-015-1832-x","journal-title":"Pharmac Res"},{"key":"908_CR7","doi-asserted-by":"publisher","DOI":"10.1016\/j.molliq.2021.117631","volume":"344","author":"K Paduszy\u0144ski","year":"2021","unstructured":"Paduszy\u0144ski K, Klebowski K, Kr\u00f3likowska M (2021) Predicting melting point of ionic liquids using QSPR approach: Literature review and new models. J Mol Liq 344:117631. https:\/\/doi.org\/10.1016\/j.molliq.2021.117631","journal-title":"J Mol Liq"},{"issue":"8","key":"908_CR8","doi-asserted-by":"publisher","first-page":"2209","DOI":"10.1016\/j.drudis.2022.03.017","volume":"27","author":"N Pillai","year":"2022","unstructured":"Pillai N, Dasgupta A, Sudsakorn S, Fretland J, Mavroudis PD (2022) Machine Learning guided early drug discovery of small molecules. Drug Disc Today 27(8):2209\u20132215. https:\/\/doi.org\/10.1016\/j.drudis.2022.03.017","journal-title":"Drug Disc Today"},{"issue":"D1","key":"908_CR9","doi-asserted-by":"publisher","first-page":"930","DOI":"10.1093\/nar\/gky1075","volume":"47","author":"D Mendez","year":"2019","unstructured":"Mendez D, Gaulton A, Bento AP, Chambers J, De Veij M, F\u00e9lix E, Magari\u00f1os M, Mosquera J, Mutowo P, Nowotka M, Gordillo-Mara\u00f1\u00f3n M, Hunter F, Junco L, Mugumbate G, Rodriguez-Lopez M, Atkinson F, Bosc N, Radoux C, Segura-Cabrera A, Hersey A, Leach A (2019) ChEMBL: towards direct deposition of bioassay data. Nucleic Acids Res 47(D1):930\u2013940. https:\/\/doi.org\/10.1093\/nar\/gky1075","journal-title":"Nucleic Acids Res"},{"issue":"D1","key":"908_CR10","doi-asserted-by":"publisher","first-page":"1373","DOI":"10.1093\/nar\/gkac956","volume":"51","author":"S Kim","year":"2023","unstructured":"Kim S, Chen J, Cheng T, Gindulyte A, He J, He S, Li Q, Shoemaker BA, Thiessen PA, Yu B, Zaslavsky L, Zhang J, Bolton EE (2023) PubChem 2023 update. Nucleic Acids Res 51(D1):1373\u20131380. https:\/\/doi.org\/10.1093\/nar\/gkac956","journal-title":"Nucleic Acids Res"},{"issue":"12","key":"908_CR11","doi-asserted-by":"publisher","first-page":"4977","DOI":"10.1021\/jm4004285","volume":"57","author":"A Cherkasov","year":"2014","unstructured":"Cherkasov A, Muratov EN, Fourches D, Varnek A, Baskin II, Cronin M, Dearden J, Gramatica P, Martin YC, Todeschini R, Consonni V, Kuz\u2019min VE, Cramer R, Benigni R, Yang C, Rathman J, Terfloth L, Gasteiger J, Richard A, Tropsha A (2014) QSAR Modeling: where have you been? Where are you going to? J Med Chem 57(12):4977\u20135010. https:\/\/doi.org\/10.1021\/jm4004285","journal-title":"J Med Chem"},{"key":"908_CR12","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41573-023-00832-0","volume":"1","author":"A Tropsha","year":"2023","unstructured":"Tropsha A, Isayev O, Varnek A, Schneider G, Cherkasov A (2023) Integrating QSAR modelling and deep learning in drug discovery: the emergence of deep QSAR. Nat Rev Drug Disc 1:1\u201315. https:\/\/doi.org\/10.1038\/s41573-023-00832-0","journal-title":"Nat Rev Drug Disc"},{"key":"908_CR13","doi-asserted-by":"publisher","first-page":"89","DOI":"10.1016\/j.ddtec.2020.08.003","volume":"32","author":"BJ Bongers","year":"2019","unstructured":"Bongers BJ, IJzerman AP, Van\u00a0Westen GJP (2019) Proteochemometrics: recent developments in bioactivity and selectivity modeling. Drug Disc Today 32:89\u201398. https:\/\/doi.org\/10.1016\/j.ddtec.2020.08.003","journal-title":"Drug Disc Today"},{"key":"908_CR14","doi-asserted-by":"publisher","unstructured":"Cort\u00e9s-Ciriano I, UI Ain Q, Subramanian V, Lenselink BE, M\u00e9ndez-Lucio O, IJzerman PA, Wohlfahrt G, Prusis P, Malliavin ET, Westen GJPV, Bender A (2015) Polypharmacology modelling using proteochemometrics (PCM): recent methodological developments, applications to target families, and future prospects. MedChemComm 6(1):24\u201350. https:\/\/doi.org\/10.1039\/C4MD00216D","DOI":"10.1039\/C4MD00216D"},{"issue":"9","key":"908_CR15","doi-asserted-by":"publisher","first-page":"4283","DOI":"10.1021\/acs.jcim.9b01204","volume":"60","author":"L Burggraaff","year":"2020","unstructured":"Burggraaff L, Lenselink EB, Jespers W, Engelen J, Bongers BJ, Gorostiola\u00a0Gonz\u00e1lez M, Liu R, Hoos HH, Vlijmen HWT, IJzerman AP, Westen G.J.P (2020) Successive statistical and structure-based modeling to identify chemically novel kinase inhibitors. J Chem Inform Model 60(9):4283\u20134295. https:\/\/doi.org\/10.1021\/acs.jcim.9b01204","journal-title":"J Chem Inform Model"},{"issue":"1","key":"908_CR16","doi-asserted-by":"publisher","first-page":"74","DOI":"10.1186\/s13321-023-00745-5","volume":"15","author":"M Gorostiola\u00a0Gonz\u00e1lez","year":"2023","unstructured":"Gorostiola\u00a0Gonz\u00e1lez M, Broek R.L, Braun TGM, Chatzopoulou M, Jespers W, IJzerman AP, Heitman LH, Westen GJP (2023) 3DDPDs: describing protein dynamics for proteochemometric bioactivity prediction. A case for (mutant) G protein-coupled receptors. J Cheminform 15(1):74. https:\/\/doi.org\/10.1186\/s13321-023-00745-5","journal-title":"J Cheminform"},{"issue":"2","key":"908_CR17","doi-asserted-by":"publisher","first-page":"240","DOI":"10.1021\/acs.jcim.1c00889","volume":"62","author":"J Born","year":"2022","unstructured":"Born J, Huynh T, Stroobants A, Cornell WD, Manica M (2022) Active Site Sequence Representations of Human Kinases Outperform Full Sequence Representations for Affinity Prediction and Inhibitor Generation: 3D Effects in a 1D Model. J Chem Inform Model 62(2):240\u2013257. https:\/\/doi.org\/10.1021\/acs.jcim.1c00889","journal-title":"J Chem Inform Model"},{"key":"908_CR18","doi-asserted-by":"publisher","unstructured":"Lenselink EB, Dijke N, Bongers B, Papadatos G, Vlijmen HWT, Kowalczyk W, IJzerman AP, Westen GJP (2017) Beyond the hype: deep neural networks outperform established methods using a ChEMBL bioactivity benchmark set. Journal of Cheminformatics 9:45. https:\/\/doi.org\/10.1186\/s13321-017-0232-0","DOI":"10.1186\/s13321-017-0232-0"},{"issue":"1","key":"908_CR19","doi-asserted-by":"publisher","first-page":"11","DOI":"10.1186\/s13321-020-0413-0","volume":"12","author":"B Playe","year":"2020","unstructured":"Playe B, Stoven V (2020) Evaluation of deep and shallow learning methods in chemogenomics for the prediction of drugs specificity. J Cheminform 12(1):11. https:\/\/doi.org\/10.1186\/s13321-020-0413-0","journal-title":"J Cheminform"},{"issue":"1","key":"908_CR20","doi-asserted-by":"publisher","first-page":"16","DOI":"10.1186\/s13321-023-00689-w","volume":"15","author":"H Atas Guvenilir","year":"2023","unstructured":"Atas Guvenilir H, Do\u011fan T (2023) How to approach machine learning-based prediction of drug\/compound-target interactions. J Cheminform 15(1):16. https:\/\/doi.org\/10.1186\/s13321-023-00689-w","journal-title":"J Cheminform"},{"issue":"4","key":"908_CR21","doi-asserted-by":"publisher","first-page":"1657","DOI":"10.1021\/acs.jcim.1c00086","volume":"61","author":"A Lopez-del Rio","year":"2021","unstructured":"Lopez-del Rio A, Picart-Armada S, Perera-Lluna A (2021) Balancing data on deep learning-based proteochemometric activity classification. J Chem Inform Model 61(4):1657\u20131669. https:\/\/doi.org\/10.1021\/acs.jcim.1c00086","journal-title":"J Chem Inform Model"},{"issue":"12","key":"908_CR22","doi-asserted-by":"publisher","first-page":"3688","DOI":"10.1021\/acs.jcim.3c00132","volume":"63","author":"S Luukkonen","year":"2023","unstructured":"Luukkonen S, Meijer E, Tricarico GA, Hofmans J, Stouten PFW, Westen GJP, Lenselink EB (2023) Large-scale modeling of sparse protein kinase activity data. J Chem Inform Model 63(12):3688\u20133696. https:\/\/doi.org\/10.1021\/acs.jcim.3c00132","journal-title":"J Chem Inform Model"},{"issue":"9","key":"908_CR23","doi-asserted-by":"publisher","first-page":"1011301","DOI":"10.1371\/journal.pcbi.1011301","volume":"19","author":"GK Kanev","year":"2023","unstructured":"Kanev GK, Zhang Y, Kooistra AJ, Bender A, Leurs R, Bailey D, W\u00fcrdinger T, Graaf CD, Esch IJPD, Westerman BA (2023) Predicting the target landscape of kinase inhibitors using 3D convolutional neural networks. PLOS Comput Biol 19(9):1011301. https:\/\/doi.org\/10.1371\/journal.pcbi.1011301","journal-title":"PLOS Comput Biol"},{"issue":"1","key":"908_CR24","doi-asserted-by":"publisher","first-page":"17816","DOI":"10.1038\/s41598-023-45086-3","volume":"13","author":"T Janela","year":"2023","unstructured":"Janela T, Bajorath J (2023) Rationalizing general limitations in assessing and comparing methods for compound potency prediction. Scientific Reports 13(1):17816. https:\/\/doi.org\/10.1038\/s41598-023-45086-3","journal-title":"Scientific Reports"},{"key":"908_CR25","doi-asserted-by":"publisher","unstructured":"McElfresh D, Khandagale S, Valverde JC, Feuer VP, Hegde B, Ramakrishnan GC, Goldblum M, White C (2023) When Do Neural Nets Outperform Boosted Trees on Tabular Data? arXiv. https:\/\/doi.org\/10.48550\/arXiv.2305.02997","DOI":"10.48550\/arXiv.2305.02997"},{"key":"908_CR26","doi-asserted-by":"publisher","unstructured":"Grinsztajn L, Oyallon E, Varoquaux G (2022) Why do tree-based models still outperform deep learning on tabular data? arXiv. https:\/\/doi.org\/10.48550\/arXiv.2207.08815","DOI":"10.48550\/arXiv.2207.08815"},{"issue":"8","key":"908_CR27","doi-asserted-by":"publisher","first-page":"3370","DOI":"10.1021\/acs.jcim.9b00237","volume":"59","author":"K Yang","year":"2019","unstructured":"Yang K, Swanson K, Jin W, Coley C, Eiden P, Gao H, Guzman-Perez A, Hopper T, Kelley B, Mathea M, Palmer A, Settels V, Jaakkola T, Jensen K, Barzilay R (2019) Analyzing learned molecular representations for property prediction. J Chem Inform Model 59(8):3370\u20133388. https:\/\/doi.org\/10.1021\/acs.jcim.9b00237","journal-title":"J Chem Inform Model"},{"issue":"1","key":"908_CR28","doi-asserted-by":"publisher","first-page":"6395","DOI":"10.1038\/s41467-023-41948-6","volume":"14","author":"J Deng","year":"2023","unstructured":"Deng J, Yang Z, Wang H, Ojima I, Samaras D, Wang F (2023) A systematic study of key elements underlying molecular property prediction. Nat Commun 14(1):6395. https:\/\/doi.org\/10.1038\/s41467-023-41948-6","journal-title":"Nat Commun"},{"issue":"2","key":"908_CR29","doi-asserted-by":"publisher","first-page":"513","DOI":"10.1039\/C7SC02664A","volume":"9","author":"Z Wu","year":"2018","unstructured":"Wu Z, Ramsundar B, Feinberg EN, Gomes J, Geniesse C, Pappu AS, Leswing K, Pande V (2018) MoleculeNet: a benchmark for molecular machine learning. Chem Sci 9(2):513\u2013530. https:\/\/doi.org\/10.1039\/C7SC02664A","journal-title":"Chem Sci"},{"key":"908_CR30","doi-asserted-by":"publisher","unstructured":"Tossou P, Wognum C, Craig M, Mary H, Noutahi E (2023) Real-World Molecular Out-Of-Distribution: Specification and Investigation. ChemRxiv. https:\/\/doi.org\/10.26434\/chemrxiv-2023-q11q4-v2","DOI":"10.26434\/chemrxiv-2023-q11q4-v2"},{"key":"908_CR31","unstructured":"Steshin S (2023) Lo-Hi: Practical ML Drug Discovery Benchmark. arXiv. http:\/\/arxiv.org\/abs\/2310.06399 Accessed 2023-12-11"},{"issue":"23","key":"908_CR32","doi-asserted-by":"publisher","first-page":"5938","DOI":"10.1021\/acs.jcim.2c01073","volume":"62","author":"D Tilborg","year":"2022","unstructured":"Tilborg D, Alenicheva A, Grisoni F (2022) Exposing the limitations of molecular machine learning with activity cliffs. J Chem Inform Model 62(23):5938\u20135951. https:\/\/doi.org\/10.1021\/acs.jcim.2c01073","journal-title":"J Chem Inform Model"},{"issue":"1","key":"908_CR33","doi-asserted-by":"publisher","first-page":"73","DOI":"10.1186\/s13321-023-00743-7","volume":"15","author":"D Boldini","year":"2023","unstructured":"Boldini D, Grisoni F, Kuhn D, Friedrich L, Sieber SA (2023) Practical guidelines for the use of gradient boosting for molecular property prediction. J Cheminform 15(1):73. https:\/\/doi.org\/10.1186\/s13321-023-00743-7","journal-title":"J Cheminform"},{"issue":"6","key":"908_CR34","doi-asserted-by":"publisher","first-page":"1127","DOI":"10.1021\/acs.jcim.6b00088","volume":"56","author":"P Gramatica","year":"2016","unstructured":"Gramatica P, Sangion A (2016) A historical excursus on the statistical validation parameters for QSAR models: a clarification concerning metrics and terminology. J Chem Inform Model 56(6):1127\u20131131. https:\/\/doi.org\/10.1021\/acs.jcim.6b00088","journal-title":"J Chem Inform Model"},{"key":"908_CR35","unstructured":"Walters P (2023) Comparing Classification Models - You\u2019re Probably Doing It Wrong. https:\/\/practicalcheminformatics.blogspot.com\/2023\/11\/comparing-classification-models-youre.html Accessed 2023-12-07"},{"key":"908_CR36","unstructured":"Walters P (2023) We Need Better Benchmarks for Machine Learning in Drug Discovery. http:\/\/practicalcheminformatics.blogspot.com\/2023\/08\/we-need-better-benchmarks-for-machine.html Accessed 2023-12-08"},{"key":"908_CR37","doi-asserted-by":"publisher","DOI":"10.1021\/acs.jcim.4c00049","author":"GA Landrum","year":"2024","unstructured":"Landrum GA, Riniker S (2024) Combining IC50 or Ki Values from Different Sources Is a Source of Significant Noise. J Chem Inform Model. https:\/\/doi.org\/10.1021\/acs.jcim.4c00049","journal-title":"J Chem Inform Model"},{"key":"908_CR38","unstructured":"Sciences ENAo, Affairs PaG, Science E, Information BoRD, Sciences DoEaP, Statistics CoAaT, Analytics BoMS, Studies DoEaL, Board NaRS, Education DoBaSS, Statistics CoN, Behavioral C, Science CoRaRi (2019) Understanding Reproducibility and Replicability. In: Reproducibility and Replicability in Science. National Academies Press (US), Washington (DC). https:\/\/www.ncbi.nlm.nih.gov\/books\/NBK547546\/"},{"issue":"6377","key":"908_CR39","doi-asserted-by":"publisher","first-page":"725","DOI":"10.1126\/science.359.6377.725","volume":"359","author":"M Hutson","year":"2018","unstructured":"Hutson M (2018) Artificial intelligence faces reproducibility crisis. Science (New York, N.Y.) 359(6377):725\u2013726. https:\/\/doi.org\/10.1126\/science.359.6377.725","journal-title":"Science (New York, N.Y.)"},{"issue":"1","key":"908_CR40","doi-asserted-by":"publisher","first-page":"9","DOI":"10.1186\/s13321-020-0408-x","volume":"12","author":"N Schaduangrat","year":"2020","unstructured":"Schaduangrat N, Lampa S, Simeon S, Gleeson MP, Spjuth O, Nantasenamat C (2020) Towards reproducible computational drug discovery. J Cheminform 12(1):9. https:\/\/doi.org\/10.1186\/s13321-020-0408-x","journal-title":"J Cheminform"},{"issue":"1","key":"908_CR41","doi-asserted-by":"publisher","first-page":"62","DOI":"10.1186\/s13321-019-0385-0","volume":"11","author":"RD Clark","year":"2019","unstructured":"Clark RD (2019) A path to next-generation reproducibility in cheminformatics. J Cheminform. 11(1):62. https:\/\/doi.org\/10.1186\/s13321-019-0385-0","journal-title":"J Cheminform."},{"issue":"1","key":"908_CR42","doi-asserted-by":"publisher","first-page":"62","DOI":"10.1186\/s13321-023-00730-y","volume":"15","author":"CT Hoyt","year":"2023","unstructured":"Hoyt CT, Zdrazil B, Guha R, Jeliazkova N, Martinez-Mayorga K, Nittinger E (2023) Improving reproducibility and reusability in the Journal of Cheminformatics. J Cheminform 15(1):62. https:\/\/doi.org\/10.1186\/s13321-023-00730-y","journal-title":"J Cheminform"},{"issue":"3","key":"908_CR43","doi-asserted-by":"publisher","first-page":"673","DOI":"10.1021\/acs.jcim.7b00523","volume":"58","author":"M Patel","year":"2018","unstructured":"Patel M, Chilton ML, Sartini A, Gibson L, Barber C, Covey-Crump L, Przybylak KR, Cronin MTD, Madden JC (2018) Assessment and Reproducibility of Quantitative Structure-Activity Relationship Models by the Nonexpert. Journal of Chemical Information and Modeling 58(3):673\u2013682. https:\/\/doi.org\/10.1021\/acs.jcim.7b00523","journal-title":"Journal of Chemical Information and Modeling"},{"key":"908_CR44","doi-asserted-by":"publisher","unstructured":"Berthold MR, Cebron N, Dill F, Gabriel TR, K\u00f6tter T, Meinl T, Ohl P, Sieb C, Thiel K, Wiswedel B (2008) KNIME: The Konstanz Information Miner. In: Preisach, C., Burkhardt, H., Schmidt-Thieme, L., Decker, R. (eds.) Data Analysis, Machine Learning and Applications. Studies in Classification, Data Analysis, and Knowledge Organization, pp. 319\u2013326. Springer, Berlin, Heidelberg. https:\/\/doi.org\/10.1007\/978-3-540-78246-9_38","DOI":"10.1007\/978-3-540-78246-9_38"},{"key":"908_CR45","unstructured":"KNIME: Create a New Python based KNIME Extension (2024). https:\/\/docs.knime.com\/latest\/pure_python_node_extensions_guide\/index.html#introduction Accessed 2024-02-26"},{"key":"908_CR46","volume-title":"Deep Learning for the Life Sciences: Applying Deep Learning to Genomics, Microscopy, Drug Discovery, and More, 1st","author":"B Ramsundar","year":"2019","unstructured":"Ramsundar B, Eastman P, Walters P, Pande V (2019) Deep Learning for the Life Sciences: Applying Deep Learning to Genomics, Microscopy, Drug Discovery, and More, 1st, edition. O\u2019Reilly Media, Sebastopol, CA","edition":"edition"},{"issue":"4","key":"908_CR47","doi-asserted-by":"publisher","first-page":"1955","DOI":"10.1021\/acs.jcim.9b01053","volume":"60","author":"AJ Minnich","year":"2020","unstructured":"Minnich AJ, McLoughlin K, Tse M, Deng J, Weber A, Murad N, Madej BD, Ramsundar B, Rush T, Calad-Thomson S, Brase J, Allen JE (2020) AMPL: A Data-Driven Modeling Pipeline for Drug Discovery. J Chem Inform Model 60(4):1955\u20131968. https:\/\/doi.org\/10.1021\/acs.jcim.9b01053","journal-title":"J Chem Inform Model"},{"key":"908_CR48","unstructured":"Gao Z, Ji X, Zhao G, Wang H, Zheng H, Ke G, Zhang L (2023) Uni-QSAR: an Auto-ML Tool for Molecular Property Prediction. https:\/\/arxiv.org\/abs\/2304.12239"},{"key":"908_CR49","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41467-024-51321-w","volume":"15","author":"L Shuqi","year":"2024","unstructured":"Shuqi L, Gao Z, He D, Zhang L, Ke G (2024) Data-driven quantum chemical property prediction leveraging 3d conformations with uni-mol+. Nat Commun 15:1. https:\/\/doi.org\/10.1038\/s41467-024-51321-w","journal-title":"Nat Commun"},{"key":"908_CR50","doi-asserted-by":"publisher","DOI":"10.1101\/2022.12.13.520154","author":"G Turon","year":"2022","unstructured":"Turon G, Hlozek J, Woodland JG, Chibale K, Duran-Frigola M (2022) First fully-automated AI\/ML virtual screening cascade implemented at a drug discovery centre in Africa. BioRxiv. https:\/\/doi.org\/10.1101\/2022.12.13.520154","journal-title":"BioRxiv"},{"key":"908_CR51","doi-asserted-by":"publisher","DOI":"10.1021\/acs.jcim.3c00523","author":"J Lanini","year":"2023","unstructured":"Lanini J, Santarossa G, Sirockin F, Lewis R, Fechner N, Misztela H, Lewis S, Maziarz K, Stanley M, Segler M, Stiefl N, Schneider N (2023) PREFER: a new predictive modeling framework for molecular discovery. J Chem Inform Model. https:\/\/doi.org\/10.1021\/acs.jcim.3c00523","journal-title":"J Chem Inform Model"},{"key":"908_CR52","unstructured":"Feurer M, Klein A, Eggensperger K, Springenberg J, Blum M, Hutter F (2015) Efficient and Robust Automated Machine Learning. In: Advances in Neural Information Processing Systems, vol. 28. Curran Associates, Inc., Red Hook, New York, USA"},{"issue":"14","key":"908_CR53","doi-asserted-by":"publisher","first-page":"5365","DOI":"10.1021\/acs.jcim.4c00457","volume":"64","author":"L Mervin","year":"2024","unstructured":"Mervin L, Voronov A, Kabeshov M, Engkvist O (2024) QSARtuna: An Automated QSAR Modeling Platform for Molecular Property Prediction in Drug Design. J Chem Inform Model 64(14):5365\u20135374. https:\/\/doi.org\/10.1021\/acs.jcim.4c00457","journal-title":"J Chem Inform Model"},{"key":"908_CR54","doi-asserted-by":"publisher","unstructured":"Bjerrum EJ, Bachorz RA, Bitton A, Choung O, Chen Y, Esposito C, et al (2023) Scikit-Mol brings cheminformatics to Scikit-Learn. ChemRxiv. https:\/\/doi.org\/10.26434\/chemrxiv-2023-fzqwd","DOI":"10.26434\/chemrxiv-2023-fzqwd"},{"issue":"1","key":"908_CR55","doi-asserted-by":"publisher","first-page":"45","DOI":"10.1186\/s13321-015-0086-2","volume":"7","author":"DS Murrell","year":"2015","unstructured":"Murrell DS, Cortes-Ciriano I, Westen GJP, Stott IP, Bender A, Malliavin TE, Glen RC (2015) Chemically Aware Model Builder (camb): an R package for property and bioactivity modelling of small molecules. J Cheminform 7(1):45. https:\/\/doi.org\/10.1186\/s13321-015-0086-2","journal-title":"J Cheminform"},{"key":"908_CR56","unstructured":"Murrell DS, Cortes-Ciriano I, Westen GJP, Stott IP, Bender A, Malliavin TE, Glen RC (2021) cambDI\/camb. cambDI. https:\/\/github.com\/cambDI\/camb Accessed 2024-02-26"},{"issue":"1","key":"908_CR57","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1186\/s13321-022-00672-x","volume":"15","author":"OJ Bequignon","year":"2023","unstructured":"Bequignon OJ, Bongers BJ (2023) Papyrus: a large-scale curated dataset aimed at bioactivity predictions. J Cheminform 15(1):3. https:\/\/doi.org\/10.1186\/s13321-022-00672-x","journal-title":"J Cheminform"},{"key":"908_CR58","doi-asserted-by":"publisher","unstructured":"McKinney W (2010) Data Structures for Statistical Computing in Python. Proceedings of the 9th Python in Science Conference, 56\u201361 https:\/\/doi.org\/10.25080\/Majora-92bf1922-00a","DOI":"10.25080\/Majora-92bf1922-00a"},{"key":"908_CR59","unstructured":"\u0160\u00ed\u00adcho M (2023) martin-sicho\/papyrus-scaffold-visualizer. https:\/\/github.com\/martin-sicho\/papyrus-scaffold-visualizer Accessed 2023-12-12"},{"key":"908_CR60","unstructured":"RDKit: Open-source cheminformatics. (2024). https:\/\/www.rdkit.org"},{"issue":"1","key":"908_CR61","doi-asserted-by":"publisher","first-page":"4","DOI":"10.1186\/s13321-018-0258-y","volume":"10","author":"H Moriwaki","year":"2018","unstructured":"Moriwaki H, Tian Y-S, Kawashita N, Takagi T (2018) Mordred: a molecular descriptor calculator. J Cheminform 10(1):4. https:\/\/doi.org\/10.1186\/s13321-018-0258-y","journal-title":"J Cheminform"},{"issue":"7","key":"908_CR62","doi-asserted-by":"publisher","first-page":"1337","DOI":"10.1021\/ci800038f","volume":"48","author":"H Hong","year":"2008","unstructured":"Hong H, Xie Q, Ge W, Qian F, Fang H, Shi L, Su Z, Perkins R, Tong W (2008) Mold2, Molecular Descriptors from 2D Structures for Chemoinformatics and Toxicoinformatics. J Chem Inform Model 48(7):1337\u20131344. https:\/\/doi.org\/10.1021\/ci800038f","journal-title":"J Chem Inform Model"},{"issue":"7","key":"908_CR63","doi-asserted-by":"publisher","first-page":"1466","DOI":"10.1002\/jcc.21707","volume":"32","author":"CW Yap","year":"2011","unstructured":"Yap CW (2011) PaDEL-descriptor: An open source software to calculate molecular descriptors and fingerprints. Journal of Computational Chemistry 32(7):1466\u20131474. https:\/\/doi.org\/10.1002\/jcc.21707","journal-title":"Journal of Computational Chemistry"},{"issue":"6","key":"908_CR64","doi-asserted-by":"publisher","first-page":"1273","DOI":"10.1021\/ci010132r","volume":"42","author":"JL Durant","year":"2002","unstructured":"Durant JL, Leland BA, Henry DR, Nourse JG (2002) Reoptimization of MDL keys for use in drug discovery. J Chem Inform Comput Sci 42(6):1273\u20131280. https:\/\/doi.org\/10.1021\/ci010132r","journal-title":"J Chem Inform Comput Sci"},{"issue":"7","key":"908_CR65","doi-asserted-by":"publisher","first-page":"1126","DOI":"10.1021\/jm00390a003","volume":"30","author":"S Hellberg","year":"1987","unstructured":"Hellberg S, Sjoestroem M, Skagerberg B, Wold S (1987) Peptide quantitative structure-activity relationships, a multivariate approach. J Med Chem 30(7):1126\u20131135. https:\/\/doi.org\/10.1021\/jm00390a003","journal-title":"J Med Chem"},{"issue":"5","key":"908_CR66","doi-asserted-by":"publisher","first-page":"703","DOI":"10.1089\/cmb.2008.0173","volume":"16","author":"AG Georgiev","year":"2009","unstructured":"Georgiev AG (2009) Interpretable numerical descriptors of amino acid space. J Comput Biol 16(5):703\u2013723. https:\/\/doi.org\/10.1089\/cmb.2008.0173","journal-title":"J Comput Biol"},{"issue":"6","key":"908_CR67","doi-asserted-by":"publisher","first-page":"775","DOI":"10.1002\/bip.20296","volume":"80","author":"H Mei","year":"2005","unstructured":"Mei H, Liao ZH, Zhou Y, Li SZ (2005) A new set of amino acid descriptors and its application in peptide QSARs. Peptide Sci 80(6):775\u2013786. https:\/\/doi.org\/10.1002\/bip.20296","journal-title":"Peptide Sci"},{"key":"908_CR68","unstructured":"B\u00e9quignon Olivier JM (2023) ProDEC. https:\/\/github.com\/OlivierBeq\/ProDEC\/tree\/master Accessed 2023-12-12"},{"issue":"1","key":"908_CR69","doi-asserted-by":"publisher","first-page":"539","DOI":"10.1038\/msb.2011.75","volume":"7","author":"F Sievers","year":"2011","unstructured":"Sievers F, Wilm A, Dineen D, Gibson TJ, Karplus K, Li W, Lopez R, McWilliam H, Remmert M, S\u00f6ding J, Thompson JD, Higgins DG (2011) Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol Syst Biol 7(1):539. https:\/\/doi.org\/10.1038\/msb.2011.75","journal-title":"Mol Syst Biol"},{"issue":"4","key":"908_CR70","doi-asserted-by":"publisher","first-page":"772","DOI":"10.1093\/molbev\/mst010","volume":"30","author":"K Katoh","year":"2013","unstructured":"Katoh K, Standley DM (2013) MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol 30(4):772\u2013780. https:\/\/doi.org\/10.1093\/molbev\/mst010","journal-title":"Mol Biol Evol"},{"key":"908_CR71","unstructured":"Homola D (2023) boruta_py. scikit-learn-contrib. https:\/\/github.com\/scikit-learn-contrib\/boruta_py Accessed 2023-11-28"},{"issue":"4","key":"908_CR72","doi-asserted-by":"publisher","first-page":"772","DOI":"10.1093\/molbev\/mst010","volume":"30","author":"MB Kursa","year":"2010","unstructured":"Kursa MB, Rudnicki WR (2010) Feature Selection with the Boruta Package. J Stat Softw 36:1\u201313. https:\/\/doi.org\/10.18637\/jss.v036.i11","journal-title":"J Stat Softw"},{"issue":"4","key":"908_CR73","doi-asserted-by":"publisher","first-page":"1111","DOI":"10.3390\/molecules26041111","volume":"26","author":"A R\u00e1cz","year":"2021","unstructured":"R\u00e1cz A, Bajusz D, H\u00e9berger K (2021) Effect of Dataset Size and Train\/Test Split Ratios in QSAR\/QSPR Multiclass Classification. Molecules 26(4):1111. https:\/\/doi.org\/10.3390\/molecules26041111","journal-title":"Molecules"},{"issue":"85","key":"908_CR74","first-page":"2825","volume":"12","author":"F Pedregosa","year":"2011","unstructured":"Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E (2011) Scikit-learn: machine learning in Python. J Mach Learn Res 12(85):2825\u20132830","journal-title":"J Mach Learn Res"},{"key":"908_CR75","doi-asserted-by":"publisher","unstructured":"Tricarico GA, Hofmans J, Lenselink EB, L\u00f3pez-Ramos M, Dr\u00e9anic M-P, Stouten PFW (2022) Construction of balanced, chemically dissimilar training, validation and test sets for machine learning on molecular datasets https:\/\/doi.org\/10.26434\/chemrxiv-2022-m8l33-v2","DOI":"10.26434\/chemrxiv-2022-m8l33-v2"},{"key":"908_CR76","unstructured":"Paszke A, Gross S, Massa F, Lerer A, Bradbury J, Chanan G, Killeen T, Lin Z, Gimelshein N, Antiga L, Desmaison A, K\u00f6pf A, Yang E, DeVito Z, Raison M, Tejani A, Chilamkurthy S, Steiner B, Fang L, Bai J, Chintala S (2019) PyTorch: An Imperative Style, High-Performance Deep Learning Library. arXiv. http:\/\/arxiv.org\/abs\/1912.01703"},{"key":"908_CR77","doi-asserted-by":"publisher","unstructured":"Iosipoi L, Vakhrushev A (2022) SketchBoost: Fast Gradient Boosted Decision Tree for Multioutput Problems. arXiv. https:\/\/doi.org\/10.48550\/arXiv.2211.12858","DOI":"10.48550\/arXiv.2211.12858"},{"key":"908_CR78","doi-asserted-by":"publisher","unstructured":"Heid E, Greenman KP, Chung Y, Li S-C, Graff DE, Vermeire FH, Wu H, Green WH, McGill CJ (2023) Chemprop: A Machine Learning Package for Chemical Property Prediction. ChemRxiv. https:\/\/doi.org\/10.26434\/chemrxiv-2023-3zcfl-v3","DOI":"10.26434\/chemrxiv-2023-3zcfl-v3"},{"issue":"1","key":"908_CR79","doi-asserted-by":"publisher","first-page":"43","DOI":"10.1186\/s13321-024-00837-w","volume":"16","author":"SJJ Guesn\u00e9","year":"2024","unstructured":"Guesn\u00e9 SJJ, Hanser T, Werner S, Boobier S, Scott S (2024) Mind your prevalence! J Cheminform 16(1):43. https:\/\/doi.org\/10.1186\/s13321-024-00837-w","journal-title":"J Cheminform"},{"key":"908_CR80","unstructured":"QSPRpred: Documentation (2024). https:\/\/cddleiden.github.io\/QSPRpred\/docs\/index.html Accessed 2023-12-18"},{"key":"908_CR81","doi-asserted-by":"publisher","unstructured":"Akiba T, Sano S, Yanase T, Ohta T, Koyama M (2019) Optuna: A Next-generation Hyperparameter Optimization Framework. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 2623\u20132631. ACM, Anchorage AK USA. https:\/\/doi.org\/10.1145\/3292500.3330701","DOI":"10.1145\/3292500.3330701"},{"key":"908_CR82","unstructured":"B\u00e9quignon OJM (2023) MLChemAD. https:\/\/github.com\/OlivierBeq\/MLChemAD Accessed 2024-02-25"},{"key":"908_CR83","unstructured":"McCorkindale W, Elijo\u00e9ius R (2022) molplotly. https:\/\/github.com\/wjm41\/molplotly"},{"issue":"1","key":"908_CR84","doi-asserted-by":"publisher","DOI":"10.1038\/sdata.2016.18","volume":"3","author":"MD Wilkinson","year":"2016","unstructured":"Wilkinson MD, Dumontier M, Aalbersberg IJ, Appleton G, Axton M, Baak A, Blomberg N, Boiten J-W, Silva Santos LB, Bourne PE, Bouwman J, Brookes AJ, Clark T, Crosas M, Dillo I, Dumon O, Edmunds S, Evelo CT, Finkers R, Gonzalez-Beltran A, Gray AJG, Groth P, Goble C, Grethe JS, Heringa J, Hoen PAC, Hooft R, Kuhn T, Kok R, Kok J, Lusher SJ, Martone ME, Mons A, Packer AL, Persson B, Rocca-Serra P, Roos M, Schaik R, Sansone S-A, Schultes E, Sengstag T, Slater T, Strawn G, Swertz MA, Thompson M, Lei J, Mulligen E, Velterop J, Waagmeester A, Wittenburg P, Wolstencroft K, Zhao J, Mons B (2016) The FAIR Guiding Principles for scientific data management and stewardship. Sci Data 3(1):160018. https:\/\/doi.org\/10.1038\/sdata.2016.18","journal-title":"Sci Data"},{"key":"908_CR85","unstructured":"B\u00e9quignon OJM (2023) ml2json. https:\/\/github.com\/OlivierBeq\/ml2json Accessed 2023-12-12"},{"key":"908_CR86","unstructured":"Aguilar D (2023) jsonpickle. https:\/\/github.com\/jsonpickle\/jsonpickle Accessed 2023-12-12"},{"key":"908_CR87","unstructured":"QSPRpred: Style guide (2024). https:\/\/github.com\/CDDLeiden\/QSPRpred\/blob\/main\/docs\/style_guide.py Accessed 2023-12-18"},{"key":"908_CR88","unstructured":"Biewald L (2020) Experiment Tracking with Weights and Biases. https:\/\/www.wandb.com\/"},{"key":"908_CR89","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbi.2020.103484","volume":"108","author":"Z Zhao","year":"2020","unstructured":"Zhao Z, Qin J, Gou Z, Zhang Y, Yang Y (2020) Multi-task learning models for predicting active compounds. J Biomed Inform 108:103484. https:\/\/doi.org\/10.1016\/j.jbi.2020.103484","journal-title":"J Biomed Inform"},{"issue":"4","key":"908_CR90","doi-asserted-by":"publisher","first-page":"1800108","DOI":"10.1002\/minf.201800108","volume":"38","author":"S Sosnin","year":"2019","unstructured":"Sosnin S, Vashurina M, Withnall M, Karpov P, Fedorov M, Tetko IV (2019) A survey of multi-task learning methods in chemoinformatics. Mol Inform 38(4):1800108. https:\/\/doi.org\/10.1002\/minf.201800108","journal-title":"Mol Inform"},{"key":"908_CR91","doi-asserted-by":"publisher","unstructured":"Chen T, Guestrin C (2016) XGBoost: A Scalable Tree Boosting System. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery And Data Mining, pp. 785\u2013794. ACM, San Francisco California USA. https:\/\/doi.org\/10.1145\/2939672.2939785","DOI":"10.1145\/2939672.2939785"},{"key":"908_CR92","doi-asserted-by":"publisher","unstructured":"Vandenhende S, Georgoulis S, Van\u00a0Gansbeke W, Proesmans M, Dai D, Van\u00a0Gool L (2022) Multi-Task Learning for Dense Prediction Tasks: A Survey. IEEE Transactions on Pattern Analysis and Machine Intelligence 44(7), 3614\u20133633. https:\/\/doi.org\/10.1109\/TPAMI.2021.3054719 . Conference Name: IEEE Transactions on Pattern Analysis and Machine Intelligence","DOI":"10.1109\/TPAMI.2021.3054719"},{"issue":"1","key":"908_CR93","doi-asserted-by":"publisher","first-page":"26","DOI":"10.1186\/s13321-018-0281-z","volume":"10","author":"A Le\u00f3n","year":"2018","unstructured":"Le\u00f3n A, Chen B, Gillet VJ (2018) Effect of missing data on multitask prediction methods. Journal of Cheminformatics 10(1):26. https:\/\/doi.org\/10.1186\/s13321-018-0281-z","journal-title":"Journal of Cheminformatics"},{"key":"908_CR94","doi-asserted-by":"publisher","DOI":"10.1021\/acs.molpharmaceut.3c00812","author":"F Lombardo","year":"2024","unstructured":"Lombardo F, Bentzien J, Berellini G, Muegge I (2024) Prediction of Human Clearance Using In Silico Models with Reduced Bias. Mol Pharm. https:\/\/doi.org\/10.1021\/acs.molpharmaceut.3c00812","journal-title":"Mol Pharm"},{"key":"908_CR95","doi-asserted-by":"publisher","unstructured":"\u0160\u00ed\u00adcho M, Luukkonen S, Maagdenberg HW, Schoenmaker L, B\u00e9quignon OJM, Westen GJP (2023) DrugEx: Deep Learning Models and Tools for Exploration of Drug-Like Chemical Space. Journal of Chemical Information and Modeling 63(12):3629\u20133636. https:\/\/doi.org\/10.1021\/acs.jcim.3c00434","DOI":"10.1021\/acs.jcim.3c00434"},{"key":"908_CR96","unstructured":"Noutahi, E., Wognum, C., Mary, H., Hounwanou, H., Kovary, K.M., Gilmour, D., thibaultvarin-r, Burns, J., St-Laurent, J., t, DomInvivo, Maheshkar, S., rbyrne-momatx: datamol-io\/molfeat: 0.9.4. Zenodo (2023). https:\/\/zenodo.org\/records\/8373019"},{"key":"908_CR97","unstructured":"Tietz M, Fan TJ, Nouri D, Bossan B (2017) Developers: skorch: A scikit-learn compatible neural network library that wraps PyTorch. https:\/\/skorch.readthedocs.io\/en\/stable\/"},{"key":"908_CR98","doi-asserted-by":"publisher","unstructured":"Norinder U, Carlsson L, Boyer S, Eklund M (2014) Introducing conformal prediction in predictive modeling. a transparent and flexible alternative to applicability domain determination. Journal of chemical information and modeling. 54. https:\/\/doi.org\/10.1021\/ci5001168","DOI":"10.1021\/ci5001168"},{"issue":"1","key":"908_CR99","doi-asserted-by":"publisher","first-page":"73","DOI":"10.1186\/s13321-021-00550-y","volume":"13","author":"M Sicho","year":"2021","unstructured":"Sicho M, Liu X, Svozil D, Westen GJP (2021) GenUI: interactive and extensible open source software platform for de novo molecular generation and cheminformatics. J Cheminform 13(1):73. https:\/\/doi.org\/10.1186\/s13321-021-00550-y","journal-title":"J Cheminform"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-024-00908-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13321-024-00908-y\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-024-00908-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,11,14]],"date-time":"2024-11-14T12:04:43Z","timestamp":1731585883000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/s13321-024-00908-y"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,11,14]]},"references-count":99,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2024,12]]}},"alternative-id":["908"],"URL":"https:\/\/doi.org\/10.1186\/s13321-024-00908-y","relation":{"has-preprint":[{"id-type":"doi","id":"10.26434\/chemrxiv-2024-m9989","asserted-by":"object"}]},"ISSN":["1758-2946"],"issn-type":[{"value":"1758-2946","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,11,14]]},"assertion":[{"value":"28 February 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 September 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"14 November 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interest"}}],"article-number":"128"}}