{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,23]],"date-time":"2026-03-23T15:08:21Z","timestamp":1774278501833,"version":"3.50.1"},"reference-count":32,"publisher":"Springer Science and Business Media LLC","issue":"3","license":[{"start":{"date-parts":[[2021,2,17]],"date-time":"2021-02-17T00:00:00Z","timestamp":1613520000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,2,17]],"date-time":"2021-02-17T00:00:00Z","timestamp":1613520000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Comput Aided Mol Des"],"published-print":{"date-parts":[[2021,3]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Machine learning (ML) enables modeling of quantitative structure\u2013activity relationships (QSAR) and compound potency predictions. Recently, multi-target QSAR models have been gaining increasing attention. Simultaneous compound potency predictions for multiple targets can be carried out using ensembles of independently derived target-based QSAR models or in a more integrated and advanced manner using multi-target deep neural networks (MT-DNNs). Herein, single-target and multi-target ML models were systematically compared on a large scale in compound potency value predictions for 270 human targets. By design, this large-magnitude evaluation has been a special feature of our study. To these ends, MT-DNN, single-target DNN\u00a0(ST-DNN), support vector regression (SVR), and random forest regression (RFR) models were implemented. Different test systems were defined to benchmark these ML methods under conditions of varying complexity. Source compounds were divided into training and test sets in a compound- or analog series-based manner taking target information into account. Data partitioning approaches used for model training and evaluation were shown to influence the relative performance of ML methods, especially for the most challenging compound data sets. For example, the performance of MT-DNNs with per-target models yielded superior performance compared to single-target models. For a test compound or its analogs, the availability of potency measurements for multiple targets affected model performance, revealing the influence of ML synergies.<\/jats:p>","DOI":"10.1007\/s10822-021-00376-8","type":"journal-article","created":{"date-parts":[[2021,2,19]],"date-time":"2021-02-19T08:13:21Z","timestamp":1613722401000},"page":"285-295","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":15,"title":["Evaluation of multi-target deep neural network models for compound potency prediction under increasingly challenging test conditions"],"prefix":"10.1007","volume":"35","author":[{"given":"Raquel","family":"Rodr\u00edguez-P\u00e9rez","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0557-5714","authenticated-orcid":false,"given":"J\u00fcrgen","family":"Bajorath","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2021,2,17]]},"reference":[{"key":"376_CR1","doi-asserted-by":"publisher","first-page":"1538","DOI":"10.1016\/j.drudis.2018.05.010","volume":"23","author":"Y Lo","year":"2018","unstructured":"Lo Y, Rensi SE, Torng W, Altman RB (2018) Machine learning in chemoinformatics and drug discovery. Drug Discov Today 23:1538\u20131546","journal-title":"Drug Discov Today"},{"key":"376_CR2","doi-asserted-by":"publisher","first-page":"1413","DOI":"10.1021\/ci200409x","volume":"52","author":"A Varnek","year":"2012","unstructured":"Varnek A, Baskin I (2012) Machine learning methods for property prediction in cheminformatics: quo vadis? J Chem Inf Model 52:1413\u20131437","journal-title":"J Chem Inf Model"},{"key":"376_CR3","doi-asserted-by":"publisher","first-page":"4977","DOI":"10.1021\/jm4004285","volume":"57","author":"A Cherkasov","year":"2014","unstructured":"Cherkasov A, Muratov E, Fourches D, Varnek A, Baskin II, Cronin M, Dearden J, Gramatica P, Martin YC, Todeschini R, Consonni V, Kuzmin VE, Cramer R, Benigni R, Yang C, Rathman J, Terfloth L, Gasteiger J, Richard A, Tropsha A (2014) QSAR modeling: where have you been? Where are you going to? J Med Chem 57:4977\u20135010","journal-title":"J Med Chem"},{"key":"376_CR4","doi-asserted-by":"publisher","first-page":"1947","DOI":"10.1021\/ci034160g","volume":"43","author":"V Svetnik","year":"2003","unstructured":"Svetnik V, Liaw A, Tong C, Culberson JC, Sheridan RP, Feuston BP (2003) Random forest: a classification and regression tool for compound classification and QSAR modeling. J Chem Inf Comput Sci 43:1947\u20131958","journal-title":"J Chem Inf Comput Sci"},{"key":"376_CR5","doi-asserted-by":"publisher","first-page":"6371","DOI":"10.1021\/acsomega.7b01079","volume":"2","author":"R Rodr\u00edguez-P\u00e9rez","year":"2017","unstructured":"Rodr\u00edguez-P\u00e9rez R, Vogt M, Bajorath J (2017) Support vector machine classification and regression prioritize different structural features for binary compound activity and potency value prediction. ACS Omega 2:6371\u20136379","journal-title":"ACS Omega"},{"key":"376_CR6","doi-asserted-by":"publisher","first-page":"1241","DOI":"10.1016\/j.drudis.2018.01.039","volume":"23","author":"H Chen","year":"2018","unstructured":"Chen H, Engkvist O, Wang Y, Olivecrona M, Blaschke T (2018) The Rise of deep learning in drug discovery. Drug Discov Today 23:1241\u20131250","journal-title":"Drug Discov Today"},{"key":"376_CR7","doi-asserted-by":"publisher","first-page":"263","DOI":"10.1021\/ci500747n","volume":"55","author":"J Ma","year":"2015","unstructured":"Ma J, Sheridan RP, Liaw A, Dahl GE, Svetnik V (2015) Deep neural nets as a method for quantitative structure-activity relationships. J Chem Inf Model 55:263\u2013274","journal-title":"J Chem Inf Model"},{"key":"376_CR8","doi-asserted-by":"publisher","first-page":"4713","DOI":"10.1021\/acsomega.8b00462","volume":"3","author":"R Rodr\u00edguez-P\u00e9rez","year":"2018","unstructured":"Rodr\u00edguez-P\u00e9rez R, Miyao T, Jasial S, Vogt M, Bajorath J (2018) Prediction of compound profiling matrices using machine learning. ACS Omega 3:4713\u20134723","journal-title":"ACS Omega"},{"key":"376_CR9","doi-asserted-by":"publisher","first-page":"45","DOI":"10.1186\/s13321-017-0232-0","volume":"9","author":"EB Lenselink","year":"2017","unstructured":"Lenselink EB, Dijke N, Bongers B, Papadatos G, van Vlijmen HWT, Kowalczyk W, Ijzerman AP, van Westen GJP (2017) Beyond the hype: deep neural networks outperform established methods using a ChEMBL bioactivity benchmark set. J Cheminf 9:45","journal-title":"J Cheminf"},{"key":"376_CR10","doi-asserted-by":"publisher","first-page":"42","DOI":"10.1186\/s13321-017-0226-y","volume":"9","author":"A Koutsoukas","year":"2017","unstructured":"Koutsoukas A, Monaghan KJ, Li X, Huan J (2017) Deep-learning: investigating deep neural networks hyper-parameters and comparison of performance to shallow methods for modeling bioactivity data. J Cheminf 9:42","journal-title":"J Cheminf"},{"key":"376_CR11","doi-asserted-by":"publisher","first-page":"5441","DOI":"10.1039\/C8SC00148K","volume":"9","author":"A Mayr","year":"2018","unstructured":"Mayr A, Klambauer G, Unterthiner T, Steijaert M, Wegner JK, Ceulemans H, Clevert D, Hochreiter S (2018) Large-scale comparison of machine learning methods for drug target prediction on ChEMBL. Chem Sci 9:5441\u20135451","journal-title":"Chem Sci"},{"key":"376_CR12","doi-asserted-by":"publisher","first-page":"1900154","DOI":"10.1002\/minf.201900154","volume":"39","author":"Y Kato","year":"2019","unstructured":"Kato Y, Hamada S, Goto H (2019) Validation study of QSAR\/DNN models using the competition datasets. Mol Inform 39:1900154","journal-title":"Mol Inform"},{"key":"376_CR13","doi-asserted-by":"publisher","first-page":"2017","DOI":"10.1016\/j.drudis.2019.07.006","volume":"24","author":"A Lavecchia","year":"2019","unstructured":"Lavecchia A (2019) Deep learning in drug discovery: opportunities, challenges and future prospects. Drug Discov Today 24:2017\u20132032","journal-title":"Drug Discov Today"},{"key":"376_CR14","doi-asserted-by":"publisher","first-page":"785","DOI":"10.1080\/17460441.2016.1201262","volume":"11","author":"II Baskin","year":"2016","unstructured":"Baskin II, Winkler D, Tetko IV (2016) A renaissance of neural networks in drug discovery. Expert Opin Drug Discov 11:785\u2013795","journal-title":"Expert Opin Drug Discov"},{"key":"376_CR15","doi-asserted-by":"publisher","first-page":"95","DOI":"10.1007\/978-1-4615-5529-2_5","volume-title":"Learning to learn","author":"R Caruana","year":"1998","unstructured":"Caruana R (1998) Multitask learning. In: Thrun S, Pratt L (eds) Learning to learn. Springer, New York, pp 95\u2013133"},{"key":"376_CR16","doi-asserted-by":"publisher","first-page":"4367","DOI":"10.1021\/acsomega.9b00298","volume":"4","author":"R Rodr\u00edguez-P\u00e9rez","year":"2019","unstructured":"Rodr\u00edguez-P\u00e9rez R, Bajorath J (2019) Multitask machine learning for classifying highly and weakly potent kinase inhibitors. ACS Omega 4:4367\u20134375","journal-title":"ACS Omega"},{"key":"376_CR17","doi-asserted-by":"publisher","first-page":"2490","DOI":"10.1021\/acs.jcim.7b00087","volume":"57","author":"Y Xu","year":"2017","unstructured":"Xu Y, Ma J, Liaw A, Sheridan RP, Svetnik V (2017) Demystifying multitask deep neural networks for quantitative structure-activity relationships. J Chem Inf Model 57:2490\u20132504","journal-title":"J Chem Inf Model"},{"key":"376_CR18","doi-asserted-by":"publisher","first-page":"12033","DOI":"10.1021\/acsomega.8b01682","volume":"3","author":"R Rodr\u00edguez-P\u00e9rez","year":"2018","unstructured":"Rodr\u00edguez-P\u00e9rez R, Bajorath J (2018) Prediction of compound profiling matrices, part II: relative performance of multitask deep learning and random forest classification on the basis of varying amounts of training data. ACS Omega 3:12033\u201312040","journal-title":"ACS Omega"},{"key":"376_CR19","doi-asserted-by":"publisher","first-page":"26","DOI":"10.1186\/s13321-018-0281-z","volume":"10","author":"A Vega de Le\u00f3n","year":"2018","unstructured":"Vega de Le\u00f3n A, Chen B, Gillet VJ (2018) Effect of missing data on multitask prediction methods. J Cheminf 10:26","journal-title":"J Cheminf"},{"key":"376_CR20","doi-asserted-by":"publisher","first-page":"D1100","DOI":"10.1093\/nar\/gkr777","volume":"40","author":"A Gaulton","year":"2012","unstructured":"Gaulton A, Bellis LJ, Bento AP, Chambers J, Davies M, Hersey A, Light Y, McGlinchey S, Michalovich D, Al-Lazikani B, Overington JP (2012) ChEMBL: a large-scale bioactivity database for drug discovery. Nucleic Acids Res 40:D1100\u2013D1107","journal-title":"Nucleic Acids Res"},{"key":"376_CR21","doi-asserted-by":"publisher","first-page":"7667","DOI":"10.1021\/acs.jmedchem.6b00906","volume":"59","author":"D Stumpfe","year":"2016","unstructured":"Stumpfe D, Dimova D, Bajorath J (2016) Computational method for the systematic identification of analog series and key compounds representing series and their biological activity profiles. J Med Chem 59:7667\u20137676","journal-title":"J Med Chem"},{"key":"376_CR22","doi-asserted-by":"publisher","first-page":"339","DOI":"10.1021\/ci900450m","volume":"10","author":"J Hussain","year":"2010","unstructured":"Hussain J, Rea C (2010) Computationally efficient algorithm to identify matched molecular pairs (MMPs) in large data sets. J Chem Inf Model 10:339\u2013348","journal-title":"J Chem Inf Model"},{"key":"376_CR23","doi-asserted-by":"publisher","first-page":"742","DOI":"10.1021\/ci100050t","volume":"50","author":"D Rogers","year":"2010","unstructured":"Rogers D, Hahn M (2010) Extended connectivity fingerprints. J Chem Inf Model 50:742\u2013754","journal-title":"J Chem Inf Model"},{"key":"376_CR24","volume-title":"OpenEye Scientific Software","author":"OEChem Toolkit","year":"2019","unstructured":"OEChem Toolkit (2019) OpenEye Scientific Software. Santa Fe, NM"},{"key":"376_CR25","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4757-3264-1","volume-title":"The nature of statistical learning theory","author":"VN Vapnik","year":"2000","unstructured":"Vapnik VN (2000) The nature of statistical learning theory, 2nd edn. Springer, New York","edition":"2"},{"key":"376_CR26","doi-asserted-by":"publisher","first-page":"199","DOI":"10.1023\/B:STCO.0000035301.49549.88","volume":"14","author":"AJ Smola","year":"2004","unstructured":"Smola AJ, Sch\u00f6lkopf B (2004) A tutorial on support vector regression. Stat Comput 14:199\u2013222","journal-title":"Stat Comput"},{"key":"376_CR27","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1023\/A:1010933404324","volume":"45","author":"L Breiman","year":"2001","unstructured":"Breiman L (2001) Random forests. Mach Learn 45:5\u201332","journal-title":"Mach Learn"},{"key":"376_CR28","volume-title":"Deep learning","author":"I Goodfellow","year":"2016","unstructured":"Goodfellow I, Bengio Y, Courville A (2016) Deep learning. MIT Press, Cambridge, MA"},{"key":"376_CR29","unstructured":"Nielsen MA (2015) Neural networks and deep learning. Determination Press"},{"key":"376_CR30","first-page":"2825","volume":"12","author":"F Pedregosa","year":"2011","unstructured":"Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E (2011) Scikit-learn: machine learning in Python. J Mach Learn Res 12:2825\u20132830","journal-title":"J Mach Learn Res"},{"key":"376_CR31","volume-title":"TensorFlow: a system for large-scale machine learning","author":"M Abadi","year":"2016","unstructured":"Abadi M, Barham P, Chen J, Chen Z, Davis A, Dean J, Devin M, Ghemawat S, Irving G, Isard M, Kudlur M, Levenberg J, Monga R, Moore S, Murray DG, Steiner B, Tucker P, Vasudevan V, Warden P, Wicke M, Yu Y, Zheng X (2016) TensorFlow: a system for large-scale machine learning. 12th USENIX Symposium on operating systems design and implementation (OSDI 16), Savannah, GA"},{"key":"376_CR32","unstructured":"Chollet FK (2015) https:\/\/github.com\/keras-team\/keras. Accessed 10 Nov, 2020"}],"container-title":["Journal of Computer-Aided Molecular Design"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10822-021-00376-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s10822-021-00376-8\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10822-021-00376-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,4,6]],"date-time":"2021-04-06T07:06:39Z","timestamp":1617692799000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s10822-021-00376-8"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,2,17]]},"references-count":32,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2021,3]]}},"alternative-id":["376"],"URL":"https:\/\/doi.org\/10.1007\/s10822-021-00376-8","relation":{},"ISSN":["0920-654X","1573-4951"],"issn-type":[{"value":"0920-654X","type":"print"},{"value":"1573-4951","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,2,17]]},"assertion":[{"value":"10 December 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 February 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 February 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}