{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,4]],"date-time":"2026-06-04T00:41:08Z","timestamp":1780533668412,"version":"3.54.1"},"reference-count":65,"publisher":"Springer Science and Business Media LLC","issue":"7","license":[{"start":{"date-parts":[[2020,4,16]],"date-time":"2020-04-16T00:00:00Z","timestamp":1586995200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2020,4,16]],"date-time":"2020-04-16T00:00:00Z","timestamp":1586995200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100002347","name":"Bundesministerium f\u00fcr Bildung und Forschung","doi-asserted-by":"publisher","award":["031A262C"],"award-info":[{"award-number":["031A262C"]}],"id":[{"id":"10.13039\/501100002347","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100002347","name":"Bundesministerium f\u00fcr Bildung und Forschung","doi-asserted-by":"publisher","award":["031A262C"],"award-info":[{"award-number":["031A262C"]}],"id":[{"id":"10.13039\/501100002347","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100006188","name":"Einstein Stiftung Berlin","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100006188","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Comput Aided Mol Des"],"published-print":{"date-parts":[[2020,7]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>In drug development, late stage toxicity issues of a compound are the main cause of failure in clinical trials. In silico methods are therefore of high importance to guide the early design process to reduce time, costs and animal testing. Technical advances and the ever growing amount of available toxicity data enabled machine learning, especially neural networks, to impact the field of predictive toxicology. In this study, cytotoxicity prediction, one of the earliest handles in drug discovery, is investigated using a deep learning approach trained on a highly consistent in-house data set of over 34,000 compounds with a share of less than 5% of cytotoxic molecules. The model reached a balanced accuracy of over 70%, similar to previously reported studies using Random Forest. Albeit yielding good results, neural networks are often described as a black box lacking deeper mechanistic understanding of the underlying model. To overcome this absence of interpretability, a Deep Taylor Decomposition method is investigated to identify substructures that may be responsible for the cytotoxic effects, the so-called toxicophores. Furthermore, this study introduces cytotoxicity maps which provide a visual structural interpretation of the relevance of these substructures. Using this approach could be helpful in drug development to predict the potential toxicity of a compound as well as to generate new insights into the toxic mechanism. Moreover, it could also help to de-risk and optimize compounds.<\/jats:p>","DOI":"10.1007\/s10822-020-00310-4","type":"journal-article","created":{"date-parts":[[2020,4,16]],"date-time":"2020-04-16T02:02:41Z","timestamp":1587002561000},"page":"731-746","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":60,"title":["Revealing cytotoxic substructures in molecules using deep learning"],"prefix":"10.1007","volume":"34","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8833-7617","authenticated-orcid":false,"given":"Henry E.","family":"Webel","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8881-920X","authenticated-orcid":false,"given":"Talia B.","family":"Kimber","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2688-2868","authenticated-orcid":false,"given":"Silke","family":"Radetzki","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3114-7975","authenticated-orcid":false,"given":"Martin","family":"Neuenschwander","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1602-2330","authenticated-orcid":false,"given":"Marc","family":"Nazar\u00e9","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3760-580X","authenticated-orcid":false,"given":"Andrea","family":"Volkamer","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2020,4,16]]},"reference":[{"key":"310_CR1","unstructured":"CAS. CAS REGISTRY. https:\/\/www.cas.org\/support\/documentation\/chemical-substances"},{"issue":"2","key":"310_CR2","doi-asserted-by":"publisher","first-page":"83","DOI":"10.14573\/altex.1603091","volume":"33","author":"T Hartung","year":"2016","unstructured":"Hartung T (2016) Making big sense from big data in toxicology by read-across. ALTEX-Altern Anim Exp 33(2):83\u201393. https:\/\/doi.org\/10.14573\/altex.1603091","journal-title":"ALTEX-Altern Anim Exp"},{"issue":"7","key":"310_CR3","doi-asserted-by":"publisher","first-page":"475","DOI":"10.1038\/nrd4609","volume":"14","author":"MJ Waring","year":"2015","unstructured":"Waring MJ, Arrowsmith J, Leach AR, Leeson PD, Mandrell S, Owen RM, Pairaudeau G, Pennie WD, Pickett SD, Wang J et al (2015) An analysis of the attrition of drug candidates from four major pharmaceutical companies. Nat Rev Drug Discov 14(7):475.\u00a0https:\/\/doi.org\/10.1038\/nrd4609","journal-title":"Nat Rev Drug Discov"},{"issue":"2","key":"310_CR4","doi-asserted-by":"publisher","first-page":"188","DOI":"10.2174\/138620710790596736","volume":"13","author":"JM McKim","year":"2010","unstructured":"McKim JM (2010) Building a tiered approach to in vitro predictive toxicity screening: a focus on assays with in vivo relevance. Combinatorial Chem High Throughput screen 13(2):188\u2013206.\u00a0https:\/\/doi.org\/10.2174\/138620710790596736","journal-title":"Combinatorial Chem High Throughput screen"},{"key":"310_CR5","unstructured":"BMEL - \u00dcbersicht: BMEL informiert \u00fcber Tierschutz - Verwendung von Versuchstieren im Jahr 2016. https:\/\/www.bmel.de\/DE\/Tier\/Tierschutz\/_texte\/Versuchstierzahlen2016.html#doc10323474bodyText6"},{"issue":"10","key":"310_CR6","doi-asserted-by":"publisher","first-page":"2445","DOI":"10.1007\/s00204-015-1618-2","volume":"90","author":"P Carri\u00f3","year":"2016","unstructured":"Carri\u00f3 P, Sanz F, Pastor M (2016) Toward a unifying strategy for the structure-based prediction of toxicological endpoints. Archiv Toxicol 90(10):2445\u20132460.\u00a0https:\/\/doi.org\/10.1007\/s00204-015-1618-2","journal-title":"Archiv Toxicol"},{"key":"310_CR7","unstructured":"Regulation (EC) No 1907\/2006 of the European Parliament and of the Council of 18 December 2006 concerning the Registration, Evaluation, Authorisation and Restriction of Chemicals (REACH). https:\/\/ec.europa.eu\/environment\/chemicals\/reach\/reach_en.htm"},{"key":"310_CR8","doi-asserted-by":"crossref","unstructured":"Graves A, Mohamed A, Hinton GE (2013) Speech recognition with deep recurrent neural networks. CoRR, abs\/1303.5778, arXiv:1303.5778","DOI":"10.1109\/ICASSP.2013.6638947"},{"key":"310_CR9","first-page":"1097","volume-title":"Advances in neural information processing systems","author":"A Krizhevsky","year":"2012","unstructured":"Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. In: Pereira F, Burges CJC, Bottou L, Weinberger KQ (eds) Advances in neural information processing systems. Curran Associates, Inc., Red Hook, pp 1097\u20131105. https:\/\/papers.nips.cc\/paper\/4824-imagenet-classification-with-deep-convolutional-neural-networks.pdf"},{"issue":"7","key":"310_CR10","doi-asserted-by":"publisher","first-page":"eaap7885","DOI":"10.1126\/sciadv.aap7885","volume":"4","author":"M Popova","year":"2018","unstructured":"Popova M, Isayev O, Tropsha A (2018) Deep reinforcement learning for de novo drug design. Sci Adv 4(7):eaap7885. https:\/\/doi.org\/10.1126\/sciadv.aap7885","journal-title":"Sci Adv"},{"issue":"1","key":"310_CR11","doi-asserted-by":"publisher","first-page":"120","DOI":"10.1021\/acscentsci.7b00512","volume":"4","author":"HS Segler Marwin","year":"2018","unstructured":"Segler Marwin HS, Thierry K, Christian T, Waller Mark P (2018) Generating focused molecule libraries for drug discovery with recurrent neural networks. ACS Central Sci 4(1):120\u2013131. https:\/\/doi.org\/10.1021\/acscentsci.7b00512","journal-title":"ACS Central Sci"},{"issue":"6","key":"310_CR12","doi-asserted-by":"publisher","first-page":"1194","DOI":"10.1021\/acs.jcim.7b00690","volume":"58","author":"P Evgeny","year":"2018","unstructured":"Evgeny P, Arip A, Yan I, Vladimir A, Benjamin S-L, Al\u00e1n A-G, Alex Z (2018) Reinforced adversarial neural computer for De Novo molecular design. J Chem Inform Model 58(6):1194\u20131204. https:\/\/doi.org\/10.1021\/acs.jcim.7b00690","journal-title":"J Chem Inform Model"},{"issue":"1\u20132","key":"310_CR13","doi-asserted-by":"publisher","first-page":"1700123","DOI":"10.1002\/minf.201700123","volume":"37","author":"B Thomas","year":"2018","unstructured":"Thomas B, Marcus O, Ola E, J\u00fcrgen B, Hongming C (2018) Application of generative autoencoder in De Novo molecular design. Mol Inform 37(1\u20132):1700123. https:\/\/doi.org\/10.1002\/minf.201700123","journal-title":"Mol Inform"},{"issue":"2","key":"310_CR14","doi-asserted-by":"publisher","first-page":"268","DOI":"10.1021\/acscentsci.7b00572","volume":"4","author":"G-B Rafael","year":"2018","unstructured":"Rafael G-B, Wei Jennifer N, David D, Miguel Hern\u00e1ndez-Lobato Jos\u00e9, Benjam\u00edn S\u00e1nchez-Lengeling, Dennis Sheberla, Jorge A-I, Hirzel Timothy D, Adams Ryan P, Al\u00e1n A-G (2018) Automatic chemical design using a data-driven continuous representation of molecules. ACS Central Sci 4(2):268\u2013276. https:\/\/doi.org\/10.1021\/acscentsci.7b00572","journal-title":"ACS Central Sci"},{"issue":"6","key":"310_CR15","doi-asserted-by":"publisher","first-page":"2545","DOI":"10.1021\/acs.jcim.9b00266","volume":"59","author":"C Mater Adam","year":"2019","unstructured":"Mater Adam C, Coote Michelle L (2019) Deep learning in chemistry. J Chem Inform Model 59(6):2545\u20132559. https:\/\/doi.org\/10.1021\/acs.jcim.9b00266","journal-title":"J Chem Inform Model"},{"key":"310_CR16","doi-asserted-by":"publisher","unstructured":"Hu Y, Stumpfe D, Bajorath J (2013) Advancing the activity cliff concept. F1000Research, 2, ISSN 2046-1402. https:\/\/doi.org\/10.12688\/f1000research.2-199.v1","DOI":"10.12688\/f1000research.2-199.v1"},{"issue":"10","key":"310_CR17","doi-asserted-by":"publisher","first-page":"1294","DOI":"10.1016\/j.chembiol.2016.07.023","volume":"23","author":"KM Gayvert","year":"2016","unstructured":"Gayvert KM, Madhukar NS, Elemento O (2016) A data-driven approach to predicting successes and failures of clinical trials. Cell Chem Biol 23(10):1294\u20131301. https:\/\/doi.org\/10.1016\/j.chembiol.2016.07.023","journal-title":"Cell Chem Biol"},{"issue":"2","key":"310_CR18","doi-asserted-by":"publisher","first-page":"263","DOI":"10.1021\/ci500747n","volume":"55","author":"M Junshui","year":"2015","unstructured":"Junshui M, Sheridan RP, Andy L, Dahl GE, Vladimir S (2015) Deep neural nets as a method for quantitative structure-activity relationships. J Chem Inform Model 55(2):263\u2013274. https:\/\/doi.org\/10.1021\/ci500747n","journal-title":"J Chem Inform Model"},{"issue":"6","key":"310_CR19","doi-asserted-by":"publisher","first-page":"914","DOI":"10.3390\/ijms17060914","volume":"17","author":"N Serena","year":"2016","unstructured":"Serena N, Francesca G, Viviana C, Robert T (2016) In silico prediction of cytochrome P450-drug interaction: QSARs for CYP3A4 and CYP2C9. Int J Mol Sci 17(6):914. https:\/\/doi.org\/10.3390\/ijms17060914","journal-title":"Int J Mol Sci"},{"key":"310_CR20","unstructured":"Bender A (2019) \u2019AI\u2019 in toxicology (in silico toxicology): The Pieces Don\u2019t Yet Fit Together, http:\/\/www.drugdiscovery.net\/tag\/insilicotox\/"},{"issue":"11","key":"310_CR21","doi-asserted-by":"publisher","first-page":"3007","DOI":"10.1021\/acschembio.6b00538","volume":"11","author":"LH Mervin","year":"2016","unstructured":"Mervin LH, Qing C, Barrett IP, Firth MA, Murray D, McWilliams L, Haddrick M, Wigglesworth M, Engkvist O, Bender A (2016) Understanding cytotoxicity and cytostaticity in a high-throughput screening collection. ACS Chem Biol 11(11):3007\u20133023. https:\/\/doi.org\/10.1021\/acschembio.6b00538","journal-title":"ACS Chem Biol"},{"key":"310_CR22","doi-asserted-by":"publisher","unstructured":"Riss TL, Moravec RA, Niles AL (2011) Cytotoxicity testing: measuring viable cells, dead cells, and detecting mechanism of cell death. In: Mammalian cell viability, pp 103\u2013114. Springer.\u00a0https:\/\/doi.org\/10.1007\/978-1-61779-108-6_12","DOI":"10.1007\/978-1-61779-108-6_12"},{"key":"310_CR23","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gky318","author":"B Priyanka","year":"2018","unstructured":"Priyanka B, Eckert AO, Schrey AK, Preissner R (2018) ProTox-II: a webserver for the prediction of toxicity of chemicals. Nucleic Acids Res. https:\/\/doi.org\/10.1093\/nar\/gky318","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"310_CR24","doi-asserted-by":"publisher","first-page":"73","DOI":"10.1039\/C6TX00252H","volume":"6","author":"F Svensson","year":"2017","unstructured":"Svensson F, Norinder U, Bender A (2017) Modelling compound cytotoxicity using conformal prediction and PubChem HTS data. Toxicol Res 6(1):73\u201380. https:\/\/doi.org\/10.1039\/C6TX00252H","journal-title":"Toxicol Res"},{"issue":"1","key":"310_CR25","doi-asserted-by":"publisher","first-page":"11","DOI":"10.1186\/1758-2946-2-11","volume":"2","author":"SR Langdon","year":"2010","unstructured":"Langdon SR, Mulgrew J, Paolini GV, Van Hoorn WP (2010) Predicting cytotoxicity from heterogeneous data sources with Bayesian learning. J Cheminform 2(1):11. https:\/\/doi.org\/10.1186\/1758-2946-2-11","journal-title":"J Cheminform"},{"issue":"1","key":"310_CR26","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1371\/journal.pone.0191838","volume":"13","author":"AA Lagunin","year":"2018","unstructured":"Lagunin AA, Dubovskaja VI, Rudik AV, Pogodin PV, Druzhilovskiy DS, Gloriozova TA, Filimonov DA, Sastry NG (2018) CLC-Pred: a freely available web-service for in silico prediction of human cell line cytotoxicity for drug-like compounds. PLoS ONE 13(1):1\u201313. https:\/\/doi.org\/10.1371\/journal.pone.0191838","journal-title":"PLoS ONE"},{"key":"310_CR27","volume-title":"Deep learning","author":"I Goodfellow","year":"2016","unstructured":"Goodfellow I, Bengio Y, Courville A (2016) Deep learning. MIT Press, Cambridge.\u00a0https:\/\/www.deeplearningbook.org\/"},{"key":"310_CR28","first-page":"1","volume":"27","author":"T Unterthiner","year":"2014","unstructured":"Unterthiner T, Mayr A, Klambauer G, Steijaert M, Wegner J\u00f6rg K, Ceulemans H, Hochreiter S (2014) Deep learning as an opportunity in virtual screening. Proc Deep Learn Workshop at NIPS 27:1\u20139. https:\/\/pdfs.semanticscholar.org\/95f7\/b2c0fe75f08e3ce0d2ac4315166f4239db5c.pdf","journal-title":"Proc Deep Learn Workshop at NIPS"},{"issue":"24","key":"310_CR29","doi-asserted-by":"publisher","first-page":"5441","DOI":"10.1039\/c8sc00148k","volume":"9","author":"A Mayr","year":"2018","unstructured":"Mayr A, Klambauer G, Unterthiner T, Steijaert M, Wegner JK, Ceulemans H, Clevert DA, Hochreiter S (2018) Large-scale comparison of machine learning methods for drug target prediction on ChEMBL. Chem Sci 9(24):5441\u20135451.\u00a0https:\/\/doi.org\/10.1039\/c8sc00148k","journal-title":"Chem Sci"},{"key":"310_CR30","doi-asserted-by":"publisher","first-page":"513","DOI":"10.1039\/C7SC02664A","volume":"9","author":"Z Wu","year":"2018","unstructured":"Wu Z, Ramsundar B, Feinberg EN, Gomes J, Geniesse C, Pappu AS, Leswing K, Pande V (2018) Moleculenet: a benchmark for molecular machine learning. Chem. Sci. 9:513\u2013530. https:\/\/doi.org\/10.1039\/C7SC02664A","journal-title":"Chem. Sci."},{"issue":"4","key":"310_CR31","doi-asserted-by":"publisher","first-page":"283","DOI":"10.1021\/acscentsci.6b00367","volume":"3","author":"H Altae-Tran","year":"2017","unstructured":"Altae-Tran H, Ramsundar B, Pappu AS, Pande V (2017) Low data drug discovery with one-shot learning. ACS Central Sci 3(4):283\u2013293. https:\/\/doi.org\/10.1021\/acscentsci.6b00367","journal-title":"ACS Central Sci"},{"key":"310_CR32","doi-asserted-by":"publisher","first-page":"50","DOI":"10.1021\/ci100176x","volume":"1204","author":"D Fourches","year":"2010","unstructured":"Fourches D, Muratov E, Tropsha A (2010) Trust, but verify: On the importance of chemical structure curation in cheminformatics and QSAR modeling research. J Chem Inform Model 1204:50\u20131189.\u00a0https:\/\/doi.org\/10.1021\/ci100176x","journal-title":"J Chem Inform Model"},{"key":"310_CR33","doi-asserted-by":"publisher","first-page":"80","DOI":"10.3389\/fenvs.2015.00080","volume":"3","author":"A Mayr","year":"2016","unstructured":"Mayr A, Klambauer G, Unterthiner T, Hochreiter S (2016) DeepTox: toxicity prediction using deep learning. Front Environ Sci 3:80.\u00a0https:\/\/doi.org\/10.3389\/fenvs.2015.00080","journal-title":"Front Environ Sci"},{"issue":"4","key":"310_CR34","doi-asserted-by":"publisher","first-page":"1324","DOI":"10.1021\/acs.jcim.8b00825","volume":"59","author":"RP Sheridan","year":"2019","unstructured":"Sheridan RP (2019) Interpretation of QSAR models by coloring atoms according to changes in predicted activity: how robust is it? J Chem Inform Model 59(4):1324\u20131337. https:\/\/doi.org\/10.1021\/acs.jcim.8b00825","journal-title":"J Chem Inform Model"},{"key":"310_CR35","doi-asserted-by":"publisher","unstructured":"Preuer K, Klambauer G, Rippmann F, Hochreiter S, Unterthiner T (2019) Interpretable deep learning in drug discovery, pp 331\u2013345. Springer International Publishing, Cham, https:\/\/doi.org\/10.1007\/978-3-030-28954-6_18","DOI":"10.1007\/978-3-030-28954-6_18"},{"key":"310_CR36","doi-asserted-by":"publisher","DOI":"10.1021\/acs.molpharmaceut.9b00520","author":"M Manica","year":"2019","unstructured":"Manica M, Oskooei A, Born J, Subramanian V, S\u00e1ez-Rodr\u00edguez J, Rodr\u00edguez Mart\u00ednez M (2019) Toward explainable anticancer compound sensitivity prediction via multimodal attention-based convolutional encoders. Mol Pharm. https:\/\/doi.org\/10.1021\/acs.molpharmaceut.9b00520","journal-title":"Mol Pharm"},{"key":"310_CR37","doi-asserted-by":"publisher","first-page":"96","DOI":"10.1016\/j.jmgm.2018.06.005","volume":"84","author":"J Hochuli","year":"2018","unstructured":"Hochuli J, Helbling A, Skaist T, Ragoza M, Koes DR (2018) Visualizing convolutional neural network protein-ligand scoring. J Mol Graph Model 84:96\u2013108. https:\/\/doi.org\/10.1016\/j.jmgm.2018.06.005","journal-title":"J Mol Graph Model"},{"issue":"16","key":"310_CR38","doi-asserted-by":"publisher","first-page":"953","DOI":"10.1002\/jcc.25168","volume":"39","author":"P \u017duvela","year":"2018","unstructured":"\u017duvela P, David J, Wong MW (2018) Interpretation of ANN-based QSAR models for prediction of antioxidant activity of flavonoids. J Comput Chem 39(16):953\u2013963. https:\/\/doi.org\/10.1002\/jcc.25168","journal-title":"J Comput Chem"},{"key":"310_CR39","doi-asserted-by":"publisher","first-page":"211","DOI":"10.1016\/j.patcog.2016.11.008","volume":"65","author":"G Montavon","year":"2017","unstructured":"Montavon G, Lapuschkin S, Binder A, Samek W, M\u00fcller KR (2017) Explaining nonlinear classification decisions with deep Taylor decomposition. Pattern Recognit 65:211\u2013222.\u00a0https:\/\/doi.org\/10.1016\/j.patcog.2016.11.008","journal-title":"Pattern Recognit"},{"issue":"1","key":"310_CR40","doi-asserted-by":"publisher","first-page":"43","DOI":"10.1186\/1758-2946-5-43","volume":"5","author":"S Riniker","year":"2013","unstructured":"Riniker S, Landrum GA (2013) Similarity maps: a visualization strategy for molecular fingerprints and machine-learning methods. J Cheminform 5(1):43. https:\/\/doi.org\/10.1186\/1758-2946-5-43","journal-title":"J Cheminform"},{"issue":"2","key":"310_CR41","doi-asserted-by":"publisher","first-page":"401","DOI":"10.1007\/s11030-009-9187-z","volume":"14","author":"M Lisurek","year":"2010","unstructured":"Lisurek M, Rupp B, Wichard J, Neuenschwander M, von Kries JP, Frank R, Rademann J, K\u00fchne R (2010) Design of chemical libraries with potentially bioactive molecules applying a maximum common substructure concept. Mol Divers 14(2):401\u2013408. https:\/\/doi.org\/10.1007\/s11030-009-9187-z","journal-title":"Mol Divers"},{"issue":"7","key":"310_CR42","doi-asserted-by":"publisher","first-page":"2719","DOI":"10.1021\/jm901137j","volume":"53","author":"JB Baell","year":"2010","unstructured":"Baell JB, Holloway GA (2010) New Substructure filters for removal of pan assay interference compounds (PAINS) from screening libraries and for their exclusion in bioassays. J Med Chem 53(7):2719\u20132740. https:\/\/doi.org\/10.1021\/jm901137j","journal-title":"J Med Chem"},{"key":"310_CR43","unstructured":"Spence MTZ, Johnson I (2010) The molecular probes handbook: a guide to fluorescent probes and labeling technologies. Live technologies corporation, 11th edn, ISBN 978-0-9829279-1-5"},{"key":"310_CR44","unstructured":"RDKit, online. RDKit: Open-source cheminformatics. http:\/\/www.rdkit.org"},{"key":"310_CR45","unstructured":"Atkinson F. standardiser 0.1.9, 8 2017. https:\/\/pypi.org\/project\/standardiser\/"},{"issue":"1","key":"310_CR46","doi-asserted-by":"publisher","first-page":"60","DOI":"10.1186\/s13321-016-0173-z","volume":"8","author":"M G\u00fctlein","year":"2016","unstructured":"G\u00fctlein M, Kramer S (2016) Filtered circular fingerprints improve either prediction or runtime performance while retaining interpretability. J Cheminform 8(1):60. https:\/\/doi.org\/10.1186\/s13321-016-0173-z","journal-title":"J Cheminform"},{"key":"310_CR47","doi-asserted-by":"publisher","first-page":"1929","DOI":"10.5555\/2627435.2670313","volume":"15","author":"N Srivastava","year":"2014","unstructured":"Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R (2014) Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res 15:1929\u20131958. https:\/\/doi.org\/10.5555\/2627435.2670313","journal-title":"J Mach Learn Res"},{"key":"310_CR48","unstructured":"Kingma DP, Adam JB (2014) A method for stochastic optimization. arXiv preprint arXiv:1412.6980"},{"issue":"6","key":"310_CR49","doi-asserted-by":"publisher","first-page":"1947","DOI":"10.1021\/ci034160g","volume":"43","author":"V Svetnik","year":"2003","unstructured":"Svetnik V, Liaw A, Tong C, Christopher Culberson J, Sheridan RP, Feuston BP (2003) Random forest: a classification and regression tool for compound classification and QSAR modeling. J Chem Inform Comput Sci 43(6):1947\u20131958. https:\/\/doi.org\/10.1021\/ci034160g","journal-title":"J Chem Inform Comput Sci"},{"key":"310_CR50","first-page":"2825","volume":"12","author":"F Pedregosa","year":"2011","unstructured":"Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E (2011) Scikit-learn: machine learning in python. J Mach Learn Res 12:2825\u20132830. https:\/\/arxiv.org\/abs\/1201.0490v4","journal-title":"J Mach Learn Res"},{"key":"310_CR51","doi-asserted-by":"publisher","unstructured":"Brodersen KH, Ong CS, Stephan KE, Buhmann JM (Aug 2010) The balanced accuracy and its posterior distribution. In 2010 20th International Conference on Pattern Recognition, pp 3121\u20133124, https:\/\/doi.org\/10.1109\/ICPR.2010.764","DOI":"10.1109\/ICPR.2010.764"},{"issue":"3","key":"310_CR52","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1371\/journal.pone.0118432","volume":"10","author":"T Saito","year":"2015","unstructured":"Saito T, Rehmsmeier M (2015) The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PLoS ONE 10(3):1\u201321. https:\/\/doi.org\/10.1371\/journal.pone.0118432","journal-title":"PLoS ONE"},{"issue":"7","key":"310_CR53","doi-asserted-by":"publisher","first-page":"e0130140","DOI":"10.1371\/journal.pone.0130140","volume":"10","author":"S Bach","year":"2015","unstructured":"Bach S, Binder A, Montavon G, Klauschen F, M\u00fcller K-R, Samek W (2015) On pixel-wise explanations for non-linear classifier decisions by layer-wise relevance propagation. PLoS ONE 10(7):e0130140. https:\/\/doi.org\/10.1371\/journal.pone.0130140","journal-title":"PLoS ONE"},{"key":"310_CR54","unstructured":"Chollet F et al. (2015) Keras. https:\/\/keras.io"},{"issue":"93","key":"310_CR55","first-page":"1","volume":"20","author":"M Alber","year":"2019","unstructured":"Alber M, Lapuschkin S, Seegerer P, H\u00e4gele M, Sch\u00fctt KT, Montavon G, Samek W, M\u00fcller K-R, D\u00e4hne S, Kindermans PJ (2019) iNNvestigate neural networks. J Mach Learn Res 20(93):1\u20138. https:\/\/arxiv.org\/abs\/1808.04260v1","journal-title":"J Mach Learn Res"},{"key":"310_CR56","unstructured":"\u2018LOPAC\u00ae1280 library\u2019 from Sigma-Aldrich, https:\/\/www.sigmaaldrich.com\/life-science\/cell-biology\/bioactive-small-molecules\/lopac1280-navigator.html"},{"key":"310_CR57","unstructured":"\u2018FDA Approved Drug Library L1300\u2019 from Selleckchem, https:\/\/www.selleckchem.com\/screening\/fda-approved-drug-library.html"},{"key":"310_CR58","unstructured":"Landrum G (2018) Working with unbalanced data, part I . http:\/\/rdkit.blogspot.com\/2018\/11\/working-with-unbalanced-data-part-i.html"},{"issue":"D1","key":"310_CR59","doi-asserted-by":"publisher","first-page":"D945","DOI":"10.1093\/nar\/gkw1074","volume":"45","author":"A Gaulton","year":"2017","unstructured":"Gaulton A, Hersey A, Nowotka M, Bento AP, Chambers J, Mendez D, Mutowo P, Atkinson F, Bellis LJ, Cibri\u00e1n-Uhalte E et al (2017) The ChEMBL database in 2017. Nucleic Acids Res 45(D1):D945\u2013D954.\u00a0https:\/\/doi.org\/10.1093\/nar\/gkw1074","journal-title":"Nucleic Acids Res"},{"issue":"14","key":"310_CR60","doi-asserted-by":"publisher","first-page":"2508","DOI":"10.1093\/bioinformatics\/bty135","volume":"34","author":"C Ji","year":"2018","unstructured":"Ji C, Svensson F, Zoufir A, Bender A (2018) eMolTox: prediction of molecular toxicity with confidence. Bioinformatics 34(14):2508\u20132509. https:\/\/doi.org\/10.1093\/bioinformatics\/bty135","journal-title":"Bioinformatics"},{"key":"310_CR61","doi-asserted-by":"publisher","unstructured":"Cruz-Monteagudo M, Medina-Franco JL, P\u00e9rez-Castillo Y, Nicolotti O, Nat\u00e1lia M, Cordeiro DS, Borges F (2014) Activity cliffs in drug discovery: Dr Jekyll or Mr Hyde?, ISSN 18785832. https:\/\/doi.org\/10.1016\/j.drudis.2014.02.003","DOI":"10.1016\/j.drudis.2014.02.003"},{"key":"310_CR62","unstructured":"Bahdanau D, Cho KH, Bengio Y (2015) Neural machine translation by jointly learning to align and translate. In 3rd International Conference on Learning Representations, ICLR 2015-Conference Track Proceedings. International Conference on Learning Representations, ICLR. https:\/\/arxiv.org\/abs\/1409.0473"},{"issue":"11","key":"310_CR63","doi-asserted-by":"publisher","first-page":"865","DOI":"10.1080\/1062936X.2016.1250229","volume":"27","author":"T Hanser","year":"2016","unstructured":"Hanser T, Barber C, Marchaland JF, Werner S (2016) Applicability domain: towards a more formal definition. SAR QSAR Environ Res 27(11):865\u2013881. https:\/\/doi.org\/10.1080\/1062936X.2016.1250229","journal-title":"SAR QSAR Environ Res"},{"key":"310_CR64","unstructured":"Kimber TB, Engelke S, Tetko IV, Bruno E, Godin G (2018) Synergy effect between convolutional neural networks and the multiplicity of SMILES for improvement of molecular prediction. arXiv preprint https:\/\/arxiv.org\/abs\/1812.04439"},{"key":"310_CR65","doi-asserted-by":"publisher","first-page":"1692","DOI":"10.1039\/C8SC04175J","volume":"10","author":"R Winter","year":"2019","unstructured":"Winter R, Montanari F, No\u00e9 F, Clevert DA (2019) Learning continuous and data-driven molecular descriptors by translating equivalent chemical representations. Chem. Sci. 10:1692\u20131701. https:\/\/doi.org\/10.1039\/C8SC04175J","journal-title":"Chem. Sci."}],"container-title":["Journal of Computer-Aided Molecular Design"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10822-020-00310-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10822-020-00310-4\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10822-020-00310-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,4,15]],"date-time":"2021-04-15T23:57:44Z","timestamp":1618531064000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10822-020-00310-4"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,4,16]]},"references-count":65,"journal-issue":{"issue":"7","published-print":{"date-parts":[[2020,7]]}},"alternative-id":["310"],"URL":"https:\/\/doi.org\/10.1007\/s10822-020-00310-4","relation":{},"ISSN":["0920-654X","1573-4951"],"issn-type":[{"value":"0920-654X","type":"print"},{"value":"1573-4951","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,4,16]]},"assertion":[{"value":"4 February 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"16 March 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"16 April 2020","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Compliance with ethical standards"}},{"value":"The authors declare that they have no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflicts of interest"}}]}}