{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,9]],"date-time":"2026-05-09T09:38:24Z","timestamp":1778319504504,"version":"3.51.4"},"reference-count":97,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2020,9,4]],"date-time":"2020-09-04T00:00:00Z","timestamp":1599177600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2020,9,4]],"date-time":"2020-09-04T00:00:00Z","timestamp":1599177600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100000266","name":"Engineering and Physical Sciences Research Council","doi-asserted-by":"crossref","award":["EP\/S004963\/1"],"award-info":[{"award-number":["EP\/S004963\/1"]}],"id":[{"id":"10.13039\/501100000266","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"published-print":{"date-parts":[[2020,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>We address the problem of generating novel molecules with desired interaction properties as a multi-objective optimization problem. Interaction binding models are learned from binding data using graph convolution networks (GCNs). Since the experimentally obtained property scores are recognised as having potentially gross errors, we adopted a robust loss for the model. Combinations of these terms, including drug likeness and synthetic accessibility, are then optimized using reinforcement learning based on a graph convolution policy approach. Some of the molecules generated, while legitimate chemically, can have excellent drug-likeness scores but appear unusual. We provide an example based on the binding potency of small molecules to dopamine transporters. We extend our method successfully to use a multi-objective reward function, in this case for generating novel molecules that bind with dopamine transporters but not with those for norepinephrine. Our method should be generally applicable to the generation in silico of molecules with desirable properties.<\/jats:p>","DOI":"10.1186\/s13321-020-00454-3","type":"journal-article","created":{"date-parts":[[2020,9,4]],"date-time":"2020-09-04T09:02:47Z","timestamp":1599210167000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":61,"title":["DeepGraphMolGen, a multi-objective, computational strategy for generating molecules with desirable properties: a graph convolution and reinforcement learning approach"],"prefix":"10.1186","volume":"12","author":[{"given":"Yash","family":"Khemchandani","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Stephen","family":"O\u2019Hagan","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Soumitra","family":"Samanta","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Neil","family":"Swainston","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Timothy J.","family":"Roberts","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Danushka","family":"Bollegala","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5838-7963","authenticated-orcid":false,"given":"Douglas B.","family":"Kell","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2020,9,4]]},"reference":[{"issue":"1","key":"454_CR1","doi-asserted-by":"crossref","first-page":"972","DOI":"10.1080\/14686996.2017.1401424","volume":"18","author":"X Yang","year":"2017","unstructured":"Yang X, Zhang J, Yoshizoe K, Terayama K, Tsuda K (2017) ChemTS: an efficient python library for de novo molecular generation. Sci Technol Adv Mater 18(1):972\u2013976","journal-title":"Sci Technol Adv Mater"},{"issue":"10","key":"454_CR2","doi-asserted-by":"crossref","first-page":"1120","DOI":"10.1038\/nmat4717","volume":"15","author":"R G\u00f3mez-Bombarelli","year":"2016","unstructured":"G\u00f3mez-Bombarelli R, Aguilera-Iparraguirre J, Hirzel TD, Duvenaud D, Maclaurin D, Blood-Forsythe MA, Chae HS, Einzinger M, Ha DG, Wu T et al (2016) Design of efficient molecular organic light-emitting diodes by a high-throughput virtual screening and experimental approach. Nat Mater 15(10):1120","journal-title":"Nat Mater"},{"issue":"2","key":"454_CR3","doi-asserted-by":"crossref","first-page":"268","DOI":"10.1021\/acscentsci.7b00572","volume":"4","author":"R G\u00f3mez-Bombarelli","year":"2018","unstructured":"G\u00f3mez-Bombarelli R, Wei JN, Duvenaud D, Hern\u00e1ndez-Lobato JM, S\u00e1nchez-Lengeling B, Sheberla D, Aguilera-Iparraguirre J, Hirzel TD, Adams RP, Aspuru-Guzik A (2018) Automatic chemical design using a data-driven continuous representation of molecules. ACS Cent Sci 4(2):268\u2013276","journal-title":"ACS Cent Sci"},{"issue":"6400","key":"454_CR4","doi-asserted-by":"crossref","first-page":"360","DOI":"10.1126\/science.aat2663","volume":"361","author":"B Sanchez-Lengeling","year":"2018","unstructured":"Sanchez-Lengeling B, Aspuru-Guzik A (2018) Inverse molecular design using machine learning: generative models for matter engineering. Science 361(6400):360\u2013365","journal-title":"Science"},{"issue":"9","key":"454_CR5","doi-asserted-by":"crossref","first-page":"3098","DOI":"10.1021\/acs.molpharmaceut.7b00346","volume":"14","author":"A Kadurin","year":"2017","unstructured":"Kadurin A, Nikolenko S, Khrabrov K, Aliper A, Zhavoronkov A (2017) druGAN: an advanced generative adversarial autoencoder model for de novo generation of new molecules with desired molecular properties in silico. Mol Pharm 14(9):3098\u20133104","journal-title":"Mol Pharm"},{"issue":"1","key":"454_CR6","doi-asserted-by":"crossref","first-page":"285","DOI":"10.1007\/s10994-017-5685-x","volume":"107","author":"I Olier","year":"2018","unstructured":"Olier I, Sadawi N, Bickerton GR, Vanschoren J, Grosan C, Soldatova L, King RD (2018) Meta-QSAR: a large-scale application of meta-learning to drug design and discovery. Mach Learn 107(1):285\u2013311","journal-title":"Mach Learn"},{"issue":"7","key":"454_CR7","doi-asserted-by":"crossref","first-page":"eaap7885","DOI":"10.1126\/sciadv.aap7885","volume":"4","author":"M Popova","year":"2018","unstructured":"Popova M, Isayev O, Tropsha A (2018) Deep reinforcement learning for de novo drug design. Sci Adv 4(7):eaap7885","journal-title":"Sci Adv"},{"key":"454_CR8","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1038\/s41578-018-0005-z","volume":"3","author":"DP Tabor","year":"2018","unstructured":"Tabor DP, Roch LM, Saikin SK, Kreisbeck C, Sheberla D, Montoya JH, Dwaraknath S, Aykol M, Ortiz C, Tribukait H et al (2018) Accelerating the discovery of materials for clean energy in the era of smart automation. Nat Rev Mater 3:5\u201320","journal-title":"Nat Rev Mater"},{"issue":"2","key":"454_CR9","doi-asserted-by":"crossref","first-page":"1720","DOI":"10.1021\/acs.analchem.9b02348","volume":"92","author":"SM Colby","year":"2020","unstructured":"Colby SM, Nu\u00f1ez JR, Hodas NO, Corley CD, Renslow RR (2020) Deep learning to generate in silico chemical property libraries and candidate molecules for small molecule identification in complex samples. Anal Chem 92(2):1720\u20131729","journal-title":"Anal Chem"},{"key":"454_CR10","doi-asserted-by":"publisher","DOI":"10.1080\/17460441.2020.1745183","author":"II Baskin","year":"2020","unstructured":"Baskin II (2020) The power of deep learning to ligand-based novel drug discovery. Expert Opin Drug Discov. https:\/\/doi.org\/10.1080\/17460441.2020.1745183","journal-title":"Expert Opin Drug Discov"},{"issue":"1","key":"454_CR11","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1021\/acs.jcim.9b00694","volume":"60","author":"SH Hong","year":"2020","unstructured":"Hong SH, Ryu S, Lim J, Kim WY (2020) Molecular generative model based on an adversarially regularized autoencoder. J Chem Inf Model 60(1):29\u201336","journal-title":"J Chem Inf Model"},{"issue":"4","key":"454_CR12","doi-asserted-by":"crossref","first-page":"1153","DOI":"10.1039\/C9SC04503A","volume":"11","author":"J Lim","year":"2020","unstructured":"Lim J, Hwang SY, Moon S, Kim S, Kim WY (2020) Scaffold-based molecular design with a graph generative model. Chem Sci 11(4):1153\u20131164","journal-title":"Chem Sci"},{"issue":"9","key":"454_CR13","doi-asserted-by":"crossref","first-page":"2531","DOI":"10.1039\/C9SC03414E","volume":"11","author":"AS Rifaioglu","year":"2020","unstructured":"Rifaioglu AS, Nalbat E, Atalay V, Martin MJ, Cetin-Atalay R, Do\u011fan T (2020) DEEPScreen: high performance drug-target interaction prediction with convolutional neural networks using 2-D structural compound representations. Chem Sci 11(9):2531\u20132557","journal-title":"Chem Sci"},{"issue":"1","key":"454_CR14","doi-asserted-by":"crossref","first-page":"14","DOI":"10.1186\/s13321-020-00419-6","volume":"12","author":"J Yasonik","year":"2020","unstructured":"Yasonik J (2020) Multiobjective de novo drug design with recurrent neural networks and nondominated sorting. J Cheminform 12(1):14","journal-title":"J Cheminform"},{"issue":"3","key":"454_CR15","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1248\/cpb.c19-00625","volume":"68","author":"A Yoshimori","year":"2020","unstructured":"Yoshimori A, Kawasaki E, Kanai C, Tasaka T (2020) Strategies for design of molecular structures with a desired pharmacophore using deep reinforcement learning. Chem Pharm Bull (Tokyo) 68(3):227\u2013233","journal-title":"Chem Pharm Bull (Tokyo)"},{"issue":"2","key":"454_CR16","doi-asserted-by":"crossref","first-page":"143","DOI":"10.1038\/s41587-020-0418-2","volume":"38","author":"WP Walters","year":"2020","unstructured":"Walters WP, Murcko M (2020) Assessing the impact of generative AI on medicinal chemistry. Nat Biotechnol 38(2):143\u2013145","journal-title":"Nat Biotechnol"},{"issue":"2","key":"454_CR17","doi-asserted-by":"crossref","first-page":"577","DOI":"10.1039\/C9SC04026A","volume":"11","author":"RR Griffiths","year":"2020","unstructured":"Griffiths RR, Hern\u00e1ndez-Lobato JM (2020) Constrained Bayesian optimization for automatic chemical design using variational autoencoders. Chem Sci 11(2):577\u2013586","journal-title":"Chem Sci"},{"key":"454_CR18","doi-asserted-by":"crossref","first-page":"809","DOI":"10.3389\/fchem.2019.00809","volume":"7","author":"TFGG Cova","year":"2019","unstructured":"Cova TFGG, Pais AACC (2019) Deep learning for deep chemistry: optimizing the prediction of chemical patterns. Front Chem 7:809","journal-title":"Front Chem"},{"issue":"5","key":"454_CR19","doi-asserted-by":"crossref","first-page":"1370","DOI":"10.1016\/j.matt.2019.08.017","volume":"1","author":"J Noh","year":"2019","unstructured":"Noh J, Kim J, Stein HS, Sanchez-Lengeling B, Gregoire JM, Aspuru-Guzik A, Jung Y (2019) Inverse design of solid-state materials via a continuous representation. Matter 1(5):1370\u20131384","journal-title":"Matter"},{"issue":"12","key":"454_CR20","doi-asserted-by":"crossref","first-page":"1006","DOI":"10.2533\/chimia.2019.1006","volume":"73","author":"F Grisoni","year":"2019","unstructured":"Grisoni F, Schneider G (2019) De novo molecular design with generative long short-term memory. Chimia 73(12):1006\u20131011","journal-title":"Chimia"},{"issue":"12","key":"454_CR21","doi-asserted-by":"crossref","first-page":"1129","DOI":"10.1002\/cmdc.201900097","volume":"14","author":"F Grisoni","year":"2019","unstructured":"Grisoni F, Merk D, Friedrich L, Schneider G (2019) Design of natural-product-inspired multitarget ligands by machine learning. ChemMedChem 14(12):1129\u20131134","journal-title":"ChemMedChem"},{"issue":"1\u20132","key":"454_CR22","doi-asserted-by":"crossref","first-page":"1700111","DOI":"10.1002\/minf.201700111","volume":"37","author":"A Gupta","year":"2018","unstructured":"Gupta A, M\u00fcller AT, Huisman BJH, Fuchs JA, Schneider P, Schneider G (2018) Generative Recurrent Networks for de novo drug design. Mol Inform 37(1\u20132):1700111","journal-title":"Mol Inform"},{"issue":"1\u20132","key":"454_CR23","doi-asserted-by":"crossref","first-page":"1700153","DOI":"10.1002\/minf.201700153","volume":"37","author":"D Merk","year":"2018","unstructured":"Merk D, Friedrich L, Grisoni F, Schneider G (2018) De novo design of bioactive small molecules by artificial intelligence. Mol Inform 37(1\u20132):1700153","journal-title":"Mol Inform"},{"issue":"1\u20132","key":"454_CR24","first-page":"188031","volume":"37","author":"G Schneider","year":"2018","unstructured":"Schneider G (2018) Generative models for artificially-intelligent molecular design. Mol Inform 37(1\u20132):188031","journal-title":"Mol Inform"},{"key":"454_CR25","doi-asserted-by":"crossref","first-page":"353","DOI":"10.1038\/s41573-019-0050-3","volume":"19","author":"P Schneider","year":"2020","unstructured":"Schneider P, Walters WP, Plowright AT, Sieroka N, Listgarten J, Goodnow RA, Fisher J, Jansen JM, Duca JS, Rush TS et al (2020) Rethinking drug design in the artificial intelligence era. Nat Rev Drug Discov 19:353\u2013364","journal-title":"Nat Rev Drug Discov"},{"issue":"7","key":"454_CR26","doi-asserted-by":"crossref","first-page":"307","DOI":"10.1038\/s42256-019-0067-7","volume":"1","author":"A Button","year":"2019","unstructured":"Button A, Merk D, Hiss JA, Schneider G (2019) Automated de novo molecular design by hybrid machine intelligence and rule-driven chemical synthesis. Nat mach Intell 1(7):307\u2013315","journal-title":"Nat mach Intell"},{"key":"454_CR27","doi-asserted-by":"crossref","first-page":"171","DOI":"10.1038\/s42256-020-0160-y","volume":"2","author":"M Moret","year":"2020","unstructured":"Moret M, Friedrich L, Grisoni F, Merk D, Schneider G (2020) Generative molecular design in low data regimes. Nat Mach Intell 2:171\u2013180","journal-title":"Nat Mach Intell"},{"issue":"7","key":"454_CR28","doi-asserted-by":"crossref","first-page":"3166","DOI":"10.1021\/acs.jcim.9b00325","volume":"59","author":"N St\u00e5hl","year":"2019","unstructured":"St\u00e5hl N, Falkman G, Karlsson A, Mathiason G, Bostr\u00f6m J (2019) Deep reinforcement learning for multiparameter optimization in de novo drug design. J Chem Inf Model 59(7):3166\u20133176","journal-title":"J Chem Inf Model"},{"issue":"1","key":"454_CR29","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1186\/s13321-019-0341-z","volume":"11","author":"J Ar\u00fas-Pous","year":"2019","unstructured":"Ar\u00fas-Pous J, Blaschke T, Ulander S, Reymond JL, Chen H, Engkvist O (2019) Exploring the GDB-13 chemical space using deep generative models. J Cheminform 11(1):20","journal-title":"J Cheminform"},{"issue":"3","key":"454_CR30","doi-asserted-by":"crossref","first-page":"722","DOI":"10.1021\/ar500432k","volume":"48","author":"JL Reymond","year":"2015","unstructured":"Reymond JL (2015) The Chemical Space Project. Acc Chem Res 48(3):722\u2013730","journal-title":"Acc Chem Res"},{"issue":"1","key":"454_CR31","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1002\/(SICI)1098-1128(199601)16:1<3::AID-MED1>3.0.CO;2-6","volume":"16","author":"RS Bohacek","year":"1996","unstructured":"Bohacek RS, McMartin C, Guida WC (1996) The art and practice of structure-based drug design: a molecular modeling perspective. Med Res Rev 16(1):3\u201350","journal-title":"Med Res Rev"},{"issue":"2","key":"454_CR32","doi-asserted-by":"crossref","first-page":"374","DOI":"10.1021\/ci0255782","volume":"43","author":"P Ertl","year":"2003","unstructured":"Ertl P (2003) Cheminformatics analysis of organic substituents: identification of the most common substituents, calculation of substituent properties, and automatic identification of drug-like bioisosteric groups. J Chem Inf Comput Sci 43(2):374\u2013380","journal-title":"J Chem Inf Comput Sci"},{"issue":"1","key":"454_CR33","doi-asserted-by":"crossref","first-page":"1700503","DOI":"10.1002\/biot.201700503","volume":"13","author":"S O\u2019Hagan","year":"2018","unstructured":"O\u2019Hagan S, Kell DB (2018) Analysing and navigating natural products space for generating small, diverse, but representative chemical libraries. Biotechnol J 13(1):1700503","journal-title":"Biotechnol J"},{"key":"454_CR34","unstructured":"You J, Liu B, Ying R, Pande V, Leskovec J: Graph Convolutional Policy Network for Goal-Directed Molecular Graph Generation. arXiv 2018:1806.02473v02471"},{"key":"454_CR35","doi-asserted-by":"crossref","first-page":"6553","DOI":"10.1021\/jm500577n","volume":"57","author":"D Dimova","year":"2014","unstructured":"Dimova D, Stumpfe D, Bajorath J (2014) Method for the evaluation of structure-activity relationship information associated with coordinated activity cliffs. J Med Chem 57:6553\u20136563","journal-title":"J Med Chem"},{"issue":"1","key":"454_CR36","doi-asserted-by":"crossref","first-page":"18","DOI":"10.1021\/jm401120g","volume":"57","author":"D Stumpfe","year":"2014","unstructured":"Stumpfe D, Hu Y, Dimova D, Bajorath J (2014) Recent progress in understanding activity cliffs and their utility in medicinal chemistry. J Med Chem 57(1):18\u201328","journal-title":"J Med Chem"},{"issue":"2","key":"454_CR37","doi-asserted-by":"crossref","first-page":"451","DOI":"10.1021\/ci400728r","volume":"54","author":"D Stumpfe","year":"2014","unstructured":"Stumpfe D, Dimova D, Bajorath J (2014) Composition and topology of activity cliff clusters formed by bioactive compounds. J Chem Inf Model 54(2):451\u2013461","journal-title":"J Chem Inf Model"},{"issue":"1","key":"454_CR38","doi-asserted-by":"crossref","first-page":"9","DOI":"10.1186\/1758-2946-5-9","volume":"5","author":"AL Teixeira","year":"2013","unstructured":"Teixeira AL, Leal JP, Falcao AO (2013) Random forests for feature selection in QSPR models\u2014an application for predicting standard enthalpy of formation of hydrocarbons. J Cheminform 5(1):9","journal-title":"J Cheminform"},{"issue":"6","key":"454_CR39","doi-asserted-by":"crossref","first-page":"2538","DOI":"10.1021\/acs.jcim.9b00295","volume":"59","author":"P Ambure","year":"2019","unstructured":"Ambure P, Halder AK, Gonzalez Diaz H, Cordeiro M (2019) QSAR-co: an open source software for developing robust multitasking or multitarget classification-based QSAR models. J Chem Inf Model 59(6):2538\u20132544","journal-title":"J Chem Inf Model"},{"key":"454_CR40","volume-title":"Neural networks for chemists","author":"J Zupan","year":"1993","unstructured":"Zupan J, Gasteiger J (1993) Neural networks for chemists. Verlag Chemie, Weinheim"},{"key":"454_CR41","volume-title":"Data analysis for chemists","author":"D Livingstone","year":"1995","unstructured":"Livingstone D (1995) Data analysis for chemists. Oxford University Press, Oxford"},{"issue":"4","key":"454_CR42","doi-asserted-by":"crossref","first-page":"409","DOI":"10.2174\/138620709788167926","volume":"12","author":"P Mah\u00e9","year":"2009","unstructured":"Mah\u00e9 P, Vert JP (2009) Virtual screening with support vector machines and structure kernels. Comb Chem High Throughput Screen 12(4):409\u2013423","journal-title":"Comb Chem High Throughput Screen"},{"key":"454_CR43","doi-asserted-by":"crossref","first-page":"387","DOI":"10.1007\/s10710-015-9247-3","volume":"16","author":"S O\u2019Hagan","year":"2015","unstructured":"O\u2019Hagan S, Kell DB (2015) The KNIME workflow environment and its applications in Genetic Programming and machine learning. Genetic Progr Evol Mach 16:387\u2013391","journal-title":"Genetic Progr Evol Mach"},{"issue":"7553","key":"454_CR44","doi-asserted-by":"crossref","first-page":"436","DOI":"10.1038\/nature14539","volume":"521","author":"Y LeCun","year":"2015","unstructured":"LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436\u2013444","journal-title":"Nature"},{"key":"454_CR45","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1016\/j.neunet.2014.09.003","volume":"61","author":"J Schmidhuber","year":"2015","unstructured":"Schmidhuber J (2015) Deep learning in neural networks: an overview. Neural Netw 61:85\u2013117","journal-title":"Neural Netw"},{"issue":"1","key":"454_CR46","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1002\/minf.201501008","volume":"35","author":"E Gawehn","year":"2016","unstructured":"Gawehn E, Hiss JA, Schneider G (2016) Deep learning in drug discovery. Mol Inform 35(1):3\u201314","journal-title":"Mol Inform"},{"issue":"141","key":"454_CR47","doi-asserted-by":"crossref","first-page":"20170387","DOI":"10.1098\/rsif.2017.0387","volume":"15","author":"T Ching","year":"2018","unstructured":"Ching T, Himmelstein DS, Beaulieu-Jones BK, Kalinin AA, Do BT, Way GP, Ferrero E, Agapow PM, Zietz M, Hoffman MM et al (2018) Opportunities and obstacles for deep learning in biology and medicine. J R Soc Interface 15(141):20170387","journal-title":"J R Soc Interface"},{"issue":"6","key":"454_CR48","doi-asserted-by":"crossref","first-page":"2545","DOI":"10.1021\/acs.jcim.9b00266","volume":"59","author":"AC Mater","year":"2019","unstructured":"Mater AC, Coote ML (2019) Deep Learning in Chemistry. J Chem Inf Model 59(6):2545\u20132559","journal-title":"J Chem Inf Model"},{"issue":"8","key":"454_CR49","doi-asserted-by":"crossref","first-page":"1757","DOI":"10.1021\/acs.jcim.6b00601","volume":"57","author":"CW Coley","year":"2017","unstructured":"Coley CW, Barzilay R, Green WH, Jaakkola TS, Jensen KF (2017) Convolutional embedding of attributed molecular graphs for physical property prediction. J Chem Inf Model 57(8):1757\u20131772","journal-title":"J Chem Inf Model"},{"issue":"1","key":"454_CR50","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1021\/ci00057a005","volume":"28","author":"D Weininger","year":"1988","unstructured":"Weininger D (1988) SMILES, a chemical language and information system.1. Introduction to methodology and encoding rules. J Chem Inf Comput Sci 28(1):31\u201336","journal-title":"J Chem Inf Comput Sci"},{"key":"454_CR51","unstructured":"Dai H, Tian Y, Dai B, Skiena S, Song L (2018) Syntax-directed variational autoencoder for structured data. arXiv. 1802.08786v08721"},{"key":"454_CR52","unstructured":"Kusner MJ, Paige B, Hern\u00e1ndez-Lobato JM (2017) Grammar Variational Autoencoder. arXiv. 1703.01925v01921"},{"issue":"1\u20132","key":"454_CR53","doi-asserted-by":"crossref","first-page":"1700123","DOI":"10.1002\/minf.201700123","volume":"37","author":"T Blaschke","year":"2018","unstructured":"Blaschke T, Olivecrona M, Engkvist O, Bajorath J, Chen HM (2018) Application of generative autoencoder in de novo molecular design. Mol Inform 37(1\u20132):1700123","journal-title":"Mol Inform"},{"issue":"6","key":"454_CR54","doi-asserted-by":"crossref","first-page":"567","DOI":"10.4155\/fmc-2018-0358","volume":"11","author":"Y Xu","year":"2019","unstructured":"Xu Y, Lin K, Wang S, Wang L, Cai C, Song C, Lai L, Pei J (2019) Deep learning for molecular generation. Future Med Chem 11(6):567\u2013597","journal-title":"Future Med Chem"},{"key":"454_CR55","doi-asserted-by":"crossref","unstructured":"O\u2019Boyle N, Dalke A (2018) DeepSMILES: an adaptation of SMILES for use in machine-learning of chemical structures. ChemRxiv. 7097960.v7097961","DOI":"10.26434\/chemrxiv.7097960.v1"},{"key":"454_CR56","volume-title":"Deep learning","author":"I Goodfellow","year":"2016","unstructured":"Goodfellow I, Bengio Y, Courville A (2016) Deep learning. MIT Press, Boston"},{"issue":"4","key":"454_CR57","doi-asserted-by":"crossref","first-page":"688","DOI":"10.1016\/j.cell.2020.01.021","volume":"180","author":"JM Stokes","year":"2020","unstructured":"Stokes JM, Yang K, Swanson K, Jin W, Cubillos-Ruiz A, Donghia NM, MacNair CR, French S, Carfrae LA, Bloom-Ackerman Z et al (2020) A deep learning approach to antibiotic discovery. Cell 180(4):688\u2013702","journal-title":"Cell"},{"key":"454_CR58","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1186\/s13321-016-0127-5","volume":"8","author":"G Zahor\u00e1nszky-K\u0151halmi","year":"2016","unstructured":"Zahor\u00e1nszky-K\u0151halmi G, Bologa CG, Oprea TI (2016) Impact of similarity threshold on the topology of molecular similarity networks and clustering outcomes. J Cheminform 8:16","journal-title":"J Cheminform"},{"key":"454_CR59","unstructured":"Segler MHS, Kogej T, Tyrchan C, Waller MP (2017) Generating focussed molecule libraries for drug discovery with recurrent neural networks. arXiv. 1701.01329v01321"},{"issue":"1","key":"454_CR60","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1186\/s13321-020-00425-8","volume":"12","author":"R van Deursen","year":"2020","unstructured":"van Deursen R, Ertl P, Tetko IV, Godin G (2020) GEN: highly efficient SMILES explorer using autodidactic generative examination networks. J Cheminform 12(1):22","journal-title":"J Cheminform"},{"issue":"2","key":"454_CR61","doi-asserted-by":"crossref","first-page":"85","DOI":"10.5599\/admet.5.2.376","volume":"5","author":"S O\u2019Hagan","year":"2017","unstructured":"O\u2019Hagan S, Kell DB (2017) Consensus rank orderings of molecular fingerprints illustrate the \u2018most genuine\u2019 similarities between marketed drugs and small endogenous human metabolites, but highlight exogenous natural products as the most important \u2018natural\u2019 drug transporter substrates. ADMET DMPK 5(2):85\u2013125","journal-title":"ADMET DMPK"},{"key":"454_CR62","unstructured":"Kajino H (2018) Molecular hypergraph grammar with its application to molecular optimization. arXiv. 1809.02745v02741"},{"key":"454_CR63","unstructured":"Jin W, Barzilay R, Jaakkola T. Junction tree variational autoencoder for molecular graph generation. arXiv 2018:1802.04364v04362"},{"key":"454_CR64","doi-asserted-by":"crossref","unstructured":"Zang C, Wang F (2020) MoFlow: an invertible flow model for generating molecular graphs. arXiv. 2006.10137","DOI":"10.1145\/3394486.3403104"},{"key":"454_CR65","unstructured":"Tavakoli M, Baldi P (2020) Continuous representation of molecules using graph variational autoencoder. arXiv:2004.08152v08151"},{"key":"454_CR66","unstructured":"Samanta B, De A, Ganguly N, Gomez-Rodriguez M (2018) Designing random graph models using variational autoencoders with applications to chemical design. arXiv.1802.05283"},{"key":"454_CR67","unstructured":"Flam-Shepherd D, Wu T, Aspuru-Guzik A (2020) Graph deconvolutional generation. arXiv. 2002.07087v07081"},{"issue":"8","key":"454_CR68","doi-asserted-by":"crossref","first-page":"595","DOI":"10.1007\/s10822-016-9938-8","volume":"30","author":"S Kearnes","year":"2016","unstructured":"Kearnes S, McCloskey K, Berndl M, Pande V, Riley P (2016) Molecular graph convolutions: moving beyond fingerprints. J Comput Aided Mol Des 30(8):595\u2013608","journal-title":"J Comput Aided Mol Des"},{"key":"454_CR69","unstructured":"Bresson X, Laurent T (2019) A two-step graph convolutional decoder for molecule generation. arXiv. 1906.03412"},{"key":"454_CR70","unstructured":"Kearnes S, Li L, Riley P (2019) Decoding molecular graph embeddings with reinforcement learning. arXiv. 1904.08915"},{"issue":"2","key":"454_CR71","doi-asserted-by":"crossref","first-page":"90","DOI":"10.1038\/nchem.1243","volume":"4","author":"GR Bickerton","year":"2012","unstructured":"Bickerton GR, Paolini GV, Besnard J, Muresan S, Hopkins AL (2012) Quantifying the chemical beauty of drugs. Nat Chem 4(2):90\u201398","journal-title":"Nat Chem"},{"key":"454_CR72","unstructured":"Zhang Z, Cui P, Zhu W (2018) Deep learning on graphs: a survey. arXi: 1812.04202v04201"},{"key":"454_CR73","unstructured":"Barron JT (2017) A general and adaptive robust loss function. arXiv. 1701.03077v03010"},{"key":"454_CR74","doi-asserted-by":"crossref","unstructured":"Yang K, Swanson K, Jin W, Coley C, Eiden P, Gao H, Guzman-Perez A, Hopper T, Kelley B, Mathea M et al (2019) Analyzing learned molecular representations for property prediction. arXiv. 1904.01561v01564","DOI":"10.26434\/chemrxiv.7940594.v3"},{"issue":"8","key":"454_CR75","doi-asserted-by":"crossref","first-page":"3370","DOI":"10.1021\/acs.jcim.9b00237","volume":"59","author":"K Yang","year":"2019","unstructured":"Yang K, Swanson K, Jin W, Coley C, Eiden P, Gao H, Guzman-Perez A, Hopper T, Kelley B, Mathea M et al (2019) Analyzing learned molecular representations for property prediction. J Chem Inf Model 59(8):3370\u20133388","journal-title":"J Chem Inf Model"},{"key":"454_CR76","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1016\/0003-2670(95)00170-5","volume":"313","author":"R Goodacre","year":"1995","unstructured":"Goodacre R, Trew S, Wrigley-Jones C, Saunders G, Neal MJ, Porter N, Kell DB (1995) Rapid and quantitative analysis of metabolites in fermentor broths using pyrolysis mass spectrometry with supervised learning: application to the screening of Penicillium chryosgenum fermentations for the overproduction of penicillins. Anal Chim Acta 313:25\u201343","journal-title":"Anal Chim Acta"},{"key":"454_CR77","doi-asserted-by":"crossref","unstructured":"Jarrett K, Kavukcuoglu K, Ranzato M, Lecun Y (2009) What is the best multi-stage architecture for object recognition? IEEE I Conf Comp Vis; pp. 2146\u20132153","DOI":"10.1109\/ICCV.2009.5459469"},{"key":"454_CR78","doi-asserted-by":"crossref","first-page":"201","DOI":"10.1016\/j.sigpro.2019.01.004","volume":"158","author":"S Ashkezari-Toussi","year":"2019","unstructured":"Ashkezari-Toussi S, Sadoghi-Yazdi H (2019) Robust diffusion LMS over adaptive networks. Signal Process 158:201\u2013209","journal-title":"Signal Process"},{"key":"454_CR79","unstructured":"Guimaraes GL, Sanchez-Lengeling B, Outeiral C, Farias PLC, Aspuru-Guzik A (2017) Objective-Reinforced Generative Adversarial Networks (ORGAN) for sequence generation models. arXiv. 1705.10843"},{"issue":"1","key":"454_CR80","doi-asserted-by":"crossref","first-page":"8","DOI":"10.1186\/1758-2946-1-8","volume":"1","author":"P Ertl","year":"2009","unstructured":"Ertl P, Schuffenhauer A (2009) Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions. J Cheminform 1(1):8","journal-title":"J Cheminform"},{"key":"454_CR81","unstructured":"Schulman J, Wolski F, Dhariwal P, Radford A, Klimov O (2017) Proximal policy optimization algorithms. arXiv. 1707.06347v06342"},{"key":"454_CR82","unstructured":"Schulman J, Moritz P, Levine S, Jordan M, Abbeel P (2015) High-dimensional continuous control using generalized advantage estimation. arXiv. 1506.02438"},{"key":"454_CR83","first-page":"1","volume":"28","author":"S Levine","year":"2013","unstructured":"Levine S, Koltun V (2013) Guided policy search. Proc ICML 28:1\u20139","journal-title":"Proc ICML"},{"key":"454_CR84","first-page":"249","volume":"9","author":"X Glorot","year":"2010","unstructured":"Glorot X, Bengio Y (2010) Understanding the difficulty of training deep feedforward neural networks. Proc AISTATs 9:249\u2013256","journal-title":"Proc AISTATs"},{"key":"454_CR85","unstructured":"Li Y, Wei C, Ma T (2019) Towards explaining the regularization effect of initial large learning rate in training neural networks. arXiv. 1907.04595v04592"},{"key":"454_CR86","doi-asserted-by":"crossref","first-page":"2324","DOI":"10.1021\/acs.jcim.5b00559","volume":"55","author":"T Sterling","year":"2015","unstructured":"Sterling T, Irwin JJ (2015) ZINC 15\u2014ligand discovery for everyone. J Chem Inf Model 55:2324\u20132337","journal-title":"J Chem Inf Model"},{"issue":"7428","key":"454_CR87","doi-asserted-by":"crossref","first-page":"215","DOI":"10.1038\/nature11691","volume":"492","author":"J Besnard","year":"2012","unstructured":"Besnard J, Ruda GF, Setola V, Abecassis K, Rodriguiz RM, Huang XP, Norval S, Sassano MF, Shin AI, Webster LA et al (2012) Automated design of ligands to polypharmacological profiles. Nature 492(7428):215\u2013220","journal-title":"Nature"},{"issue":"23","key":"454_CR88","doi-asserted-by":"crossref","first-page":"6802","DOI":"10.1021\/jm060902w","volume":"49","author":"JH Nettles","year":"2006","unstructured":"Nettles JH, Jenkins JL, Bender A, Deng Z, Davies JW, Glick M (2006) Bridging chemical and biological space: \u201ctarget fishing\u201d using 2D and 3D molecular descriptors. J Med Chem 49(23):6802\u20136810","journal-title":"J Med Chem"},{"issue":"5","key":"454_CR89","doi-asserted-by":"crossref","first-page":"1103","DOI":"10.1021\/ci300030u","volume":"52","author":"G Hu","year":"2012","unstructured":"Hu G, Kuang G, Xiao W, Li W, Liu G, Tang Y (2012) Performance evaluation of 2D fingerprint and 3D shape similarity methods in virtual screening. J Chem Inf Model 52(5):1103\u20131113","journal-title":"J Chem Inf Model"},{"issue":"6","key":"454_CR90","doi-asserted-by":"crossref","first-page":"811","DOI":"10.1590\/S0103-50532002000600013","volume":"13","author":"TI Oprea","year":"2002","unstructured":"Oprea TI (2002) On the information content of 2D and 3D descriptors for QSAR. J Brazil Chem Soc 13(6):811\u2013815","journal-title":"J Brazil Chem Soc"},{"issue":"1","key":"454_CR91","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1021\/ci960373c","volume":"37","author":"RD Brown","year":"1997","unstructured":"Brown RD, Martin YC (1997) The information content of 2D and 3D structural descriptors relevant to ligand-receptor binding. J Chem Inf Comp Sci 37(1):1\u20139","journal-title":"J Chem Inf Comp Sci"},{"issue":"7","key":"454_CR92","doi-asserted-by":"crossref","first-page":"1337","DOI":"10.1021\/ci800038f","volume":"48","author":"HX Hong","year":"2008","unstructured":"Hong HX, Xie Q, Ge WG, Qian F, Fang H, Shi LM, Su ZQ, Perkins R, Tong WD (2008) Mold2, molecular descriptors from 2D structures for chemoinformatics and toxicoinformatics. J Chem Inf Model 48(7):1337\u20131344","journal-title":"J Chem Inf Model"},{"issue":"5","key":"454_CR93","doi-asserted-by":"crossref","first-page":"355","DOI":"10.1038\/nrd3701","volume":"11","author":"MM Hann","year":"2012","unstructured":"Hann MM, Keser\u00fc GM (2012) Finding the sweet spot: the role of nature and nurture in medicinal chemistry. Nat Rev Drug Discov 11(5):355\u2013365","journal-title":"Nat Rev Drug Discov"},{"key":"454_CR94","doi-asserted-by":"crossref","first-page":"2952","DOI":"10.1021\/jm801513z","volume":"52","author":"WR Pitt","year":"2009","unstructured":"Pitt WR, Parry DM, Perry BG, Groom CR (2009) Heteroaromatic rings of the future. J Med Chem 52:2952\u20132963","journal-title":"J Med Chem"},{"issue":"10","key":"454_CR95","doi-asserted-by":"crossref","first-page":"3451","DOI":"10.1021\/jm200187y","volume":"54","author":"SD Roughley","year":"2011","unstructured":"Roughley SD, Jordan AM (2011) The medicinal chemist\u2019s toolbox: an analysis of reactions used in the pursuit of drug candidates. J Med Chem 54(10):3451\u20133479","journal-title":"J Med Chem"},{"key":"454_CR96","unstructured":"Scalia G, Grambow CA, Pernici B, Li Y-P, Green WH (2019) Evaluating scalable uncertainty estimation methods for DNN-based molecular property prediction. arXiv. 1910.03127"},{"issue":"3","key":"454_CR97","doi-asserted-by":"crossref","first-page":"1096","DOI":"10.1021\/acs.jcim.8b00839","volume":"59","author":"N Brown","year":"2019","unstructured":"Brown N, Fiscato M, Segler MHS, Vaucher AC (2019) GuacaMol: benchmarking models for de novo molecular design. J Chem Inf Model 59(3):1096\u20131108","journal-title":"J Chem Inf Model"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-020-00454-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13321-020-00454-3\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-020-00454-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,3]],"date-time":"2021-09-03T20:24:00Z","timestamp":1630700640000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/s13321-020-00454-3"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,9,4]]},"references-count":97,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2020,12]]}},"alternative-id":["454"],"URL":"https:\/\/doi.org\/10.1186\/s13321-020-00454-3","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2020.05.25.114165","asserted-by":"object"},{"id-type":"doi","id":"10.21203\/rs.3.rs-32446\/v2","asserted-by":"object"},{"id-type":"doi","id":"10.21203\/rs.3.rs-32446\/v1","asserted-by":"object"}]},"ISSN":["1758-2946"],"issn-type":[{"value":"1758-2946","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,9,4]]},"assertion":[{"value":"29 May 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 August 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"4 September 2020","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"The authors have no conflicts of interest to report.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"53"}}