{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,7,3]],"date-time":"2026-07-03T18:26:28Z","timestamp":1783103188928,"version":"3.54.6"},"reference-count":83,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2024,2,21]],"date-time":"2024-02-21T00:00:00Z","timestamp":1708473600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,2,21]],"date-time":"2024-02-21T00:00:00Z","timestamp":1708473600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>REINVENT 4 is a modern open-source generative AI framework for the design of small molecules. The software utilizes recurrent neural networks and transformer architectures to drive molecule generation. These generators are seamlessly embedded within the general machine learning optimization algorithms, transfer learning, reinforcement learning and curriculum learning. REINVENT 4 enables and facilitates de novo design, R-group replacement, library design, linker design, scaffold hopping and molecule optimization. This contribution gives an overview of the software and describes its design. Algorithms and their applications are discussed in detail. REINVENT 4 is a command line tool which reads a user configuration in either TOML or JSON format. The aim of this release is to provide reference implementations for some of the most common algorithms in AI based molecule generation. An additional goal with the release is to create a framework for education and future innovation in AI based molecular design. The software is available from<jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/github.com\/MolecularAI\/REINVENT4\">https:\/\/github.com\/MolecularAI\/REINVENT4<\/jats:ext-link>and released under the permissive Apache 2.0 license.<jats:bold>Scientific contribution<\/jats:bold>. The software provides an open\u2013source reference implementation for generative molecular design where the software is also being used in production to support in\u2013house drug discovery projects. The publication of the most common machine learning algorithms in one code and full documentation thereof will increase transparency of AI and foster innovation, collaboration and education.<\/jats:p>","DOI":"10.1186\/s13321-024-00812-5","type":"journal-article","created":{"date-parts":[[2024,2,21]],"date-time":"2024-02-21T13:02:14Z","timestamp":1708520534000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":254,"title":["Reinvent 4: Modern AI\u2013driven generative molecule design"],"prefix":"10.1186","volume":"16","author":[{"given":"Hannes H.","family":"Loeffler","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jiazhen","family":"He","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Alessandro","family":"Tibo","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jon Paul","family":"Janet","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Alexey","family":"Voronov","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Lewis H.","family":"Mervin","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Ola","family":"Engkvist","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2024,2,21]]},"reference":[{"issue":"1","key":"812_CR1","doi-asserted-by":"publisher","first-page":"90","DOI":"10.1016\/j.ejphar.2009.06.065","volume":"625","author":"S Mandal","year":"2009","unstructured":"Mandal S, Moudgil M, Mandal SK (2009) Rational drug design. Eur J Pharmacol 625(1):90\u2013100","journal-title":"Eur J Pharmacol"},{"key":"812_CR2","doi-asserted-by":"publisher","DOI":"10.3390\/ijms22041676","author":"VD Mouchlis","year":"2021","unstructured":"Mouchlis VD, Afantitis A, Serra A, Fratello M, Papadiamantis AG, Aidinis V, Lynch I, Greco D, Melagraki G (2021) Advances in de novo drug design: from conventional to machine learning methods. Int J Mol Sci. https:\/\/doi.org\/10.3390\/ijms22041676","journal-title":"Int J Mol Sci"},{"issue":"11","key":"812_CR3","doi-asserted-by":"publisher","first-page":"2707","DOI":"10.1016\/j.drudis.2021.05.019","volume":"26","author":"J Meyers","year":"2021","unstructured":"Meyers J, Fabian B, Brown N (2021) De novo molecular design and generative models. Drug Discovery Today 26(11):2707\u20132715. https:\/\/doi.org\/10.1016\/j.drudis.2021.05.019","journal-title":"Drug Discovery Today"},{"issue":"1\u20132","key":"812_CR4","doi-asserted-by":"publisher","first-page":"1700153","DOI":"10.1002\/minf.201700153","volume":"37","author":"D Merk","year":"2018","unstructured":"Merk D, Friedrich L, Grisoni F, Schneider G (2018) De novo design of bioactive small molecules by artificial intelligence. Mol Inform 37(1\u20132):1700153. https:\/\/doi.org\/10.1002\/minf.201700153","journal-title":"Mol Inform"},{"issue":"9","key":"812_CR5","doi-asserted-by":"publisher","first-page":"1038","DOI":"10.1038\/s41587-019-0224-x","volume":"37","author":"A Zhavoronkov","year":"2019","unstructured":"...Zhavoronkov A, Ivanenkov YA, Aliper A, Veselov MS, Aladinskiy VA, Aladinskaya AV, Terentiev VA, Polykovskiy DA, Kuznetsov MD, Asadulaev A, Volkov Y, Zholus A, Shayakhmetov RR, Zhebrak A, Minaeva LI, Zagribelnyy BA, Lee LH, Soll R, Madge D, Xing L, Guo T, Aspuru-Guzik A (2019) Deep learning enables rapid identification of potent ddr1 kinase inhibitors. Nature Biotechnol 37(9):1038\u20131040. https:\/\/doi.org\/10.1038\/s41587-019-0224-x","journal-title":"Nature Biotechnol"},{"issue":"10","key":"812_CR6","doi-asserted-by":"publisher","first-page":"692","DOI":"10.1002\/jcc.26826","volume":"43","author":"Q Perron","year":"2022","unstructured":"Perron Q, Mirguet O, Tajmouati H, Skiredj A, Rojas A, Gohier A, Ducrot P, Bourguignon MP, Sansilvestri-Morel P, DoHuu N, Gellibert F, Gaston-Math\u00e9 Y (2022) Deep generative models for ligand-based de novo design applied to multi-parametric optimization. J Comput Chem 43(10):692\u2013703","journal-title":"J Comput Chem"},{"issue":"1","key":"812_CR7","doi-asserted-by":"publisher","first-page":"129","DOI":"10.1038\/s42004-022-00733-0","volume":"5","author":"M Korshunova","year":"2022","unstructured":"Korshunova M, Huang N, Capuzzi S, Radchenko DS, Savych O, Moroz YS, Wells CI, Willson TM, Tropsha A, Isayev O (2022) Generative and reinforcement learning approaches for the automated de novo design of bioactive compounds. Commun Chem 5(1):129. https:\/\/doi.org\/10.1038\/s42004-022-00733-0","journal-title":"Commun Chem"},{"issue":"16","key":"812_CR8","doi-asserted-by":"publisher","first-page":"8736","DOI":"10.1021\/jacs.2c13467","volume":"145","author":"DM Anstine","year":"2023","unstructured":"Anstine DM, Isayev O (2023) Generative models as an emerging paradigm in the chemical sciences. J Am Chem Soc 145(16):8736\u20138750. https:\/\/doi.org\/10.1021\/jacs.2c13467","journal-title":"J Am Chem Soc"},{"key":"812_CR9","doi-asserted-by":"publisher","DOI":"10.1016\/j.ailsci.2023.100064\u2019www.sciencedirect.com\/science\/article\/pii\/S2667318523000089\u2019","author":"M Vogt","year":"2023","unstructured":"Vogt M (2023) Exploring chemical space - generative models and their evaluation. Artif Intell Life Sci. https:\/\/doi.org\/10.1016\/j.ailsci.2023.100064\u2019www.sciencedirect.com\/science\/article\/pii\/S2667318523000089\u2019","journal-title":"Artif Intell Life Sci"},{"issue":"4","key":"812_CR10","doi-asserted-by":"publisher","first-page":"798","DOI":"10.1016\/j.jmat.2023.05.001","volume":"9","author":"Y Liu","year":"2023","unstructured":"Liu Y, Yang Z, Yu Z, Liu Z, Liu D, Lin H, Li M, Ma S, Avdeev M, Shi S (2023) Generative artificial intelligence and its applications in materials science: Current situation and future perspectives. J Materiom 9(4):798\u2013816","journal-title":"J Materiom"},{"key":"812_CR11","doi-asserted-by":"publisher","first-page":"215","DOI":"10.1016\/j.tifs.2023.06.005","volume":"138","author":"M Al-Sarayreh","year":"2023","unstructured":"Al-Sarayreh M, Gomes Reis M, Carr A, dos Reis MM (2023) Inverse design and ai\/deep generative networks in food design: a comprehensive review. Trends Food Sci Technol 138:215\u2013228. https:\/\/doi.org\/10.1016\/j.tifs.2023.06.005","journal-title":"Trends Food Sci Technol"},{"key":"812_CR12","doi-asserted-by":"publisher","DOI":"10.1146\/annurev-pharmtox-040323-040828","author":"C Hasselgren","year":"2024","unstructured":"Hasselgren C, Oprea TI (2024) Artificial intelligence for drug discovery: Are we there yet? Ann Rev Pharmacol Toxicol. https:\/\/doi.org\/10.1146\/annurev-pharmtox-040323-040828.","journal-title":"Ann Rev Pharmacol Toxicol"},{"issue":"12","key":"812_CR13","doi-asserted-by":"publisher","DOI":"10.1016\/j.xcrm.2022.100794","volume":"3","author":"X Zeng","year":"2022","unstructured":"Zeng X, Wang F, Luo Y, gu Kang S, Tang J, Lightstone FC, Fang EF, Cornell W, Nussinov R, Cheng F, (2022) Deep generative molecular design reshapes drug discovery. Cell Rep Med 3(12):100794.","journal-title":"Cell Rep Med"},{"issue":"2","key":"812_CR14","doi-asserted-by":"publisher","first-page":"268","DOI":"10.1021\/acscentsci.7b00572","volume":"4","author":"R G\u00f3mez-Bombarelli","year":"2018","unstructured":"G\u00f3mez-Bombarelli R, Wei JN, Duvenaud D, Hern\u00e1ndez-Lobato JM, S\u00e1nchez-Lengeling B, Sheberla D, Aguilera-Iparraguirre J, Hirzel TD, Adams RP, Aspuru-Guzik A (2018) Automatic chemical design using a data-driven continuous representation of molecules. ACS Central Sci 4(2):268\u2013276. https:\/\/doi.org\/10.1021\/acscentsci.7b00572","journal-title":"ACS Central Sci"},{"key":"812_CR15","doi-asserted-by":"publisher","unstructured":"Kai L, Wei Z, Ming G (2023) in Natural Language Processing, Information Retrieval and AI (Academy and Industry Research Collaboration Center (AIRCC)). https:\/\/doi.org\/10.5121\/csit.2023.130303","DOI":"10.5121\/csit.2023.130303"},{"key":"812_CR16","doi-asserted-by":"publisher","DOI":"10.1016\/j.commatsci.2022.111612","author":"R Jabbar","year":"2022","unstructured":"Jabbar R, Jabbar R, Kamoun S (2022) Recent progress in generative adversarial networks applied to inversely designing inorganic materials: a brief review. Computat Mater Sci. https:\/\/doi.org\/10.1016\/j.commatsci.2022.111612","journal-title":"Computat Mater Sci"},{"issue":"1\u20132","key":"812_CR17","doi-asserted-by":"publisher","first-page":"1700111","DOI":"10.1002\/minf.201700111","volume":"37","author":"A Gupta","year":"2018","unstructured":"Gupta A, M\u00fcller AT, Huisman BJH, Fuchs JA, Schneider P, Schneider G (2018) Generative recurrent networks for de novo drug design. Mol Inform 37(1\u20132):1700111","journal-title":"Mol Inform"},{"issue":"1","key":"812_CR18","doi-asserted-by":"publisher","first-page":"24","DOI":"10.1186\/s13321-023-00694-z","volume":"15","author":"X Liu","year":"2023","unstructured":"Liu X, Ye K, van Vlijmen HWT, IJzerman AP, van Westen GJP, (2023) Drugex v3: scaffold-constrained drug design with graph transformer-based reinforcement learning. J Cheminform 15(1):24. https:\/\/doi.org\/10.1186\/s13321-023-00694-z","journal-title":"J Cheminform"},{"issue":"1","key":"812_CR19","doi-asserted-by":"publisher","first-page":"48","DOI":"10.1186\/s13321-017-0235-x","volume":"9","author":"M Olivecrona","year":"2017","unstructured":"Olivecrona M, Blaschke T, Engkvist O, Chen H (2017) Molecular de-novo design through deep reinforcement learning. J Cheminform 9(1):48. https:\/\/doi.org\/10.1186\/s13321-017-0235-x","journal-title":"J Cheminform"},{"issue":"12","key":"812_CR20","doi-asserted-by":"publisher","first-page":"5918","DOI":"10.1021\/acs.jcim.0c00915","volume":"60","author":"T Blaschke","year":"2020","unstructured":"Blaschke T, Ar\u00fas-Pous J, Chen H, Margreitter C, Tyrchan C, Engkvist O, Papadopoulos K, Patronov A (2020) Reinvent 2.0: an ai tool for de novo drug design. J Chem Inform Model 60(12):5918\u20135922. https:\/\/doi.org\/10.1021\/acs.jcim.0c00915","journal-title":"J Chem Inform Model"},{"issue":"12","key":"812_CR21","doi-asserted-by":"publisher","first-page":"1256","DOI":"10.1038\/s42256-022-00580-7","volume":"4","author":"J Ross","year":"2022","unstructured":"Ross J, Belgodere B, Chenthamarakshan V, Padhi I, Mroueh Y, Das P (2022) Large-scale chemical language representations capture molecular structure and properties. Nature Machine Intell 4(12):1256\u20131264. https:\/\/doi.org\/10.1038\/s42256-022-00580-7","journal-title":"Nature Machine Intell"},{"issue":"1","key":"812_CR22","doi-asserted-by":"publisher","first-page":"18","DOI":"10.1186\/s13321-022-00599-3","volume":"14","author":"J He","year":"2022","unstructured":"He J, Nittinger E, Tyrchan C, Czechtizky W, Patronov A, Bjerrum EJ, Engkvist O (2022) Transformer-based molecular optimization beyond matched molecular pairs. J Cheminform 14(1):18. https:\/\/doi.org\/10.1186\/s13321-022-00599-3","journal-title":"J Cheminform"},{"key":"812_CR23","unstructured":"Shi C, Xu M, Zhu Z, Zhang W, Zhang M, Tang J (2020) Graphaf: a flow-based autoregressive model for molecular graph generation. CoRR abs\/2001.09382. arXiv:2001.09382"},{"key":"812_CR24","doi-asserted-by":"publisher","unstructured":"Zang C, Wang F (2020) in Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (ACM, 2020). https:\/\/doi.org\/10.1145\/3394486.3403104.https:\/\/doi.org\/10.1145","DOI":"10.1145\/3394486.3403104."},{"key":"812_CR25","unstructured":"Yang L, Zhang Z, Song Y, Hong S, Xu R, Zhao Y, Shao Y, Zhang W, Cui B, Yang MH (2022) Diffusion models: a comprehensive survey of methods and applications. arXiv preprint arXiv:2209.00796 (2022). arXiv:2209.00796 [cs.LG]"},{"key":"812_CR26","unstructured":"Hoogeboom E, Satorras VG, Vignac C, Welling M (2022) Equivariant diffusion for molecule generation in 3d"},{"key":"812_CR27","doi-asserted-by":"crossref","unstructured":"Xu M, Powers A, Dror R, Ermon S, Leskovec J (2023) Geometric latent diffusion models for 3d molecule generation","DOI":"10.1609\/aaai.v38i1.27787"},{"issue":"23","key":"812_CR28","doi-asserted-by":"publisher","DOI":"10.1063\/5.0155890","volume":"122","author":"RK Luu","year":"2023","unstructured":"Luu RK, Wysokowski M, Buehler MJ (2023) Generative discovery of de novo chemical designs using diffusion modeling and transformer deep neural networks with application to deep eutectic solvents. Appl Phys Lett 122(23):234103","journal-title":"Appl Phys Lett"},{"key":"812_CR29","unstructured":"Bradshaw J, Paige B, Kusner MJ, Segler MHS, Hern\u00e1ndez-Lobato JM (2020) in Proceedings of the 34th International Conference on Neural Information Processing Systems (Curran Associates Inc., Red Hook, NY, USA, 2020). NIPS\u201920"},{"key":"812_CR30","doi-asserted-by":"publisher","DOI":"10.1002\/wcms.1608","author":"CL Bilodeau","year":"2022","unstructured":"Bilodeau CL, Jin W, Jaakkola TS, Barzilay R, Jensen KF (2022) Generative models for molecular discovery: recent advances and challenges. WIREs Computat Mol Sci. https:\/\/doi.org\/10.1002\/wcms.1608","journal-title":"WIREs Computat Mol Sci"},{"key":"812_CR31","doi-asserted-by":"publisher","DOI":"10.3389\/fphar.2020.565644","author":"D Polykovskiy","year":"2020","unstructured":"Polykovskiy D, Zhebrak A, Sanchez-Lengeling B, Golovanov S, Tatanov O, Belyaev S, Kurbanov R, Artamonov A, Aladinskiy V, Veselov M, Kadurin A, Johansson S, Chen H, Nikolenko S, Aspuru-Guzik A, Zhavoronkov A (2020) Molecular sets (moses): a benchmarking platform for molecular generation models. Front Pharmacol. https:\/\/doi.org\/10.3389\/fphar.2020.565644","journal-title":"Front Pharmacol"},{"issue":"3","key":"812_CR32","doi-asserted-by":"publisher","first-page":"1096","DOI":"10.1021\/acs.jcim.8b00839","volume":"59","author":"N Brown","year":"2019","unstructured":"Brown N, Fiscato M, Segler MH, Vaucher AC (2019) Guacamol: Benchmarking models for de novo molecular design. J Chem Inform Model 59(3):1096\u20131108. https:\/\/doi.org\/10.1021\/acs.jcim.8b00839","journal-title":"J Chem Inform Model"},{"key":"812_CR33","doi-asserted-by":"publisher","DOI":"10.1016\/j.sbi.2023.102658","author":"M Stanley","year":"2023","unstructured":"Stanley M, Segler M (2023) Fake it until you make it? generative de novo design and virtual screening of synthesizable molecules. Current Opin Struct Biol. https:\/\/doi.org\/10.1016\/j.sbi.2023.102658","journal-title":"Current Opin Struct Biol"},{"issue":"6476","key":"812_CR34","doi-asserted-by":"publisher","first-page":"397","DOI":"10.1126\/science.aay3060","volume":"367","author":"JB Zimmerman","year":"2020","unstructured":"Zimmerman JB, Anastas PT, Erythropel HC, Leitner W (2020) Designing for a green chemistry future. Science 367(6476):397\u2013400. https:\/\/doi.org\/10.1126\/science.aay3060","journal-title":"Science"},{"key":"812_CR35","doi-asserted-by":"publisher","DOI":"10.1021\/acs.jmedchem.3c01083","author":"M Beckers","year":"2023","unstructured":"Beckers M, Sturm N, Sirockin F, Fechner N, Stiefl N (2023) Prediction of small-molecule developability using large-scale in silico admet models. J Med Chem. https:\/\/doi.org\/10.1021\/acs.jmedchem.3c01083","journal-title":"J Med Chem"},{"key":"812_CR36","doi-asserted-by":"publisher","DOI":"10.1016\/j.drudis.2023.103758","author":"CS Tautermann","year":"2023","unstructured":"Tautermann CS, Borghardt JM, Pfau R, Zentgraf M, Weskamp N, Sauer A (2023) Towards holistic compound quality scores: extending ligand efficiency indices with compound pharmacokinetic characteristics. Drug Discovery Today. https:\/\/doi.org\/10.1016\/j.drudis.2023.103758","journal-title":"Drug Discovery Today"},{"issue":"5","key":"812_CR37","doi-asserted-by":"publisher","first-page":"353","DOI":"10.1038\/s41573-019-0050-3","volume":"19","author":"P Schneider","year":"2020","unstructured":"Schneider P, Walters WP, Plowright AT, Sieroka N, Listgarten J, Goodnow RA, Fisher J, Jansen JM, Duca JS, Rush TS, Zentgraf M, Hill JE, Krutoholow E, Kohler M, Blaney J, Funatsu K, Luebkemann C, Schneider G (2020) Rethinking drug design in the artificial intelligence era. Nature Rev Drug Discov 19(5):353\u2013364. https:\/\/doi.org\/10.1038\/s41573-019-0050-3","journal-title":"Nature Rev Drug Discov"},{"issue":"11","key":"812_CR38","doi-asserted-by":"publisher","first-page":"1311","DOI":"10.1038\/s41557-022-01016-w","volume":"14","author":"JS Manzano","year":"2022","unstructured":"Manzano JS, Hou W, Zalesskiy SS, Frei P, Wang H, Kitson PJ, Cronin L (2022) An autonomous portable platform for universal chemical synthesis. Nature Chem 14(11):1311\u20131318. https:\/\/doi.org\/10.1038\/s41557-022-01016-w","journal-title":"Nature Chem"},{"issue":"10","key":"812_CR39","doi-asserted-by":"publisher","first-page":"7073","DOI":"10.1021\/acs.jmedchem.2c00334","volume":"65","author":"B Goldman","year":"2022","unstructured":"Goldman B, Kearnes S, Kramer T, Riley P, Walters WP (2022) Defining levels of automated chemical design. J Med Chem 65(10):7073\u20137087. https:\/\/doi.org\/10.1021\/acs.jmedchem.2c00334","journal-title":"J Med Chem"},{"key":"812_CR40","volume-title":"Reinforcement learning: an introduction (2018)","author":"RS Sutton","year":"2018","unstructured":"Sutton RS, Barto AG (2018) Reinforcement learning: an introduction (2018). MIT press, Cambridge"},{"issue":"7","key":"812_CR41","doi-asserted-by":"publisher","first-page":"7885","DOI":"10.1126\/sciadv.aap7885","volume":"4","author":"M Popova","year":"2018","unstructured":"Popova M, Isayev O, Tropsha A (2018) Deep reinforcement learning for de novo drug design. Sci Adv 4(7):7885. https:\/\/doi.org\/10.1126\/sciadv.aap7885","journal-title":"Sci Adv"},{"issue":"3","key":"812_CR42","doi-asserted-by":"publisher","first-page":"695","DOI":"10.1021\/acs.jcim.2c01191","volume":"63","author":"YA Ivanenkov","year":"2023","unstructured":"Ivanenkov YA, Polykovskiy D, Bezrukov D, Zagribelnyy B, Aladinskiy V, Kamya P, Aliper A, Ren F, Zhavoronkov A (2023) Chemistry42: an ai-driven platform for molecular design and optimization. J Chem Inform Model 63(3):695\u2013701. https:\/\/doi.org\/10.1021\/acs.jcim.2c01191","journal-title":"J Chem Inform Model"},{"key":"812_CR43","unstructured":"Gao W, Fu T, Sun J, Coley CW (2022) Sample efficiency matters: a benchmark for practical molecular optimization"},{"issue":"11","key":"812_CR44","doi-asserted-by":"publisher","first-page":"3238","DOI":"10.1021\/acs.jcim.2c01355","volume":"63","author":"T Cieplinski","year":"2023","unstructured":"Cieplinski T, Danel T, Podlewska S, Jastrzebski S (2023) Generative models should at least be able to design molecules that dock well: a new benchmark. J Chem Inform Model 63(11):3238\u20133247. https:\/\/doi.org\/10.1021\/acs.jcim.2c01355","journal-title":"J Chem Inform Model"},{"issue":"4","key":"812_CR45","doi-asserted-by":"publisher","first-page":"386","DOI":"10.1038\/s42256-023-00636-2","volume":"5","author":"M Mokaya","year":"2023","unstructured":"Mokaya M, Imrie F, van Hoorn WP, Kalisz A, Bradley AR, Deane CM (2023) Testing the limits of smiles-based de novo molecular generation with curriculum and deep reinforcement learning. Nature Machine Intell 5(4):386\u2013394. https:\/\/doi.org\/10.1038\/s42256-023-00636-2","journal-title":"Nature Machine Intell"},{"issue":"9","key":"812_CR46","doi-asserted-by":"publisher","first-page":"2046","DOI":"10.1021\/acs.jcim.1c00469","volume":"62","author":"V Fialkov\u00e1","year":"2022","unstructured":"Fialkov\u00e1 V, Zhao J, Papadopoulos K, Engkvist O, Bjerrum EJ, Kogej T, Patronov A (2022) Libinvent: Reaction-based generative scaffold decoration for in silico library design. J Chem Inform Model 62(9):2046\u20132063. https:\/\/doi.org\/10.1021\/acs.jcim.1c00469","journal-title":"J Chem Inform Model"},{"key":"812_CR47","doi-asserted-by":"publisher","first-page":"392","DOI":"10.1039\/D2DD00115B","volume":"2","author":"J Guo","year":"2023","unstructured":"Guo J, Knuth F, Margreitter C, Janet JP, Papadopoulos K, Engkvist O, Patronov A (2023) Link-invent: generative linker design with reinforcement learning. Digital Discov 2:392\u2013408. https:\/\/doi.org\/10.1039\/D2DD00115B","journal-title":"Digital Discov"},{"issue":"1","key":"812_CR48","doi-asserted-by":"publisher","first-page":"68","DOI":"10.1186\/s13321-020-00473-0","volume":"12","author":"T Blaschke","year":"2020","unstructured":"Blaschke T, Engkvist O, Bajorath J, Chen H (2020) Memory-assisted reinforcement learning for diverse molecular de novo design. J Chem Inform 12(1):68. https:\/\/doi.org\/10.1186\/s13321-020-00473-0","journal-title":"J Chem Inform"},{"issue":"2","key":"812_CR49","doi-asserted-by":"publisher","first-page":"270","DOI":"10.1162\/neco.1989.1.2.270","volume":"1","author":"RJ Williams","year":"1989","unstructured":"Williams RJ, Zipser D (1989) A learning algorithm for continually running fully recurrent neural networks. Neural Computation 1(2):270\u2013280","journal-title":"Neural Computation"},{"issue":"1","key":"812_CR50","doi-asserted-by":"publisher","first-page":"20","DOI":"10.1186\/s13321-019-0341-z","volume":"11","author":"J Ar\u00fas-Pous","year":"2019","unstructured":"Ar\u00fas-Pous J, Blaschke T, Ulander S, Reymond JL, Chen H, Engkvist O (2019) Exploring the gdb-13 chemical space using deep generative models. J Cheminform 11(1):20. https:\/\/doi.org\/10.1186\/s13321-019-0341-z","journal-title":"J Cheminform"},{"issue":"1","key":"812_CR51","doi-asserted-by":"publisher","first-page":"71","DOI":"10.1186\/s13321-019-0393-0","volume":"11","author":"J Ar\u00fas-Pous","year":"2019","unstructured":"Ar\u00fas-Pous J, Johansson SV, Prykhodko O, Bjerrum EJ, Tyrchan C, Reymond JL, Chen H, Engkvist O (2019) Randomized smiles strings improve the quality of molecular generative models. J Cheminform 11(1):71. https:\/\/doi.org\/10.1186\/s13321-019-0393-0","journal-title":"J Cheminform"},{"issue":"1","key":"812_CR52","doi-asserted-by":"publisher","first-page":"26","DOI":"10.1186\/s13321-021-00497-0","volume":"13","author":"J He","year":"2021","unstructured":"He J, You H, Sandstr\u00f6m E, Nittinger E, Bjerrum EJ, Tyrchan C, Czechtizky W, Engkvist O (2021) Molecular optimization by capturing chemist\u2019s intuition using deep neural networks. J Cheminform 13(1):26. https:\/\/doi.org\/10.1186\/s13321-021-00497-0","journal-title":"J Cheminform"},{"issue":"2","key":"812_CR53","doi-asserted-by":"publisher","first-page":"260","DOI":"10.1109\/TIT.1967.1054010","volume":"13","author":"A Viterbi","year":"1967","unstructured":"Viterbi A (1967) Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. IEEE Trans Inform Theor 13(2):260\u2013269","journal-title":"IEEE Trans Inform Theor"},{"key":"812_CR54","doi-asserted-by":"publisher","DOI":"10.26434\/chemrxiv-2023-v25xb","author":"A Tibo","year":"2023","unstructured":"Tibo A, He J, Janet JP, Nittinger E, Engkvist O (2023) Exhaustive local chemical space exploration using a transformer model. ChemRxiv. https:\/\/doi.org\/10.26434\/chemrxiv-2023-v25xb","journal-title":"ChemRxiv"},{"issue":"D1","key":"812_CR55","doi-asserted-by":"publisher","first-page":"D1373","DOI":"10.1093\/nar\/gkac956","volume":"51","author":"S Kim","year":"2022","unstructured":"Kim S, Chen J, Cheng T, Gindulyte A, He J, He S, Li Q, Shoemaker BA, Thiessen PA, Yu B, Zaslavsky L, Zhang J, Bolton EE (2022) PubChem 2023 update. Nucl Acids Res 51(D1):D1373\u2013D1380. https:\/\/doi.org\/10.1093\/nar\/gkac956","journal-title":"Nucl Acids Res"},{"issue":"3","key":"812_CR56","doi-asserted-by":"publisher","first-page":"769","DOI":"10.1016\/j.drudis.2020.12.003","volume":"26","author":"M Elbadawi","year":"2021","unstructured":"Elbadawi M, Gaisford S, Basit AW (2021) Advanced machine-learning techniques in drug discovery. Drug Discovery Today 26(3):769\u2013777","journal-title":"Drug Discovery Today"},{"key":"812_CR57","doi-asserted-by":"publisher","DOI":"10.26434\/chemrxiv-2023-j88dg","author":"M Dodds","year":"2023","unstructured":"Dodds M, Guo J, L\u00f6hr T, Tibo A, Engkvist O, Janet JP (2023) Sample efficient reinforcement learning with active learning for molecular design. ChemRxiv. https:\/\/doi.org\/10.26434\/chemrxiv-2023-j88dg","journal-title":"ChemRxiv"},{"issue":"6","key":"812_CR58","doi-asserted-by":"publisher","first-page":"555","DOI":"10.1038\/s42256-022-00494-4","volume":"4","author":"J Guo","year":"2022","unstructured":"Guo J, Fialkov\u00e1 V, Arango JD, Margreitter C, Janet JP, Papadopoulos K, Engkvist O, Patronov A (2022) Improving de novo molecular design with curriculum learning. Nature Machine Intell 4(6):555\u2013563. https:\/\/doi.org\/10.1038\/s42256-022-00494-4","journal-title":"Nature Machine Intell"},{"key":"812_CR59","doi-asserted-by":"crossref","unstructured":"Guo J, Schwaller P (2023) Augmented memory: Capitalizing on experience replay to accelerate de novo molecular design","DOI":"10.26434\/chemrxiv-2023-qmqmq-v2"},{"key":"812_CR60","unstructured":"Smith SL, Kindermans PJ, Ying C, Le QV (2018) Don\u2019t decay the learning rate, increase the batch size"},{"key":"812_CR61","unstructured":"Hoffer E, Hubara I, Soudry D (2017) in Advances in Neural Information Processing Systems, vol.\u00a030, ed. by I.\u00a0Guyon, U.V. Luxburg, S.\u00a0Bengio, H.\u00a0Wallach, R.\u00a0Fergus, S.\u00a0Vishwanathan, R.\u00a0Garnett (Curran Associates, Inc.,). https:\/\/proceedings.neurips.cc\/paper_files\/paper\/2017\/file\/a5e0ff62be0b08456fc7f1e88812af3d-Paper.pdf"},{"issue":"9","key":"812_CR62","doi-asserted-by":"publisher","first-page":"759","DOI":"10.1038\/s42256-021-00368-1","volume":"3","author":"MA Skinnider","year":"2021","unstructured":"Skinnider MA, Stacey RG, Wishart DS, Foster LJ (2021) Chemical language models enable navigation in sparsely populated chemical space. Nature Machine Intell 3(9):759\u2013770. https:\/\/doi.org\/10.1038\/s42256-021-00368-1","journal-title":"Nature Machine Intell"},{"issue":"1","key":"812_CR63","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1016\/S0169-409X(00)00129-0","volume":"46","author":"CA Lipinski","year":"2001","unstructured":"Lipinski CA, Lombardo F, Dominy BW, Feeney PJ (2001) Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. Advanced Drug Delivery Rev 46(1):3\u201326","journal-title":"Advanced Drug Delivery Rev"},{"issue":"2","key":"812_CR64","doi-asserted-by":"publisher","first-page":"90","DOI":"10.1038\/nchem.1243","volume":"4","author":"GR Bickerton","year":"2012","unstructured":"Bickerton GR, Paolini GV, Besnard J, Muresan S, Hopkins AL (2012) Quantifying the chemical beauty of drugs. Nature Chem 4(2):90\u201398. https:\/\/doi.org\/10.1038\/nchem.1243","journal-title":"Nature Chem"},{"issue":"1","key":"812_CR65","doi-asserted-by":"publisher","first-page":"89","DOI":"10.1186\/s13321-021-00563-7","volume":"13","author":"J Guo","year":"2021","unstructured":"Guo J, Janet JP, Bauer MR, Nittinger E, Giblin KA, Papadopoulos K, Voronov A, Patronov A, Engkvist O, Margreitter C (2021) Dockstream: a docking wrapper to enhance de novo molecular design. J Cheminform 13(1):89. https:\/\/doi.org\/10.1186\/s13321-021-00563-7","journal-title":"J Cheminform"},{"issue":"2","key":"812_CR66","doi-asserted-by":"publisher","first-page":"455","DOI":"10.1002\/jcc.21334","volume":"31","author":"O Trott","year":"2010","unstructured":"Trott O, Olson AJ (2010) Autodock vina: Improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. J Computat Chem 31(2):455\u2013461. https:\/\/doi.org\/10.1002\/jcc.21334","journal-title":"J Computat Chem"},{"issue":"8","key":"812_CR67","doi-asserted-by":"publisher","first-page":"3891","DOI":"10.1021\/acs.jcim.1c00203","volume":"61","author":"J Eberhardt","year":"2021","unstructured":"Eberhardt J, Santos-Martins D, Tillack AF, Forli S (2021) Autodock vina 1.2.0: New docking methods, expanded force field, and python bindings. J Chem Inform Model 61(8):3891\u20133898. https:\/\/doi.org\/10.1021\/acs.jcim.1c00203","journal-title":"J Chem Inform Model"},{"issue":"4","key":"812_CR68","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1371\/journal.pcbi.1003571","volume":"10","author":"S Ruiz-Carmona","year":"2014","unstructured":"Ruiz-Carmona S, Alvarez-Garcia D, Foloppe N, Garmendia-Doval AB, Juhos S, Schmidtke P, Barril X, Hubbard RE, Morley SD (2014) rdock: a fast, versatile and open source program for docking ligands to proteins and nucleic acids. PLOS Computat Biol 10(4):1\u20137. https:\/\/doi.org\/10.1371\/journal.pcbi.1003571","journal-title":"PLOS Computat Biol"},{"issue":"8","key":"812_CR69","doi-asserted-by":"publisher","first-page":"897","DOI":"10.1007\/s10822-012-9584-8","volume":"26","author":"M McGann","year":"2012","unstructured":"McGann M (2012) Fred and hybrid docking performance on standardized datasets. J Computer-Aided Mol Design 26(8):897\u2013906. https:\/\/doi.org\/10.1007\/s10822-012-9584-8","journal-title":"J Computer-Aided Mol Design"},{"issue":"21","key":"812_CR70","doi-asserted-by":"publisher","first-page":"6177","DOI":"10.1021\/jm051256o.10.1021\/jm051256","volume":"49","author":"RA Friesner","year":"2006","unstructured":"Friesner RA, Murphy RB, Repasky MP, Frye LL, Greenwood JR, Halgren TA, Sanschagrin PC, Mainz DT (2006) Extra precision glide: docking and scoring incorporating a model of hydrophobic enclosure for protein-ligand complexes. J Med Chem 49(21):6177\u20136196. https:\/\/doi.org\/10.1021\/jm051256o.10.1021\/jm051256","journal-title":"J Med Chem"},{"issue":"3","key":"812_CR71","doi-asserted-by":"publisher","first-page":"727","DOI":"10.1006\/jmbi.1996.0897","volume":"267","author":"G Jones","year":"1997","unstructured":"Jones G, Willett P, Glen RC, Leach AR, Taylor R (1997) Development and validation of a genetic algorithm for flexible docking11edited by f e cohen. J Mol Biol 267(3):727\u2013748","journal-title":"J Mol Biol"},{"issue":"4","key":"812_CR72","doi-asserted-by":"publisher","first-page":"688","DOI":"10.1016\/j.cell.2020.01.021","volume":"180","author":"JM Stokes","year":"2020","unstructured":"Stokes JM, Yang K, Swanson K, Jin W, Cubillos-Ruiz A, Donghia NM, MacNair CR, French S, Carfrae LA, Bloom-Ackermann Z, Tran VM, Chiappino-Pepe A, Badran AH, Andrews IW, Chory EJ, Church GM, Brown ED, Jaakkola TS, Barzilay R, Collins JJ (2020) A deep learning approach to antibiotic discovery. Cell 180(4):688-702.e13. https:\/\/doi.org\/10.1016\/j.cell.2020.01.021","journal-title":"Cell"},{"issue":"9","key":"812_CR73","doi-asserted-by":"publisher","first-page":"2101","DOI":"10.1021\/acs.jcim.1c00975","volume":"62","author":"E Heid","year":"2022","unstructured":"Heid E, Green WH (2022) Machine learning of reaction properties via learned representations of the condensed graph of reaction. J Chem Inform Model 62(9):2101\u20132110. https:\/\/doi.org\/10.1021\/acs.jcim.1c00975","journal-title":"J Chem Inform Model"},{"issue":"21","key":"812_CR74","doi-asserted-by":"publisher","first-page":"4951","DOI":"10.1093\/bioinformatics\/btac614","volume":"38","author":"JH Moore","year":"2022","unstructured":"Moore JH, Bauer MR, Guo J, Patronov A, Engkvist O, Margreitter C (2022) Icolos: a workflow manager for structure-based post-processing of de novo generated small molecules. Bioinformatics 38(21):4951\u20134952","journal-title":"Bioinformatics"},{"issue":"1","key":"812_CR75","doi-asserted-by":"publisher","first-page":"74","DOI":"10.1021\/jm0603365","volume":"50","author":"PCD Hawkins","year":"2007","unstructured":"Hawkins PCD, Skillman AG, Nicholls A (2007) Comparison of shape-matching and docking as virtual screening tools. J Med Chem 50(1):74\u201382. https:\/\/doi.org\/10.1021\/jm0603365","journal-title":"J Med Chem"},{"issue":"1","key":"812_CR76","doi-asserted-by":"publisher","first-page":"8","DOI":"10.1186\/1758-2946-1-8","volume":"1","author":"P Ertl","year":"2009","unstructured":"Ertl P, Schuffenhauer A (2009) Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions. J Cheminform 1(1):8. https:\/\/doi.org\/10.1186\/1758-2946-1-8","journal-title":"J Cheminform"},{"issue":"5","key":"812_CR77","doi-asserted-by":"publisher","first-page":"902","DOI":"10.1021\/acs.jcim.8b00173","volume":"58","author":"A Dalke","year":"2018","unstructured":"Dalke A, Hert J, Kramer C (2018) mmpdb: an open-source matched molecular pair platform for large multiproperty data sets. J Chem Inform Model 58(5):902\u2013910. https:\/\/doi.org\/10.1021\/acs.jcim.8b00173","journal-title":"J Chem Inform Model"},{"key":"812_CR78","doi-asserted-by":"publisher","DOI":"10.1016\/j.sbi.2023.102537","author":"S Luukkonen","year":"2023","unstructured":"Luukkonen S, van den Maagdenberg HW, Emmerich MT, van Westen GJ (2023) Artificial intelligence in multi-objective drug design. Current Opin Struct Biol. https:\/\/doi.org\/10.1016\/j.sbi.2023.102537","journal-title":"Current Opin Struct Biol"},{"issue":"2","key":"812_CR79","doi-asserted-by":"publisher","DOI":"10.1016\/j.patter.2023.100678","volume":"4","author":"JC Fromer","year":"2023","unstructured":"Fromer JC, Coley CW (2023) Computer-aided multi-objective optimization in small molecule discovery. Patterns 4(2):100678. https:\/\/doi.org\/10.1016\/j.patter.2023.100678","journal-title":"Patterns"},{"issue":"D1","key":"812_CR80","doi-asserted-by":"publisher","first-page":"D930","DOI":"10.1093\/nar\/gky1075","volume":"47","author":"D Mendez","year":"2019","unstructured":"Mendez, D, Gaulton A, Bento AP, Chambers J, De Veij M, F\u00e9lix E, Magari\u00f1os MP, Mosquera JF, Mutowo P, Nowotka M, Gordillo-Mara\u00f1\u00f3n M, Hunter F, Junco L, Mugumbate G, Rodriguez-Lopez M, Atkinson F, Bosc N, Radoux CJ, Segura-Cabrera A, Hersey A, Leach AR, (2019) Chembl: towards direct deposition of bioassay data. Nucl Acids Res 47(D1):D930\u2013D940. https:\/\/doi.org\/10.1093\/nar\/gky1075","journal-title":"Nucl Acids Res"},{"key":"812_CR81","unstructured":"Abadi M, Agarwal A, Barham P, Brevdo E, Chen Z, Citro C, Corrado GS, Davis A, Dean J, Devin M, Ghemawat S, Goodfellow I, Harp A, Irving G, Isard M, Jia Y, Jozefowicz R, Kaiser L, Kudlur M, Levenberg J, Man\u00e9 D, Monga R, Moore S, Murray D, Olah C, Schuster M, Shlens J, Steiner B, Sutskever I, Talwar K, Tucker P, Vanhoucke V, Vasudevan V, Vi\u00e9gas F, Vinyals O, Warden P, Wattenberg M, Wicke M, Yu Y, Zheng X (2015) TensorFlow: Large-scale machine learning on heterogeneous systems. https:\/\/www.tensorflow.org\/. Software available from tensorflow.org"},{"issue":"14","key":"812_CR82","doi-asserted-by":"publisher","first-page":"4095","DOI":"10.1016\/j.bmcl.2010.05.070","volume":"20","author":"M Angiolini","year":"2010","unstructured":"Angiolini M, Banfi P, Casale E, Casuscelli F, Fiorelli C, Saccardo MB, Silvagni M, Zuccotto F (2010) Structure-based optimization of potent pdk1 inhibitors. Bioorganic Med Chem Lett 20(14):4095\u20134099. https:\/\/doi.org\/10.1016\/j.bmcl.2010.05.070","journal-title":"Bioorganic Med Chem Lett"},{"issue":"15","key":"812_CR83","doi-asserted-by":"publisher","first-page":"2887","DOI":"10.1021\/jm9602928","volume":"39","author":"GW Bemis","year":"1996","unstructured":"Bemis GW, Murcko MA (1996) The properties of known drugs 1 molecular frameworks. J Med Chem 39(15):2887\u20132893. https:\/\/doi.org\/10.1021\/jm9602928","journal-title":"J Med Chem"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-024-00812-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13321-024-00812-5\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-024-00812-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,11,12]],"date-time":"2024-11-12T03:04:39Z","timestamp":1731380679000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/s13321-024-00812-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,2,21]]},"references-count":83,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2024,12]]}},"alternative-id":["812"],"URL":"https:\/\/doi.org\/10.1186\/s13321-024-00812-5","relation":{},"ISSN":["1758-2946"],"issn-type":[{"value":"1758-2946","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,2,21]]},"assertion":[{"value":"23 November 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"9 February 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"21 February 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"20"}}