{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T04:28:29Z","timestamp":1772166509939,"version":"3.50.1"},"reference-count":37,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2021,11,27]],"date-time":"2021-11-27T00:00:00Z","timestamp":1637971200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,11,27]],"date-time":"2021-11-27T00:00:00Z","timestamp":1637971200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100009619","name":"japan agency for medical research and development","doi-asserted-by":"publisher","award":["Platform Project for Supporting Drug Discovery and Life Science Research (Basis for Supporting Innovative Drug Discovery and Life Science Research (BINDS)) from AMED under Grant Number JP20am0101112"],"award-info":[{"award-number":["Platform Project for Supporting Drug Discovery and Life Science Research (Basis for Supporting Innovative Drug Discovery and Life Science Research (BINDS)) from AMED under Grant Number JP20am0101112"]}],"id":[{"id":"10.13039\/100009619","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001691","name":"japan society for the promotion of science","doi-asserted-by":"publisher","award":["KAKENHI Grant Numbers 20H00620"],"award-info":[{"award-number":["KAKENHI Grant Numbers 20H00620"]}],"id":[{"id":"10.13039\/501100001691","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"published-print":{"date-parts":[[2021,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>\n                    The hit-to-lead process makes the physicochemical properties of the hit molecules that show the desired type of activity obtained in the screening assay more drug-like. Deep learning-based molecular generative models are expected to contribute to the hit-to-lead process. The simplified molecular input line entry system (SMILES), which is a string of alphanumeric characters representing the chemical structure of a molecule, is one of the most commonly used representations of molecules, and molecular generative models based on SMILES have achieved significant success. However, in contrast to molecular graphs, during the process of generation, SMILES are not considered as valid SMILES. Further, it is quite difficult to generate molecules starting from a certain molecule, thus making it difficult to apply SMILES to the hit-to-lead process. In this study, we have developed a SMILES-based generative model that can be generated starting from a certain molecule. This method generates partial SMILES and inserts it into the original SMILES using Monte Carlo Tree Search and a Recurrent Neural Network. We validated our method using a molecule dataset obtained from the ZINC database and successfully generated molecules that were both well optimized for the objectives of the quantitative estimate of drug-likeness (QED) and penalized octanol-water partition coefficient (PLogP) optimization. The source code is available at\n                    <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/github.com\/sekijima-lab\/mermaid\">https:\/\/github.com\/sekijima-lab\/mermaid<\/jats:ext-link>\n                    .\n                  <\/jats:p>","DOI":"10.1186\/s13321-021-00572-6","type":"journal-article","created":{"date-parts":[[2021,11,27]],"date-time":"2021-11-27T05:02:28Z","timestamp":1637989348000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":24,"title":["MERMAID: an open source automated hit-to-lead method based on deep reinforcement learning"],"prefix":"10.1186","volume":"13","author":[{"given":"Daiki","family":"Erikawa","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3114-7895","authenticated-orcid":false,"given":"Nobuaki","family":"Yasuo","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3806-9535","authenticated-orcid":false,"given":"Masakazu","family":"Sekijima","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2021,11,27]]},"reference":[{"key":"572_CR1","unstructured":"PhRMA: Biopharmaceuticals in perspective summer 2019 (2019). https:\/\/www.phrma.org\/-\/media\/Project\/PhRMA\/PhRMA-Org\/PhRMA-Org\/PDF\/P-R\/PhRMA_2019_ChartPack_Final.pdf (visited: 2021-3-22)"},{"issue":"12","key":"572_CR2","first-page":"877","volume":"13","author":"A Mullard","year":"2014","unstructured":"Mullard A (2014) New drugs cost US $2.6 billion to develop. Nat Rev Drug Discov 13(12):877","journal-title":"Nat Rev Drug Discov"},{"key":"572_CR3","doi-asserted-by":"publisher","unstructured":"Varma H, Lo D, Stockwell B (2010) High-throughput and high-content screening for huntington\u2019s disease therapeutics. In: Neurobiology of Huntington\u2019s Disease. CRC Press, Amsterdam; pp. 121\u201314. https:\/\/doi.org\/10.1201\/ebk0849390005-c5","DOI":"10.1201\/ebk0849390005-c5"},{"issue":"4","key":"572_CR4","doi-asserted-by":"publisher","first-page":"273","DOI":"10.1038\/nrd3139","volume":"9","author":"G Schneider","year":"2010","unstructured":"Schneider G (2010) Virtual screening: an endless staircase? Nat Rev Drug Discov 9(4):273\u2013276. https:\/\/doi.org\/10.1038\/nrd3139","journal-title":"Nat Rev Drug Discov"},{"key":"572_CR5","doi-asserted-by":"publisher","first-page":"17209","DOI":"10.1038\/srep17209","volume":"5","author":"S Chiba","year":"2015","unstructured":"Chiba S, Ikeda K, Ishida T, Gromiha MM, Taguchi Y, Iwadate M, Umeyama H, Hsin K-Y, Kitano H, Yamamoto K, Sugaya N, Kato K, Okuno T, Chikenji G, Mochizuki M, Yasuo N, Yoshino R, Yanagisawa K, Ban T, Teramoto R, Ramakrishnan C, Thangakani AM, Velmurugan D, Prathipati P, Ito J, Tsuchiya Y, Mizuguchi K, Honma T, Sekijima M (2015) Identification of potential inhibitors based on compound proposal contest: tyrosine-protein kinase Yes as a target. Sci Rep 5:17209","journal-title":"Sci Rep"},{"issue":"1","key":"572_CR6","doi-asserted-by":"publisher","first-page":"12038","DOI":"10.1038\/s41598-017-10275-4","volume":"7","author":"S Chiba","year":"2017","unstructured":"Chiba S, Ishida T, Ikeda K, Mochizuki M, Teramoto R, Taguchi Y, Iwadate M, Umeyama H, Ramakrishnan C, Thangakani AM, Velmurugan D, Gromiha MM, Okuno T, Kato K, Minami S, Chikenji G, Suzuki SD, Yanagisawa K, Shin W-H, Kihara D, Yamamoto KZ, Moriwaki Y, Yasuo N, Yoshino R, Zozulya S, Borysko P, Stavniichuk R, Honma T, Hirokawa T, Akiyama Y, Sekijima M (2017) An iterative compound screening contest method for identifying target protein inhibitors using the tyrosine-protein kinase yes. Sci Rep 7(1):12038","journal-title":"Sci Rep"},{"key":"572_CR7","doi-asserted-by":"publisher","DOI":"10.1038\/s41598-019-55069-y","author":"S Chiba","year":"2019","unstructured":"Chiba S, Ohue M, Gryniukova A, Borysko P, Zozulya S, Yasuo N, Yoshino R, Ikeda K, Shin W-H, Kihara D, Iwadate M, Umeyama H, Ichikawa T, Teramoto R, Hsin K-Y, Gupta V, Kitano H, Sakamoto M, Higuchi A, Miura N, Yura K, Mochizuki M, Ramakrishnan C, Thangakani AM, Velmurugan D, Gromiha MM, Nakane I, Uchida N, Hakariya H, Tan M, Nakamura HK, Suzuki SD, Ito T, Kawatani M, Kudoh K, Takashina S, Yamamoto KZ, Moriwaki Y, Oda K, Kobayashi D, Okuno T, Minami S, Chikenji G, Prathipati P, Nagao C, Mohsen A, Ito M, Mizuguchi K, Honma T, Ishida T, Hirokawa T, Akiyama Y, Sekijima M (2019) A prospective compound screening contest identified broader inhibitors for sirtuin 1. Sci Rep. https:\/\/doi.org\/10.1038\/s41598-019-55069-y","journal-title":"Sci Rep"},{"issue":"5","key":"572_CR8","first-page":"89","volume":"3","author":"V Rao","year":"2011","unstructured":"Rao V, Srinivas K (2011) Modern drug discovery process: an in silico approach. J Bioinform Sequence Anal. 3(5):89\u201394","journal-title":"J Bioinform Sequence Anal"},{"issue":"2\u20133","key":"572_CR9","doi-asserted-by":"publisher","first-page":"115","DOI":"10.1002\/minf.201400132","volume":"34","author":"H Li","year":"2015","unstructured":"Li H, Leung K-S, Wong M-H, Ballester PJ (2015) Improving AutoDock vina using random forest: The growing accuracy of binding affinity prediction by the effective exploitation of larger data sets. Mol Inform 34(2\u20133):115\u2013126. https:\/\/doi.org\/10.1002\/minf.201400132","journal-title":"Mol Inform"},{"issue":"4","key":"572_CR10","doi-asserted-by":"publisher","first-page":"942","DOI":"10.1021\/acs.jcim.6b00740","volume":"57","author":"M Ragoza","year":"2017","unstructured":"Ragoza M, Hochuli J, Idrobo E, Sunseri J, Koes DR (2017) Protein\u2013ligand scoring with convolutional neural networks. J Chem Inform Modeling 57(4):942\u2013957. https:\/\/doi.org\/10.1021\/acs.jcim.6b00740","journal-title":"J Chem Inform Modeling"},{"issue":"3","key":"572_CR11","doi-asserted-by":"publisher","first-page":"1050","DOI":"10.1021\/acs.jcim.8b00673","volume":"59","author":"N Yasuo","year":"2019","unstructured":"Yasuo N, Sekijima M (2019) Improved method of structure-based virtual screening via interaction-energy-based learning. J Chem Inform Modeling 59(3):1050\u20131061. https:\/\/doi.org\/10.1021\/acs.jcim.8b00673","journal-title":"J Chem Inform Modeling"},{"key":"572_CR12","doi-asserted-by":"crossref","unstructured":"Yasuo N, Nakashima Y, Sekijima M (2018) CoDe-DTI: collaborative deep learning-based drug-target interaction predictior. In: 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE, NewYork, pp. 792\u2013797","DOI":"10.1109\/BIBM.2018.8621368"},{"issue":"4","key":"572_CR13","doi-asserted-by":"publisher","first-page":"828","DOI":"10.1039\/C9ME00039A","volume":"4","author":"DC Elton","year":"2019","unstructured":"Elton DC, Boukouvalas Z, Fuge MD, Chung PW (2019) Deep learning for molecular design-a review of the state of the art. Mol Syst Design Eng 4(4):828\u2013849","journal-title":"Mol Syst Design Eng"},{"key":"572_CR14","doi-asserted-by":"publisher","DOI":"10.1039\/C9ME00039A","author":"D Elton","year":"2019","unstructured":"Elton D, Boukouvalas Z, Fuge M, Chung P (2019) Deep learning for molecular design\u2014a review of the state of the art. Mol Syst Design Eng. https:\/\/doi.org\/10.1039\/C9ME00039A","journal-title":"Mol Syst Design Eng"},{"issue":"6400","key":"572_CR15","doi-asserted-by":"publisher","first-page":"360","DOI":"10.1126\/science.aat2663","volume":"361","author":"B Sanchez-Lengeling","year":"2018","unstructured":"Sanchez-Lengeling B, Aspuru-Guzik A (2018) Inverse molecular design using machine learning: generative models for matter engineering. Science 361(6400):360. https:\/\/doi.org\/10.1126\/science.aat2663","journal-title":"Science"},{"issue":"2","key":"572_CR16","doi-asserted-by":"publisher","first-page":"268","DOI":"10.1021\/acscentsci.7b00572","volume":"4","author":"R G\u00f3mez-Bombarelli","year":"2018","unstructured":"G\u00f3mez-Bombarelli R, Wei JN, Duvenaud D, Hern\u00e1ndez-Lobato JM, S\u00e1nchez-Lengeling B, Sheberla D, Aguilera-Iparraguirre J, Hirzel TD, Adams RP, Aspuru-Guzik A (2018) Automatic chemical design using a data-driven continuous representation of molecules. ACS Central Sci 4(2):268\u2013276. https:\/\/doi.org\/10.1021\/acscentsci.7b00572","journal-title":"ACS Central Sci"},{"issue":"1","key":"572_CR17","doi-asserted-by":"publisher","first-page":"120","DOI":"10.1021\/acscentsci.7b00512","volume":"4","author":"MHS Segler","year":"2018","unstructured":"Segler MHS, Kogej T, Tyrchan C, Waller MP (2018) Generating focused molecule libraries for drug discovery with recurrent neural networks. ACS Central Sci 4(1):120\u2013131. https:\/\/doi.org\/10.1021\/acscentsci.7b00512","journal-title":"ACS Central Sci"},{"key":"572_CR18","doi-asserted-by":"publisher","first-page":"8016","DOI":"10.1039\/C9SC01928F","volume":"10","author":"R Winter","year":"2019","unstructured":"Winter R, Montanari F, Steffen A, Briem H, No\u00e9 F, Clevert D-A (2019) Efficient multi-objective molecular optimization in a continuous latent space. Chem Sci. 10:8016\u20138024. https:\/\/doi.org\/10.1039\/C9SC01928F","journal-title":"Chem Sci."},{"issue":"12","key":"572_CR19","doi-asserted-by":"publisher","first-page":"5682","DOI":"10.1021\/acs.jcim.0c00599","volume":"60","author":"K Gao","year":"2020","unstructured":"Gao K, Nguyen DD, Tu M, Wei G-W (2020) Generative network complex for the automated generation of drug-like molecules. J Chem Inform Model 60(12):5682\u20135698. https:\/\/doi.org\/10.1021\/acs.jcim.0c00599","journal-title":"J Chem Inform Model"},{"issue":"1","key":"572_CR20","doi-asserted-by":"publisher","first-page":"972","DOI":"10.1080\/14686996.2017.1401424","volume":"18","author":"X Yang","year":"2017","unstructured":"Yang X, Zhang J, Yoshizoe K, Terayama K, Tsuda K (2017) Chemts: an efficient python library for de novo molecular generation. Sci Technol Adv Mater 18(1):972\u2013976. https:\/\/doi.org\/10.1080\/14686996.2017.1401424","journal-title":"Sci Technol Adv Mater"},{"key":"572_CR21","unstructured":"Goodfellow IJ, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y (2014) Generative adversarial nets. In: Proceedings of the 27th International Conference on Neural Information Processing Systems - Volume 2. NIPS\u201914. MIT Press, Cambridge, pp. 2672\u20132680"},{"key":"572_CR22","unstructured":"Kingma DP, Welling M (2013) Auto-encoding variational Bayes. cite arxiv:1312.6114. http:\/\/arxiv.org\/abs\/1312.6114"},{"key":"572_CR23","first-page":"2323","volume":"80","author":"W Jin","year":"2018","unstructured":"Jin W, Barzilay R, Jaakkola T (2018) Junction tree variational autoencoder for molecular graph generation 80:2323\u20132332","journal-title":"Junction tree variational autoencoder for molecular graph generation"},{"issue":"1","key":"572_CR24","doi-asserted-by":"publisher","first-page":"10752","DOI":"10.1038\/s41598-019-47148-x","volume":"9","author":"Z Zhou","year":"2019","unstructured":"Zhou Z, Kearnes S, Li L, Zare RN, Riley P (2019) Optimization of molecules via deep reinforcement learning. Sci Rep 9(1):10752. https:\/\/doi.org\/10.1038\/s41598-019-47148-x","journal-title":"Sci Rep"},{"key":"572_CR25","unstructured":"Shi C, Xu M, Zhu Z, Zhang W, Zhang M, Tang J (2020) GraphAF: a flow-based autoregressive model for molecular graph generation"},{"key":"572_CR26","doi-asserted-by":"crossref","unstructured":"Simonovsky M, Komodakis N (2018) Graphvae: towards generation of small graphs using variational autoencoders. In: 27th International Conference on Artificial Neural Networks, Rhodes, Greece, October 4\u20137, 2018, Proceedings, Part I. pp. 412\u2013422","DOI":"10.1007\/978-3-030-01418-6_41"},{"key":"572_CR27","unstructured":"De\u00a0Cao N, Kipf T (2018) MolGAN: an implicit generative model for small molecular graphs. ICML 2018 workshop on Theoretical Foundations and Applications of Deep Generative Models"},{"key":"572_CR28","unstructured":"Jin W, Barzilay R, Jaakkola T (2020) Hierarchical generation of molecular graphs using structural motifs"},{"key":"572_CR29","unstructured":"You J, Liu B, Ying R, Pande V, Leskovec J (2018) Graph convolutional policy network for goal-directed molecular graph generation. In: Proceedings of the 32nd International Conference on Neural Information Processing Systems. NIPS\u201918. Curran Associates Inc., Red Hook, pp. 6412\u20136422"},{"key":"572_CR30","doi-asserted-by":"crossref","unstructured":"Coulom R (2006) Efficient selectivity and backup operators in monte-carlo tree search. Proceedings of the 5th international conference on Computers and games, 72\u201383","DOI":"10.1007\/978-3-540-75538-8_7"},{"issue":"1","key":"572_CR31","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1109\/TCIAIG.2012.2186810","volume":"4","author":"CB Browne","year":"2012","unstructured":"Browne CB, Powley E, Whitehouse D, Lucas SM, Cowling PI, Rohlfshagen P, Tavener S, Perez D, Samothrakis S, Colton S (2012) A survey of monte carlo tree search methods. IEEE Trans Comput Intell AI Games 4(1):1\u201343. https:\/\/doi.org\/10.1109\/TCIAIG.2012.2186810","journal-title":"IEEE Trans Comput Intell AI Games"},{"key":"572_CR32","doi-asserted-by":"crossref","unstructured":"Kocsis L, Szepesv\u00e1ri C (2006) Bandit based monte-carlo planning. In: F\u00fcrnkranz J, Scheffer T, Spiliopoulou M, eds. Machine Learning: ECML. Springer, Berlin, pp. 282\u2013293","DOI":"10.1007\/11871842_29"},{"issue":"8","key":"572_CR33","doi-asserted-by":"publisher","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","volume":"9","author":"S Hochreiter","year":"1997","unstructured":"Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput. 9(8):1735\u20131780. https:\/\/doi.org\/10.1162\/neco.1997.9.8.1735","journal-title":"Neural Comput."},{"key":"572_CR34","unstructured":"Kingma DP, Ba J (2017) Adam: a method for stochastic Opoimization. http:\/\/arxiv.org\/abs\/1412.69801412.6980"},{"key":"572_CR35","doi-asserted-by":"publisher","first-page":"90","DOI":"10.1038\/nchem.1243","volume":"4","author":"R Bickerton","year":"2012","unstructured":"Bickerton R, Paolini G, Besnard J, Muresan S, Hopkins A (2012) Quantifying the chemical beauty of drugs. Nat Chem 4:90\u20138. https:\/\/doi.org\/10.1038\/nchem.1243","journal-title":"Nat Chem"},{"issue":"1","key":"572_CR36","doi-asserted-by":"publisher","first-page":"2","DOI":"10.1186\/s13321-019-0404-1","volume":"12","author":"L Maziarka","year":"2020","unstructured":"Maziarka L, Pocha A, Kaczmarczyk J, Rataj K, Danel T, Warchol M (2020) Mol-cyclegan: a generative model for molecular optimization. J Cheminform 12(1):2. https:\/\/doi.org\/10.1186\/s13321-019-0404-1","journal-title":"J Cheminform"},{"key":"572_CR37","doi-asserted-by":"publisher","unstructured":"Senn H, Thiel W (2009) Qm\/mm methods for biomolecular systems. angew chem int ed 48:1198. Angewandte Chemie (International ed. in English) 48, 1198\u2013229. https:\/\/doi.org\/10.1002\/anie.200802019","DOI":"10.1002\/anie.200802019"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-021-00572-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13321-021-00572-6\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-021-00572-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,11,27]],"date-time":"2021-11-27T17:09:51Z","timestamp":1638032991000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/s13321-021-00572-6"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,11,27]]},"references-count":37,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,12]]}},"alternative-id":["572"],"URL":"https:\/\/doi.org\/10.1186\/s13321-021-00572-6","relation":{"has-preprint":[{"id-type":"doi","id":"10.26434\/chemrxiv.14450313.v1","asserted-by":"object"}]},"ISSN":["1758-2946"],"issn-type":[{"value":"1758-2946","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,11,27]]},"assertion":[{"value":"8 July 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"15 November 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"27 November 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"94"}}