{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,21]],"date-time":"2026-05-21T18:59:47Z","timestamp":1779389987290,"version":"3.53.1"},"reference-count":86,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2023,4,8]],"date-time":"2023-04-08T00:00:00Z","timestamp":1680912000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,4,8]],"date-time":"2023-04-08T00:00:00Z","timestamp":1680912000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"Lingang Laboratory","award":["LG-QS-202204-01"],"award-info":[{"award-number":["LG-QS-202204-01"]}]},{"name":"Lingang Laboratory","award":["LG202102-01-02"],"award-info":[{"award-number":["LG202102-01-02"]}]},{"DOI":"10.13039\/501100004739","name":"Youth Innovation Promotion Association of the Chinese Academy of Sciences","doi-asserted-by":"publisher","award":["2023296"],"award-info":[{"award-number":["2023296"]}],"id":[{"id":"10.13039\/501100004739","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["82204278"],"award-info":[{"award-number":["82204278"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["T2225002"],"award-info":[{"award-number":["T2225002"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100002858","name":"China Postdoctoral Science Foundation","doi-asserted-by":"publisher","award":["2022M720153"],"award-info":[{"award-number":["2022M720153"]}],"id":[{"id":"10.13039\/501100002858","id-type":"DOI","asserted-by":"publisher"}]},{"name":"National Key Research and Development Program of China","award":["2022YFC3400504"],"award-info":[{"award-number":["2022YFC3400504"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Artificial intelligence (AI)-based molecular design methods, especially deep generative models for generating novel molecule structures, have gratified our imagination to explore unknown chemical space without relying on brute-force exploration. However, whether designed by AI or human experts, the molecules need to be accessibly synthesized and biologically evaluated, and the trial-and-error process remains a resources-intensive endeavor. Therefore, AI-based drug design methods face a major challenge of how to prioritize the molecular structures with potential for subsequent drug development. This study indicates that common filtering approaches based on traditional screening metrics fail to differentiate AI-designed molecules. To address this issue, we propose a novel molecular filtering method, MolFilterGAN, based on a\u00a0progressively augmented generative adversarial network. Comparative analysis shows that MolFilterGAN outperforms conventional screening approaches based on drug-likeness or synthetic ability metrics. Retrospective analysis of AI-designed discoidin domain receptor 1 (DDR1) inhibitors shows that MolFilterGAN significantly increases the efficiency of molecular triaging. Further evaluation of MolFilterGAN on eight external ligand sets suggests that MolFilterGAN is useful in triaging or enriching bioactive compounds across a wide range of target types. These results highlighted the importance of MolFilterGAN in evaluating molecules integrally and further accelerating molecular discovery especially combined with advanced AI generative models.<\/jats:p>","DOI":"10.1186\/s13321-023-00711-1","type":"journal-article","created":{"date-parts":[[2023,4,8]],"date-time":"2023-04-08T17:14:47Z","timestamp":1680974087000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":22,"title":["MolFilterGAN: a progressively augmented generative adversarial network for triaging AI-designed molecules"],"prefix":"10.1186","volume":"15","author":[{"given":"Xiaohong","family":"Liu","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Wei","family":"Zhang","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xiaochu","family":"Tong","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Feisheng","family":"Zhong","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Zhaojun","family":"Li","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Zhaoping","family":"Xiong","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jiacheng","family":"Xiong","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xiaolong","family":"Wu","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Zunyun","family":"Fu","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xiaoqin","family":"Tan","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Zhiguo","family":"Liu","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Sulin","family":"Zhang","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Hualiang","family":"Jiang","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xutong","family":"Li","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Mingyue","family":"Zheng","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2023,4,8]]},"reference":[{"key":"711_CR1","doi-asserted-by":"publisher","first-page":"e1395","DOI":"10.1002\/wcms.1395","volume":"9","author":"D Xue","year":"2018","unstructured":"Xue D, Gong Y, Yang Z et al (2018) Advances and challenges in deep generative models for de novo molecule generation. Wiley Interdiscip Rev Comput Mol Sci 9:e1395. https:\/\/doi.org\/10.1002\/wcms.1395","journal-title":"Wiley Interdiscip Rev Comput Mol Sci"},{"key":"711_CR2","doi-asserted-by":"publisher","first-page":"8749","DOI":"10.1021\/acs.jmedchem.9b00959","volume":"63","author":"Z Xiong","year":"2020","unstructured":"Xiong Z, Wang D, Liu X et al (2020) Pushing the boundaries of molecular representation for drug discovery with the graph attention mechanism. J Med Chem 63:8749\u20138760. https:\/\/doi.org\/10.1021\/acs.jmedchem.9b00959","journal-title":"J Med Chem"},{"key":"711_CR3","doi-asserted-by":"publisher","first-page":"268","DOI":"10.1021\/acscentsci.7b00572","volume":"4","author":"R Gomez-Bombarelli","year":"2018","unstructured":"Gomez-Bombarelli R, Wei JN, Duvenaud D et al (2018) Automatic chemical design using a data-driven continuous representation of molecules. ACS Cent Sci 4:268\u2013276. https:\/\/doi.org\/10.1021\/acscentsci.7b00572","journal-title":"ACS Cent Sci"},{"key":"711_CR4","doi-asserted-by":"crossref","unstructured":"Simonovsky M, Komodakis N (2018) Graphvae: towards generation of small graphs using variational autoencoders. In: Artificial neural networks and machine learning\u2013ICANN 2018: 27th international conference on artificial neural networks, Rhodes, Greece, October 4\u20137, 2018, proceedings, part I 27, pp 412\u2013422","DOI":"10.1007\/978-3-030-01418-6_41"},{"key":"711_CR5","unstructured":"Cao ND, Kipf T (2018) MolGAN: an implicit generative model for small molecular graphs. arXiv:1805.11973"},{"key":"711_CR6","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13321-019-0397-9","volume":"11","author":"O Prykhodko","year":"2019","unstructured":"Prykhodko O, Johansson SV, Kotsias P-C et al (2019) A de novo molecular generation method using latent vector based generative adversarial network. J Cheminform 11:1\u201313. https:\/\/doi.org\/10.1186\/s13321-019-0397-9","journal-title":"J Cheminform"},{"key":"711_CR7","doi-asserted-by":"publisher","first-page":"120","DOI":"10.1021\/acscentsci.7b00512","volume":"4","author":"MHS Segler","year":"2018","unstructured":"Segler MHS, Kogej T, Tyrchan C et al (2018) Generating focused molecule libraries for drug discovery with recurrent neural networks. ACS Cent Sci 4:120\u2013131. https:\/\/doi.org\/10.1021\/acscentsci.7b00512","journal-title":"ACS Cent Sci"},{"key":"711_CR8","unstructured":"Bjerrum EJ, Threlfall R (2017) Molecular generation with recurrent neural networks (RNNs). arXiv:1705.04612"},{"key":"711_CR9","doi-asserted-by":"publisher","unstructured":"Gupta A, Muller AT, Huisman BJH et al (2018) Generative recurrent networks for de novo drug design. Mol Inform. https:\/\/doi.org\/10.1002\/minf.201700111","DOI":"10.1002\/minf.201700111"},{"key":"711_CR10","doi-asserted-by":"publisher","unstructured":"Merk D, Friedrich L, Grisoni F et al (2018) De novo design of bioactive small molecules by artificial intelligence. Mol Inform. https:\/\/doi.org\/10.1002\/minf.201700153","DOI":"10.1002\/minf.201700153"},{"key":"711_CR11","doi-asserted-by":"crossref","unstructured":"Zang C, Wang F (2020) MoFlow: an invertible flow model for generating molecular graphs. In: Proceedings of the 26th ACM SIGKDD international conference on knowledge discovery & data mining, pp 617\u2013626","DOI":"10.1145\/3394486.3403104"},{"key":"711_CR12","unstructured":"Shi C, Xu M, Zhu Z et al (2020) Graphaf: a flow-based autoregressive model for molecular graph generation. arXiv:2001.09382"},{"key":"711_CR13","doi-asserted-by":"publisher","first-page":"2064","DOI":"10.1021\/acs.jcim.1c00600","volume":"62","author":"V Bagal","year":"2021","unstructured":"Bagal V, Aggarwal R, Vinod P et al (2021) MolGPT: molecular generation using a transformer-decoder model. J Chem Inf Model 62:2064\u20132076. https:\/\/doi.org\/10.1021\/acs.jcim.1c00600","journal-title":"J Chem Inf Model"},{"key":"711_CR14","doi-asserted-by":"publisher","first-page":"18","DOI":"10.1186\/s13321-022-00599-3","volume":"14","author":"J He","year":"2022","unstructured":"He J, Nittinger E, Tyrchan C et al (2022) Transformer-based molecular optimization beyond matched molecular pairs. J Cheminform 14:18. https:\/\/doi.org\/10.1186\/s13321-022-00599-3","journal-title":"J Cheminform"},{"key":"711_CR15","unstructured":"Shi C, Luo S, Xu M et al (2021) Learning gradient fields for molecular conformation generation. In: International conference on machine learning, pp 9558\u20139568"},{"key":"711_CR16","unstructured":"Xu M, Yu L, Song Y et al (2022) Geodiff: a geometric diffusion model for molecular conformation generation. arXiv:2203.02923"},{"key":"711_CR17","doi-asserted-by":"publisher","first-page":"43","DOI":"10.1021\/acs.jcim.8b00263","volume":"59","author":"S Kang","year":"2019","unstructured":"Kang S, Cho K (2019) Conditional molecular design with deep generative models. J Chem Inf Model 59:43\u201352. https:\/\/doi.org\/10.1021\/acs.jcim.8b00263","journal-title":"J Chem Inf Model"},{"key":"711_CR18","doi-asserted-by":"publisher","first-page":"1182","DOI":"10.1021\/acs.jcim.8b00751","volume":"59","author":"B Sattarov","year":"2019","unstructured":"Sattarov B, Baskin II, Horvath D et al (2019) De novo molecular design by combining deep autoencoder recurrent neural networks with generative topographic mapping. J Chem Inf Model 59:1182\u20131196. https:\/\/doi.org\/10.1021\/acs.jcim.8b00751","journal-title":"J Chem Inf Model"},{"key":"711_CR19","doi-asserted-by":"publisher","first-page":"4398","DOI":"10.1021\/acs.molpharmaceut.8b00839","volume":"15","author":"D Polykovskiy","year":"2018","unstructured":"Polykovskiy D, Zhebrak A, Vetrov D et al (2018) Entangled conditional adversarial autoencoder for de novo drug discovery. Mol Pharm 15:4398\u20134405. https:\/\/doi.org\/10.1021\/acs.molpharmaceut.8b00839","journal-title":"Mol Pharm"},{"key":"711_CR20","unstructured":"Dai H, Tian Y, Dai B et al (2018) Syntax-directed variational autoencoder for structured data. arXiv:1802.08786"},{"key":"711_CR21","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13321-019-0404-1","volume":"12","author":"\u0141 Maziarka","year":"2020","unstructured":"Maziarka \u0141, Pocha A, Kaczmarczyk J et al (2020) Mol-CycleGAN: a generative model for molecular optimization. J Cheminform 12:1\u201318. https:\/\/doi.org\/10.1186\/s13321-019-0404-1","journal-title":"J Cheminform"},{"key":"711_CR22","doi-asserted-by":"publisher","first-page":"14011","DOI":"10.1021\/acs.jmedchem.1c00927","volume":"64","author":"X Tong","year":"2021","unstructured":"Tong X, Liu X, Tan X et al (2021) Generative models for de novo drug design. J Med Chem 64:14011\u201314027. https:\/\/doi.org\/10.1021\/acs.jmedchem.1c00927","journal-title":"J Med Chem"},{"key":"711_CR23","doi-asserted-by":"publisher","first-page":"577","DOI":"10.1039\/C9SC04026A","volume":"11","author":"R-R Griffiths","year":"2020","unstructured":"Griffiths R-R, Hern\u00e1ndez-Lobato JM (2020) Constrained Bayesian optimization for automatic chemical design using variational autoencoders. Chem Sci 11:577\u2013586. https:\/\/doi.org\/10.1039\/C9SC04026A","journal-title":"Chem Sci"},{"key":"711_CR24","doi-asserted-by":"crossref","unstructured":"Yoshikawa N, Terayama K, Honma T et al (2018) Population-based de novo molecule generation, using grammatical evolution. arXiv:1804.02134v1","DOI":"10.1246\/cl.180665"},{"key":"711_CR25","doi-asserted-by":"publisher","first-page":"12482","DOI":"10.1021\/acs.jmedchem.2c01179","volume":"65","author":"J Wang","year":"2022","unstructured":"Wang J, Wang X, Sun H et al (2022) ChemistGA: a chemical synthesizable accessible molecular generation algorithm for real-world drug discovery. J Med Chem 65:12482\u201312496. https:\/\/doi.org\/10.1021\/acs.jmedchem.2c01179","journal-title":"J Med Chem"},{"key":"711_CR26","unstructured":"Lee SY, Choi S, Chung S-Y (2019) Sample-efficient deep reinforcement learning via episodic backward update. arXiv:1805.12375v2"},{"key":"711_CR27","doi-asserted-by":"publisher","first-page":"35","DOI":"10.1186\/s13321-019-0355-6","volume":"11","author":"X Liu","year":"2019","unstructured":"Liu X, Ye K, van Vlijmen HWT et al (2019) An exploration strategy improves the diversity of de novo ligands using deep reinforcement learning: a case for the adenosine A2A receptor. J Cheminform 11:35. https:\/\/doi.org\/10.1186\/s13321-019-0355-6","journal-title":"J Cheminform"},{"key":"711_CR28","doi-asserted-by":"publisher","unstructured":"Popova M, Isayev O, Tropsha A (2018) Deep reinforcement learning for de novo drug design. Sci Adv 4:eaap7885. https:\/\/doi.org\/10.1126\/sciadv.aap7885","DOI":"10.1126\/sciadv.aap7885"},{"key":"711_CR29","doi-asserted-by":"publisher","first-page":"1038","DOI":"10.1038\/s41587-019-0224-x","volume":"37","author":"A Zhavoronkov","year":"2019","unstructured":"Zhavoronkov A, Ivanenkov YA, Aliper A et al (2019) Deep learning enables rapid identification of potent DDR1 kinase inhibitors. Nat Biotechnol 37:1038\u20131040. https:\/\/doi.org\/10.1038\/s41587-019-0224-x","journal-title":"Nat Biotechnol"},{"key":"711_CR30","doi-asserted-by":"crossref","unstructured":"Zhou Z, Kearnes S, Li L et al (2019) Optimization of molecules via deep reinforcement learning. arXiv:1810.08678","DOI":"10.1038\/s41598-019-47148-x"},{"key":"711_CR31","doi-asserted-by":"publisher","first-page":"914","DOI":"10.1038\/s42256-021-00403-1","volume":"3","author":"J Wang","year":"2021","unstructured":"Wang J, Hsieh C-Y, Wang M et al (2021) Multi-constraint molecular generation based on conditional transformer, knowledge distillation and reinforcement learning. Nat Mach Intell 3:914\u2013922. https:\/\/doi.org\/10.1038\/s42256-021-00403-1","journal-title":"Nat Mach Intell"},{"key":"711_CR32","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13321-017-0235-x","volume":"9","author":"M Olivecrona","year":"2017","unstructured":"Olivecrona M, Blaschke T, Engkvist O et al (2017) Molecular de-novo design through deep reinforcement learning. J Cheminform 9:1\u201314. https:\/\/doi.org\/10.1186\/s13321-017-0235-x","journal-title":"J Cheminform"},{"key":"711_CR33","doi-asserted-by":"publisher","first-page":"1337","DOI":"10.1021\/acs.jmedchem.9b01721","volume":"63","author":"Y Yang","year":"2020","unstructured":"Yang Y, Zhang R, Li Z et al (2020) Discovery of highly potent, selective, and orally efficacious p300\/CBP histone acetyltransferases inhibitors. J Med Chem 63:1337\u20131360. https:\/\/doi.org\/10.1021\/acs.jmedchem.9b01721","journal-title":"J Med Chem"},{"key":"711_CR34","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13321-020-00446-3","volume":"12","author":"X Li","year":"2020","unstructured":"Li X, Xu Y, Yao H et al (2020) Chemical space exploration based on recurrent neural networks: applications in discovering kinase inhibitors. J Cheminform 12:1\u201313. https:\/\/doi.org\/10.1186\/s13321-020-00446-3","journal-title":"J Cheminform"},{"key":"711_CR35","doi-asserted-by":"publisher","unstructured":"Tan X, Jiang X, He Y et al (2020) Automated design and optimization of multitarget schizophrenia drug candidates by deep learning. Eur J Med Chem 204:112572. https:\/\/doi.org\/10.1016\/j.ejmech.2020.112572","DOI":"10.1016\/j.ejmech.2020.112572"},{"key":"711_CR36","doi-asserted-by":"publisher","first-page":"8723","DOI":"10.1021\/acs.jmedchem.9b00855","volume":"63","author":"X Li","year":"2020","unstructured":"Li X, Li Z, Wu X et al (2020) Deep learning enhancing kinome-wide polypharmacology profiling: model construction and experiment validation. J Med Chem 63:8723\u20138737. https:\/\/doi.org\/10.1021\/acs.jmedchem.9b00855","journal-title":"J Med Chem"},{"key":"711_CR37","doi-asserted-by":"publisher","unstructured":"Xiong J, Xiong Z, Chen K et al (2021) Graph neural networks for automated de novo drug design. Drug Discovery Today 26:1382\u20131393. https:\/\/doi.org\/10.1016\/j.drudis.2021.02.011","DOI":"10.1016\/j.drudis.2021.02.011"},{"key":"711_CR38","doi-asserted-by":"publisher","DOI":"10.1016\/j.ymeth.2023.01.004","author":"J Wang","year":"2023","unstructured":"Wang J, Mao J, Wang M et al (2023) Explore drug-like space with deep generative models. Methods. https:\/\/doi.org\/10.1016\/j.ymeth.2023.01.004","journal-title":"Methods"},{"key":"711_CR39","doi-asserted-by":"publisher","unstructured":"Bilodeau C, Jin W, Jaakkola T et al (2022) Generative models for molecular discovery: recent advances and challenges. Wiley Interdiscip Rev Comput Mol Sci 12:e1608. https:\/\/doi.org\/10.1002\/wcms.1608","DOI":"10.1002\/wcms.1608"},{"key":"711_CR40","doi-asserted-by":"publisher","first-page":"746","DOI":"10.3390\/biom12060746","volume":"12","author":"B Tang","year":"2022","unstructured":"Tang B, He F, Liu D et al (2022) AI-aided design of novel targeted covalent inhibitors against SARS-CoV-2. Biomolecules 12:746. https:\/\/doi.org\/10.3390\/biom12060746","journal-title":"Biomolecules"},{"key":"711_CR41","doi-asserted-by":"publisher","first-page":"7555","DOI":"10.1080\/07391102.2021.1905559","volume":"40","author":"AM Andrianov","year":"2022","unstructured":"Andrianov AM, Nikolaev GI, Shuldov NA et al (2022) Application of deep learning and molecular modeling to identify small drug-like compounds as potential HIV-1 entry inhibitors. J Biomol Struct Dyn 40:7555\u20137573. https:\/\/doi.org\/10.1080\/07391102.2021.1905559","journal-title":"J Biomol Struct Dyn"},{"key":"711_CR42","doi-asserted-by":"publisher","unstructured":"Bjerrum EJ, Sattarov B (2018) Improving chemical autoencoder latent space and molecular de novo generation diversity with heteroencoders. Biomolecules. https:\/\/doi.org\/10.3390\/biom8040131","DOI":"10.3390\/biom8040131"},{"key":"711_CR43","unstructured":"Kusner MJ, Paige B, Hernandez-Lobato JM (2017) Grammar variational autoencoder. arXiv:1703.01925"},{"key":"711_CR44","unstructured":"Samanta B, De A, Ganguly N et al (2018) Nevae: designing random graph models using variational autoencoders with applications to chemical design. arXiv:1802.05283v1"},{"key":"711_CR45","doi-asserted-by":"publisher","first-page":"567","DOI":"10.4155\/fmc-2018-0358","volume":"11","author":"Y Xu","year":"2019","unstructured":"Xu Y, Lin K, Wang S et al (2019) Deep learning for molecular generation. Fut Med Chem 11:567\u2013597. https:\/\/doi.org\/10.4155\/fmc-2018-0358","journal-title":"Fut Med Chem"},{"key":"711_CR46","doi-asserted-by":"publisher","first-page":"1096","DOI":"10.1021\/acs.jcim.8b00839","volume":"59","author":"N Brown","year":"2019","unstructured":"Brown N, Fiscato M, Segler MHS et al (2019) GuacaMol: benchmarking models for de novo molecular design. J Chem Inf Model 59:1096\u20131108. https:\/\/doi.org\/10.1021\/acs.jcim.8b00839","journal-title":"J Chem Inf Model"},{"key":"711_CR47","doi-asserted-by":"publisher","first-page":"5714","DOI":"10.1021\/acs.jcim.0c00174","volume":"60","author":"W Gao","year":"2020","unstructured":"Gao W, Coley CW (2020) The synthesizability of molecules proposed by generative models. J Chem Inf Model 60:5714\u20135723. https:\/\/doi.org\/10.1021\/acs.jcim.0c00174","journal-title":"J Chem Inf Model"},{"key":"711_CR48","unstructured":"Gottipati SK, Sattarov B, Niu S et al (2020) Learning to navigate the synthetically accessible chemical space using reinforcement learning. In: International conference on machine learning, pp 3668\u20133679"},{"key":"711_CR49","doi-asserted-by":"publisher","first-page":"337","DOI":"10.1016\/j.ddtec.2004.11.007","volume":"1","author":"CA Lipinski","year":"2004","unstructured":"Lipinski CA (2004) Lead- and drug-like compounds: the rule-of-five revolution. Drug Discov Today Technol 1:337\u2013341. https:\/\/doi.org\/10.1016\/j.ddtec.2004.11.007","journal-title":"Drug Discov Today Technol"},{"key":"711_CR50","doi-asserted-by":"publisher","first-page":"980","DOI":"10.1038\/nsb1203-980","volume":"10","author":"H Berman","year":"2003","unstructured":"Berman H, Henrick K, Nakamura H (2003) Announcing the worldwide Protein Data Bank. Nat Struct Biol 10:980. https:\/\/doi.org\/10.1038\/nsb1203-980","journal-title":"Nat Struct Biol"},{"key":"711_CR51","doi-asserted-by":"publisher","first-page":"90","DOI":"10.1038\/nchem.1243","volume":"4","author":"GR Bickerton","year":"2012","unstructured":"Bickerton GR, Paolini GV, Besnard J et al (2012) Quantifying the chemical beauty of drugs. Nat Chem 4:90\u201398. https:\/\/doi.org\/10.1038\/nchem.1243","journal-title":"Nat Chem"},{"key":"711_CR52","doi-asserted-by":"publisher","first-page":"8","DOI":"10.1186\/1758-2946-1-8","volume":"1","author":"P Ertl","year":"2009","unstructured":"Ertl P, Schuffenhauer A (2009) Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions. J Cheminform 1:8. https:\/\/doi.org\/10.1186\/1758-2946-1-8","journal-title":"J Cheminform"},{"key":"711_CR53","doi-asserted-by":"publisher","first-page":"6752","DOI":"10.1021\/jm901241e","volume":"52","author":"F Lovering","year":"2009","unstructured":"Lovering F, Bikker J, Humblet C (2009) Escape from flatland: increasing saturation as an approach to improving clinical success. J Med Chem 52:6752\u20136756. https:\/\/doi.org\/10.1021\/jm901241e","journal-title":"J Med Chem"},{"key":"711_CR54","doi-asserted-by":"publisher","first-page":"1839","DOI":"10.1016\/j.drudis.2020.07.017","volume":"25","author":"W Wei","year":"2020","unstructured":"Wei W, Cherukupalli S, Jing L et al (2020) Fsp3: a new parameter for drug-likeness. Drug Discovery Today 25:1839\u20131845. https:\/\/doi.org\/10.1016\/j.drudis.2020.07.017","journal-title":"Drug Discovery Today"},{"key":"711_CR55","doi-asserted-by":"publisher","first-page":"10026","DOI":"10.1021\/acs.jmedchem.9b00004","volume":"62","author":"YA Ivanenkov","year":"2019","unstructured":"Ivanenkov YA, Zagribelnyy BA, Aladinskiy VA (2019) Are we opening the door to a new era of medicinal chemistry or being collapsed to a chemical singularity? J Med Chem 62:10026\u201310043. https:\/\/doi.org\/10.1021\/acs.jmedchem.9b00004","journal-title":"J Med Chem"},{"key":"711_CR56","doi-asserted-by":"publisher","unstructured":"Polykovskiy D, Zhebrak A, Sanchez-Lengeling B et al (2020) Molecular sets (MOSES): a benchmarking platform for molecular generation models. Front Pharmacol 11:565644. https:\/\/doi.org\/10.3389\/fphar.2020.565644","DOI":"10.3389\/fphar.2020.565644"},{"key":"711_CR57","doi-asserted-by":"publisher","first-page":"585","DOI":"10.3389\/fgene.2018.00585","volume":"9","author":"Q Hu","year":"2018","unstructured":"Hu Q, Feng M, Lai L et al (2018) Prediction of drug-likeness using deep autoencoder neural networks. Front Genet 9:585. https:\/\/doi.org\/10.3389\/fgene.2018.00585","journal-title":"Front Genet"},{"key":"711_CR58","doi-asserted-by":"publisher","first-page":"827","DOI":"10.1007\/s11030-020-10065-7","volume":"25","author":"SA Hooshmand","year":"2021","unstructured":"Hooshmand SA, Jamalkandi SA, Alavi SM et al (2021) Distinguishing drug\/non-drug-like small molecules in drug discovery using deep belief network. Mol Diversity 25:827\u2013838. https:\/\/doi.org\/10.1007\/s11030-020-10065-7","journal-title":"Mol Diversity"},{"key":"711_CR59","doi-asserted-by":"publisher","first-page":"554","DOI":"10.1039\/D1SC05248A","volume":"13","author":"K Lee","year":"2022","unstructured":"Lee K, Jang J, Seo S et al (2022) Drug-likeness scoring based on unsupervised learning. Chem Sci 13:554\u2013565. https:\/\/doi.org\/10.1039\/D1SC05248A","journal-title":"Chem Sci"},{"key":"711_CR60","doi-asserted-by":"publisher","first-page":"457","DOI":"10.1038\/s42256-020-0209-y","volume":"2","author":"W Beker","year":"2020","unstructured":"Beker W, Wo\u0142os A, Szymku\u0107 S et al (2020) Minimal-uncertainty prediction of general drug-likeness based on Bayesian neural networks. Nat Mach Intell 2:457\u2013465. https:\/\/doi.org\/10.1038\/s42256-020-0209-y","journal-title":"Nat Mach Intell"},{"key":"711_CR61","unstructured":"Mao X, Su Z, Siang Tan P et al (2019) Is discriminator a good feature extractor? arXiv:1912.00789"},{"key":"711_CR62","unstructured":"Donahue J, Kr\u00e4henb\u00fchl P, Darrell T (2016) Adversarial feature learning. arXiv:1605.09782"},{"key":"711_CR63","doi-asserted-by":"publisher","first-page":"31","DOI":"10.1021\/ci00057a005","volume":"28","author":"D Weininger","year":"1988","unstructured":"Weininger D (1988) SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules. J Chem Inf Comput Sci 28:31\u201336. https:\/\/doi.org\/10.1021\/ci00057a005","journal-title":"J Chem Inf Comput Sci"},{"key":"711_CR64","doi-asserted-by":"publisher","first-page":"3567","DOI":"10.1039\/c8sc05372c","volume":"10","author":"JH Jensen","year":"2019","unstructured":"Jensen JH (2019) A graph-based genetic algorithm and generative model\/Monte Carlo tree search for the exploration of chemical space. Chem Sci 10:3567\u20133572. https:\/\/doi.org\/10.1039\/c8sc05372c","journal-title":"Chem Sci"},{"key":"711_CR65","doi-asserted-by":"publisher","first-page":"2324","DOI":"10.1021\/acs.jcim.5b00559","volume":"55","author":"T Sterling","year":"2015","unstructured":"Sterling T, Irwin JJ (2015) ZINC 15\u2014ligand discovery for everyone. J Chem Inf Model 55:2324\u20132337. https:\/\/doi.org\/10.1021\/acs.jcim.5b00559","journal-title":"J Chem Inf Model"},{"key":"711_CR66","unstructured":"Real Database (2020) https:\/\/enamine.net\/library-synthesis\/real-compounds\/real-database"},{"key":"711_CR67","doi-asserted-by":"publisher","first-page":"D930","DOI":"10.1093\/nar\/gky1075","volume":"47","author":"D Mendez","year":"2019","unstructured":"Mendez D, Gaulton A, Bento AP et al (2019) ChEMBL: towards direct deposition of bioassay data. Nucl Acids Res 47:D930\u2013D940. https:\/\/doi.org\/10.1093\/nar\/gky1075","journal-title":"Nucl Acids Res"},{"key":"711_CR68","doi-asserted-by":"publisher","unstructured":"Jianhua S, Xiaoying X, Feng C et al (2003) Virtual screening on natural products for discovering active compounds and target information. Curr Med Chem 10:2327\u20132342. https:\/\/doi.org\/10.2174\/0929867033456729","DOI":"10.2174\/0929867033456729"},{"key":"711_CR69","unstructured":"Li Y (2017) Deep reinforcement learning: an overview. arXiv:1701.07274"},{"key":"711_CR70","doi-asserted-by":"crossref","unstructured":"Yu L, Zhang W, Wang J et al (2017) Seqgan: sequence generative adversarial nets with policy gradient. In: Proceedings of the AAAI conference on artificial intelligence","DOI":"10.1609\/aaai.v31i1.10804"},{"key":"711_CR71","doi-asserted-by":"crossref","unstructured":"Kim Y (2014) Convolutional neural networks for sentence classification. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), Doha, Qatar, pp 1746\u20131751","DOI":"10.3115\/v1\/D14-1181"},{"key":"711_CR72","unstructured":"Zhang X, LeCun Y (2015) Text understanding from scratch. arXiv:1502.01710"},{"key":"711_CR73","doi-asserted-by":"publisher","first-page":"229","DOI":"10.1023\/A:1022672621406","volume":"8","author":"RJ Williams","year":"1992","unstructured":"Williams RJ (1992) Simple statistical gradient-following algorithms for connectionist reinforcement learning. Mach Learn 8:229\u2013256. https:\/\/doi.org\/10.1023\/A:1022672621406","journal-title":"Mach Learn"},{"key":"711_CR74","doi-asserted-by":"publisher","first-page":"D1074","DOI":"10.1093\/nar\/gkx1037","volume":"46","author":"DS Wishart","year":"2018","unstructured":"Wishart DS, Feunang YD, Guo AC et al (2018) DrugBank 5.0: a major update to the DrugBank database for 2018. Nucl Acids Res 46:D1074\u2013D1082. https:\/\/doi.org\/10.1093\/nar\/gkx1037","journal-title":"Nucl Acids Res"},{"key":"711_CR75","doi-asserted-by":"publisher","first-page":"D932","DOI":"10.1093\/nar\/gkw993","volume":"45","author":"O Ursu","year":"2017","unstructured":"Ursu O, Holmes J, Knockel J et al (2017) DrugCentral: online drug compendium. Nucl Acids Res 45:D932\u2013D939. https:\/\/doi.org\/10.1093\/nar\/gkw993","journal-title":"Nucl Acids Res"},{"key":"711_CR76","doi-asserted-by":"publisher","first-page":"D1137","DOI":"10.1093\/nar\/gkx1088","volume":"46","author":"VB Siramshetty","year":"2018","unstructured":"Siramshetty VB, Eckert OA, Gohlke BO et al (2018) SuperDRUG2: a one stop resource for approved\/marketed drugs. Nucl Acids Res 46:D1137\u2013D1143. https:\/\/doi.org\/10.1093\/nar\/gkx1088","journal-title":"Nucl Acids Res"},{"key":"711_CR77","doi-asserted-by":"publisher","unstructured":"Griesenauer RH, Schillebeeckx C, Kinch MS (2019) CDEK: clinical drug experience knowledgebase. Database (Oxford). https:\/\/doi.org\/10.1093\/database\/baz087","DOI":"10.1093\/database\/baz087"},{"key":"711_CR78","unstructured":"Kingma DP, Ba J (2014) Adam: a method for stochastic optimization. arXiv:1412.6980"},{"key":"711_CR79","unstructured":"Maestro, Schr\u00f6dinger, LLC, New York, NY (2015)"},{"key":"711_CR80","unstructured":"LigPrep, Schr\u00f6dinger, LLC, New York, NY (2015)"},{"key":"711_CR81","unstructured":"Epik, Schr\u00f6dinger, LLC, New York, NY (2015)"},{"key":"711_CR82","doi-asserted-by":"publisher","first-page":"1739","DOI":"10.1021\/jm0306430","volume":"47","author":"RA Friesner","year":"2004","unstructured":"Friesner RA, Banks JL, Murphy RB et al (2004) Glide: a new approach for rapid, accurate docking and scoring. 1. Method and assessment of docking accuracy. J Med Chem 47:1739\u20131749. https:\/\/doi.org\/10.1021\/jm0306430","journal-title":"J Med Chem"},{"key":"711_CR83","doi-asserted-by":"publisher","first-page":"460","DOI":"10.1021\/ci500588j","volume":"55","author":"T Sander","year":"2015","unstructured":"Sander T, Freyss J, von Korff M et al (2015) DataWarrior: an open-source program for chemistry aware data visualization and analysis. J Chem Inf Model 55:460\u2013473. https:\/\/doi.org\/10.1021\/ci500588j","journal-title":"J Chem Inf Model"},{"key":"711_CR84","doi-asserted-by":"publisher","first-page":"4263","DOI":"10.1021\/acs.jcim.0c00155","volume":"60","author":"V-K Tran-Nguyen","year":"2020","unstructured":"Tran-Nguyen V-K, Jacquemard C, Rognan DJJoci, et al (2020) LIT-PCBA: an unbiased data set for machine learning and virtual screening. J Chem Inf Model 60:4263\u20134273. https:\/\/doi.org\/10.1021\/acs.jcim.0c00155","journal-title":"J Chem Inf Model"},{"key":"711_CR85","doi-asserted-by":"publisher","first-page":"2092","DOI":"10.1109\/LGRS.2017.2752750","volume":"14","author":"D Lin","year":"2017","unstructured":"Lin D, Fu K, Wang Y et al (2017) MARTA GANs: unsupervised representation learning for remote sensing image classification. IEEE Geosci Remote Sens Lett 14:2092\u20132096. https:\/\/doi.org\/10.1109\/LGRS.2017.2752750","journal-title":"IEEE Geosci Remote Sens Lett"},{"key":"711_CR86","doi-asserted-by":"publisher","first-page":"2669","DOI":"10.1109\/TGRS.2018.2876123","volume":"57","author":"M Zhang","year":"2019","unstructured":"Zhang M, Gong M, Mao Y et al (2019) Unsupervised feature extraction in hyperspectral images based on Wasserstein generative adversarial network. IEEE Trans Geosci Remote Sens 57:2669\u20132688. https:\/\/doi.org\/10.1109\/TGRS.2018.2876123","journal-title":"IEEE Trans Geosci Remote Sens"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-023-00711-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13321-023-00711-1\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-023-00711-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,4,8]],"date-time":"2023-04-08T17:15:35Z","timestamp":1680974135000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/s13321-023-00711-1"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,4,8]]},"references-count":86,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,12]]}},"alternative-id":["711"],"URL":"https:\/\/doi.org\/10.1186\/s13321-023-00711-1","relation":{},"ISSN":["1758-2946"],"issn-type":[{"value":"1758-2946","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,4,8]]},"assertion":[{"value":"15 October 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"14 March 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"8 April 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"42"}}