{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,30]],"date-time":"2026-04-30T23:26:33Z","timestamp":1777591593628,"version":"3.51.4"},"reference-count":57,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2024,12,5]],"date-time":"2024-12-05T00:00:00Z","timestamp":1733356800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,12,5]],"date-time":"2024-12-05T00:00:00Z","timestamp":1733356800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100003725","name":"National Research Foundation of Korea","doi-asserted-by":"publisher","award":["RS-2023-00232157"],"award-info":[{"award-number":["RS-2023-00232157"]}],"id":[{"id":"10.13039\/501100003725","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100003725","name":"National Research Foundation of Korea","doi-asserted-by":"publisher","award":["RS-2023-00232157"],"award-info":[{"award-number":["RS-2023-00232157"]}],"id":[{"id":"10.13039\/501100003725","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100003725","name":"National Research Foundation of Korea","doi-asserted-by":"publisher","award":["RS-2023-00232157"],"award-info":[{"award-number":["RS-2023-00232157"]}],"id":[{"id":"10.13039\/501100003725","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100003725","name":"National Research Foundation of Korea","doi-asserted-by":"publisher","award":["RS-2023-00232157"],"award-info":[{"award-number":["RS-2023-00232157"]}],"id":[{"id":"10.13039\/501100003725","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100003725","name":"National Research Foundation of Korea","doi-asserted-by":"publisher","award":["RS-2023-00232157"],"award-info":[{"award-number":["RS-2023-00232157"]}],"id":[{"id":"10.13039\/501100003725","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Institute of Information & communications Technology Planning & Evaluation","award":["RS-2023-00220628 to CS"],"award-info":[{"award-number":["RS-2023-00220628 to CS"]}]},{"name":"Institute of Information & communications Technology Planning & Evaluation","award":["RS-2023-00220628 to CS"],"award-info":[{"award-number":["RS-2023-00220628 to CS"]}]},{"name":"Institute of Information & communications Technology Planning & Evaluation","award":["RS-2023-00220628 to CS"],"award-info":[{"award-number":["RS-2023-00220628 to CS"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>The two key components of computational molecular design are virtually generating molecules and predicting the properties of these generated molecules. This study focuses on an effective method for molecular generation through virtual synthesis and global optimization of a given objective function. Using a pre-trained graph neural network (GNN) objective function to approximate the docking energies of compounds for four target receptors, we generated highly optimized compounds with 300\u2013400\u00a0times less computational effort compared to virtual compound library screening. These optimized compounds exhibit similar synthesizability and diversity to known binders with high potency and are notably novel compared to library chemicals or known ligands. This method, called CSearch, can be effectively utilized to generate chemicals optimized for a given objective function. With the GNN function approximating docking energies, CSearch generated molecules with predicted binding poses to the target receptors similar to known inhibitors, demonstrating its effectiveness in producing drug-like binders.<\/jats:p>\n                  <jats:p>\n                    <jats:bold>Scientific Contribution<\/jats:bold>\n                    We have developed a method for effectively exploring the chemical space of drug-like molecules using a global optimization algorithm with fragment-based virtual synthesis. The compounds generated using this method optimize the given objective function efficiently and are synthesizable like commercial library compounds. Furthermore, they are diverse, novel drug-like molecules with properties similar to known inhibitors for target receptors.\n                  <\/jats:p>","DOI":"10.1186\/s13321-024-00936-8","type":"journal-article","created":{"date-parts":[[2024,12,5]],"date-time":"2024-12-05T08:14:51Z","timestamp":1733386491000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["CSearch: chemical space search via virtual synthesis and global optimization"],"prefix":"10.1186","volume":"16","author":[{"given":"Hakjean","family":"Kim","sequence":"first","affiliation":[]},{"given":"Seongok","family":"Ryu","sequence":"additional","affiliation":[]},{"given":"Nuri","family":"Jung","sequence":"additional","affiliation":[]},{"given":"Jinsol","family":"Yang","sequence":"additional","affiliation":[]},{"given":"Chaok","family":"Seok","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2024,12,5]]},"reference":[{"issue":"2","key":"936_CR1","doi-asserted-by":"publisher","first-page":"268","DOI":"10.1021\/acscentsci.7b00572","volume":"4","author":"R G\u00f3mez-Bombarelli","year":"2018","unstructured":"G\u00f3mez-Bombarelli R, Wei JN, Duvenaud D, Hern\u00e1ndez-Lobato JM, S\u00e1nchez-Lengeling B, Sheberla D, Aguilera-Iparraguirre J, Hirzel TD, Adams RP, Aspuru-Guzik A (2018) Automatic chemical design using a data-driven continuous representation of molecules. ACS Cent Sci 4(2):268\u2013276. https:\/\/doi.org\/10.1021\/acscentsci.7b00572","journal-title":"ACS Cent Sci"},{"key":"936_CR2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1812.01070","author":"W Jin","year":"2019","unstructured":"Jin W, Yang K, Barzilay R, Jaakkola T (2019) Learning multimodal graph-to-graph translation for molecular optimization. arXiv preprint. https:\/\/doi.org\/10.48550\/arXiv.1812.01070","journal-title":"arXiv preprint"},{"issue":"1","key":"936_CR3","doi-asserted-by":"publisher","first-page":"10752","DOI":"10.1038\/s41598-019-47148-x","volume":"9","author":"Z Zhou","year":"2019","unstructured":"Zhou Z, Kearnes S, Li L, Zare RN, Riley P (2019) Optimization of molecules via deep reinforcement learning. Sci Rep 9(1):10752. https:\/\/doi.org\/10.1038\/s41598-019-47148-x","journal-title":"Sci Rep"},{"issue":"36","key":"936_CR4","doi-asserted-by":"publisher","first-page":"8438","DOI":"10.1039\/c9sc01992h","volume":"1","author":"S Ryu","year":"2019","unstructured":"Ryu S, Kwon Y, Kim WY (2019) A Bayesian graph convolutional network for reliable prediction of molecular properties with uncertainty quantification. Chem Sci 1(36):8438\u20138446. https:\/\/doi.org\/10.1039\/c9sc01992h","journal-title":"Chem Sci"},{"issue":"11","key":"936_CR5","doi-asserted-by":"publisher","first-page":"1520","DOI":"10.1021\/acscentsci.8b00507","volume":"4","author":"EN Feinberg","year":"2018","unstructured":"Feinberg EN, Sur D, Wu Z, Husic BE, Mai H, Li Y, Sun S, Yang J, Ramsundar B, Pande VS (2018) PotentialNet for molecular property prediction. ACS Cent Sci 4(11):1520\u20131530. https:\/\/doi.org\/10.1021\/acscentsci.8b00507","journal-title":"ACS Cent Sci"},{"issue":"8","key":"936_CR6","doi-asserted-by":"publisher","first-page":"2020","DOI":"10.1021\/acs.jpclett.2c03906","volume":"14","author":"Z Yang","year":"2023","unstructured":"Yang Z, Zhong W, Lv Q, Dong T, Yu-Chian Chen C (2023) Geometric interaction graph neural network for predicting protein-ligand binding affinities from 3D structures (GIGN). J Phys Chem Lett 14(8):2020\u20132033. https:\/\/doi.org\/10.1021\/acs.jpclett.2c03906","journal-title":"J Phys Chem Lett"},{"key":"936_CR7","doi-asserted-by":"publisher","first-page":"753002","DOI":"10.3389\/fchem.2021.753002","volume":"9","author":"Z Wang","year":"2021","unstructured":"Wang Z, Zheng L, Liu Y, Qu Y, Li YQ, Zhao M, Mu Y, Li W (2021) OnionNet-2: a convolutional neural network model for predicting protein-ligand binding affinity based on residue-atom contacting shells. Front Chem 9:753002. https:\/\/doi.org\/10.3389\/fchem.2021.753002","journal-title":"Front Chem"},{"issue":"7893","key":"936_CR8","doi-asserted-by":"publisher","first-page":"452","DOI":"10.1038\/s41586-021-04220-9","volume":"601","author":"AA Sadybekov","year":"2022","unstructured":"Sadybekov AA, Sadybekov AV, Liu Y, Iliopoulos-Tsoutsouvas C, Huang XP, Pickett J, Houser B, Patel N, Tran NK, Tong F, Zvonok N, Jain MK, Savych O, RadChenko DS, Nikas SP, Petasis NA, Moroz YS, Roth BL, Makriyannis A, Katritch V (2022) Synthon-based ligand discovery in virtual libraries of over 11 billion compounds. Nature 601(7893):452\u2013459. https:\/\/doi.org\/10.1038\/s41586-021-04220-9","journal-title":"Nature"},{"key":"936_CR9","unstructured":"Jin W, Barzilay R, Jaakkola T (2020) Multi-Objective Molecule Generation using Interpretable Substructures. arXiv preprint arXiv.2002.03244"},{"issue":"11","key":"936_CR10","doi-asserted-by":"publisher","first-page":"101681","DOI":"10.1016\/j.isci.2020.101681","volume":"23","author":"OO Grygorenko","year":"2020","unstructured":"Grygorenko OO, Radchenko DS, Dziuba I, Chuprina A, Gubina KE, Moroz YS (2020) Generating multibillion chemical space of readily accessible screening compounds. IScience 23(11):101681. https:\/\/doi.org\/10.1016\/j.isci.2020.101681","journal-title":"IScience"},{"key":"936_CR11","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13321-017-0235-x","volume":"9","author":"M Olivecrona","year":"2017","unstructured":"Olivecrona M, Blaschke T, Engkvist O, Chen H (2017) Molecular de-novo design through deep reinforcement learning. J Chem Inf 9:1\u201314. https:\/\/doi.org\/10.1186\/s13321-017-0235-x","journal-title":"J Chem Inf"},{"issue":"9","key":"936_CR12","doi-asserted-by":"publisher","first-page":"2064","DOI":"10.1021\/acs.jcim.1c00600","volume":"62","author":"V Bagal","year":"2022","unstructured":"Bagal V, Aggarwal R, Vinod PK, Priyakumar UD (2022) MolGPT: molecular generation using a transformer-decoder model. J Chem Inf Model 62(9):2064\u20132076. https:\/\/doi.org\/10.1021\/acs.jcim.1c00600","journal-title":"J Chem Inf Model"},{"key":"936_CR13","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2403.07902","author":"J Guan","year":"2024","unstructured":"Guan J, Zhou X, Yang Y, Yu B, Peng J, Ma J, Liu Q, Wang L, Gu Q (2024) DecompDiff: diffusion models with decomposed priors for structure-based drug design. arXiv preprint. https:\/\/doi.org\/10.48550\/arXiv.2403.07902","journal-title":"arXiv preprint"},{"key":"936_CR14","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2303.03543","author":"J Guan","year":"2023","unstructured":"Guan J, Wesley Wei Q, Peng X, Su Y, Peng J, Ma J (2023) 3D equivariant diffusion for target-aware molecule generation and affinity prediction. arXiv preprint. https:\/\/doi.org\/10.48550\/arXiv.2303.03543","journal-title":"arXiv preprint"},{"key":"936_CR15","unstructured":"Lee S, Jo J, Hwang SJ (2023) Exploring chemical space with score-based out-of-distribution generation. In: Proceedings of the 40th International Conference on Machine Learning, Proceedings of Machine Learning Research 202:18872\u201318892."},{"key":"936_CR16","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v35i1.16085","author":"T Fu","year":"2022","unstructured":"Fu T, Cao X, Li X, Glass LM, Sun J (2022) MIMOSA: multi-constraint molecule sampling for molecule optimization. arXiv preprint. https:\/\/doi.org\/10.1609\/aaai.v35i1.16085","journal-title":"arXiv preprint"},{"key":"936_CR17","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2103.10432","author":"Y Xie","year":"2021","unstructured":"Xie Y, Shi C, Zhou H, Yang Y, Zhang W, Yu Y, Li L (2021) MARS: markov molecular sampling for multi-objective drug discovery. arXiv preprint. https:\/\/doi.org\/10.48550\/arXiv.2103.10432","journal-title":"arXiv preprint"},{"issue":"2","key":"936_CR18","doi-asserted-by":"publisher","first-page":"90","DOI":"10.1038\/nchem.1243","volume":"4","author":"GR Bickerton","year":"2012","unstructured":"Bickerton GR, Paolini GV, Besnard J, Muresan S, Hopkins AL (2012) Quantifying the chemical beauty of drugs. Nat Chem 4(2):90\u201398. https:\/\/doi.org\/10.1038\/nchem.1243","journal-title":"Nat Chem"},{"issue":"1","key":"936_CR19","doi-asserted-by":"publisher","first-page":"8","DOI":"10.1186\/1758-2946-1-8","volume":"1","author":"P Ertl","year":"2009","unstructured":"Ertl P, Schuffenhauer A (2009) Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions. J Chem Inf 1(1):8. https:\/\/doi.org\/10.1186\/1758-2946-1-8","journal-title":"J Chem Inf"},{"issue":"10","key":"936_CR20","doi-asserted-by":"publisher","first-page":"860","DOI":"10.1038\/s43588-023-00529-9","volume":"3","author":"J Yu","year":"2023","unstructured":"Yu J, Li Z, Chen G, Kong X, Hu J, Wang D, Cao D, Li Y, Huo R, Wang G, Liu X, Jiang H, Li X, Luo X, Zheng M (2023) Computing the relative binding affinity of ligands based on a pairwise binding comparison network. Nat Comput Sci 3(10):860\u2013872. https:\/\/doi.org\/10.1038\/s43588-023-00529-9","journal-title":"Nat Comput Sci"},{"issue":"1","key":"936_CR21","doi-asserted-by":"publisher","first-page":"14","DOI":"10.1093\/bib\/bbad462","volume":"25","author":"HS Tan","year":"2024","unstructured":"Tan HS, Wang ZX, Hu G (2024) GAABind: a geometry-aware attention-based network for accurate protein-ligand binding pose and binding affinity prediction. Brief Bioinform 25(1):14. https:\/\/doi.org\/10.1093\/bib\/bbad462","journal-title":"Brief Bioinform"},{"issue":"13","key":"936_CR22","doi-asserted-by":"publisher","first-page":"3661","DOI":"10.1039\/d1sc06946b","volume":"13","author":"S Moon","year":"2022","unstructured":"Moon S, Zhung W, Yang S, Lim J, Kim WY (2022) PIGNet: a physics-informed deep learning model toward generalized drug-target interaction predictions. Chem Sci 13(13):3661\u20133673. https:\/\/doi.org\/10.1039\/d1sc06946b","journal-title":"Chem Sci"},{"issue":"22","key":"936_CR23","doi-asserted-by":"publisher","first-page":"1","DOI":"10.3390\/ijms21228424","volume":"21","author":"Y Kwon","year":"2020","unstructured":"Kwon Y, Shin WH, Ko J, Lee J (2020) AK-score: accurate protein-ligand binding affinity prediction using an ensemble of 3D-convolutional neural networks. Int J Mol Sci 21(22):1\u201316. https:\/\/doi.org\/10.3390\/ijms21228424","journal-title":"Int J Mol Sci"},{"issue":"12","key":"936_CR24","doi-asserted-by":"publisher","first-page":"5682","DOI":"10.1021\/acs.jcim.0c00599","volume":"60","author":"KF Gao","year":"2020","unstructured":"Gao KF, Nguyen DD, Tu MH, Wei GW (2020) Generative network complex for the automated generation of drug-like molecules. J Chem Inf Model 60(12):5682\u20135698. https:\/\/doi.org\/10.1021\/acs.jcim.0c00599","journal-title":"J Chem Inf Model"},{"key":"936_CR25","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1705.04612","author":"EJ Bjerrum","year":"2017","unstructured":"Bjerrum EJ, Threlfall R (2017) Molecular generation with recurrent neural networks (RNNs). arXiv preprint. https:\/\/doi.org\/10.48550\/arXiv.1705.04612","journal-title":"arXiv preprint"},{"issue":"11","key":"936_CR26","doi-asserted-by":"publisher","first-page":"746","DOI":"10.1002\/jcc.26494","volume":"42","author":"M Druchok","year":"2021","unstructured":"Druchok M, Yarish D, Gurbych O, Maksymenko M (2021) Toward efficient generation, correction, and properties control of unique drug-like structures. J Comput Chem 42(11):746\u2013760. https:\/\/doi.org\/10.1002\/jcc.26494","journal-title":"J Comput Chem"},{"key":"936_CR27","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2111.04107","author":"P Drot\u00e1r","year":"2021","unstructured":"Drot\u00e1r P, Jamasb AR, Day B, Cangea C, Li\u00f2 P (2021) Structure-aware generation of drug-like molecules. arXiv preprint. https:\/\/doi.org\/10.48550\/arXiv.2111.04107","journal-title":"arXiv preprint"},{"issue":"1","key":"936_CR28","doi-asserted-by":"publisher","first-page":"i326","DOI":"10.1093\/bioinformatics\/btad222","volume":"39","author":"J-N Li","year":"2023","unstructured":"Li J-N, Yang G, Zhao P-C, Wei X-X, Shi J-Y (2023) CProMG: controllable protein-oriented molecule generation with desired binding affinity and drug-like properties. Bioinformatics 39(1):i326\u2013i336. https:\/\/doi.org\/10.1093\/bioinformatics\/btad222","journal-title":"Bioinformatics"},{"key":"936_CR29","doi-asserted-by":"publisher","unstructured":"Lee J, Scheraga HA, Rackovsky S (1997) New optimization method for conformational energy calculations on polypeptides: conformational space annealing. J Comput Chem 18(9):1222\u20131232. https:\/\/doi.org\/10.1002\/(SICI)1096-987X(19970715)18:9%3C1222::AID-JCC10%3E3.0.CO;2-7","DOI":"10.1002\/(SICI)1096-987X(19970715)18:9%3C1222::AID-JCC10%3E3.0.CO;2-7"},{"issue":"15","key":"936_CR30","doi-asserted-by":"publisher","first-page":"3226","DOI":"10.1002\/jcc.21905","volume":"32","author":"W-H Shin","year":"2011","unstructured":"Shin W-H, Heo L, Lee J, Ko J, Seok C, Lee J (2011) LigDockCSA: protein-ligand docking using conformational space annealing. J Comput Chem 32(15):3226\u20133232. https:\/\/doi.org\/10.1002\/jcc.21905","journal-title":"J Comput Chem"},{"issue":"9","key":"936_CR31","doi-asserted-by":"publisher","first-page":"2725","DOI":"10.1002\/prot.23101","volume":"79","author":"H Park","year":"2011","unstructured":"Park H, Ko J, Joo K, Lee J, Seok C, Lee J (2011) Refinement of protein termini in template-based modeling using conformational space annealing. Proteins 79(9):2725\u20132734. https:\/\/doi.org\/10.1002\/prot.23101","journal-title":"Proteins"},{"issue":"8","key":"936_CR32","doi-asserted-by":"publisher","first-page":"2403","DOI":"10.1002\/prot.23059","volume":"79","author":"J Lee","year":"2011","unstructured":"Lee J, Lee J, Sasaki TN, Sasai M, Seok C, Lee J (2011) De novo protein structure prediction by dynamic fragment assembly and conformational space annealing. Proteins 79(8):2403\u20132417. https:\/\/doi.org\/10.1002\/prot.23059","journal-title":"Proteins"},{"issue":"30","key":"936_CR33","doi-asserted-by":"publisher","first-page":"2647","DOI":"10.1002\/jcc.23438","volume":"34","author":"W-H Shin","year":"2013","unstructured":"Shin W-H, Kim J-K, Kim D-S, Seok C (2013) GalaxyDock2: protein-ligand docking using beta-complex and global optimization. J Comput Chem 34(30):2647\u20132656. https:\/\/doi.org\/10.1002\/jcc.23438","journal-title":"J Comput Chem"},{"issue":"1","key":"936_CR34","first-page":"1","volume":"2","author":"W-H Shin","year":"2014","unstructured":"Shin W-H, Lee G-R, Heo L, Lee H, Seok C (2014) Prediction of protein structure and interaction by GALAXY protein modeling programs. Bio Des 2(1):1\u201311","journal-title":"Bio Des"},{"issue":"1","key":"936_CR35","doi-asserted-by":"publisher","first-page":"24","DOI":"10.1186\/s13321-021-00501-7","volume":"13","author":"Y Kwon","year":"2021","unstructured":"Kwon Y, Lee J (2021) MolFinder: an evolutionary algorithm for the global optimization of molecular properties and the extensive exploration of chemical space using SMILES. J Chem Inf 13(1):24\u201324. https:\/\/doi.org\/10.1186\/s13321-021-00501-7","journal-title":"J Chem Inf"},{"issue":"10","key":"936_CR36","doi-asserted-by":"publisher","first-page":"1503","DOI":"10.1002\/cmdc.200800178","volume":"3","author":"J Degen","year":"2008","unstructured":"Degen J, Wegscheid-Gerlach C, Zaliani A, Rarey M (2008) On the art of compiling and using \u2018drug-like\u2019 chemical fragment spaces. Chem Med Chem 3(10):1503\u20131507. https:\/\/doi.org\/10.1002\/cmdc.200800178","journal-title":"Chem Med Chem"},{"issue":"1","key":"936_CR37","doi-asserted-by":"publisher","first-page":"20","DOI":"10.1186\/s13321-024-00812-5","volume":"16","author":"HH Loeffler","year":"2024","unstructured":"Loeffler HH, He J, Tibo A, Janet JP, Voronov A, Mervin LH, Engkvist O (2024) Reinvent 4: modern AI\u2013driven generative molecule design. J Chem Inf 16(1):20. https:\/\/doi.org\/10.1186\/s13321-024-00812-5","journal-title":"J Chem Inf"},{"issue":"D1","key":"936_CR38","doi-asserted-by":"publisher","first-page":"D1170","DOI":"10.1093\/nar\/gkaa920","volume":"49","author":"T Yang","year":"2021","unstructured":"Yang T, Li Z, Chen Y, Feng D, Wang G, Fu Z, Ding X, Tan X, Zhao J, Luo X, Chen K, Jiang H, Zheng M (2021) DrugSpaceX: a large screenable and synthetically tractable database extending drug space. Nucleic Acids Res 49(D1):D1170\u2013D1178. https:\/\/doi.org\/10.1093\/nar\/gkaa920","journal-title":"Nucleic Acids Res"},{"key":"936_CR39","volume-title":"An elementary mathematical theory of classification and prediction","author":"TT Tanimoto","year":"1958","unstructured":"Tanimoto TT (1958) An elementary mathematical theory of classification and prediction. International Business Machines Corporation, New York"},{"issue":"2","key":"936_CR40","doi-asserted-by":"publisher","first-page":"107","DOI":"10.1021\/c160017a018","volume":"5","author":"HL Morgan","year":"1965","unstructured":"Morgan HL (1965) The generation of a unique machine description for chemical structures-a technique developed at chemical abstracts service. J Chem Doc 5(2):107\u2013113. https:\/\/doi.org\/10.1021\/c160017a018","journal-title":"J Chem Doc"},{"issue":"31.10","key":"936_CR41","first-page":"5281","volume":"8","author":"G Landrum","year":"2013","unstructured":"Landrum G (2013) RDKit: a software suite for cheminformatics, computational chemistry, and predictive modeling. Greg Landrum 8(31.10):5281","journal-title":"Greg Landrum"},{"key":"936_CR42","unstructured":"Fragment Collection (2023) Enamine Ltd, Kyiv. https:\/\/enamine.net\/compound-collections\/fragment-collection. Accessed 23 Feb 2023"},{"key":"936_CR43","unstructured":"The PubChem Compound Database (2023) https:\/\/ftp.ncbi.nlm.nih.gov\/pubchem\/Compound\/. Accessed 24 Jul 2023."},{"issue":"31","key":"936_CR44","doi-asserted-by":"publisher","first-page":"2739","DOI":"10.1002\/jcc.26050","volume":"40","author":"J Yang","year":"2019","unstructured":"Yang J, Baek M, Seok C (2019) GalaxyDock3: protein\u2013ligand docking that considers the full ligand conformational flexibility. J Comput Chem 40(31):2739\u20132748. https:\/\/doi.org\/10.1002\/jcc.26050","journal-title":"J Comput Chem"},{"issue":"D1","key":"936_CR45","doi-asserted-by":"publisher","first-page":"1100","DOI":"10.1093\/nar\/gkr777","volume":"40","author":"A Gaulton","year":"2012","unstructured":"Gaulton A, Bellis LJ, Bento AP, Chambers J, Davies M, Hersey A, Light Y, McGlinchey S, Michalovich D, Al-Lazikani B, Overington JP (2012) ChEMBL: a large-scale bioactivity database for drug discovery. Nucleic Acids Res 40(D1):1100\u20131107. https:\/\/doi.org\/10.1093\/nar\/gkr777","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"936_CR46","doi-asserted-by":"publisher","first-page":"235","DOI":"10.1093\/nar\/28.1.235","volume":"28","author":"HM Berman","year":"2000","unstructured":"Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE (2000) The protein data bank. Nucleic Acids Res 28(1):235\u2013242. https:\/\/doi.org\/10.1093\/nar\/28.1.235","journal-title":"Nucleic Acids Res"},{"key":"936_CR47","unstructured":"Enamine HTS collection (2023) https:\/\/enamine.net\/compound-collections\/screening-collection\/hts-collection. Accessed 21 Feb 2023"},{"issue":"1","key":"936_CR48","doi-asserted-by":"publisher","first-page":"177","DOI":"10.1021\/ci049714","volume":"45","author":"JJ Irwin","year":"2005","unstructured":"Irwin JJ, Shoichet BK (2005) ZINC\u2014a free database of commercially available compounds for virtual screening. J Chem Inf Model 45(1):177\u2013182. https:\/\/doi.org\/10.1021\/ci049714","journal-title":"J Chem Inf Model"},{"issue":"11","key":"936_CR49","doi-asserted-by":"publisher","first-page":"2324","DOI":"10.1021\/acs.jcim.5b00559","volume":"55","author":"T Sterling","year":"2015","unstructured":"Sterling T, Irwin JJ (2015) ZINC 15\u2014ligand discovery for everyone. J Chem Inf Model 55(11):2324\u20132337. https:\/\/doi.org\/10.1021\/acs.jcim.5b00559","journal-title":"J Chem Inf Model"},{"key":"936_CR50","unstructured":"ZINC tranches (2023) https:\/\/zinc.docking.org\/tranches\/home. Accessed 22 Jun 2022"},{"key":"936_CR51","doi-asserted-by":"publisher","first-page":"4","DOI":"10.1016\/j.addr.2012.09.019","volume":"64","author":"CA Lipinski","year":"2012","unstructured":"Lipinski CA, Lombardo F, Dominy BW, Feeney PJ (2012) Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. Adv Drug Deliv Rev 64:4\u201317. https:\/\/doi.org\/10.1016\/j.addr.2012.09.019","journal-title":"Adv Drug Deliv Rev"},{"issue":"1","key":"936_CR52","doi-asserted-by":"publisher","first-page":"198","DOI":"10.1093\/nar\/gkl999","volume":"35","author":"T Liu","year":"2007","unstructured":"Liu T, Lin Y, Wen X, Jorissen RN, Gilson MK (2007) BindingDB: a web-accessible database of experimentally determined protein-ligand binding affinities. Nucleic Acids Res 35(1):198\u2013201. https:\/\/doi.org\/10.1093\/nar\/gkl999","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"936_CR53","doi-asserted-by":"publisher","first-page":"1045","DOI":"10.1093\/nar\/gkv1072","volume":"44","author":"MK Gilson","year":"2016","unstructured":"Gilson MK, Liu T, Baitaluk M, Nicola G, Hwang L, Chong J (2016) BindingDB in 2015: a public database for medicinal chemistry, computational chemistry and systems pharmacology. Nucleic Acids Res 44(1):1045\u20131053. https:\/\/doi.org\/10.1093\/nar\/gkv1072","journal-title":"Nucleic Acids Res"},{"key":"936_CR54","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.csbj.2022.11.047","volume":"21","author":"S Kwon","year":"2023","unstructured":"Kwon S, Seok C (2023) CSAlign and CSAlign-Dock: structure alignment of ligands considering full flexibility and application to protein-ligand docking. Comput Struct Biotechnol J 21:1\u201310. https:\/\/doi.org\/10.1016\/j.csbj.2022.11.047","journal-title":"Comput Struct Biotechnol J"},{"key":"936_CR55","doi-asserted-by":"publisher","first-page":"102559","DOI":"10.1016\/j.sbi.2023.102559","volume":"79","author":"M Thomas","year":"2023","unstructured":"Thomas M, Bender A, de Graaf C (2023) Integrating structure-based approaches in generative molecular design. Curr Opin Struct Biol 79:102559. https:\/\/doi.org\/10.1016\/j.sbi.2023.102559","journal-title":"Curr Opin Struct Biol"},{"key":"936_CR56","doi-asserted-by":"publisher","DOI":"10.1101\/2024.06.03.597263","author":"O Zhang","year":"2024","unstructured":"Zhang O, Jin J, Lin H, Zhang J, Hua C, Huang Y, Zhao H, Hsieh C-Y, Hou T (2024) ECloudGen: access to broader chemical space for structure-based molecule generation. bioRxiv. https:\/\/doi.org\/10.1101\/2024.06.03.597263","journal-title":"bioRxiv"},{"key":"936_CR57","first-page":"2825","volume":"12","author":"F Pedregosa","year":"2011","unstructured":"Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay \u00c9 (2011) Scikit-learn: machine learning in Python. J Mach Learn Res 12:2825\u20132830.\u00a0","journal-title":"J Mach Learn Res"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-024-00936-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13321-024-00936-8\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-024-00936-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,12,5]],"date-time":"2024-12-05T09:07:01Z","timestamp":1733389621000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/s13321-024-00936-8"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,12,5]]},"references-count":57,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2024,12]]}},"alternative-id":["936"],"URL":"https:\/\/doi.org\/10.1186\/s13321-024-00936-8","relation":{"has-preprint":[{"id-type":"doi","id":"10.21203\/rs.3.rs-4709199\/v1","asserted-by":"object"}]},"ISSN":["1758-2946"],"issn-type":[{"value":"1758-2946","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,12,5]]},"assertion":[{"value":"9 July 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"22 November 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 December 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"137"}}