{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,28]],"date-time":"2026-02-28T02:06:54Z","timestamp":1772244414715,"version":"3.50.1"},"reference-count":37,"publisher":"Frontiers Media SA","license":[{"start":{"date-parts":[[2025,9,8]],"date-time":"2025-09-08T00:00:00Z","timestamp":1757289600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["frontiersin.org"],"crossmark-restriction":true},"short-container-title":["Front. Bioinform."],"abstract":"<jats:p>\n                    Structure-based drug design (SBDD) is enhanced by machine learning (ML) to improve both virtual screening and\n                    <jats:italic>de novo<\/jats:italic>\n                    design. Despite advances in ML tools for both strategies, screening remains bounded by time and computational cost, while generative models frequently produce invalid and synthetically inaccessible molecules. Screening time can be improved with pharmacophore search, which quickly identifies ligands in a database that match a pharmacophore query. In this work, we introduce PharmacoForge, a diffusion model for generating 3D pharmacophores conditioned on a protein pocket. Generated pharmacophore queries identify ligands that are guaranteed to be valid, commercially available molecules. We evaluate PharmacoForge against automated pharmacophore generation methods using the LIT-PCBA benchmark and ligand generative models through a docking-based evaluation framework. We further assess pharmacophore quality through a retrospective screening of the DUD-E dataset. PharmacoForge surpasses other pharmacophore generation methods in the LIT-PCBA benchmark, and resulting ligands from pharmacophore queries performed similarly to\n                    <jats:italic>de novo<\/jats:italic>\n                    generated ligands when docking to DUD-E targets and had lower strain energies compared to\n                    <jats:italic>de novo<\/jats:italic>\n                    generated ligands.\n                  <\/jats:p>","DOI":"10.3389\/fbinf.2025.1628800","type":"journal-article","created":{"date-parts":[[2025,9,8]],"date-time":"2025-09-08T09:15:22Z","timestamp":1757322922000},"update-policy":"https:\/\/doi.org\/10.3389\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["PharmacoForge: pharmacophore generation with diffusion models"],"prefix":"10.3389","volume":"5","author":[{"given":"Emma L.","family":"Flynn","sequence":"first","affiliation":[]},{"given":"Riya","family":"Shah","sequence":"additional","affiliation":[]},{"given":"Ian","family":"Dunn","sequence":"additional","affiliation":[]},{"given":"Rishal","family":"Aggarwal","sequence":"additional","affiliation":[]},{"given":"David Ryan","family":"Koes","sequence":"additional","affiliation":[]}],"member":"1965","published-online":{"date-parts":[[2025,9,8]]},"reference":[{"key":"B1","doi-asserted-by":"publisher","first-page":"301","DOI":"10.1186\/s12915-024-02096-5","article-title":"PharmRL: pharmacophore elucidation with deep geometric reinforcement learning","volume":"22","author":"Aggarwal","year":"2024","journal-title":"BMC Biol."},{"key":"B2","doi-asserted-by":"publisher","first-page":"787","DOI":"10.1016\/j.chembiol.2003.09.002","article-title":"The process of structure-based drug design","volume":"10","author":"Anderson","year":"2003","journal-title":"Chem. and Biol."},{"key":"B3","doi-asserted-by":"publisher","first-page":"2783","DOI":"10.3390\/ijms20112783","article-title":"A structure-based drug discovery paradigm","volume":"20","author":"Batool","year":"2019","journal-title":"Int. J. Mol. Sci."},{"key":"B4","doi-asserted-by":"publisher","first-page":"1807","DOI":"10.1016\/j.drudis.2020.07.024","article-title":"High-throughput screening: today\u2019s biochemical and cell-based approaches","volume":"25","author":"Blay","year":"2020","journal-title":"Drug Discov. Today"},{"key":"B5","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2311.13466","article-title":"Accelerating inference in molecular diffusion models with latent representations of protein structure","author":"Dunn","year":"2024","journal-title":"arXiv Prepr. arXiv:2311.13466"},{"key":"B6","doi-asserted-by":"publisher","first-page":"168","DOI":"10.1111\/j.1747-0285.2008.00761.x","article-title":"AutoGrow: a novel algorithm for protein inhibitor design","volume":"73","author":"Durrant","year":"2009","journal-title":"Chem. Biol. and Drug Des."},{"key":"B7","doi-asserted-by":"publisher","first-page":"4200","DOI":"10.1021\/acs.jcim.0c00411","article-title":"Three-dimensional convolutional neural networks and a cross-docked data set for structure-based drug design","volume":"60","author":"Francoeur","year":"2020","journal-title":"J. Chem. Inf. Model."},{"key":"B8","doi-asserted-by":"publisher","first-page":"939","DOI":"10.1021\/acscentsci.0c00229","article-title":"Deep docking: a deep learning platform for augmentation of structure based drug discovery","volume":"6","author":"Gentile","year":"2020","journal-title":"ACS Central Sci."},{"key":"B9","doi-asserted-by":"publisher","first-page":"672","DOI":"10.1038\/s41596-021-00659-2","article-title":"Artificial intelligence\u2013enabled virtual screening of ultra-large chemical libraries with deep docking","volume":"17","author":"Gentile","year":"2022","journal-title":"Nat. Protoc."},{"key":"B10","doi-asserted-by":"publisher","first-page":"646","DOI":"10.3390\/ph15050646","article-title":"Drug design by pharmacophore and virtual screening approach","volume":"15","author":"Giordano","year":"2022","journal-title":"Pharmaceuticals"},{"key":"B11","doi-asserted-by":"publisher","first-page":"101","DOI":"10.1021\/acs.jcim.2c00814","article-title":"Apo2ph4: a versatile workflow for the generation of receptor-based pharmacophore models for virtual screening","volume":"63","author":"Heider","year":"2022","journal-title":"J. Chem. Inf. Model."},{"key":"B12","doi-asserted-by":"publisher","first-page":"6840","DOI":"10.48550\/arXiv.2006.11239","article-title":"Denoising diffusion probabilistic models","volume":"33","author":"Ho","year":"2020","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"B13","first-page":"8867","article-title":"Equivariant diffusion for molecule generation in 3D","volume-title":"International conference on machine learning","author":"Hoogeboom","year":"2022"},{"key":"B14","doi-asserted-by":"publisher","first-page":"1239","DOI":"10.1111\/j.1476-5381.2010.01127.x","article-title":"Principles of early drug discovery","volume":"162","author":"Hughes","year":"2011","journal-title":"Br. J. Pharmacol."},{"key":"B15","doi-asserted-by":"publisher","first-page":"14577","DOI":"10.1039\/D1SC02436A","article-title":"Deep generative design with 3D pharmacophoric constraints","volume":"12","author":"Imrie","year":"2021","journal-title":"Chem. Sci."},{"key":"B16","article-title":"Learning from protein structure with geometric vector perceptrons","volume-title":"International conference on learning representations","author":"Jing","year":"2020"},{"key":"B17","doi-asserted-by":"publisher","first-page":"22799","DOI":"10.3390\/molecules201219880","article-title":"Pharmacophore models and pharmacophore-based virtual screening: concepts and applications exemplified on hydroxysteroid dehydrogenases","volume":"20","author":"Kaserer","year":"2015","journal-title":"Molecules"},{"key":"B18","doi-asserted-by":"crossref","first-page":"167","DOI":"10.1007\/7653_2015_46","article-title":"Pharmacophore modeling: methods and applications","volume-title":"Computer-aided drug discovery","author":"Koes","year":"2015"},{"key":"B19","doi-asserted-by":"publisher","first-page":"1307","DOI":"10.1021\/ci200097m","article-title":"Pharmer: efficient and exact pharmacophore search","volume":"51","author":"Koes","year":"2011","journal-title":"J. Chem. Inf. Model."},{"key":"B20","unstructured":"RDKit: open-source cheminformatics\n          \n          \n            \n              Landrum\n              G.\n            \n          \n          \n          2024"},{"key":"B21","doi-asserted-by":"publisher","first-page":"43","DOI":"10.1186\/s13321-021-00522-2","article-title":"GNINA 1.0: molecular docking with deep learning","volume":"13","author":"McNutt","year":"2021","journal-title":"J. Cheminformatics"},{"key":"B22","doi-asserted-by":"publisher","first-page":"6582","DOI":"10.1021\/jm300687e","article-title":"Directory of useful decoys, enhanced (DUD-E): better ligands and decoys for better benchmarking","volume":"55","author":"Mysinger","year":"2012","journal-title":"J. Med. Chem."},{"key":"B23","first-page":"17644","article-title":"Pocket2Mol: efficient molecular sampling based on 3D protein pockets","volume-title":"International conference on machine learning","author":"Peng","year":"2022"},{"key":"B24","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2405.03961","article-title":"Structure-based drug design by denoising voxel grids","author":"Pinheiro","year":"2024","journal-title":"arXiv Prepr. arXiv:2405.03961"},{"key":"B25","doi-asserted-by":"publisher","first-page":"2701","DOI":"10.1039\/D1SC05976A","article-title":"Generating 3D molecules conditional on receptor binding sites with deep generative models","volume":"13","author":"Ragoza","year":"2022","journal-title":"Chem. Sci."},{"key":"B26","doi-asserted-by":"publisher","first-page":"673","DOI":"10.1038\/s41586-023-05905-z","article-title":"Computational approaches streamlining drug discovery","volume":"616","author":"Sadybekov","year":"2023","journal-title":"Nature"},{"key":"B27","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2210.13695","article-title":"Structure-based drug design with equivariant diffusion models","author":"Schneuing","year":"2022"},{"key":"B28","first-page":"2256","article-title":"Deep unsupervised learning using nonequilibrium thermodynamics","volume-title":"International conference on machine learning","author":"Sohl-Dickstein","year":"2015"},{"key":"B29","article-title":"Score-based generative modeling through stochastic differential equations","volume-title":"International conference on learning representations","author":"Song","year":"2020"},{"key":"B30","doi-asserted-by":"publisher","first-page":"W442","DOI":"10.1093\/nar\/gkw287","article-title":"Pharmit: interactive exploration of chemical space","volume":"44","author":"Sunseri","year":"2016","journal-title":"Nucleic Acids Res."},{"key":"B31","doi-asserted-by":"publisher","first-page":"7369","DOI":"10.3390\/molecules26237369","article-title":"Virtual screening with GNINA 1.0","volume":"26","author":"Sunseri","year":"2021","journal-title":"Molecules"},{"key":"B32","doi-asserted-by":"publisher","first-page":"4263","DOI":"10.1021\/acs.jcim.0c00155","article-title":"LIT-PCBA: an unbiased data set for machine learning and virtual screening","volume":"60","author":"Tran-Nguyen","year":"2020","journal-title":"J. Chem. Inf. Model."},{"key":"B33","doi-asserted-by":"publisher","first-page":"455","DOI":"10.1002\/jcc.21334","article-title":"AutoDock vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading","volume":"31","author":"Trott","year":"2010","journal-title":"J. Comput. Chem."},{"key":"B34","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2412.19812","article-title":"Pharmacophore-guided de novo drug design with diffusion bridge","author":"Wang","year":"2024","journal-title":"arXiv Prepr. arXiv:2412.19812"},{"key":"B35","doi-asserted-by":"publisher","first-page":"9478","DOI":"10.1021\/acs.jmedchem.2c00732","article-title":"RELATION: a deep generative model for structure-based de novo drug design","volume":"65","author":"Wang","year":"2022","journal-title":"J. Med. Chem."},{"key":"B36","doi-asserted-by":"publisher","first-page":"6234","DOI":"10.1038\/s41467-023-41454-9","article-title":"A pharmacophore-guided deep learning approach for bioactive molecular generation","volume":"14","author":"Zhu","year":"2023","journal-title":"Nat. Commun."},{"key":"B37","doi-asserted-by":"publisher","first-page":"4263","DOI":"10.1021\/acs.jcim.4c02008","article-title":"MolSnapper: conditioning diffusion for structure-based drug design","volume":"65","author":"Ziv","year":"2025","journal-title":"J. Chem. Inf. Model."}],"container-title":["Frontiers in Bioinformatics"],"original-title":[],"link":[{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fbinf.2025.1628800\/full","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,8]],"date-time":"2025-09-08T09:15:24Z","timestamp":1757322924000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fbinf.2025.1628800\/full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,9,8]]},"references-count":37,"alternative-id":["10.3389\/fbinf.2025.1628800"],"URL":"https:\/\/doi.org\/10.3389\/fbinf.2025.1628800","relation":{"has-preprint":[{"id-type":"doi","id":"10.26434\/chemrxiv-2025-dz765-v2","asserted-by":"object"},{"id-type":"doi","id":"10.26434\/chemrxiv-2025-dz765","asserted-by":"object"}]},"ISSN":["2673-7647"],"issn-type":[{"value":"2673-7647","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,9,8]]},"article-number":"1628800"}}