{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T00:52:27Z","timestamp":1760143947894,"version":"build-2065373602"},"reference-count":51,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2024,3,11]],"date-time":"2024-03-11T00:00:00Z","timestamp":1710115200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000774","name":"Defense Threat Reduction Agency (DTRA)","doi-asserted-by":"publisher","award":["HDTRA1036045","CB100035","LLNL-JRNL-858354"],"award-info":[{"award-number":["HDTRA1036045","CB100035","LLNL-JRNL-858354"]}],"id":[{"id":"10.13039\/100000774","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000774","name":"Joint Science and Technology Office (JSTO)","doi-asserted-by":"publisher","award":["HDTRA1036045","CB100035","LLNL-JRNL-858354"],"award-info":[{"award-number":["HDTRA1036045","CB100035","LLNL-JRNL-858354"]}],"id":[{"id":"10.13039\/100000774","id-type":"DOI","asserted-by":"publisher"}]},{"name":"U.S. Department of Energy","award":["HDTRA1036045","CB100035","LLNL-JRNL-858354"],"award-info":[{"award-number":["HDTRA1036045","CB100035","LLNL-JRNL-858354"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["MAKE"],"abstract":"<jats:p>Decades of drug development research have explored a vast chemical space for highly active compounds. The exponential growth of virtual libraries enables easy access to billions of synthesizable molecules. Computational modeling, particularly molecular docking, utilizes physics-based calculations to prioritize molecules for synthesis and testing. Nevertheless, the molecular docking process often yields docking poses with favorable scores that prove to be inaccurate with experimental testing. To address these issues, several approaches using machine learning (ML) have been proposed to filter incorrect poses based on the crystal structures. However, most of the methods are limited by the availability of structure data. Here, we propose a new pose classification approach, PECAN2 (Pose Classification with 3D Atomic Network 2), without the need for crystal structures, based on a 3D atomic neural network with Point Cloud Network (PCN). The new approach uses the correlation between docking scores and experimental data to assign labels, instead of relying on the crystal structures. We validate the proposed classifier on multiple datasets including human mu, delta, and kappa opioid receptors and SARS-CoV-2 Mpro. Our results demonstrate that leveraging the correlation between docking scores and experimental data alone enhances molecular docking performance by filtering out false positives and false negatives.<\/jats:p>","DOI":"10.3390\/make6010030","type":"journal-article","created":{"date-parts":[[2024,3,11]],"date-time":"2024-03-11T08:56:41Z","timestamp":1710147401000},"page":"642-657","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["Enhancing Docking Accuracy with PECAN2, a 3D Atomic Neural Network Trained without Co-Complex Crystal Structures"],"prefix":"10.3390","volume":"6","author":[{"given":"Heesung","family":"Shim","sequence":"first","affiliation":[{"name":"Physical and Life Sciences, Lawrence Livermore National Laboratory, Livermore, CA 94550, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4359-8263","authenticated-orcid":false,"given":"Jonathan E.","family":"Allen","sequence":"additional","affiliation":[{"name":"Global Security Computing Applications Division, Lawrence Livermore National Laboratory, Livermore, CA 94550, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3993-9077","authenticated-orcid":false,"given":"W. F. Drew","family":"Bennett","sequence":"additional","affiliation":[{"name":"Physical and Life Sciences, Lawrence Livermore National Laboratory, Livermore, CA 94550, USA"}]}],"member":"1968","published-online":{"date-parts":[[2024,3,11]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"712","DOI":"10.1038\/s41589-022-01234-w","article-title":"Modeling the expansion of virtual screening libraries","volume":"19","author":"Lyu","year":"2023","journal-title":"Nat. Chem. Biol."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"727","DOI":"10.1006\/jmbi.1996.0897","article-title":"Development and validation of a genetic algorithm for flexible docking","volume":"267","author":"Jones","year":"1997","journal-title":"J. Mol. Biol."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"411","DOI":"10.1023\/A:1011115820450","article-title":"DOCK 4.0: Search strategies for automated molecular docking of flexible molecule databases","volume":"15","author":"Ewing","year":"2001","journal-title":"J. Comput. Aided Mol. Des."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"5895","DOI":"10.1021\/jm050038s","article-title":"PMF scoring revisited","volume":"49","author":"Muegge","year":"2006","journal-title":"J. Med. Chem."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"6296","DOI":"10.1021\/jm050436v","article-title":"DrugScore(CSD)-knowledge-based scoring function derived from small molecule crystal data with superior recognition rate of near-native ligand poses and better affinity prediction","volume":"48","author":"Velec","year":"2005","journal-title":"J. Med. Chem."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Jin, Z., Wu, T., Chen, T., Pan, D., Wang, X., Xie, J., Quan, L., and Lyu, Q. (2023). CAPLA: Improved prediction of protein-ligand binding affinity by a deep learning approach based on a cross-attention mechanism. Bioinformatics, 39.","DOI":"10.1093\/bioinformatics\/btad049"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"431","DOI":"10.1146\/annurev.bb.18.060189.002243","article-title":"Free energy via molecular simulation: Applications to chemical and biomolecular systems","volume":"18","author":"Beveridge","year":"1989","journal-title":"Annu. Rev. Biophys. Biophys. Chem."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"2395","DOI":"10.1021\/cr00023a004","article-title":"Free energy calculations: Applications to chemical and biochemical phenomena","volume":"93","author":"Kollman","year":"1993","journal-title":"Chem. Rev."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"1866","DOI":"10.1002\/jcc.20504","article-title":"An iterative knowledge-based scoring function to predict protein-ligand interactions: I. Derivation of interaction potentials","volume":"27","author":"Huang","year":"2006","journal-title":"J. Comput. Chem."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"425","DOI":"10.1023\/A:1007996124545","article-title":"Empirical scoring functions: I. The development of a fast empirical scoring function to estimate the binding affinity of ligands in receptor complexes","volume":"11","author":"Eldridge","year":"1997","journal-title":"J. Comput. Aided Mol. Des."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1023\/A:1016357811882","article-title":"Further development and validation of empirical scoring functions for structure-based binding affinity prediction","volume":"16","author":"Wang","year":"2002","journal-title":"J. Comput. Aided Mol. Des."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1007\/BF00994018","article-title":"Support-vector networks","volume":"20","author":"Cortes","year":"1995","journal-title":"Mach. Learn."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random Forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach. Learn."},{"key":"ref_14","unstructured":"Mason, L., Baxter, J., Bartlett, P., and Frean, M. (1999). Boosting Algorithms as Gradient Descent in Function Space, NIPS."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"367","DOI":"10.1016\/S0167-9473(01)00065-2","article-title":"Stochastic gradient boosting","volume":"38","author":"Friedman","year":"2002","journal-title":"Comput. Stat. Data Anal."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"84","DOI":"10.1145\/3065386","article-title":"ImageNet classification with deep convolutional neural networks","volume":"60","author":"Krizhevsky","year":"2017","journal-title":"Commun. ACM"},{"key":"ref_17","unstructured":"Wallach, I., Dzamba, M., and Heifets, A. (2015). AtomNet: A Deep Convolutional Neural Network for Bioactivity Prediction in Structure-based Drug Discovery. arXiv."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"3666","DOI":"10.1093\/bioinformatics\/bty374","article-title":"Development and evaluation of a deep learning model for protein\u2013ligand binding affinity prediction","volume":"34","author":"Zielenkiewicz","year":"2018","journal-title":"Bioinformatics"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Li, Y., Rezaei, M.A., Li, C., Li, X., and Wu, D. (2019). DeepAtom: A Framework for Protein-Ligand Binding Affinity Prediction. arXiv.","DOI":"10.1109\/BIBM47256.2019.8982964"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"15956","DOI":"10.1021\/acsomega.9b01997","article-title":"OnionNet: A Multiple-Layer Intermolecular-Contact-Based Convolutional Neural Network for Protein\u2013Ligand Binding Affinity Prediction","volume":"4","author":"Zheng","year":"2019","journal-title":"ACS Omega"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"753002","DOI":"10.3389\/fchem.2021.753002","article-title":"OnionNet-2: A Convolutional Neural Network Model for Predicting Protein-Ligand Binding Affinity Based on Residue-Atom Contacting Shells","volume":"9","author":"Wang","year":"2021","journal-title":"Front. Chem."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"2791","DOI":"10.1021\/acs.jcim.0c00075","article-title":"RosENet: Improving Binding Affinity Prediction by Leveraging Molecular Mechanics Energies with an Ensemble of 3D Convolutional Neural Networks","volume":"60","author":"Zhang","year":"2020","journal-title":"J. Chem. Inf. Model."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"1583","DOI":"10.1021\/acs.jcim.0c01306","article-title":"Improved Protein-Ligand Binding Affinity Prediction with Structure-Based Deep Fusion Inference","volume":"61","author":"Jones","year":"2021","journal-title":"J. Chem. Inf. Model."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Atz, K., Grisoni, F., and Schneider, G. (2021). Geometric Deep Learning on Molecular Representations. arXiv.","DOI":"10.1038\/s42256-021-00418-8"},{"key":"ref_25","unstructured":"Bronstein, M.M., Bruna, J., Cohen, T., and Veli\u010dkovi\u0107, P. (2021). Geometric Deep Learning: Grids, Groups, Graphs, Geodesics, and Gauges. arXiv."},{"key":"ref_26","unstructured":"Corso, G., St\u00e4rk, H., Jing, B., Barzilay, R., and Jaakkola, T. (2022). DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking. arXiv."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"18699","DOI":"10.1021\/acsomega.2c01404","article-title":"MegaSyn: Integrating Generative Molecular Design, Automated Analog Designer, and Synthetic Viability Prediction","volume":"7","author":"Urbina","year":"2022","journal-title":"ACS Omega"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Buttenschoen, M., Morris, G.M., and Deane, C.M. (2023). PoseBusters: AI-based docking methods fail to generate physically valid poses or generalise to novel sequences. arXiv.","DOI":"10.1039\/D3SC04185A"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"2977","DOI":"10.1021\/jm030580l","article-title":"The PDBbind database: Collection of binding affinities for protein-ligand complexes with known three-dimensional structures","volume":"47","author":"Wang","year":"2004","journal-title":"J. Med. Chem."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"377","DOI":"10.1146\/annurev-biophys-070317-033239","article-title":"Serial Femtosecond Crystallography of G Protein-Coupled Receptors","volume":"47","author":"Stauch","year":"2018","journal-title":"Annu. Rev. Biophys."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"69","DOI":"10.3389\/fphar.2020.00069","article-title":"Predicting or Pretending: Artificial Intelligence for Protein-Ligand Interactions Lack of Sufficiently Large and Unbiased Datasets","volume":"11","author":"Yang","year":"2020","journal-title":"Front. Pharmacol."},{"key":"ref_32","unstructured":"Li, J., Guan, X., Zhang, O., Sun, K., Wang, Y., Bagni, D., and Head-Gordon, T. (2023). Leak Proof PDBBind: A Reorganized Dataset of Protein-Ligand Complexes for More Generalizable Binding Affinity Prediction. arXiv."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"2301","DOI":"10.1021\/acs.jcim.1c01510","article-title":"Pose Classification Using Three-Dimensional Atomic Structure-Based Neural Networks Applied to Ion Channel-Ligand Docking","volume":"62","author":"Shim","year":"2022","journal-title":"J. Chem. Inf. Model."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"D198","DOI":"10.1093\/nar\/gkl999","article-title":"BindingDB: A web-accessible database of experimentally determined protein-ligand binding affinities","volume":"35","author":"Liu","year":"2007","journal-title":"Nucleic Acids Res."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"D1334","DOI":"10.1093\/nar\/gkaa993","article-title":"TCRD and Pharos 2021: Mining the human proteome for disease biology","volume":"49","author":"Sheils","year":"2021","journal-title":"Nucleic Acids Res."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1038\/nature10954","article-title":"Crystal structure of the micro-opioid receptor bound to a morphinan antagonist","volume":"485","author":"Manglik","year":"2012","journal-title":"Nature"},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"327","DOI":"10.1038\/nature10939","article-title":"Structure of the human kappa-opioid receptor in complex with JDTic","volume":"485","author":"Wu","year":"2012","journal-title":"Nature"},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"265","DOI":"10.1038\/nsmb.2965","article-title":"Structural basis for bifunctional peptide recognition at human delta-opioid receptor","volume":"22","author":"Fenalti","year":"2015","journal-title":"Nat. Struct. Mol. Biol."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"15487","DOI":"10.1073\/pnas.0805590105","article-title":"Evolution of vertebrate opioid receptors","volume":"105","author":"Dreborg","year":"2008","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1038\/s41586-020-2223-y","article-title":"Structure of M(pro) from SARS-CoV-2 and discovery of its inhibitors","volume":"582","author":"Jin","year":"2020","journal-title":"Nature"},{"key":"ref_41","unstructured":"(2023, September 28). Global Online Structure Activity Relationship Database (GOSTAR), Excelra. Available online: https:\/\/www.gostardb.com\/gostar\/newui\/applications.jsp."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"5909","DOI":"10.1039\/D1CC00050K","article-title":"Discovery of SARS-CoV-2 main protease inhibitors using a synthesis-directed de novo design model","volume":"57","author":"Morris","year":"2021","journal-title":"Chem. Commun."},{"key":"ref_43","unstructured":"(2023). Molecular Operating Environment (MOE), Chemical Computing Group."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"3891","DOI":"10.1021\/acs.jcim.1c00203","article-title":"AutoDock Vina 1.2.0: New Docking Methods, Expanded Force Field, and Python Bindings","volume":"61","author":"Eberhardt","year":"2021","journal-title":"J. Chem. Inf. Model."},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"324","DOI":"10.1021\/ci4005145","article-title":"Toward fully automated high performance computing drug discovery: A massively parallel virtual screening pipeline for docking and molecular mechanics\/generalized Born surface area rescoring to improve enrichment","volume":"54","author":"Zhang","year":"2014","journal-title":"J. Chem. Inf. Model."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"26","DOI":"10.1186\/s13321-015-0078-2","article-title":"Open Drug Discovery Toolkit (ODDT): A new open-source player in the drug discovery field","volume":"7","author":"Wojcikowski","year":"2015","journal-title":"J. Cheminform."},{"key":"ref_47","unstructured":"Qi, C.R., Su, H., Mo, K., and Guibas, L.J. (2016). PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation. arXiv."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"1955","DOI":"10.1021\/acs.jcim.9b01053","article-title":"AMPL: A Data-Driven Modeling Pipeline for Drug Discovery","volume":"60","author":"Minnich","year":"2020","journal-title":"J. Chem. Inf. Model."},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"2887","DOI":"10.1021\/jm9602928","article-title":"The properties of known drugs. 1. Molecular frameworks","volume":"39","author":"Bemis","year":"1996","journal-title":"J. Med. Chem."},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"22496","DOI":"10.1021\/acsomega.3c00085","article-title":"SS-GNN: A Simple-Structured Graph Neural Network for Affinity Prediction","volume":"8","author":"Zhang","year":"2023","journal-title":"ACS Omega"},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"81","DOI":"10.1186\/s13321-021-00560-w","article-title":"The impact of cross-docked poses on performance of machine learning classifier for protein-ligand binding pose prediction","volume":"13","author":"Shen","year":"2021","journal-title":"J. Cheminform."}],"container-title":["Machine Learning and Knowledge Extraction"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2504-4990\/6\/1\/30\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T14:11:58Z","timestamp":1760105518000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2504-4990\/6\/1\/30"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,3,11]]},"references-count":51,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2024,3]]}},"alternative-id":["make6010030"],"URL":"https:\/\/doi.org\/10.3390\/make6010030","relation":{},"ISSN":["2504-4990"],"issn-type":[{"type":"electronic","value":"2504-4990"}],"subject":[],"published":{"date-parts":[[2024,3,11]]}}}