{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T04:28:29Z","timestamp":1772166509764,"version":"3.50.1"},"reference-count":66,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2023,8,28]],"date-time":"2023-08-28T00:00:00Z","timestamp":1693180800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,8,28]],"date-time":"2023-08-28T00:00:00Z","timestamp":1693180800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100021821","name":"Oncode Institute","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100021821","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Proteochemometric (PCM) modelling is a powerful computational drug discovery tool used in bioactivity prediction of potential drug candidates relying on both chemical and protein information. In PCM features are computed to describe small molecules and proteins, which directly impact the quality of the predictive models. State-of-the-art protein descriptors, however, are calculated from the protein sequence and neglect the dynamic nature of proteins. This dynamic nature can be computationally simulated with molecular dynamics (MD). Here, novel 3D dynamic protein descriptors (3DDPDs) were designed to be applied in bioactivity prediction tasks with PCM models. As a test case, publicly available G protein-coupled receptor (GPCR) MD data from GPCRmd was used. GPCRs are membrane-bound proteins, which are activated by hormones and neurotransmitters, and constitute an important target family for drug discovery. GPCRs exist in different conformational states that allow the transmission of diverse signals and that can be modified by ligand interactions, among other factors. To translate the MD-encoded protein dynamics two types of 3DDPDs were considered: one-hot encoded residue-specific (rs) and embedding-like protein-specific (ps) 3DDPDs. The descriptors were developed by calculating distributions of trajectory coordinates and partial charges, applying dimensionality reduction, and subsequently condensing them into vectors per residue or protein, respectively. 3DDPDs were benchmarked on several PCM tasks against state-of-the-art non-dynamic protein descriptors. Our rs- and ps3DDPDs outperformed non-dynamic descriptors in regression tasks using a temporal split and showed comparable performance with a random split and in all classification tasks. Combinations of non-dynamic descriptors with 3DDPDs did not result in increased performance. Finally, the power of 3DDPDs to capture dynamic fluctuations in mutant GPCRs was explored. The results presented here show the potential of including protein dynamic information on machine learning tasks, specifically bioactivity prediction, and open opportunities for applications in drug discovery, including oncology.<\/jats:p>","DOI":"10.1186\/s13321-023-00745-5","type":"journal-article","created":{"date-parts":[[2023,8,28]],"date-time":"2023-08-28T06:03:37Z","timestamp":1693202617000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["3DDPDs: describing protein dynamics for proteochemometric bioactivity prediction. A case for (mutant) G protein-coupled receptors"],"prefix":"10.1186","volume":"15","author":[{"given":"Marina","family":"Gorostiola Gonz\u00e1lez","sequence":"first","affiliation":[]},{"given":"Remco L.","family":"van den Broek","sequence":"additional","affiliation":[]},{"given":"Thomas G. M.","family":"Braun","sequence":"additional","affiliation":[]},{"given":"Magdalini","family":"Chatzopoulou","sequence":"additional","affiliation":[]},{"given":"Willem","family":"Jespers","sequence":"additional","affiliation":[]},{"given":"Adriaan P.","family":"IJzerman","sequence":"additional","affiliation":[]},{"given":"Laura H.","family":"Heitman","sequence":"additional","affiliation":[]},{"given":"Gerard J. P.","family":"van Westen","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,8,28]]},"reference":[{"key":"745_CR1","doi-asserted-by":"publisher","DOI":"10.1016\/J.JBC.2021.100559","author":"SK Burley","year":"2021","unstructured":"Burley SK (2021) Impact of structural biologists and the Protein Data Bank on small-molecule drug discovery and development. J Biol Chem 296:100559. https:\/\/doi.org\/10.1016\/J.JBC.2021.100559","journal-title":"J Biol Chem"},{"key":"745_CR2","doi-asserted-by":"publisher","DOI":"10.1016\/j.csbj.2021.08.011","author":"P Carracedo-Reboredo","year":"2021","unstructured":"Carracedo-Reboredo P, Li\u00f1ares-Blanco J, Rodr\u00edguez-Fern\u00e1ndez N et al (2021) A review on machine learning approaches and trends in drug discovery. Comput Struct Biotechnol J 19:4538\u20134558. https:\/\/doi.org\/10.1016\/j.csbj.2021.08.011","journal-title":"Comput Struct Biotechnol J"},{"key":"745_CR3","doi-asserted-by":"publisher","DOI":"10.1038\/s41392-022-00994-0","author":"Y You","year":"2022","unstructured":"You Y, Lai X, Pan Y et al (2022) Artificial intelligence in cancer target identification and drug discovery. Signal Transduct Target Ther 7:156. https:\/\/doi.org\/10.1038\/s41392-022-00994-0","journal-title":"Sig Transduct Target Ther"},{"key":"745_CR4","doi-asserted-by":"publisher","DOI":"10.1002\/minf.202100240","author":"K Sankar","year":"2022","unstructured":"Sankar K, Trainor K, Blazer LL et al (2022) A Descriptor Set for Quantitative Structure\u2010property Relationship Prediction in Biologics. Mol Inform 41:2100240. https:\/\/doi.org\/10.1002\/minf.202100240","journal-title":"Mol Inform"},{"key":"745_CR5","doi-asserted-by":"publisher","DOI":"10.1093\/bib\/bbac075","author":"A Torkamannia","year":"2022","unstructured":"Torkamannia A, Omidi Y, Ferdousi R (2022) A review of machine learning approaches for drug synergy prediction in cancer. Brief Bioinform 23:1\u201319. https:\/\/doi.org\/10.1093\/bib\/bbac075","journal-title":"Brief Bioinform"},{"key":"745_CR6","doi-asserted-by":"publisher","DOI":"10.1159\/000518572","author":"H Satake","year":"2021","unstructured":"Satake H, Osugi T, Shiraishi A (2021) Impact of Machine Learning-Associated Research Strategies on the Identification of Peptide-Receptor Interactions in the Post-Omics Era. Neuroendocrinology 113:251\u2013261. https:\/\/doi.org\/10.1159\/000518572","journal-title":"Neuroendocrinology"},{"key":"745_CR7","doi-asserted-by":"publisher","first-page":"89","DOI":"10.1016\/j.ddtec.2020.08.003","volume":"32","author":"BJ Bongers","year":"2019","unstructured":"Bongers BJ, IJzerman AP, Van Westen GJP (2019) Proteochemometrics\u202f\u2013\u202frecent developments in bioactivity and selectivity modeling. Drug Discov Today Technol 32:89\u201398. https:\/\/doi.org\/10.1016\/j.ddtec.2020.08.003","journal-title":"Drug Discov Today Technol"},{"key":"745_CR8","doi-asserted-by":"publisher","first-page":"1350","DOI":"10.1016\/J.DRUDIS.2022.02.023","volume":"27","author":"BX Du","year":"2022","unstructured":"Du BX, Qin Y, Jiang YF et al (2022) Compound\u2013protein interaction prediction by deep learning: Databases, descriptors and models. Drug Discov Today 27:1350\u20131366. https:\/\/doi.org\/10.1016\/J.DRUDIS.2022.02.023","journal-title":"Drug Discov Today"},{"key":"745_CR9","doi-asserted-by":"publisher","DOI":"10.1016\/J.CBPA.2021.09.001","author":"A Fern\u00e1ndez-Torras","year":"2022","unstructured":"Fern\u00e1ndez-Torras A, Comajuncosa-Creus A, Duran-Frigola M, Aloy P (2022) Connecting chemistry and biology through molecular descriptors. Curr Opin Chem Biol 66:102090. https:\/\/doi.org\/10.1016\/J.CBPA.2021.09.001","journal-title":"Curr Opin Chem Biol"},{"key":"745_CR10","doi-asserted-by":"publisher","first-page":"41","DOI":"10.1186\/1758-2946-5-41","volume":"5","author":"GJP van Westen","year":"2013","unstructured":"Van Westen GJP, Swier RF, Wegner JK et al (2013) Benchmarking of protein descriptor sets in proteochemometric modeling (part 1): Comparative study of 13 amino acid descriptor sets. J Cheminform 5:41. https:\/\/doi.org\/10.1186\/1758-2946-5-41","journal-title":"J Cheminform"},{"key":"745_CR11","doi-asserted-by":"publisher","first-page":"65","DOI":"10.1007\/978-1-0716-2317-6_3","volume":"2499","author":"H Ismail","year":"2022","unstructured":"Ismail H, White C, AL-Barakati H et al (2022) FEPS: A tool for feature extraction from protein sequence. Methods mol biol 2499:65\u2013104. https:\/\/doi.org\/10.1007\/978-1-0716-2317-6_3","journal-title":"Methods Mol Biol"},{"key":"745_CR12","doi-asserted-by":"publisher","DOI":"10.1142\/9789811258589_0002","author":"N Ibtehaz","year":"2021","unstructured":"Ibtehaz N, Kihara D (2021) Application of Sequence Embedding in Protein Sequence-Based Predictions. ArXiv. https:\/\/doi.org\/10.1142\/9789811258589_0002","journal-title":"ArXiv"},{"key":"745_CR13","doi-asserted-by":"publisher","DOI":"10.1016\/j.csbj.2021.11.018","author":"DD Wang","year":"2021","unstructured":"Wang DD, Chan M-T, Yan H et al (2021) Structure-based protein-ligand interaction fingerprints for binding affinity prediction. Comput Struct Biotechnol J 19:6291\u20136300. https:\/\/doi.org\/10.1016\/j.csbj.2021.11.018","journal-title":"Comput Struct Biotechnol J"},{"key":"745_CR14","doi-asserted-by":"publisher","first-page":"3021","DOI":"10.1021\/ci400369z","volume":"53","author":"V Subramanian","year":"2013","unstructured":"Subramanian V, Prusis P, Pietil\u00e4 LO et al (2013) Visually interpretable models of kinase selectivity related features derived from field-based proteochemometrics. J Chem Inf Model 53:3021\u20133030. https:\/\/doi.org\/10.1021\/ci400369z","journal-title":"J Chem Inf Model"},{"key":"745_CR15","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbc.2021.100749","author":"MD Miller","year":"2021","unstructured":"Miller MD, Phillips GN (2021) Moving beyond static snapshots: Protein dynamics and the Protein Data Bank. J Biol Chem 296:100749. https:\/\/doi.org\/10.1016\/j.jbc.2021.100749","journal-title":"J Biol Chem"},{"key":"745_CR16","doi-asserted-by":"publisher","first-page":"743","DOI":"10.1016\/j.bpj.2016.07.011","volume":"111","author":"LA Abriata","year":"2016","unstructured":"Abriata LA, Spiga E, Peraro MD (2016) Molecular Effects of Concentrated Solutes on Protein Hydration, Dynamics, and Electrostatics. Biophys J 111:743\u2013755. https:\/\/doi.org\/10.1016\/j.bpj.2016.07.011","journal-title":"Biophys J"},{"key":"745_CR17","doi-asserted-by":"publisher","DOI":"10.1021\/acs.accounts.5b00516","author":"A Stank","year":"2016","unstructured":"Stank A, Kokh DB, Fuller JC, Wade RC (2016) Protein Binding Pocket Dynamics. Acc Chem Res 49:809\u2013815. https:\/\/doi.org\/10.1021\/acs.accounts.5b00516","journal-title":"Dynamics"},{"key":"745_CR18","doi-asserted-by":"publisher","DOI":"10.1021\/acs.jcim.2c00484","author":"F Zhu","year":"2022","unstructured":"Zhu F, Yang S, Meng F et al (2022) Leveraging Protein Dynamics to Identify Functional Phosphorylation Sites using Deep Learning Models. J Chem Inf Model 62:3331\u20133345. https:\/\/doi.org\/10.1021\/acs.jcim.2c00484","journal-title":"J Chem Inf Model"},{"key":"745_CR19","doi-asserted-by":"publisher","first-page":"124","DOI":"10.1016\/j.gene.2012.11.061","volume":"518","author":"J Gao","year":"2013","unstructured":"Gao J, Huang Q, Wu D et al (2013) Study on human GPCR-inhibitor interactions by proteochemometric modeling. Gene 518:124\u2013131. https:\/\/doi.org\/10.1016\/j.gene.2012.11.061","journal-title":"Gene"},{"key":"745_CR20","doi-asserted-by":"publisher","DOI":"10.1039\/d0ra08003a","author":"CS Odoemelam","year":"2020","unstructured":"Odoemelam CS, Percival B, Wallis H et al (2020) G-Protein coupled receptors: structure and function in drug discovery. RSC Adv 10:36337. https:\/\/doi.org\/10.1039\/d0ra08003a","journal-title":"RSC Adv"},{"key":"745_CR21","doi-asserted-by":"publisher","DOI":"10.1021\/acs.chemrev.6b00177","author":"NR Latorraca","year":"2016","unstructured":"Latorraca NR, Venkatakrishnan AJ, Dror RO (2017) GPCR Dynamics: Structures in Motion. Chem Rev 117:139\u2013155. https:\/\/doi.org\/10.1021\/acs.chemrev.6b00177","journal-title":"Chem Rev"},{"key":"745_CR22","doi-asserted-by":"publisher","first-page":"147","DOI":"10.1016\/J.SBI.2019.03.015","volume":"55","author":"Y Lee","year":"2019","unstructured":"Lee Y, Lazim R, Macalino SJY, Choi S (2019) Importance of protein dynamics in the structure-based drug discovery of class A G protein-coupled receptors (GPCRs). Curr Opin Struct Biol 55:147\u2013153. https:\/\/doi.org\/10.1016\/J.SBI.2019.03.015","journal-title":"Curr Opin Struct Biol"},{"key":"745_CR23","doi-asserted-by":"publisher","first-page":"17","DOI":"10.1038\/s41592-020-0884-y","volume":"19","author":"I Rodr\u00edguez-Espigares","year":"2020","unstructured":"Rodriguez-Espigares I, Torrens-Fontanals M, S Tiemann JK et al (2020) GPCRmd uncovers the dynamics of the 3D-GPCRome. Nat Methods 17:777\u2013787. https:\/\/doi.org\/10.1038\/s41592-020-0884-y","journal-title":"Przemyslaw Miszta"},{"key":"745_CR24","doi-asserted-by":"publisher","first-page":"11","DOI":"10.1038\/s41598-022-25323-x","volume":"2","author":"BJ Bongers","year":"2021","unstructured":"Bongers BJ, Gorostiola Gonz\u00e1lez M, Wang X et al (2022) Pan-cancer functional analysis of somatic mutations in G protein-coupled receptors. Sci Rep 12:21534. https:\/\/doi.org\/10.1038\/s41598-022-25323-x","journal-title":"bioRxiv"},{"key":"745_CR25","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1096\/FJ.202200203RR","volume":"36","author":"X Wang","year":"2022","unstructured":"Wang X, Jespers W, Waal JJ et al (2022) Cancer\u2010related somatic mutations alter adenosine A 1 receptor pharmacology\u2014A focus on mutations in the loops and C\u2010terminus . FASEB J 36:1\u201316. https:\/\/doi.org\/10.1096\/FJ.202200203RR","journal-title":"FASEB J"},{"key":"745_CR26","doi-asserted-by":"publisher","DOI":"10.1016\/j.bcp.2022.115399","author":"LS den Hollander","year":"2023","unstructured":"den Hollander LS, B\u00e9quignon OJM, Wang X et al (2023) Impact of cancer-associated mutations in CC chemokine receptor 2 on receptor function and antagonism. Biochem Pharmacol 208:115399. https:\/\/doi.org\/10.1016\/j.bcp.2022.115399","journal-title":"Biochem Pharmacol"},{"key":"745_CR27","doi-asserted-by":"publisher","DOI":"10.3390\/molecules27154676","author":"C Feng","year":"2022","unstructured":"Feng C, Wang X, Jespers W et al (2022) Cancer-Associated Mutations of the Adenosine A2A Receptor Have Diverse Influences on Ligand Binding and Receptor Functions. Molecules 27:4676. https:\/\/doi.org\/10.3390\/molecules27154676","journal-title":"Molecules"},{"key":"745_CR28","doi-asserted-by":"publisher","first-page":"75","DOI":"10.1016\/j.tips.2017.11.001","volume":"39","author":"W Jespers","year":"2018","unstructured":"Jespers W, Schiedel AC, Heitman LH et al (2018) Structural Mapping of Adenosine Receptor Mutations: Ligand Binding and Signaling Mechanisms. Trends Pharmacol Sci 39:75\u201389. https:\/\/doi.org\/10.1016\/j.tips.2017.11.001","journal-title":"Trends Pharmacol Sci"},{"key":"745_CR29","doi-asserted-by":"publisher","DOI":"10.1186\/s13321-022-00672-x","author":"OJM B\u00e9quignon","year":"2023","unstructured":"B\u00e9quignon OJM, Bongers BJ, Jespers W et al (2023) Papyrus: a large-scale curated dataset aimed at bioactivity predictions. J Cheminform 15:3. https:\/\/doi.org\/10.1186\/s13321-022-00672-x","journal-title":"J Cheminform"},{"key":"745_CR30","doi-asserted-by":"publisher","first-page":"366","DOI":"10.1016\/S1043-9471(05)80049-7","volume":"25","author":"JA Ballesteros","year":"1995","unstructured":"Ballesteros JA, Weinstein H (1995) Integrated methods for the construction of three-dimensional models and computational probing of structure-function relations in G protein-coupled receptors. Methods in Neurosciences 25:366\u2013428. https:\/\/doi.org\/10.1016\/S1043-9471(05)80049-7","journal-title":"Methods Neurosci"},{"key":"745_CR31","doi-asserted-by":"publisher","first-page":"D356","DOI":"10.1093\/nar\/gkv1178","volume":"44","author":"V Isberg","year":"2016","unstructured":"Isberg V, Mordalski S, Munk C et al (2016) GPCRdb: An information system for G protein-coupled receptors. Nucleic Acids Res 44:D356\u2013D364. https:\/\/doi.org\/10.1093\/nar\/gkv1178","journal-title":"Nucleic Acids Res"},{"key":"745_CR32","doi-asserted-by":"publisher","first-page":"726","DOI":"10.1021\/acs.jcim.6b00778","volume":"57","author":"S Riniker","year":"2017","unstructured":"Riniker S (2017) Molecular Dynamics Fingerprints (MDFP): Machine Learning from MD Data to Predict Free-Energy Differences. J Chem Inf Model 57:726\u2013741. https:\/\/doi.org\/10.1021\/acs.jcim.6b00778","journal-title":"J Chem Inf Model"},{"key":"745_CR33","doi-asserted-by":"publisher","first-page":"1388","DOI":"10.1021\/acs.jcim.1c01535","volume":"62","author":"G Bolcato","year":"2022","unstructured":"Bolcato G, Heid E, Bostr\u00f6m J (2022) On the Value of Using 3D Shape and Electrostatic Similarities in Deep Generative Methods. J Chem Inf Model 62:1388\u20131398. https:\/\/doi.org\/10.1021\/acs.jcim.1c01535","journal-title":"J Chem Inf Model"},{"key":"745_CR34","doi-asserted-by":"publisher","first-page":"42","DOI":"10.1186\/1758-2946-5-42","volume":"5","author":"GJP van Westen","year":"2013","unstructured":"Van Westen GJP, Swier RF, Cortes-Ciriano I et al (2013) Benchmarking of protein descriptor sets in proteochemometric modeling (part 2): Modeling performance of 13 amino acid descriptor sets. J Cheminform 5:42. https:\/\/doi.org\/10.1186\/1758-2946-5-42","journal-title":"J Cheminform"},{"key":"745_CR35","doi-asserted-by":"publisher","first-page":"2642","DOI":"10.1093\/bioinformatics\/bty178","volume":"34","author":"KK Yang","year":"2018","unstructured":"Yang KK, Wu Z, Bedbrook CN, Arnold FH (2018) Learned protein embeddings for machine learning. Bioinformatics 34:2642\u20132648. https:\/\/doi.org\/10.1093\/bioinformatics\/bty178","journal-title":"Bioinformatics"},{"key":"745_CR36","doi-asserted-by":"publisher","DOI":"10.1016\/j.csbj.2022.01.027","author":"H Lim","year":"2022","unstructured":"Lim H, Jeon H-N, Lim S et al (2022) Evaluation of protein descriptors in computer-aided rational protein engineering tasks and its application in property prediction in SARS-CoV-2 spike glycoprotein. Comput Struct Biotechnol J 20:788\u2013798. https:\/\/doi.org\/10.1016\/j.csbj.2022.01.027","journal-title":"Comput Struct Biotechnol J"},{"key":"745_CR37","doi-asserted-by":"publisher","first-page":"45","DOI":"10.1186\/s13321-017-0232-0","volume":"9","author":"EB Lenselink","year":"2017","unstructured":"Lenselink EB, Ten Dijke N, Bongers B et al (2017) Beyond the hype: deep neural networks outperform established methods using a ChEMBL bioactivity benchmark set. J Cheminform 9:45. https:\/\/doi.org\/10.1186\/s13321-017-0232-0","journal-title":"J Cheminform"},{"key":"745_CR38","doi-asserted-by":"publisher","first-page":"19938","DOI":"10.1073\/PNAS.2008873117","volume":"117","author":"S Rackovsky","year":"2020","unstructured":"Rackovsky S, Scheraga HA (2020) The structure of protein dynamic space. Proc Natl Acad Sci USA 117:19938\u201319942. https:\/\/doi.org\/10.1073\/PNAS.2008873117","journal-title":"Proc Natl Acad Sci USA"},{"key":"745_CR39","doi-asserted-by":"publisher","first-page":"571","DOI":"10.1038\/s41586-021-03897-2","volume":"597","author":"CJ Draper-Joyce","year":"2021","unstructured":"Draper-Joyce CJ, Bhola R, Wang J et al (2021) Positive allosteric mechanisms of adenosine A1 receptor-mediated analgesia. Nature 597:571\u2013576. https:\/\/doi.org\/10.1038\/s41586-021-03897-2","journal-title":"Nature"},{"key":"745_CR40","doi-asserted-by":"publisher","first-page":"196","DOI":"10.1016\/J.EJPHAR.2015.05.013","volume":"763","author":"SM Lee","year":"2015","unstructured":"Lee SM, Booe JM, Pioszak AA (2015) Structural insights into ligand recognition and selectivity for classes A, B, and C GPCRs. Eur J Pharmacol 763:196\u2013205. https:\/\/doi.org\/10.1016\/J.EJPHAR.2015.05.013","journal-title":"Eur J Pharmacol"},{"key":"745_CR41","doi-asserted-by":"publisher","first-page":"879","DOI":"10.1038\/s41594-021-00674-7","volume":"28","author":"AS Hauser","year":"2021","unstructured":"Hauser AS, Kooistra AJ (2021) GPCR activation mechanisms across classes and macro\/microscales. Nat Struct Mol Biol 28:879\u2013888. https:\/\/doi.org\/10.1038\/s41594-021-00674-7","journal-title":"Nat Struct Mol Biol"},{"key":"745_CR42","doi-asserted-by":"publisher","first-page":"867","DOI":"10.1016\/j.cell.2017.01.042","volume":"168","author":"A Glukhova","year":"2017","unstructured":"Glukhova A, Thal DM, Nguyen AT et al (2017) Structure of the Adenosine A1 Receptor Reveals the Basis for Subtype Selectivity. Cell 168:867-877.e13. https:\/\/doi.org\/10.1016\/j.cell.2017.01.042","journal-title":"Cell"},{"key":"745_CR43","doi-asserted-by":"publisher","first-page":"3984","DOI":"10.1021\/acs.jpcb.2c00200","volume":"2022","author":"A-N Bondar","year":"2022","unstructured":"Bondar A-N (2022) Graphs of Hydrogen-Bond Networks to Dissect Protein Conformational Dynamics. J Phys Chem B 126:3973\u20133984. https:\/\/doi.org\/10.1021\/acs.jpcb.2c00200","journal-title":"J Phys Chem B"},{"key":"745_CR44","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pcbi.1010006","author":"NJ Ose","year":"2022","unstructured":"Ose NJ, Butler BM, Kumar A et al (2022) Dynamic coupling of residues within proteins as a mechanistic foundation of many enigmatic pathogenic missense variants. PLoS Comput Biol 18:e1010006. https:\/\/doi.org\/10.1371\/journal.pcbi.1010006","journal-title":"PLoS Comput Biol"},{"key":"745_CR45","doi-asserted-by":"publisher","DOI":"10.1038\/s41467-022-30936-x","author":"B Li","year":"2022","unstructured":"Li B, Roden DM, Capra JA (2022) The 3D mutational constraint on amino acid sites in the human proteome. Nat Commun 13:3273. https:\/\/doi.org\/10.1038\/s41467-022-30936-x","journal-title":"Nat Commun"},{"key":"745_CR46","doi-asserted-by":"publisher","first-page":"18962","DOI":"10.1073\/pnas.1901156116","volume":"116","author":"S Kumar","year":"2019","unstructured":"Kumar S, Clarke D, Gerstein MB (2019) Leveraging protein dynamics to identify cancer mutational hotspots using 3D structures. Proc Natl Acad Sci USA 116:18962\u201318970. https:\/\/doi.org\/10.1073\/pnas.1901156116","journal-title":"Proc Natl Acad Sci USA"},{"key":"745_CR47","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gky300","author":"CH Rodrigues","year":"2018","unstructured":"Rodrigues CH, Pires DE, Ascher DB et al (2018) DynaMut: predicting the impact of mutations on protein conformation, flexibility and stability. Nucleic Acids Res 46:W350\u2013W355. https:\/\/doi.org\/10.1093\/nar\/gky300","journal-title":"Nucleic Acids Res"},{"key":"745_CR48","doi-asserted-by":"publisher","first-page":"439","DOI":"10.1016\/j.csbj.2020.02.007","volume":"18","author":"DD Wang","year":"2020","unstructured":"Wang DD, Ou-Yang L, Xie H et al (2020) Predicting the impacts of mutations on protein-ligand binding affinity based on molecular dynamics simulations and machine learning methods. Comput Struct Biotechnol J 18:439\u2013454. https:\/\/doi.org\/10.1016\/j.csbj.2020.02.007","journal-title":"Comput Struct Biotechnol J"},{"key":"745_CR49","doi-asserted-by":"publisher","DOI":"10.1021\/acs.jctc.8b00391","author":"B Knapp","year":"2018","unstructured":"Knapp B, Ospina L, Deane CM (2018) Avoiding False Positive Conclusions in Molecular Simulation: The Importance of Replicas. J Chem Theory Comput 14:6127\u20136138 https:\/\/doi.org\/10.1021\/acs.jctc.8b00391","journal-title":"J Chem Theory Comput"},{"key":"745_CR50","doi-asserted-by":"publisher","DOI":"10.1063\/50083060","author":"Z Li","year":"2022","unstructured":"Li Z, Meidani K, Yadav P, Farimani AB (2022) Graph Neural Networks Accelerated Molecular Dynamics. J Chem Phys 156:144103. https:\/\/doi.org\/10.1063\/50083060","journal-title":"J Chem Phys"},{"key":"745_CR51","doi-asserted-by":"publisher","first-page":"7946","DOI":"10.1021\/acs.jmedchem.2c00487","volume":"2022","author":"M Volkov","year":"2022","unstructured":"Volkov M, Turk J-A, Drizard N et al (2022) On the Frustration to Predict Binding Affinities from Protein\u2212Ligand Structures with Deep Neural Networks. J Med Chem 2022:7946\u20137958. https:\/\/doi.org\/10.1021\/acs.jmedchem.2c00487","journal-title":"J Med Chem"},{"key":"745_CR52","doi-asserted-by":"publisher","DOI":"10.3390\/ijms222413474","author":"M Jane\u017ei\u010d","year":"2021","unstructured":"Jane\u017ei\u010d M, Valjavec K, Loboda KB et al (2021) Dynophore-Based Approach in Virtual Screening: A Case of Human DNA Topoisomerase II\u03b1. Int J Mol Sci 22:13474. https:\/\/doi.org\/10.3390\/ijms222413474","journal-title":"Int J Mol Sci"},{"key":"745_CR53","doi-asserted-by":"publisher","first-page":"1528","DOI":"10.1016\/j.bpj.2015.08.015","volume":"109","author":"RT McGibbon","year":"2015","unstructured":"McGibbon RT, Beauchamp KA, Harrigan MP et al (2015) MDTraj: A Modern Open Library for the Analysis of Molecular Dynamics Trajectories. Biophys J 109:1528\u20131532. https:\/\/doi.org\/10.1016\/j.bpj.2015.08.015","journal-title":"Biophys J"},{"key":"745_CR54","unstructured":"RDKit: Open-source cheminformatics; http:\/\/www.rdkit.org"},{"key":"745_CR55","doi-asserted-by":"publisher","first-page":"261","DOI":"10.1038\/s41592-019-0686-2","volume":"17","author":"P Virtanen","year":"2020","unstructured":"Virtanen P, Gommers R, Oliphant TE et al (2020) SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat Methods 17:261\u2013272. https:\/\/doi.org\/10.1038\/s41592-019-0686-2","journal-title":"Nat Methods"},{"key":"745_CR56","doi-asserted-by":"publisher","first-page":"90","DOI":"10.1109\/MCSE.2007.55","volume":"9","author":"JD Hunter","year":"2007","unstructured":"Hunter JD (2007) Matplotlib: A 2D Graphics Environment. Comput Sci Eng 9:90\u201395. https:\/\/doi.org\/10.1109\/MCSE.2007.55","journal-title":"Comput Sci Eng"},{"key":"745_CR57","volume-title":"Python 3 reference manual","author":"G van Rossum","year":"2009","unstructured":"Van Rossum G, Drake FL (2009) Python 3 Reference Manual. CreateSpace, Scotts Valley, CA"},{"key":"745_CR58","first-page":"2825","volume":"12","author":"F Pedregosa","year":"2011","unstructured":"Pedregosa F, Michel V, Grisel O et al (2011) Scikit-learn: Machine Learning in Python. J Mach Learn Res 12:2825\u20132830","journal-title":"J Mach Learn Res"},{"key":"745_CR59","doi-asserted-by":"publisher","unstructured":"B\u00e9quignon OJM ProDEC v.1.0.2. Available at https:\/\/doi.org\/10.5281\/zenodo.7007058. Accessed 20 Aug 2022.","DOI":"10.5281\/zenodo.7007058"},{"key":"745_CR60","doi-asserted-by":"publisher","first-page":"1315","DOI":"10.1038\/s41592-019-0598-1","volume":"16","author":"EC Alley","year":"2019","unstructured":"Alley EC, Khimulya G, Biswas S et al (2019) Unified rational protein engineering with sequence-based deep representation learning. Nat Methods 16:1315\u20131322. https:\/\/doi.org\/10.1038\/s41592-019-0598-1","journal-title":"Nat Methods"},{"key":"745_CR61","doi-asserted-by":"publisher","first-page":"916","DOI":"10.1021\/acs.jcim.7b00403","volume":"58","author":"I Wallach","year":"2018","unstructured":"Wallach I, Heifets A (2018) Most Ligand-Based Classification Benchmarks Reward Memorization Rather than Generalization. J Chem Inf Model 58:916\u2013932. https:\/\/doi.org\/10.1021\/acs.jcim.7b00403","journal-title":"J Chem Inf Model"},{"key":"745_CR62","doi-asserted-by":"publisher","first-page":"3021","DOI":"10.2110\/joss.03021","volume":"6","author":"M Waskom","year":"2021","unstructured":"Waskom M (2021) Seaborn: Statistical Data Visualization. J Open Source Softw 6:3021. https:\/\/doi.org\/10.2110\/joss.03021","journal-title":"J Open Source Softw"},{"key":"745_CR63","doi-asserted-by":"publisher","first-page":"453","DOI":"10.1182\/blood-2017-03-735654","volume":"130","author":"MA Jensen","year":"2017","unstructured":"Jensen MA, Ferretti V, Grossman RL, Staudt LM (2017) The NCI Genomic Data Commons as an engine for precision medicine. Blood 130:453\u2013459. https:\/\/doi.org\/10.1182\/blood-2017-03-735654","journal-title":"Blood"},{"key":"745_CR64","doi-asserted-by":"publisher","first-page":"1845","DOI":"10.1021\/acs.jctc.6b00049","volume":"12","author":"S Doerr","year":"2016","unstructured":"Doerr S, Harvey MJ, No\u00e9 F, De Fabritiis G (2016) HTMD: High-Throughput Molecular Dynamics for Molecular Discovery. J Chem Theory Comput 12:1845\u20131852. https:\/\/doi.org\/10.1021\/acs.jctc.6b00049","journal-title":"J Chem Theory Comput"},{"key":"745_CR65","doi-asserted-by":"publisher","first-page":"1632","DOI":"10.1021\/ct9000685","volume":"5","author":"MJ Harvey","year":"2009","unstructured":"Harvey MJ, Giupponi G, De Fabritiis G (2009) ACEMD: Accelerating biomolecular dynamics in the microsecond time scale. J Chem Theory Comput 5:1632\u20131639. https:\/\/doi.org\/10.1021\/ct9000685","journal-title":"J Chem Theory Comput"},{"key":"745_CR66","unstructured":"The PyMOL Molecular Graphics System, Version 1.4 Schr\u00f6dinger, LLC."}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-023-00745-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13321-023-00745-5\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-023-00745-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,21]],"date-time":"2023-11-21T22:40:06Z","timestamp":1700606406000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/s13321-023-00745-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,8,28]]},"references-count":66,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,12]]}},"alternative-id":["745"],"URL":"https:\/\/doi.org\/10.1186\/s13321-023-00745-5","relation":{"has-preprint":[{"id-type":"doi","id":"10.26434\/chemrxiv-2023-90082","asserted-by":"object"}]},"ISSN":["1758-2946"],"issn-type":[{"value":"1758-2946","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,8,28]]},"assertion":[{"value":"11 May 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"10 August 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"28 August 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"The authors declare no competing interests.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"74"}}