{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,11]],"date-time":"2025-06-11T04:10:24Z","timestamp":1749615024541,"version":"3.41.0"},"reference-count":33,"publisher":"Oxford University Press (OUP)","issue":"17","license":[{"start":{"date-parts":[[2016,11,10]],"date-time":"2016-11-10T00:00:00Z","timestamp":1478736000000},"content-version":"vor","delay-in-days":73,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001659","name":"Deutsche Forschungsgemeinschaft","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100001659","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100004807","name":"DFG","doi-asserted-by":"publisher","award":["SFB860 TP B9"],"award-info":[{"award-number":["SFB860 TP B9"]}],"id":[{"id":"10.13039\/100004807","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2016,9,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Motivation<\/jats:title><jats:p>Large-scale conformational changes in proteins are implicated in many important biological functions. These structural transitions can often be rationalized in terms of relative movements of rigid domains. There is a need for objective and automated methods that identify rigid domains in sets of protein structures showing alternative conformational states.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>We present a probabilistic model for detecting rigid-body movements in protein structures. Our model aims to approximate alternative conformational states by a few structural parts that are rigidly transformed under the action of a rotation and a translation. By using Bayesian inference and Markov chain Monte Carlo sampling, we estimate all parameters of the model, including a segmentation of the protein into rigid domains, the structures of the domains themselves, and the rigid transformations that generate the observed structures. We find that our Gibbs sampling algorithm can also estimate the optimal number of rigid domains with high efficiency and accuracy. We assess the power of our method on several thousand entries of the DynDom database and discuss applications to various complex biomolecular systems.<\/jats:p><\/jats:sec><jats:sec><jats:title>Availability and Implementation<\/jats:title><jats:p>The Python source code for protein ensemble analysis is available at: https:\/\/github.com\/thachnguyen\/motion_detection<\/jats:p><\/jats:sec><jats:sec><jats:title>Contact<\/jats:title><jats:p>mhabeck@gwdg.de<\/jats:p><\/jats:sec>","DOI":"10.1093\/bioinformatics\/btw442","type":"journal-article","created":{"date-parts":[[2016,9,1]],"date-time":"2016-09-01T07:53:39Z","timestamp":1472716419000},"page":"i710-i717","source":"Crossref","is-referenced-by-count":2,"title":["A probabilistic model for detecting rigid domains in protein structures"],"prefix":"10.1093","volume":"32","author":[{"given":"Thach","family":"Nguyen","sequence":"first","affiliation":[{"name":"Felix Bernstein Institute for Mathematical Statistics in the Biosciences, University of G\u00f6ttingen"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Michael","family":"Habeck","sequence":"additional","affiliation":[{"name":"Felix Bernstein Institute for Mathematical Statistics in the Biosciences, University of G\u00f6ttingen"},{"name":"Max Planck Institute for Biophysical Chemistry, G\u00f6ttingen 37077, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2016,8,29]]},"reference":[{"key":"2023020113270404000_btw442-B1","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1002\/prot.22544","article-title":"RigidFinder: a fast and sensitive method to detect rigid blocks in large macromolecular complexes","volume":"78","author":"Abyzov","year":"2010","journal-title":"Proteins"},{"key":"2023020113270404000_btw442-B2","first-page":"1368","article-title":"Bayesian partitioning of large-scale distance data","volume":"2011","author":"Adametz","year":"2011","journal-title":"Nips"},{"key":"2023020113270404000_btw442-B3","doi-asserted-by":"crossref","first-page":"296","DOI":"10.1093\/nar\/gkj046","article-title":"The Database of Macromolecular Motions: new features added at the decade mark","volume":"34","author":"Flores","year":"2006","journal-title":"Nucleic Acids Res"},{"key":"2023020113270404000_btw442-B4","doi-asserted-by":"crossref","first-page":"721","DOI":"10.1109\/TPAMI.1984.4767596","article-title":"Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images","volume":"PAMI-6","author":"Geman","year":"1984","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"2023020113270404000_btw442-B5","doi-asserted-by":"crossref","first-page":"6739","DOI":"10.1021\/bi00188a001","article-title":"Structural mechanisms for domain movements in proteins","volume":"33","author":"Gerstein","year":"1994","journal-title":"Biochemistry"},{"key":"2023020113270404000_btw442-B6","first-page":"156","article-title":"Markov chain Monte Carlo maximum likelihood","author":"Geyer","year":"1991","journal-title":"Computing Science and Statistics: Proceedings of the 23rd Symposium on the Interface"},{"key":"2023020113270404000_btw442-B7","doi-asserted-by":"crossref","first-page":"325","DOI":"10.1093\/biomet\/53.3-4.325","article-title":"Some distance properties of latent root and vector methods used in multivariate analysis","volume":"53","author":"Gower","year":"1966","journal-title":"Biometrika"},{"key":"2023020113270404000_btw442-B8","doi-asserted-by":"crossref","first-page":"719","DOI":"10.1007\/s00180-009-0156-x","article-title":"Generation of three-dimensional random rotations in fitting and matching problems","volume":"24","author":"Habeck","year":"2009","journal-title":"Comput. Stat"},{"key":"2023020113270404000_btw442-B9","doi-asserted-by":"crossref","first-page":"144","DOI":"10.1002\/(SICI)1097-0134(19980201)30:2<144::AID-PROT4>3.0.CO;2-N","article-title":"Systematic analysis of domain motions in proteins from conformational change: new results on citrate synthase and T4 lysozyme","volume":"30","author":"Hayward","year":"1998","journal-title":"Proteins"},{"key":"2023020113270404000_btw442-B10","doi-asserted-by":"crossref","first-page":"425","DOI":"10.1002\/(SICI)1097-0134(199703)27:3<425::AID-PROT10>3.0.CO;2-N","article-title":"Model-free methods of analyzing domain motions in proteins from simulation: a comparison of normal mode analysis and molecular dynamics simulation of lysozyme","volume":"27","author":"Hayward","year":"1997","journal-title":"Proteins: Struct. Funct. Genet"},{"key":"2023020113270404000_btw442-B11","doi-asserted-by":"crossref","first-page":"913","DOI":"10.1038\/nature06407","article-title":"A hierarchy of timescales in protein dynamics is linked to enzyme catalysis","volume":"450","author":"Henzler-Wildman","year":"2007","journal-title":"Nature"},{"key":"2023020113270404000_btw442-B12","doi-asserted-by":"crossref","first-page":"2184","DOI":"10.1093\/bioinformatics\/btn396","article-title":"Mixture models for protein structure ensembles","volume":"24","author":"Hirsch","year":"2008","journal-title":"Bioinformatics"},{"key":"2023020113270404000_btw442-B13","doi-asserted-by":"crossref","DOI":"10.1017\/CBO9780511790423","volume-title":"Probability Theory: The Logic of Science","author":"Jaynes","year":"2003"},{"key":"2023020113270404000_btw442-B14","doi-asserted-by":"crossref","first-page":"2996","DOI":"10.1093\/bioinformatics\/bts538","article-title":"CSB: a Python framework for structural bioinformatics","volume":"28","author":"Kalev","year":"2012","journal-title":"Bioinformatics"},{"key":"2023020113270404000_btw442-B15","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1002\/nav.3800020109","article-title":"The hungarian method for the assignment problem","volume":"2","author":"Kuhn","year":"1955","journal-title":"Naval Res. Logistics Quarterly"},{"key":"2023020113270404000_btw442-B16","doi-asserted-by":"crossref","first-page":"2947","DOI":"10.1093\/bioinformatics\/btm404","article-title":"Clustal W and Clustal X version 2.0","volume":"23","author":"Larkin","year":"2007","journal-title":"Bioinformatics"},{"key":"2023020113270404000_btw442-B17","doi-asserted-by":"crossref","first-page":"1290","DOI":"10.1093\/bioinformatics\/btg137","article-title":"The DynDom database of protein domain motions","volume":"19","author":"Lee","year":"2003","journal-title":"Bioinformatics"},{"key":"2023020113270404000_btw442-B18","doi-asserted-by":"crossref","first-page":"14845","DOI":"10.1021\/bi701848w","article-title":"Swiveling domain mechanism in pyruvate phosphate dikinase","volume":"46","author":"Lim","year":"2007","journal-title":"Biochemistry"},{"volume-title":"Information Theory, Inference, and Learning Algorithms","year":"2003","author":"MacKay","key":"2023020113270404000_btw442-B19"},{"key":"2023020113270404000_btw442-B20","first-page":"101","article-title":"How many clusters?","volume":"1","author":"McCullagh","year":"2008","journal-title":"Bayesian Anal"},{"volume-title":"Probabilistic inference using Markov chain Monte Carlo methods. Technical report CRG-TR-93-1","year":"1993","author":"Neal","key":"2023020113270404000_btw442-B21"},{"key":"2023020113270404000_btw442-B22","doi-asserted-by":"crossref","first-page":"125","DOI":"10.1023\/A:1008923215028","article-title":"Annealed importance sampling","volume":"11","author":"Neal","year":"2001","journal-title":"Stat. Comput"},{"key":"2023020113270404000_btw442-B23","doi-asserted-by":"crossref","first-page":"1516","DOI":"10.1016\/j.str.2015.05.022","article-title":"SPECTRUS: a dimensionality reduction approach for identifying dynamical domains in protein complexes from limited structural datasets","volume":"23","author":"Ponzoni","year":"2015","journal-title":"Structure (London, England: 1993)"},{"key":"2023020113270404000_btw442-B24","doi-asserted-by":"crossref","first-page":"201","DOI":"10.1002\/prot.22339","article-title":"A method for the analysis of domain movements in large biomolecular complexes","volume":"76","author":"Poornam","year":"2009","journal-title":"Proteins: Struct. Funct. Bioinform"},{"key":"2023020113270404000_btw442-B25","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1016\/0377-0427(87)90125-7","article-title":"Silhouettes: a graphical aid to the interpretation and validation of cluster analysis","volume":"20","author":"Rousseeuw","year":"1987","journal-title":"J. Comput. Appl. Math"},{"key":"2023020113270404000_btw442-B26","doi-asserted-by":"crossref","first-page":"461","DOI":"10.1214\/aos\/1176344136","article-title":"Estimating the dimension of a model","volume":"6","author":"Schwarz","year":"1978","journal-title":"Ann. Stat"},{"key":"2023020113270404000_btw442-B27","doi-asserted-by":"crossref","first-page":"111","DOI":"10.1111\/j.2517-6161.1974.tb00994.x","article-title":"Cross-validatory choice and assessment of statistical predictions","volume":"36","author":"Stone","year":"1974","journal-title":"J. R. Stat. Soc. Ser. B (Methodol.)"},{"key":"2023020113270404000_btw442-B28","doi-asserted-by":"crossref","first-page":"2607","DOI":"10.1103\/PhysRevLett.57.2607","article-title":"Replica Monte Carlo simulation of spin glasses","volume":"57","author":"Swendsen","year":"1986","journal-title":"Phys. Rev. Lett"},{"key":"2023020113270404000_btw442-B29","doi-asserted-by":"crossref","first-page":"12709","DOI":"10.1021\/bi0486987","article-title":"Topological and conformational analysis of the initiation and elongation complex of t7 RNA polymerase suggests a new twist","volume":"43","author":"Theis","year":"2004","journal-title":"Biochemistry"},{"key":"2023020113270404000_btw442-B30","doi-asserted-by":"crossref","first-page":"411","DOI":"10.1111\/1467-9868.00293","article-title":"Estimating the number of clusters in a data set via the gap statistic","volume":"63","author":"Tibshirani","year":"2001","journal-title":"J. R. Stat. Soc. B"},{"key":"2023020113270404000_btw442-B31","first-page":"849","article-title":"On spectral clustering: analysis and an algorithm","volume":"14","author":"Uw","year":"2001","journal-title":"Adv. Neural Inf. Process. Syst"},{"key":"2023020113270404000_btw442-B32","doi-asserted-by":"crossref","first-page":"e0131739.","DOI":"10.1371\/journal.pone.0131739","article-title":"Overfitting Bayesian mixture models with an unknown number of components","volume":"10","author":"van Havre","year":"2015","journal-title":"PLoS One"},{"key":"2023020113270404000_btw442-B33","doi-asserted-by":"crossref","first-page":"395","DOI":"10.1007\/s11222-007-9033-z","article-title":"A tutorial on spectral clustering","volume":"17","author":"Von Luxburg","year":"2007","journal-title":"Stat. Comput"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/32\/17\/i710\/49022980\/bioinformatics_32_17_i710.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/32\/17\/i710\/49022980\/bioinformatics_32_17_i710.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,10]],"date-time":"2025-06-10T15:55:51Z","timestamp":1749570951000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/32\/17\/i710\/2450772"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,8,29]]},"references-count":33,"journal-issue":{"issue":"17","published-print":{"date-parts":[[2016,9,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btw442","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2016,9,1]]},"published":{"date-parts":[[2016,8,29]]}}}