{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,30]],"date-time":"2025-10-30T22:22:46Z","timestamp":1761862966354,"version":"3.38.0"},"reference-count":33,"publisher":"Oxford University Press (OUP)","issue":"5","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2011,3,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: The analysis of multiple sequence alignments is allowing researchers to glean valuable insights into evolution, as well as identify genomic regions that may be functional, or discover novel classes of functional elements. Understanding the distribution of conservation levels that constitutes the evolutionary landscape is crucial to distinguishing functional regions from non-functional. Recent evidence suggests that a binary classification of evolutionary rates is inappropriate for this purpose and finds only highly conserved functional elements. Given that the distribution of evolutionary rates is multi-modal, determining the number of modes is of paramount concern. Through simulation, we evaluate the performance of a number of information criterion approaches derived from MCMC simulations in determining the dimension of a model.<\/jats:p><jats:p>Results: We utilize a deviance information criterion (DIC) approximation that is more robust than the approximations from other information criteria, and show our information criteria approximations do not produce superfluous modes when estimating conservation distributions under a variety of circumstances. We analyse the distribution of conservation for a multiple alignment comprising four primate species and mouse, and repeat this on two additional multiple alignments of similar species. We find evidence of six distinct classes of evolutionary rates that appear to be robust to the species used.<\/jats:p><jats:p>Availability: Source code and data are available at http:\/\/dl.dropbox.com\/u\/477240\/changept.zip<\/jats:p><jats:p>Contact: \u00a0jonathan.keith@monash.edu; christopher.oldmeadow@newcastle.edu.au<\/jats:p><jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btq716","type":"journal-article","created":{"date-parts":[[2011,1,6]],"date-time":"2011-01-06T01:42:09Z","timestamp":1294278129000},"page":"604-610","source":"Crossref","is-referenced-by-count":9,"title":["Model selection in Bayesian segmentation of multiple DNA alignments"],"prefix":"10.1093","volume":"27","author":[{"given":"Christopher","family":"Oldmeadow","sequence":"first","affiliation":[{"name":"1 Centre for Clinical Epidemiology and Biostatistics, University of Newcastle, NSW and 2School of Mathematical Sciences, Monash University, Victoria, Australia"}]},{"given":"Jonathan M.","family":"Keith","sequence":"additional","affiliation":[{"name":"1 Centre for Clinical Epidemiology and Biostatistics, University of Newcastle, NSW and 2School of Mathematical Sciences, Monash University, Victoria, Australia"}]}],"member":"286","published-online":{"date-parts":[[2011,1,5]]},"reference":[{"key":"2023012511573076300_B1","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1111\/j.2517-6161.1985.tb01331.x","article-title":"Estimation and hypothesis testing in finite mixture models","volume":"47","author":"Aitkin","year":"1985","journal-title":"J. R. Stat. Soc. Ser. B"},{"key":"2023012511573076300_B2","doi-asserted-by":"crossref","first-page":"716","DOI":"10.1109\/TAC.1974.1100705","article-title":"A new look at the statistical model identification","volume":"19","author":"Akaike","year":"1974","journal-title":"IEEE Trans. Automatic Control"},{"key":"2023012511573076300_B3","article-title":"Assessing a mixture model for clustering with the integrated classification likelihood","author":"Biernacki","year":"1998","journal-title":"Rapports de recherche- INRIA."},{"key":"2023012511573076300_B4","doi-asserted-by":"crossref","first-page":"799","DOI":"10.1038\/nature05874","article-title":"Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project","volume":"447","author":"Birney","year":"2007","journal-title":"Nature"},{"key":"2023012511573076300_B5","doi-asserted-by":"crossref","first-page":"651","DOI":"10.1214\/06-BA122","article-title":"Deviance information criteria for missing data models","volume":"1","author":"Celeux","year":"2006","journal-title":"Bayesian Anal."},{"key":"2023012511573076300_B6","doi-asserted-by":"crossref","first-page":"931","DOI":"10.1038\/nature03001","article-title":"Finishing the euchromatic sequence of the human genome","volume":"431","author":"Collins","year":"2004","journal-title":"Nature"},{"key":"2023012511573076300_B7","first-page":"335","article-title":"The direct use of likelihood for significance testing","volume-title":"Proceedings of Conference on Foundational Questions in Statistical Inference, Aarhus, May 7-12, 1973.","author":"Dempster","year":"1974"},{"key":"2023012511573076300_B8","doi-asserted-by":"crossref","first-page":"406","DOI":"10.2307\/2412116","article-title":"Toward defining the course of evolution: minimum change for a specific tree topology","volume":"20","author":"Fitch","year":"1971","journal-title":"Syst. Zool."},{"volume-title":"Bayesian Data Analysis.","year":"2004","author":"Gelman","key":"2023012511573076300_B9"},{"key":"2023012511573076300_B10","doi-asserted-by":"crossref","first-page":"39","DOI":"10.1111\/j.2517-6161.1993.tb01468.x","article-title":"Modelling complexity: applications of Gibbs sampling in medicine","volume":"55","author":"Gilks","year":"1993","journal-title":"J. R. Stat. Soc. Ser. B"},{"key":"2023012511573076300_B11","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1023\/B:MCAP.0000012414.14405.15","article-title":"A generalized Markov sampler","volume":"6","author":"Keith","year":"2004","journal-title":"Methodol. Comput. Appl. Probab."},{"key":"2023012511573076300_B12","doi-asserted-by":"crossref","first-page":"407","DOI":"10.1089\/cmb.2007.0173","article-title":"Delineating slowly and rapidly evolving fractions of the Drosophila genome","volume":"15","author":"Keith","year":"2008","journal-title":"J. Comput. Biol."},{"key":"2023012511573076300_B13","doi-asserted-by":"crossref","first-page":"1369","DOI":"10.1089\/cmb.2006.13.1369","article-title":"Segmenting eukaryotic genomes with the generalized gibbs sampler","volume":"13","author":"Keith","year":"2006","journal-title":"J. Comput. Biol."},{"key":"2023012511573076300_B14","first-page":"49","article-title":"Consistent estimation of the order of mixture models","volume":"62","author":"Keribin","year":"2000","journal-title":"Sankhy Indian J. Stat. Ser. A"},{"key":"2023012511573076300_B15","doi-asserted-by":"crossref","first-page":"671","DOI":"10.1126\/science.220.4598.671","article-title":"Optimization by simulated annealing","volume":"220","author":"Kirkpatrick","year":"1983","journal-title":"Science"},{"key":"2023012511573076300_B16","doi-asserted-by":"crossref","first-page":"D755","DOI":"10.1093\/nar\/gkn875","article-title":"The UCSC genome browser database: update 2009","volume":"37","author":"Kuhn","year":"2009","journal-title":"Nucleic Acids Res."},{"key":"2023012511573076300_B17","doi-asserted-by":"crossref","first-page":"803","DOI":"10.1038\/nature04338","article-title":"Genome sequence, comparative analysis and haplotype structure of the domestic dog","volume":"438","author":"Lindblad-Toh","year":"2005","journal-title":"Nature"},{"key":"2023012511573076300_B18","doi-asserted-by":"crossref","first-page":"1527","DOI":"10.1126\/science.1117806","article-title":"The functional genomics of noncoding RNA","volume":"309","author":"Mattick","year":"2005","journal-title":"Science"},{"key":"2023012511573076300_B19","doi-asserted-by":"crossref","first-page":"942","DOI":"10.1093\/molbev\/msp299","article-title":"Multiple evolutionary rate classes in animal genome evolution","volume":"27","author":"Oldmeadow","year":"2010","journal-title":"Mol. Biol. Evol."},{"key":"2023012511573076300_B20","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.tig.2005.10.003","article-title":"Rapid evolution of noncoding RNAs: lack of conservation does not mean lack of function","volume":"22","author":"Pang","year":"2006","journal-title":"Trends Genet."},{"key":"2023012511573076300_B21","doi-asserted-by":"crossref","first-page":"1245","DOI":"10.1101\/gr.6406307","article-title":"Raising the estimate of functional human sequences","volume":"17","author":"Pheasant","year":"2007","journal-title":"Genome Res."},{"key":"2023012511573076300_B22","doi-asserted-by":"crossref","first-page":"110","DOI":"10.1101\/gr.097857.109","article-title":"Detection of non-neutral substitution rates on mammalian phylogenies","volume":"20","author":"Pollard","year":"2010","journal-title":"Genome Res."},{"key":"2023012511573076300_B23","first-page":"1","article-title":"Estimating the integrated likelihood via posterior simulation using the harmonic mean identity","volume":"8","author":"Raftery","year":"2007","journal-title":"Bayesian Stat."},{"key":"2023012511573076300_B24","doi-asserted-by":"crossref","first-page":"731","DOI":"10.1111\/1467-9868.00095","article-title":"On Bayesian analysis of mixtures with an unknown number of components","volume":"59","author":"Richardson","year":"1997","journal-title":"J. R. Stat. Soc. Ser. B"},{"key":"2023012511573076300_B25","doi-asserted-by":"crossref","first-page":"461","DOI":"10.1214\/aos\/1176344136","article-title":"Estimating the dimension of a model","volume":"6","author":"Schwarz","year":"1978","journal-title":"Ann. Stat."},{"key":"2023012511573076300_B26","doi-asserted-by":"crossref","first-page":"413","DOI":"10.1089\/1066527041410472","article-title":"Combining phylogenetic and Hidden Markov Models in biosequence analysis","volume":"11","author":"Siepel","year":"2004","journal-title":"J. Comput. Biol."},{"key":"2023012511573076300_B27","doi-asserted-by":"crossref","first-page":"583","DOI":"10.1111\/1467-9868.00353","article-title":"Bayesian measures of model complexity and fit","volume":"64","author":"Spiegelhalter","year":"2002","journal-title":"J. R. Stat. Soc. Ser. B"},{"key":"2023012511573076300_B28","doi-asserted-by":"crossref","first-page":"795","DOI":"10.1111\/1467-9868.00265","article-title":"Dealing with label switching in mixture models","volume":"62","author":"Stephens","year":"2000","journal-title":"J. R. Stat. Soc."},{"key":"2023012511573076300_B29","doi-asserted-by":"crossref","first-page":"1","DOI":"10.18637\/jss.v012.i03","article-title":"R2WinBUGS: a package for running WinBUGS from R","volume":"12","author":"Sturtz","year":"2005","journal-title":"J. Stat. Softw."},{"volume-title":"Statistical Analysis of Finite Mixture Distributions.","year":"1985","author":"Titterington","key":"2023012511573076300_B30"},{"key":"2023012511573076300_B31","doi-asserted-by":"crossref","first-page":"520","DOI":"10.1038\/nature01262","article-title":"Initial sequencing and comparitive analysis of the mouse genome","volume":"420","author":"Waterston","year":"2002","journal-title":"Nature"},{"key":"2023012511573076300_B32","doi-asserted-by":"crossref","first-page":"306","DOI":"10.1007\/BF00160154","article-title":"Maximum likelihood phylogenetic estimation from DNA sequences with variable rates over sites: approximate methods","volume":"39","author":"Yang","year":"1994","journal-title":"J. Mol. Evol."},{"key":"2023012511573076300_B33","doi-asserted-by":"crossref","first-page":"367","DOI":"10.1016\/0169-5347(96)10041-0","article-title":"Among-site rate variation and its impact on phylogenetic analyses","volume":"11","author":"Yang","year":"1996","journal-title":"Trends Ecol. Evol."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/27\/5\/604\/48867500\/bioinformatics_27_5_604.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/27\/5\/604\/48867500\/bioinformatics_27_5_604.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,1]],"date-time":"2025-03-01T13:02:51Z","timestamp":1740834171000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/27\/5\/604\/263896"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,1,5]]},"references-count":33,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2011,3,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btq716","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"type":"electronic","value":"1367-4811"},{"type":"print","value":"1367-4803"}],"subject":[],"published-other":{"date-parts":[[2011,3,1]]},"published":{"date-parts":[[2011,1,5]]}}}