{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,8]],"date-time":"2026-05-08T10:39:14Z","timestamp":1778236754409,"version":"3.51.4"},"reference-count":94,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2006,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Background<\/jats:title><jats:p>Recent years have seen the emergence of genome annotation methods based on the<jats:italic>phylo-grammar<\/jats:italic>, a probabilistic model combining continuous-time Markov chains and stochastic grammars. Previously, phylo-grammars have required considerable effort to implement, limiting their adoption by computational biologists.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>We have developed an open source software tool, xrate, for working with reversible, irreversible or parametric substitution models combined with stochastic context-free grammars. xrate efficiently estimates maximum-likelihood parameters and phylogenetic trees using a novel \"phylo-EM\" algorithm that we describe. The grammar is specified in an external configuration file, allowing users to design new grammars, estimate rate parameters from training data and annotate multiple sequence alignments without the need to recompile code from source. We have used xrate to measure codon substitution rates and predict protein and RNA secondary structures.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusion<\/jats:title><jats:p>Our results demonstrate that xrate estimates biologically meaningful rates and makes predictions whose accuracy is comparable to that of more specialized tools.<\/jats:p><\/jats:sec>","DOI":"10.1186\/1471-2105-7-428","type":"journal-article","created":{"date-parts":[[2006,10,3]],"date-time":"2006-10-03T18:27:08Z","timestamp":1159900028000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":48,"title":["XRate: a fast prototyping, training and annotation tool for phylo-grammars"],"prefix":"10.1186","volume":"7","author":[{"given":"Peter S","family":"Klosterman","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Andrew V","family":"Uzilov","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yuri R","family":"Benda\u00f1a","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Robert K","family":"Bradley","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sharon","family":"Chao","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Carolin","family":"Kosiol","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Nick","family":"Goldman","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ian","family":"Holmes","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2006,10,3]]},"reference":[{"key":"1167_CR1","first-page":"19","volume-title":"Proceedings Genome Informatics Workshop V","author":"N Abe","year":"1994","unstructured":"Abe N, Mamitsuka H: Prediction of beta-sheet structures using stochastic tree grammars. In Proceedings Genome Informatics Workshop V. Universal Academy Press; 1994:19\u201328."},{"issue":"3","key":"1167_CR2","doi-asserted-by":"publisher","first-page":"496","DOI":"10.1101\/gr.424203","volume":"13","author":"M Alexandersson","year":"2003","unstructured":"Alexandersson M, Cawley S, Pachter L: SLAM cross-species gene finding and alignment with a generalized pair hidden Markov model. Genome Research 2003, 13(3):496\u2013502. 10.1101\/gr.424203","journal-title":"Genome Research"},{"issue":"6","key":"1167_CR3","doi-asserted-by":"publisher","first-page":"696","DOI":"10.1007\/PL00006274","volume":"45","author":"L Arvestad","year":"1997","unstructured":"Arvestad L, Bruno WJ: Estimation of reversible substitution matrices from multiple pairs of sequences. Journal of Molecular Evolution 1997, 45(6):696\u2013703. 10.1007\/PL00006274","journal-title":"Journal of Molecular Evolution"},{"key":"1167_CR4","first-page":"1","volume":"3","author":"LE Baum","year":"1972","unstructured":"Baum LE: An equality and associated maximization technique in statistical estimation for probabilistic functions of Markov processes. Inequalities 1972, 3: 1\u20138.","journal-title":"Inequalities"},{"key":"1167_CR5","first-page":"56","volume-title":"Proceedings of the Fifth International Conference on Intelligent Systems for Molecular Biology","author":"E Birney","year":"1997","unstructured":"Birney E, Durbin R: Dynamite: a flexible code generating language for dynamic programming methods used in sequence comparison. In Proceedings of the Fifth International Conference on Intelligent Systems for Molecular Biology. Edited by: Gaasterland T, Karp P, Karplus K, Ouzounis C, Sander C, Valencia A. Menlo Park, CA, AAAI Press; 1997:56\u201364."},{"key":"1167_CR6","first-page":"34","volume-title":"Proceedings of the Eleventh International Conference on Intelligent Systems for Molecular Biology","author":"J Bockhorst","year":"2003","unstructured":"Bockhorst J, Qiu Y, Glasner J, Liu M, Blattner F, Craven M: Predicting bacterial transcription units using sequence and expression data. In Proceedings of the Eleventh International Conference on Intelligent Systems for Molecular Biology. Menlo Park, CA, AAAI Press; 2003:34\u201343."},{"key":"1167_CR7","volume-title":"Introduction to Protein Structure","author":"C Branden","year":"1991","unstructured":"Branden C, Tooze J: Introduction to Protein Structure. Garland, New York; 1991."},{"key":"1167_CR8","first-page":"47","volume-title":"Proceedings of the First International Conference on Intelligent Systems for Molecular Biology","author":"M Brown","year":"1993","unstructured":"Brown M, Hughey R, Krogh A, Mian IS, Sj\u00f6lander K, Haussler D: Using Dirichlet mixture priors to derive hidden Markov models for protein families. In Proceedings of the First International Conference on Intelligent Systems for Molecular Biology. Edited by: Hunter L, Searls DB, Shavlik J. Menlo Park, CA, AAAI Press; 1993:47\u201355."},{"issue":"10","key":"1167_CR9","doi-asserted-by":"publisher","first-page":"1368","DOI":"10.1093\/oxfordjournals.molbev.a025583","volume":"13","author":"WJ Bruno","year":"1996","unstructured":"Bruno WJ: Modelling residue usage in aligned protein sequences via maximum likelihood. Molecular Biology and Evolution 1996, 13(10):1368\u20131374.","journal-title":"Molecular Biology and Evolution"},{"issue":"1","key":"1167_CR10","doi-asserted-by":"publisher","first-page":"78","DOI":"10.1006\/jmbi.1997.0951","volume":"268","author":"C Burge","year":"1997","unstructured":"Burge C, Karlin S: Prediction of complete gene structures in human genomic DNA. Journal of Molecular Biology 1997, 268(1):78\u201394. 10.1006\/jmbi.1997.0951","journal-title":"Journal of Molecular Biology"},{"key":"1167_CR11","doi-asserted-by":"publisher","first-page":"79","DOI":"10.1007\/BF02458837","volume":"51","author":"GA Churchill","year":"1989","unstructured":"Churchill GA: Stochastic models for heterogeneous DNA sequences. Bulletin of Mathematical Biology 1989, 51: 79\u201394.","journal-title":"Bulletin of Mathematical Biology"},{"key":"1167_CR12","first-page":"89","volume-title":"Atlas of Protein Sequence and Structure","author":"MO Dayhoff","year":"1972","unstructured":"Dayhoff MO, Eck RV, Park CM: A model of evolutionary change in proteins. In Atlas of Protein Sequence and Structure. Volume 5. Edited by: Dayhoff MO. National Biomedical Research Foundation, Washington, DC; 1972:89\u201399."},{"key":"1167_CR13","unstructured":"Dayhoff MO, Schwartz RM, Orcutt BC: A model of evolutionary change in proteins. In Atlas of Protein Sequence and Structure. Volume 5. Edited by: Dayhoff MO. National Biomedical Research Foundation, Washington, DC; 1978:345\u2013352."},{"key":"1167_CR14","first-page":"1","volume":"B39","author":"AP Dempster","year":"1977","unstructured":"Dempster AP, Laird NM, Rubin DB: Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society 1977, B39: 1\u201338.","journal-title":"Journal of the Royal Statistical Society"},{"key":"1167_CR15","first-page":"18","volume-title":"Proceedings of the Fifth Pacific Symposium on Biocomputing","author":"MW Dimmic","year":"2000","unstructured":"Dimmic MW, Mindell DP, Goldstein RA: Modeling evolution at the protein level using an adjustable amino acid fitness model. Proceedings of the Fifth Pacific Symposium on Biocomputing 2000, 18\u201329."},{"key":"1167_CR16","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511790492","volume-title":"Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids","author":"R Durbin","year":"1998","unstructured":"Durbin R, Eddy S, Krogh A, Mitchison G: Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids. Cambridge University Press, Cambridge, UK; 1998."},{"key":"1167_CR17","doi-asserted-by":"crossref","unstructured":"Eddy SR: A memory-efficient dynamic programming algorithm for optimal alignment of a sequence to an RNA secondary structure. BMC Bioinformatics 2002., 3(18):","DOI":"10.1186\/1471-2105-3-18"},{"key":"1167_CR18","doi-asserted-by":"publisher","first-page":"2079","DOI":"10.1093\/nar\/22.11.2079","volume":"22","author":"SR Eddy","year":"1994","unstructured":"Eddy SR, Durbin R: RNA sequence analysis using covariance models. Nucleic Acids Research 1994, 22: 2079\u20132088.","journal-title":"Nucleic Acids Research"},{"key":"1167_CR19","doi-asserted-by":"publisher","first-page":"9","DOI":"10.1089\/cmb.1995.2.9","volume":"2","author":"SR Eddy","year":"1995","unstructured":"Eddy SR, Mitchison GJ, Durbin R: Maximum discrimination hidden Markov models of sequence consensus. Journal of Computational Biology 1995, 2: 9\u201323.","journal-title":"Journal of Computational Biology"},{"key":"1167_CR20","doi-asserted-by":"crossref","unstructured":"Engelhardt BE, Jordan MI, Muratore KE, Brenner SE: Protein molecular function prediction by Bayesian phylogenomics. PLoS Computational Biology 2005., 1(5):","DOI":"10.1371\/journal.pcbi.0010045"},{"key":"1167_CR21","doi-asserted-by":"publisher","first-page":"368","DOI":"10.1007\/BF01734359","volume":"17","author":"J Felsenstein","year":"1981","unstructured":"Felsenstein J: Evolutionary trees from DNA sequences: a maximum likelihood approach. Journal of Molecular Evolution 1981, 17: 368\u2013376. 10.1007\/BF01734359","journal-title":"Journal of Molecular Evolution"},{"key":"1167_CR22","volume-title":"Inferring Phylogenies","author":"J Felsenstein","year":"2003","unstructured":"Felsenstein J: Inferring Phylogenies. Sinauer Associates, Inc; 2003. ISBN 0878931775. ISBN 0878931775."},{"key":"1167_CR23","doi-asserted-by":"publisher","first-page":"93","DOI":"10.1093\/oxfordjournals.molbev.a025575","volume":"13","author":"J Felsenstein","year":"1996","unstructured":"Felsenstein J, Churchill GA: A hidden Markov model approach to variation among sites in rate of evolution. Molecular Biology and Evolution 1996, 13: 93\u2013104.","journal-title":"Molecular Biology and Evolution"},{"key":"1167_CR24","doi-asserted-by":"publisher","first-page":"331","DOI":"10.1089\/10665270252935494","volume":"9","author":"N Friedman","year":"2002","unstructured":"Friedman N, Ninio M, Pe'er I, Pupko T: A structural EM algorithm for phylogenetic inference. Journal of Computational Biology 2002, 9: 331\u2013353. 10.1089\/10665270252935494","journal-title":"Journal of Computational Biology"},{"key":"1167_CR25","volume-title":"Markov Chain Monte Carlo in Practice","author":"W Gilks","year":"1996","unstructured":"Gilks W, Richardson S, Spiegelhalter D: Markov Chain Monte Carlo in Practice. Chapman & Hall, London, UK; 1996."},{"issue":"2","key":"1167_CR26","doi-asserted-by":"publisher","first-page":"196","DOI":"10.1006\/jmbi.1996.0569","volume":"263","author":"N Goldman","year":"1996","unstructured":"Goldman N, Thorne JL, Jones DT: Using evolutionary trees in protein secondary structure prediction and other comparative sequence analyses. Journal of Molecular Biology 1996, 263(2):196\u2013208. 10.1006\/jmbi.1996.0569","journal-title":"Journal of Molecular Biology"},{"key":"1167_CR27","first-page":"725","volume":"11","author":"N Goldman","year":"1994","unstructured":"Goldman N, Yang Z: A codon-based model of nucleotide substitution for protein-coding DNA sequences. Molecular Biology and Evolution 1994, 11: 725\u2013735.","journal-title":"Molecular Biology and Evolution"},{"issue":"5062","key":"1167_CR28","doi-asserted-by":"publisher","first-page":"1443","DOI":"10.1126\/science.1604319","volume":"256","author":"GH Gonnet","year":"1992","unstructured":"Gonnet GH, Cohen MA, Benner SA: Exhaustive matching of the entire protein sequence database. Science 1992, 256(5062):1443\u20131445. 10.1126\/science.1604319","journal-title":"Science"},{"key":"1167_CR29","doi-asserted-by":"publisher","first-page":"4355","DOI":"10.1073\/pnas.84.13.4355","volume":"84","author":"M Gribskov","year":"1987","unstructured":"Gribskov M, McLachlan AD, Eisenberg D: Profile analysis: detection of distantly related proteins. Proceedings of the National Academy of Sciences of the USA 1987, 84: 4355\u20134358. 10.1073\/pnas.84.13.4355","journal-title":"Proceedings of the National Academy of Sciences of the USA"},{"issue":"1","key":"1167_CR30","doi-asserted-by":"publisher","first-page":"439","DOI":"10.1093\/nar\/gkg006","volume":"31","author":"S Griffiths-Jones","year":"2003","unstructured":"Griffiths-Jones S, Bateman A, Marshall M, Khanna A, Eddy SR: Rfam: an RNA family database. Nucleic Acids Research 2003, 31(1):439\u2013441. 10.1093\/nar\/gkg006","journal-title":"Nucleic Acids Research"},{"issue":"7","key":"1167_CR31","doi-asserted-by":"publisher","first-page":"910","DOI":"10.1093\/oxfordjournals.molbev.a025995","volume":"15","author":"AL Halpern","year":"1998","unstructured":"Halpern AL, Bruno WJ: Evolutionary distances for protein-coding sequences: modeling site-specific residue frequencies. Molecular Biology and Evolution 1998, 15(7):910\u2013917.","journal-title":"Molecular Biology and Evolution"},{"key":"1167_CR32","doi-asserted-by":"publisher","first-page":"160","DOI":"10.1007\/BF02101694","volume":"22","author":"M Hasegawa","year":"1985","unstructured":"Hasegawa M, Kishino H, Yano T: Dating the human-ape splitting by a molecular clock of mitochondrial DNA. Journal of Molecular Evolution 1985, 22: 160\u2013174. 10.1007\/BF02101694","journal-title":"Journal of Molecular Evolution"},{"key":"1167_CR33","first-page":"179","volume-title":"Pacific Symposium on Biocomputing","author":"J Hein","year":"2001","unstructured":"Hein J: An algorithm for statistical alignment of sequences related by a binary tree. In Pacific Symposium on Biocomputing. Edited by: Altman RB, Dunker AK, Hunter L, Laud-erdale K, Klein TE. Singapore, World Scientific; 2001:179\u2013190."},{"key":"1167_CR34","doi-asserted-by":"publisher","first-page":"265","DOI":"10.1006\/jmbi.2000.4061","volume":"302","author":"J Hein","year":"2000","unstructured":"Hein J, Wiuf C, Knudsen B, Moller MB, Wibling G: Statistical alignment: computational properties, homology testing and goodness-of-fit. Journal of Molecular Biology 2000, 302: 265\u2013279. 10.1006\/jmbi.2000.4061","journal-title":"Journal of Molecular Biology"},{"key":"1167_CR35","volume-title":"Statistical applications in Genetics and Molecular Biology","author":"A Hobolth","year":"2005","unstructured":"Hobolth A, Jensen JL: Statistical inference in evolutionary models of DNA sequences via the EM algorithm. Statistical applications in Genetics and Molecular Biology 2005., 4(1):","edition":"4"},{"key":"1167_CR36","volume-title":"BMC Bioinformatics","author":"I Holmes","year":"2004","unstructured":"Holmes I: A probabilistic model for the evolution of RNA structure. BMC Bioinformatics 2004., 5(166):","edition":"5"},{"issue":"9","key":"1167_CR37","doi-asserted-by":"publisher","first-page":"803","DOI":"10.1093\/bioinformatics\/17.9.803","volume":"17","author":"I Holmes","year":"2001","unstructured":"Holmes I, Bruno WJ: Evolutionary HMMs: a Bayesian approach to multiple alignment. Bioinformatics 2001, 17(9):803\u2013820. 10.1093\/bioinformatics\/17.9.803","journal-title":"Bioinformatics"},{"issue":"5","key":"1167_CR38","doi-asserted-by":"publisher","first-page":"757","DOI":"10.1006\/jmbi.2002.5405","volume":"317","author":"I Holmes","year":"2002","unstructured":"Holmes I, Rubin GM: An Expectation Maximization algorithm for training hidden substitution models. Journal of Molecular Biology 2002, 317(5):757\u2013768. 10.1006\/jmbi.2002.5405","journal-title":"Journal of Molecular Biology"},{"issue":"Supplement 1","key":"1167_CR39","doi-asserted-by":"publisher","first-page":"161","DOI":"10.1093\/bioinformatics\/bth917","volume":"20","author":"V Jojic","year":"2004","unstructured":"Jojic V, Jojic N, Meek C, Geiger D, Siepel A, Haussler D, Heckerman D: Efficient approximations for learning phylogenetic HMM models from data. Bioinformatics 2004, 20(Supplement 1):161\u2013168. 10.1093\/bioinformatics\/bth917","journal-title":"Bioinformatics"},{"key":"1167_CR40","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-59126-6_2","volume-title":"Tree-adjoining grammars","author":"A Joshi","year":"1997","unstructured":"Joshi A, Schabes Y: Tree-adjoining grammars. 1997."},{"key":"1167_CR41","doi-asserted-by":"publisher","first-page":"21","DOI":"10.1016\/B978-1-4832-3211-9.50009-7","volume-title":"Mammalian Protein Metabolism","author":"TH Jukes","year":"1969","unstructured":"Jukes TH, Cantor C: Evolution of protein molecules. In Mammalian Protein Metabolism. Academic Press, New York; 1969:21\u2013132."},{"issue":"5","key":"1167_CR42","doi-asserted-by":"publisher","first-page":"1027","DOI":"10.1016\/j.jmb.2004.03.016","volume":"338","author":"L Kall","year":"2004","unstructured":"Kall L, Krogh A, Sonnhammer EL: A combined transmembrane topology and signal peptide prediction method. Journal of Molecular Biology 2004, 338(5):1027\u20131036. 10.1016\/j.jmb.2004.03.016","journal-title":"Journal of Molecular Biology"},{"key":"1167_CR43","volume-title":"A First Course in Stochastic Processes","author":"S Karlin","year":"1975","unstructured":"Karlin S, Taylor H: A First Course in Stochastic Processes. Academic Press, San Diego, CA; 1975."},{"key":"1167_CR44","doi-asserted-by":"publisher","first-page":"111","DOI":"10.1007\/BF01731581","volume":"16","author":"M Kimura","year":"1980","unstructured":"Kimura M: A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. Journal of Molecular Evolution 1980, 16: 111\u2013120. 10.1007\/BF01731581","journal-title":"Journal of Molecular Evolution"},{"key":"1167_CR45","doi-asserted-by":"publisher","first-page":"392","DOI":"10.1093\/nar\/30.1.392","volume":"30","author":"PS Klosterman","year":"2002","unstructured":"Klosterman PS, Tamura M, Holbrook SR, Brenner SE: SCOR: a structural classification of RNA database. Nucleic Acids Research 2002, 30: 392\u2013394. 10.1093\/nar\/30.1.392","journal-title":"Nucleic Acids Research"},{"issue":"6","key":"1167_CR46","doi-asserted-by":"publisher","first-page":"446","DOI":"10.1093\/bioinformatics\/15.6.446","volume":"15","author":"B Knudsen","year":"1999","unstructured":"Knudsen B, Hein J: RNA secondary structure prediction using stochastic context-free grammars and evolutionary history. Bioinformatics 1999, 15(6):446\u2013454. 10.1093\/bioinformatics\/15.6.446","journal-title":"Bioinformatics"},{"issue":"13","key":"1167_CR47","doi-asserted-by":"publisher","first-page":"3423","DOI":"10.1093\/nar\/gkg614","volume":"31","author":"B Knudsen","year":"2003","unstructured":"Knudsen B, Hein J: Pfold: RNA secondary structure prediction using stochastic context-free grammars. Nucleic Acids Research Evaluation Studies 2003, 31(13):3423\u20133428. 10.1093\/nar\/gkg614","journal-title":"Nucleic Acids Research Evaluation Studies"},{"key":"1167_CR48","doi-asserted-by":"publisher","first-page":"641","DOI":"10.1093\/protein\/8.7.641","volume":"8","author":"JM Koshi","year":"1995","unstructured":"Koshi JM, Goldstein RA: Context-dependent optimal substitution matrices. Protein Engineering 1995, 8: 641\u2013645.","journal-title":"Protein Engineering"},{"key":"1167_CR49","doi-asserted-by":"publisher","first-page":"1501","DOI":"10.1006\/jmbi.1994.1104","volume":"235","author":"A Krogh","year":"1994","unstructured":"Krogh A, Brown M, Mian IS, Sj\u00f6lander K, Haussler D: Hidden Markov models in computational biology: applications to protein modeling. Journal of Molecular Biology 1994, 235: 1501\u20131531. 10.1006\/jmbi.1994.1104","journal-title":"Journal of Molecular Biology"},{"issue":"2","key":"1167_CR50","doi-asserted-by":"publisher","first-page":"498","DOI":"10.1109\/18.910572","volume":"47","author":"FR Kschischang","year":"1998","unstructured":"Kschischang FR, Frey BJ, Loeliger H-A: Factor graphs and the sum-product algorithm. IEEE Transactions on Information Theory 1998, 47(2):498\u2013519. 10.1109\/18.910572","journal-title":"IEEE Transactions on Information Theory"},{"key":"1167_CR51","doi-asserted-by":"publisher","first-page":"35","DOI":"10.1016\/0885-2308(90)90022-X","volume":"4","author":"K Lari","year":"1990","unstructured":"Lari K, Young SJ: The estimation of stochastic context-free grammars using the inside-outside algorithm. Computer Speech and Language 1990, 4: 35\u201356. 10.1016\/0885-2308(90)90022-X","journal-title":"Computer Speech and Language"},{"key":"1167_CR52","doi-asserted-by":"publisher","first-page":"342","DOI":"10.1006\/jmbi.1996.0167","volume":"257","author":"O Lichtarge","year":"1996","unstructured":"Lichtarge O, Bourne HR, Cohen FE: An evolutionary trace method defines binding surfaces common to protein families. Journal of Molecular Biology 1996, 257: 342\u2013358. 10.1006\/jmbi.1996.0167","journal-title":"Journal of Molecular Biology"},{"key":"1167_CR53","doi-asserted-by":"publisher","first-page":"1696","DOI":"10.1093\/oxfordjournals.molbev.a026083","volume":"16","author":"P Li\u00f2","year":"1999","unstructured":"Li\u00f2 P, Goldman N: Using protein structural information in evolutionary inference: transmembrane proteins. Molecular Biology and Evolution 1999, 16: 1696\u20131710.","journal-title":"Molecular Biology and Evolution"},{"key":"1167_CR54","doi-asserted-by":"crossref","unstructured":"Lunter G, Ponting CP, Hein J: Genome-wide identification of human functional DNA using a neutral indel model. PLoS Computational Biology 2006., 2(1):","DOI":"10.1371\/journal.pcbi.0020005"},{"issue":"Suppl 1","key":"1167_CR55","doi-asserted-by":"publisher","first-page":"I216","DOI":"10.1093\/bioinformatics\/bth901","volume":"20","author":"GA Lunter","year":"2004","unstructured":"Lunter GA, Hein J: A nucleotide substitution model with nearest-neighbour interactions. Bioinformatics 2004, 20(Suppl 1):I216-I223. 10.1093\/bioinformatics\/bth901","journal-title":"Bioinformatics"},{"issue":"4","key":"1167_CR56","doi-asserted-by":"publisher","first-page":"184","DOI":"10.1145\/367177.367199","volume":"3","author":"JL McCarthy","year":"1960","unstructured":"McCarthy JL: Recursive functions of symbolic expressions and their computation by machine. Communications of the ACM 1960, 3(4):184\u2013195. 10.1145\/367177.367199","journal-title":"Communications of the ACM"},{"key":"1167_CR57","volume-title":"The EM Algorithm and Extensions","author":"GJ McLachlan","year":"1996","unstructured":"McLachlan GJ, Krishnan T: The EM Algorithm and Extensions. Wiley Interscience; 1996."},{"issue":"2","key":"1167_CR58","doi-asserted-by":"publisher","first-page":"776","DOI":"10.1093\/nar\/gkh211","volume":"32","author":"IM Meyer","year":"2004","unstructured":"Meyer IM, Durbin R: Gene structure conservation aids similarity based gene prediction. Nucleic Acids Research 2004, 32(2):776\u2013783. 10.1093\/nar\/gkh211","journal-title":"Nucleic Acids Research"},{"key":"1167_CR59","doi-asserted-by":"publisher","first-page":"226","DOI":"10.1109\/78.738259","volume":"47","author":"S Michalek","year":"1999","unstructured":"Michalek S, Timmer J: Estimating rate constants in hidden Markov models by the EM algorithm. IEEE Transactions in Signal Processing 1999, 47: 226\u2013228. 10.1109\/78.738259","journal-title":"IEEE Transactions in Signal Processing"},{"issue":"3","key":"1167_CR60","doi-asserted-by":"publisher","first-page":"529","DOI":"10.1093\/molbev\/msh043","volume":"21","author":"I Mikl\u00f3s","year":"2004","unstructured":"Mikl\u00f3s I, Lunter G, Holmes I: A long indel model for evolutionary sequence alignment. Molecular Biology and Evolution 2004, 21(3):529\u2013540. 10.1093\/molbev\/msh043","journal-title":"Molecular Biology and Evolution"},{"key":"1167_CR61","doi-asserted-by":"publisher","first-page":"2469","DOI":"10.1002\/pro.5560071126","volume":"7","author":"K Mizuguchi","year":"1998","unstructured":"Mizuguchi K, Deane CM, Blundell TL, Overington JP: HOMSTRAD: a database of protein structure alignments for homologous families. Protein Science 1998, 7: 2469\u20132471.","journal-title":"Protein Science"},{"key":"1167_CR62","doi-asserted-by":"crossref","unstructured":"Moses AM, Chiang DY, Pollard DA, Iyer VN, Eisen MB: MONKEY: identifying conserved transcription-factor binding sites in multiple alignments using a binding site-specific evolutionary model. Genome Biology 2004., 5(12):","DOI":"10.1186\/gb-2004-5-12-r98"},{"issue":"6","key":"1167_CR63","doi-asserted-by":"publisher","first-page":"761","DOI":"10.1089\/10665270050514918","volume":"7","author":"T Muller","year":"2000","unstructured":"Muller T, Vingron M: Modeling amino acid replacement. Journal of Computational Biology 2000, 7(6):761\u2013776. 10.1089\/10665270050514918","journal-title":"Journal of Computational Biology"},{"key":"1167_CR64","volume-title":"Statistical Decision Theory and Related Topics","author":"J Neyman","year":"1971","unstructured":"Neyman J: Molecular studies of evolution: a source of novel statistical problems. In Statistical Decision Theory and Related Topics. Edited by: Gupta SS, Yackel J. Academic Press, New York; 1971."},{"issue":"13","key":"1167_CR65","doi-asserted-by":"publisher","first-page":"3812","DOI":"10.1093\/nar\/gkg509","volume":"31","author":"PC Ng","year":"2003","unstructured":"Ng PC, Henikoff S: SIFT: Predicting amino acid changes that affect protein function. Nucleic Acids Research 2003, 31(13):3812\u20133814. 10.1093\/nar\/gkg509","journal-title":"Nucleic Acids Research"},{"key":"1167_CR66","volume-title":"Probabilistic Reasoning in Intelligent Systems","author":"J Pearl","year":"1988","unstructured":"Pearl J: Probabilistic Reasoning in Intelligent Systems. Morgan Kaufmann Publishers, San Mateo, California; 1988."},{"issue":"4","key":"1167_CR67","doi-asserted-by":"publisher","first-page":"e33","DOI":"10.1371\/journal.pcbi.0020033","volume":"2","author":"JS Pedersen","year":"2006","unstructured":"Pedersen JS, Bejerano G, Siepel A, Rosenbloom K, Lindblad-Toh K, Lander ES, Kent J, Miller W, Haussler D: Identification and classification of conserved RNA secondary structures in the human genome. PLoS Computational Biology 2006, 2(4):e33. 10.1371\/journal.pcbi.0020033","journal-title":"PLoS Computational Biology"},{"issue":"2","key":"1167_CR68","doi-asserted-by":"publisher","first-page":"219","DOI":"10.1093\/bioinformatics\/19.2.219","volume":"19","author":"JS Pedersen","year":"2003","unstructured":"Pedersen JS, Hein J: Gene finding with a hidden Markov model of genome structure and evolution. Bioinformatics 2003, 19(2):219\u2013227. 10.1093\/bioinformatics\/19.2.219","journal-title":"Bioinformatics"},{"issue":"16","key":"1167_CR69","doi-asserted-by":"publisher","first-page":"4925","DOI":"10.1093\/nar\/gkh839","volume":"32","author":"JS Pedersen","year":"2004","unstructured":"Pedersen JS, Meyer IM, Forsberg R, Simmonds P, Hein J: A comparative method for finding and folding RNA secondary structures within protein-coding regions. Nucleic Acids Research 2004, 32(16):4925\u20134923. 10.1093\/nar\/gkh839","journal-title":"Nucleic Acids Research"},{"issue":"7108","key":"1167_CR70","doi-asserted-by":"publisher","first-page":"167","DOI":"10.1038\/nature05113","volume":"443","author":"Katherine S Pollard","year":"2006","unstructured":"Pollard KatherineS, Salama SofleR, Lambert Nelle, Lambot Marie-Alexandra, Coppens Sandra, Pedersen JakobS, Katzman Sol, King Bryan, Onodera Courtney, Siepel Adam, Kern AndrewD, Dehay Colette, Igel Haller, Ares Manuel Jr, Vanderhaeghen Pierre, Haussler David: An RNA gene expressed during cortical development evolved rapidly in humans. Nature 2006, 443(7108):167\u2013172. 10.1038\/nature05113","journal-title":"Nature"},{"issue":"1","key":"1167_CR71","doi-asserted-by":"publisher","first-page":"187","DOI":"10.1006\/jmbi.1998.2601","volume":"287","author":"DD Pollock","year":"1999","unstructured":"Pollock DD, Taylor WR, Goldman N: Coevolving protein residues: maximum likelihood identification and relationship to structure. Journal of Molecular Biology 1999, 287(1):187\u2013198. 10.1006\/jmbi.1998.2601","journal-title":"Journal of Molecular Biology"},{"issue":"1","key":"1167_CR72","doi-asserted-by":"publisher","first-page":"4","DOI":"10.1109\/MASSP.1986.1165342","volume":"3","author":"LR Rabiner","year":"1986","unstructured":"Rabiner LR, Juang BH: An introduction to hidden Markov models. IEEE ASSP Magazine 1986, 3(1):4\u201316.","journal-title":"IEEE ASSP Magazine"},{"issue":"4","key":"1167_CR73","doi-asserted-by":"publisher","first-page":"334","DOI":"10.1093\/bioinformatics\/16.4.334","volume":"16","author":"E Rivas","year":"2000","unstructured":"Rivas E, Eddy SR: The language of RNA: a formal grammar that includes pseudoknots. Bioinformatics 2000, 16(4):334\u2013340. 10.1093\/bioinformatics\/16.4.334","journal-title":"Bioinformatics"},{"key":"1167_CR74","doi-asserted-by":"crossref","unstructured":"Rivas E, Eddy SR: Noncoding RNA gene detection using comparative sequence analysis. BMC Bioinformatics 2001., 2(8):","DOI":"10.1186\/1471-2105-2-8"},{"key":"1167_CR75","volume-title":"S-expressions. Internet Draft","author":"R Rivest","year":"1997","unstructured":"Rivest R: S-expressions. Internet Draft.1997. [http:\/\/theory.lcs.mit.edu\/~rivest\/sexp.txt]"},{"key":"1167_CR76","doi-asserted-by":"publisher","first-page":"66","DOI":"10.1016\/S0076-6879(04)83004-0","volume":"383","author":"CA Rohl","year":"2004","unstructured":"Rohl CA, Strauss CE, Misura KM, Baker D: Protein structure prediction using Rosetta. Methods in Enzymology 2004, 383: 66\u201393.","journal-title":"Methods in Enzymology"},{"key":"1167_CR77","first-page":"406","volume":"4","author":"N Saitou","year":"1987","unstructured":"Saitou N, Nei M: The neighbor-joining method: a new method for reconstructing phylogenetic trees. Molecular Biology and Evolution 1987, 4: 406\u2013425.","journal-title":"Molecular Biology and Evolution"},{"key":"1167_CR78","doi-asserted-by":"publisher","first-page":"5112","DOI":"10.1093\/nar\/22.23.5112","volume":"22","author":"Y Sakakibara","year":"1994","unstructured":"Sakakibara Y, Brown M, Hughey R, Saira Mian I, Kimmen Sj\u00f6lander, Underwood RC, Haussler D: Stochastic context-free grammars for tRNA modeling. Nucleic Acids Research 1994, 22: 5112\u20135120.","journal-title":"Nucleic Acids Research"},{"issue":"8","key":"1167_CR79","doi-asserted-by":"publisher","first-page":"1034","DOI":"10.1101\/gr.3715005","volume":"15","author":"A Siepel","year":"2005","unstructured":"Siepel A, Bejerano G, Pedersen JS, Hinrichs AS, Hou M, Rosenbloom K, Clawson H, Spieth J, Hillier LW, Richards S, Weinstock GM, Wilson RK, Gibbs RA, Kent WJ, Miller W, Haussler D: Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. Genome Research 2005, 15(8):1034\u20131050. 10.1101\/gr.3715005","journal-title":"Genome Research"},{"issue":"2\u20133","key":"1167_CR80","doi-asserted-by":"publisher","first-page":"413","DOI":"10.1089\/1066527041410472","volume":"11","author":"A Siepel","year":"2004","unstructured":"Siepel A, Haussler D: Combining phylogenetic and hidden Markov models in biosequence analysis. Journal of Computational Biology 2004, 11(2\u20133):413\u2013428. 10.1089\/1066527041410472","journal-title":"Journal of Computational Biology"},{"issue":"3","key":"1167_CR81","doi-asserted-by":"publisher","first-page":"468","DOI":"10.1093\/molbev\/msh039","volume":"21","author":"A Siepel","year":"2004","unstructured":"Siepel A, Haussler D: Phylogenetic estimation of context-dependent substitution rates by maximum likelihood. Molecular Biology and Evolution 2004, 21(3):468\u2013488. 10.1093\/molbev\/msh039","journal-title":"Molecular Biology and Evolution"},{"issue":"1","key":"1167_CR82","doi-asserted-by":"publisher","first-page":"227","DOI":"10.1016\/j.jmb.2004.03.025","volume":"339","author":"OS Soyer","year":"2004","unstructured":"Soyer OS, Goldstein RA: Predicting functional sites in proteins: site-specific evolutionary models and their application to neurotransmitter transporters. Journal of Molecular Biology 2004, 339(1):227\u2013242. 10.1016\/j.jmb.2004.03.025","journal-title":"Journal of Molecular Biology"},{"key":"1167_CR83","doi-asserted-by":"publisher","first-page":"666","DOI":"10.1093\/oxfordjournals.molbev.a025627","volume":"13","author":"JL Thorne","year":"1996","unstructured":"Thorne JL, Goldman N, Jones DT: Combining protein evolution and secondary structure. Molecular Biology and Evolution 1996, 13: 666\u2013673.","journal-title":"Molecular Biology and Evolution"},{"key":"1167_CR84","doi-asserted-by":"publisher","first-page":"114","DOI":"10.1007\/BF02193625","volume":"33","author":"JL Thorne","year":"1991","unstructured":"Thorne JL, Kishino H, Felsenstein J: An evolutionary model for maximum likelihood alignment of DNA sequences. Journal of Molecular Evolution 1991, 33: 114\u2013124. 10.1007\/BF02193625","journal-title":"Journal of Molecular Evolution"},{"issue":"1","key":"1167_CR85","doi-asserted-by":"publisher","first-page":"167","DOI":"10.1006\/jmbi.1998.1700","volume":"278","author":"WW Wasserman","year":"1998","unstructured":"Wasserman WW, Fickett JW: Identification of regulatory regions which confer muscle-specific gene expression. Journal of Molecular Biology 1998, 278(1):167\u2013181. 10.1006\/jmbi.1998.1700","journal-title":"Journal of Molecular Biology"},{"issue":"12","key":"1167_CR86","doi-asserted-by":"publisher","first-page":"1556","DOI":"10.1093\/bioinformatics\/btg188","volume":"19","author":"S Whelan","year":"2003","unstructured":"Whelan S, de Bakker PI, Goldman N: Pandit: a database of protein and associated nucleotide domains with inferred trees. Bioinformatics 2003, 19(12):1556\u20131563. 10.1093\/bioinformatics\/btg188","journal-title":"Bioinformatics"},{"issue":"5","key":"1167_CR87","doi-asserted-by":"publisher","first-page":"691","DOI":"10.1093\/oxfordjournals.molbev.a003851","volume":"18","author":"S Whelan","year":"2001","unstructured":"Whelan S, Goldman N: A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach. Molecular Biology and Evolution 2001, 18(5):691\u2013699.","journal-title":"Molecular Biology and Evolution"},{"key":"1167_CR88","unstructured":"The xgram file format[http:\/\/biowiki.org\/XgramFormat]"},{"key":"1167_CR89","unstructured":"Information on xrate, xgram, xprot, xfold and related tools[http:\/\/biowiki.org\/XgramSoftware]"},{"key":"1167_CR90","first-page":"1396","volume":"10","author":"Z Yang","year":"1993","unstructured":"Yang Z: Maximum-likelihood estimation of phylogeny from DNA sequences when substitution rates differ over sites. Molecular Biology and Evolution 1993, 10: 1396\u20131401.","journal-title":"Molecular Biology and Evolution"},{"key":"1167_CR91","first-page":"105","volume":"39","author":"Z Yang","year":"1994","unstructured":"Yang Z: Estimating the pattern of nucleotide substitution. Journal of Molecular Evolution 1994, 39: 105\u2013111.","journal-title":"Journal of Molecular Evolution"},{"key":"1167_CR92","doi-asserted-by":"publisher","first-page":"306","DOI":"10.1007\/BF00160154","volume":"39","author":"Z Yang","year":"1994","unstructured":"Yang Z: Maximum likelihood phylogenetic estimation from DNA sequences with variable rates over sites: approximate methods. Journal of Molecular Evolution 1994, 39: 306\u2013314. 10.1007\/BF00160154","journal-title":"Journal of Molecular Evolution"},{"key":"1167_CR93","first-page":"432","volume":"155","author":"Z Yang","year":"2000","unstructured":"Yang Z, Nielsen R, Goldman N, Pedersen A-M: Codon-substitution models for heterogeneous selection pressure at amino acid sites. Genetics 2000, 155: 432\u2013449.","journal-title":"Genetics"},{"key":"1167_CR94","volume-title":"Statistical Methods in Molecular Evolution, chapter Estimating substitution matrices","author":"VB Yap","year":"2005","unstructured":"Yap VB, Speed TP: Statistical Methods in Molecular Evolution, chapter Estimating substitution matrices. Springer; 2005."}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-7-428.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,6]],"date-time":"2024-02-06T21:26:26Z","timestamp":1707254786000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-7-428"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2006,10,3]]},"references-count":94,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2006,12]]}},"alternative-id":["1167"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-7-428","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2006,10,3]]},"assertion":[{"value":"24 February 2006","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 October 2006","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 October 2006","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"428"}}