{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,18]],"date-time":"2026-03-18T04:47:01Z","timestamp":1773809221745,"version":"3.50.1"},"reference-count":16,"publisher":"Springer Science and Business Media LLC","issue":"S9","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2011,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Background<\/jats:title><jats:p>A large family of viruses that infect bacteria, called<jats:italic>phages<\/jats:italic>, is characterized by long tails used to inject DNA into their victims' cells. The<jats:italic>tape measure protein<\/jats:italic>got its name because the length of the corresponding gene is proportional to the length of the phage's tail: a fact shown by actually copying or splicing out parts of DNA in exemplar species. A natural question is whether there exist<jats:italic>units<\/jats:italic>for these tape measures, and if different tape measures have different units and lengths. Such units would allow us to retrace the evolution of tape measure proteins using their duplication\/loss history. The vast number of sequenced phages genomes allows us to attack this problem with a comparative genomics approach.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>Here we describe a subset of phages whose tape measure proteins contain variable numbers of an 11 amino acids sequence repeat, aligned with sequence similarity, structural properties, and simple arithmetics. This subset provides a unique opportunity for the combinatorial study of phage evolution, without the added uncertainties of multiple alignments, which are trivial in this case, or of protein functions, that are well established. We give a heuristic that reconstructs the duplication history of these sequences, using divergent strains to discriminate between mutations that occurred before and after speciation, or lineage divergence. The heuristic is based on an efficient algorithm that gives an exhaustive enumeration of all possible parsimonious reconstructions of the duplication\/speciation history of a single nucleotide. Finally, we present a method that allows, when possible, to discriminate between duplication and loss events.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusions<\/jats:title><jats:p>Establishing the evolutionary history of viruses is difficult, in part due to extensive recombinations and gene transfers, and high mutation rates that often erase detectable similarity between homologous genes. In this paper, we introduce new tools to address this problem.<\/jats:p><\/jats:sec>","DOI":"10.1186\/1471-2105-12-s9-s10","type":"journal-article","created":{"date-parts":[[2011,10,5]],"date-time":"2011-10-05T18:47:56Z","timestamp":1317840476000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":32,"title":["The evolution of the tape measure protein: units, duplications and losses"],"prefix":"10.1186","volume":"12","author":[{"given":"Mahdi","family":"Belcaid","sequence":"first","affiliation":[]},{"given":"Anne","family":"Bergeron","sequence":"additional","affiliation":[]},{"given":"Guylaine","family":"Poisson","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2011,10,5]]},"reference":[{"key":"4817_CR1","doi-asserted-by":"publisher","first-page":"691","DOI":"10.1016\/0092-8674(84)90476-8","volume":"39","author":"I Katsura","year":"1984","unstructured":"Katsura I, Hendrix RW: Length determination in bacteriophage lambda tails. Cell 1984, 39: 691\u2013698. 10.1016\/0092-8674(84)90476-8","journal-title":"Cell"},{"key":"4817_CR2","doi-asserted-by":"publisher","first-page":"728","DOI":"10.1128\/JB.01363-08","volume":"191","author":"M Siponen","year":"2009","unstructured":"Siponen M, Sciara G, Villion M, Spinelli S, Lichi\u00e8re J, Cambillau C, Moineau S, Campanacci V: Crystal structure of ORF12 from Lactococcus lactis phage p2 identifies a tape measure protein chaperone. J. Bacteriol 2009, 191: 728\u2013734. 10.1128\/JB.01363-08","journal-title":"J. Bacteriol"},{"key":"4817_CR3","doi-asserted-by":"publisher","first-page":"13","DOI":"10.1016\/S0092-8674(01)00637-7","volume":"108","author":"H Brussow","year":"2002","unstructured":"Brussow H, Hendrix RW: Phage genomics: small is beautiful. Cell 2002, 108: 13\u201316. 10.1016\/S0092-8674(01)00637-7","journal-title":"Cell"},{"key":"4817_CR4","doi-asserted-by":"crossref","first-page":"205","DOI":"10.1093\/oso\/9780198566106.003.0008","volume-title":"Mathematics of Evolution and Phylogeny","author":"O Gascuel","year":"2005","unstructured":"Gascuel O, Bertrand D, Elemento O: Reconstructing the duplication history of tandemly repeated sequences. In Mathematics of Evolution and Phylogeny. Edited by: Gascuel O. Oxford Univ. Press; 2005:205\u2013235."},{"issue":"2","key":"4817_CR5","doi-asserted-by":"publisher","first-page":"225","DOI":"10.1142\/S012905410400239X","volume":"15","author":"E Rivals","year":"2004","unstructured":"Rivals E: A Survey on Algorithmic Aspects of Tandem Repeats Evolution. International J. of Foundations of Computer Science 2004, 15(2):225\u2013257. Special Issue \u201cCombinatorics on Words with Applications\u201d Special Issue \u201cCombinatorics on Words with Applications\u201d 10.1142\/S012905410400239X","journal-title":"International J. of Foundations of Computer Science"},{"key":"4817_CR6","doi-asserted-by":"crossref","first-page":"623","DOI":"10.1093\/genetics\/86.3.623","volume":"86","author":"WM Fitch","year":"1977","unstructured":"Fitch WM: Phylogenies constrained by the crossover process as illustrated by human hemoglobins and a thirteen-cycle, eleven-amino-acid repeat in human apolipoprotein A-I. Genetics 1977, 86: 623\u2013644.","journal-title":"Genetics"},{"key":"4817_CR7","doi-asserted-by":"publisher","first-page":"462","DOI":"10.1089\/cmb.2007.A007","volume":"14","author":"M Lajoie","year":"2007","unstructured":"Lajoie M, Bertrand D, El-Mabrouk N, Gascuel O: Duplication and inversion history of a tandemly repeated genes family. J. Comput. Biol 2007, 14: 462\u2013478. 10.1089\/cmb.2007.A007","journal-title":"J. Comput. Biol"},{"key":"4817_CR8","doi-asserted-by":"publisher","first-page":"1051","DOI":"10.1089\/cmb.2009.0040","volume":"16","author":"Y Zhang","year":"2009","unstructured":"Zhang Y, Song G, Vinar T, Green ED, Siepel A, Miller W: Evolutionary history reconstruction for Mammalian complex gene clusters. J. Comput. Biol 2009, 16: 1051\u20131070. 10.1089\/cmb.2009.0040","journal-title":"J. Comput. Biol"},{"key":"4817_CR9","first-page":"44","volume-title":"Proceedings of the Seventh International Conference on Intelligent Systems for Molecular Biology","author":"G Benson","year":"1999","unstructured":"Benson G, Dong L: Reconstructing the Duplication History of a Tandem Repeat. Proceedings of the Seventh International Conference on Intelligent Systems for Molecular Biology AAAI Press; 1999, 44\u201353. [http:\/\/portal.acm.org\/citation.cfm?id=645634.660817]"},{"key":"4817_CR10","doi-asserted-by":"publisher","first-page":"573","DOI":"10.1093\/nar\/27.2.573","volume":"27","author":"G Benson","year":"1999","unstructured":"Benson G: Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res 1999, 27: 573\u2013580. 10.1093\/nar\/27.2.573","journal-title":"Nucleic Acids Res"},{"key":"4817_CR11","doi-asserted-by":"publisher","first-page":"4633","DOI":"10.1093\/nar\/29.22.4633","volume":"29","author":"S Kurtz","year":"2001","unstructured":"Kurtz S, Choudhuri J, Ohlebusch E, Schleiermacher C, Stoye J, Giegerich R: REPuter: The Manifold Applications of Repeat Analysis on a Genomic Scale. Nucleic Acids Res 2001, 29: 4633\u20134642. 10.1093\/nar\/29.22.4633","journal-title":"Nucleic Acids Res"},{"key":"4817_CR12","doi-asserted-by":"publisher","first-page":"W369","DOI":"10.1093\/nar\/gkl198","volume":"34","author":"TL Bailey","year":"2006","unstructured":"Bailey TL, Williams N, Misleh C, Li WW: MEME: discovering and analyzing DNA and protein sequence motifs. Nucleic Acids Res 2006, 34: W369\u2013373. 10.1093\/nar\/gkl198","journal-title":"Nucleic Acids Res"},{"key":"4817_CR13","doi-asserted-by":"publisher","first-page":"406","DOI":"10.2307\/2412116","volume":"20","author":"WM Fitch","year":"1971","unstructured":"Fitch WM: Toward defining the course of evolution: Minimum change for a specified tree topology. Systematic Zoology 1971, 20: 406\u2013416. 10.2307\/2412116","journal-title":"Systematic Zoology"},{"key":"4817_CR14","doi-asserted-by":"publisher","first-page":"35","DOI":"10.1137\/0128004","volume":"28","author":"D Sankoff","year":"1975","unstructured":"Sankoff D: Minimal Mutation Trees of Sequences. SIAM Journal on Applied Mathematics 1975, 28: 35\u201342. 10.1137\/0128004","journal-title":"SIAM Journal on Applied Mathematics"},{"key":"4817_CR15","doi-asserted-by":"publisher","first-page":"447","DOI":"10.1016\/j.mib.2008.09.004","volume":"11","author":"GF Hatfull","year":"2008","unstructured":"Hatfull GF: Bacteriophage genomics. Curr. Opin. Microbiol 2008, 11: 447\u2013453. 10.1016\/j.mib.2008.09.004","journal-title":"Curr. Opin. Microbiol"},{"key":"4817_CR16","doi-asserted-by":"publisher","first-page":"1315","DOI":"10.1089\/cmb.2010.0108","volume":"17","author":"M Belcaid","year":"2010","unstructured":"Belcaid M, Bergeron A, Poisson G: Mosaic graphs and comparative genomics in phage communities. J. Comput. Biol 2010, 17: 1315\u20131326. 10.1089\/cmb.2010.0108","journal-title":"J. Comput. Biol"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-12-S9-S10.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,4,13]],"date-time":"2024-04-13T09:28:22Z","timestamp":1713000502000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-12-S9-S10"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,10,5]]},"references-count":16,"journal-issue":{"issue":"S9","published-print":{"date-parts":[[2011,12]]}},"alternative-id":["4817"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-12-s9-s10","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2011,10,5]]},"assertion":[{"value":"5 October 2011","order":1,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"S10"}}