{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,21]],"date-time":"2026-02-21T19:51:36Z","timestamp":1771703496792,"version":"3.50.1"},"reference-count":27,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2008,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Each genome has a stable distribution of the combined frequency for each <jats:italic>k<\/jats:italic>-mer and its reverse complement measured in sequence fragments as short as 1000 bps across the whole genome, for 1&lt;k&lt;6. The collection of these <jats:italic>k<\/jats:italic>-mer frequency distributions is unique to each genome and termed the genome's <jats:italic>barcode<\/jats:italic>.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>We found that for each genome, the majority of its short sequence fragments have highly similar barcodes while sequence fragments with different barcodes typically correspond to genes that are horizontally transferred or highly expressed. This observation has led to new and more effective ways for addressing two challenging problems: metagenome binning problem and identification of horizontally transferred genes. Our barcode-based metagenome binning algorithm substantially improves the state of the art in terms of both binning accuracies and the scope of applicability. Other attractive properties of genomes barcodes include (a) the barcodes have different and identifiable characteristics for different classes of genomes like prokaryotes, eukaryotes, mitochondria and plastids, and (b) barcodes similarities are generally proportional to the genomes' phylogenetic closeness.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusion<\/jats:title>\n            <jats:p>These and other properties of genomes barcodes make them a new and effective tool for studying numerous genome and metagenome analysis problems.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-9-546","type":"journal-article","created":{"date-parts":[[2008,12,17]],"date-time":"2008-12-17T19:15:12Z","timestamp":1229541312000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":76,"title":["Barcodes for genomes and applications"],"prefix":"10.1186","volume":"9","author":[{"given":"Fengfeng","family":"Zhou","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Victor","family":"Olman","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ying","family":"Xu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2008,12,17]]},"reference":[{"issue":"5717","key":"2531_CR1","doi-asserted-by":"publisher","first-page":"1915","DOI":"10.1126\/science.1104816","volume":"307","author":"F Backhed","year":"2005","unstructured":"Backhed F, Ley RE, Sonnenburg JL, Peterson DA, Gordon JI: Host-bacterial mutualism in the human intestine. Science 2005, 307(5717):1915\u20131920. 10.1126\/science.1104816","journal-title":"Science"},{"issue":"7","key":"2531_CR2","doi-asserted-by":"publisher","first-page":"3801","DOI":"10.1073\/pnas.96.7.3801","volume":"96","author":"R Jain","year":"1999","unstructured":"Jain R, Rivera MC, Lake JA: Horizontal gene transfer among genomes: the complexity hypothesis. Proceedings of the National Academy of Sciences of the United States of America 1999, 96(7):3801\u20133806. 10.1073\/pnas.96.7.3801","journal-title":"Proceedings of the National Academy of Sciences of the United States of America"},{"issue":"2\u20133","key":"2531_CR3","doi-asserted-by":"publisher","first-page":"167","DOI":"10.1159\/000150543","volume":"40","author":"TK Frey","year":"1997","unstructured":"Frey TK: Neurological aspects of rubella virus infection. Intervirology 1997, 40(2\u20133):167\u2013175. 10.1159\/000150543","journal-title":"Intervirology"},{"issue":"5","key":"2531_CR4","doi-asserted-by":"publisher","first-page":"895","DOI":"10.1046\/j.1365-2958.1999.01533.x","volume":"33","author":"VN Rybchin","year":"1999","unstructured":"Rybchin VN, Svarchevsky AN: The plasmid prophage N15: a linear DNA with covalently closed ends. Mol Microbiol 1999, 33(5):895\u2013903. 10.1046\/j.1365-2958.1999.01533.x","journal-title":"Mol Microbiol"},{"issue":"1","key":"2531_CR5","doi-asserted-by":"publisher","first-page":"63","DOI":"10.1038\/nmeth976","volume":"4","author":"AC McHardy","year":"2007","unstructured":"McHardy AC, Martin HG, Tsirigos A, Hugenholtz P, Rigoutsos I: Accurate phylogenetic classification of variable-length DNA fragments. Nat Methods 2007, 4(1):63\u201372. 10.1038\/nmeth976","journal-title":"Nat Methods"},{"issue":"4","key":"2531_CR6","doi-asserted-by":"publisher","first-page":"406","DOI":"10.1360\/062004-96","volume":"48","author":"E Yang","year":"2005","unstructured":"Yang E, Bin W, Peng J, Zhang X, Wang J, Yang J, Dong J, Chu Y, Zhang J, Jin Q: Comparative genomics and phylogenetic analysis of S. dysenteriae subgroup. Sci China C Life Sci 2005, 48(4):406\u2013413. 10.1360\/062004-96","journal-title":"Sci China C Life Sci"},{"issue":"7","key":"2531_CR7","doi-asserted-by":"publisher","first-page":"3816","DOI":"10.1073\/pnas.77.7.3816","volume":"77","author":"EN Trifonov","year":"1980","unstructured":"Trifonov EN, Sussman JL: The pitch of chromatin DNA is reflected in its nucleotide sequence. Proceedings of the National Academy of Sciences of the United States of America 1980, 77(7):3816\u20133820. 10.1073\/pnas.77.7.3816","journal-title":"Proceedings of the National Academy of Sciences of the United States of America"},{"key":"2531_CR8","first-page":"826","volume":"20","author":"M Borodovsky","year":"1986","unstructured":"Borodovsky M, Sprizhitskii Y, Golovanov E, Aleksandrov A: Statistical patterns in primary structures of functional regions in the E. coli genome. I. Oligonucleotide frequencies analysis. Molecular Biology 1986, 20: 826\u2013833.","journal-title":"Molecular Biology"},{"key":"2531_CR9","first-page":"833","volume":"20","author":"M Borodovsky","year":"1986","unstructured":"Borodovsky M, Sprizhitskii Y, Golovanov E, Aleksandrov A: Statistical patterns in primary structures of functional regions in the E. coli genome. II. Non-homogeneous Markov models. Molecular Biology 1986, 20: 833\u2013840.","journal-title":"Molecular Biology"},{"key":"2531_CR10","first-page":"1145","volume":"20","author":"M Borodovsky","year":"1986","unstructured":"Borodovsky M, Sprizhitskii Y, Golovanov E, Aleksandrov A: Statistical patterns in primary structures of functional regions in the E. coli genome. III. Computer recognition of coding regions. Molecular Biology 1986, 20: 1145\u20131150.","journal-title":"Molecular Biology"},{"issue":"7","key":"2531_CR11","doi-asserted-by":"publisher","first-page":"283","DOI":"10.1016\/S0168-9525(00)89076-9","volume":"11","author":"S Karlin","year":"1995","unstructured":"Karlin S, Burge C: Dinucleotide relative abundance extremes: a genomic signature. Trends Genet 1995, 11(7):283\u2013290. 10.1016\/S0168-9525(00)89076-9","journal-title":"Trends Genet"},{"issue":"26","key":"2531_CR12","doi-asserted-by":"publisher","first-page":"14225","DOI":"10.1073\/pnas.94.26.14225","volume":"94","author":"S Karlin","year":"1997","unstructured":"Karlin S, Zhu ZY, Karlin KD: The extended environment of mononuclear metal centers in protein structures. Proceedings of the National Academy of Sciences of the United States of America 1997, 94(26):14225\u201314230. 10.1073\/pnas.94.26.14225","journal-title":"Proceedings of the National Academy of Sciences of the United States of America"},{"issue":"16","key":"2531_CR13","doi-asserted-by":"publisher","first-page":"9190","DOI":"10.1073\/pnas.96.16.9190","volume":"96","author":"S Karlin","year":"1999","unstructured":"Karlin S, Brocchieri L, Mrazek J, Campbell AM, Spormann AM: A chimeric prokaryotic ancestry of mitochondria and primitive eukaryotes. Proceedings of the National Academy of Sciences of the United States of America 1999, 96(16):9190\u20139195. 10.1073\/pnas.96.16.9190","journal-title":"Proceedings of the National Academy of Sciences of the United States of America"},{"key":"2531_CR14","unstructured":"Computed_barcodes[http:\/\/csbl.bmb.uga.edu\/~ffzhou\/BoDB\/]"},{"key":"2531_CR15","unstructured":"Supplementary_material[http:\/\/csbl.bmb.uga.edu\/~ffzhou\/BoDB\/supp\/]"},{"issue":"7","key":"2531_CR16","doi-asserted-by":"publisher","first-page":"1590","DOI":"10.1093\/nar\/29.7.1590","volume":"29","author":"J Mrazek","year":"2001","unstructured":"Mrazek J, Bhaya D, Grossman AR, Karlin S: Highly expressed and alien genes of the Synechocystis genome. Nucleic Acids Res 2001, 29(7):1590\u20131601. 10.1093\/nar\/29.7.1590","journal-title":"Nucleic Acids Res"},{"issue":"18","key":"2531_CR17","doi-asserted-by":"publisher","first-page":"5238","DOI":"10.1128\/JB.182.18.5238-5250.2000","volume":"182","author":"S Karlin","year":"2000","unstructured":"Karlin S, Mrazek J: Predicted highly expressed genes of diverse prokaryotic genomes. J Bacteriol 2000, 182(18):5238\u20135250. 10.1128\/JB.182.18.5238-5250.2000","journal-title":"J Bacteriol"},{"key":"2531_CR18","volume-title":"Bioinformatics","author":"G Lima-Mendez","year":"2008","unstructured":"Lima-Mendez G, Helden JV, Toussaint A, Leplae R: Prophinder: a computational tool for prophage prediction in pro-karyotic genomes. Bioinformatics 2008."},{"issue":"6784","key":"2531_CR19","doi-asserted-by":"publisher","first-page":"299","DOI":"10.1038\/35012500","volume":"405","author":"H Ochman","year":"2000","unstructured":"Ochman H, Lawrence JG, Groisman EA: Lateral gene transfer and the nature of bacterial innovation. Nature 2000, 405(6784):299\u2013304. 10.1038\/35012500","journal-title":"Nature"},{"issue":"4","key":"2531_CR20","doi-asserted-by":"publisher","first-page":"383","DOI":"10.1007\/PL00006158","volume":"44","author":"JG Lawrence","year":"1997","unstructured":"Lawrence JG, Ochman H: Amelioration of bacterial genomes: rates of change and exchange. J Mol Evol 1997, 44(4):383\u2013397. 10.1007\/PL00006158","journal-title":"J Mol Evol"},{"key":"2531_CR21","doi-asserted-by":"crossref","unstructured":"Liolios K, Mavromatis K, Tavernarakis N, Kyrpides NC: The Genomes On Line Database (GOLD) in 2007: status of genomic and metagenomic projects and their associated metadata. Nucleic Acids Res 2008, (36 Database):D475\u2013479.","DOI":"10.1093\/nar\/gkm884"},{"issue":"5","key":"2531_CR22","doi-asserted-by":"publisher","first-page":"499","DOI":"10.1016\/j.mib.2007.08.004","volume":"10","author":"AC McHardy","year":"2007","unstructured":"McHardy AC, Rigoutsos I: What's in the mix: phylogenetic classification of metagenome sequence samples. Current opinion in microbiology 2007, 10(5):499\u2013503.","journal-title":"Current opinion in microbiology"},{"issue":"20","key":"2531_CR23","doi-asserted-by":"publisher","first-page":"7303","DOI":"10.1073\/pnas.0502313102","volume":"102","author":"S Karlin","year":"2005","unstructured":"Karlin S, Mrazek J, Ma J, Brocchieri L: Predicted highly expressed genes in archaeal genomes. Proceedings of the National Academy of Sciences of the United States of America 2005, 102(20):7303\u20137308. 10.1073\/pnas.0502313102","journal-title":"Proceedings of the National Academy of Sciences of the United States of America"},{"key":"2531_CR24","doi-asserted-by":"publisher","first-page":"314","DOI":"10.1111\/j.1749-6632.1999.tb08893.x","volume":"870","author":"J Mrazek","year":"1999","unstructured":"Mrazek J, Karlin S: Detecting alien genes in bacterial genomes. Ann N Y Acad Sci 1999, 870: 314\u2013329. 10.1111\/j.1749-6632.1999.tb08893.x","journal-title":"Ann N Y Acad Sci"},{"key":"2531_CR25","volume-title":"IEEE\/ACM Transactions on Computational Biology and Bioinformatics","author":"V Olman","year":"2007","unstructured":"Olman V, Mao F, Wu H, Xu Y: Parallel Clustering Algorithm for Large Data Sets with applications in Bioinformatics. IEEE\/ACM Transactions on Computational Biology and Bioinformatics 2007, in press."},{"key":"2531_CR26","doi-asserted-by":"crossref","unstructured":"DeSantis TZ Jr, Hugenholtz P, Keller K, Brodie EL, Larsen N, Piceno YM, Phan R, Andersen GL: NAST: a multiple sequence alignment server for comparative analysis of 16S rRNA genes. Nucleic Acids Res 2006, (34 Web Server):W394\u2013399. 10.1093\/nar\/gkl244","DOI":"10.1093\/nar\/gkl244"},{"key":"2531_CR27","volume-title":"Introduction to Algorithms","author":"TH Cormen","year":"2001","unstructured":"Cormen TH, Leiserson CE, Rivest RL, Stein C: Introduction to Algorithms. Second edition. Cambridge, MA The MIT Press; 2001.","edition":"Second"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-9-546.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T11:13:33Z","timestamp":1630494813000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-9-546"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,12]]},"references-count":27,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2008,12]]}},"alternative-id":["2531"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-9-546","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2008,12]]},"assertion":[{"value":"16 June 2008","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 December 2008","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 December 2008","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"546"}}