{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,10]],"date-time":"2026-02-10T19:55:56Z","timestamp":1770753356554,"version":"3.50.0"},"reference-count":13,"publisher":"Oxford University Press (OUP)","issue":"7","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2013,4,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Analyses in comparative genomics often require non-redundant genome datasets. Eliminating redundancy is not as simple as keeping one strain for each named species because genomes might be redundant at a higher taxonomic level than that of species for some analyses; some strains with different species names can be as similar as most strains sharing a species name, whereas some strains sharing a species name can be so different that they should be put into different groups; and some genomes lack a species name.<\/jats:p>\n               <jats:p>Results: We have implemented a method and Web server that clusters a genome dataset into groups of redundant genomes at different thresholds based on a few phylogenomic distance measures.<\/jats:p>\n               <jats:p>Availability: The Web interface, similarity and distance data and R-scripts can be accessed at http:\/\/microbiome.wlu.ca\/research\/redundancy\/.<\/jats:p>\n               <jats:p>Contact: \u00a0gmoreno@wlu.ca<\/jats:p>","DOI":"10.1093\/bioinformatics\/btt064","type":"journal-article","created":{"date-parts":[[2013,2,10]],"date-time":"2013-02-10T01:09:00Z","timestamp":1360458540000},"page":"947-949","source":"Crossref","is-referenced-by-count":36,"title":["Phylogenomic clustering for selecting non-redundant genomes for comparative genomics"],"prefix":"10.1093","volume":"29","author":[{"given":"Gabriel","family":"Moreno-Hagelsieb","sequence":"first","affiliation":[{"name":"1 Department of Biology and 2Department of Mathematics, Wilfrid Laurier University, Waterloo, ON N2L 3C5, Canada"}]},{"given":"Zilin","family":"Wang","sequence":"additional","affiliation":[{"name":"1 Department of Biology and 2Department of Mathematics, Wilfrid Laurier University, Waterloo, ON N2L 3C5, Canada"}]},{"given":"Stephanie","family":"Walsh","sequence":"additional","affiliation":[{"name":"1 Department of Biology and 2Department of Mathematics, Wilfrid Laurier University, Waterloo, ON N2L 3C5, Canada"}]},{"given":"Aisha","family":"ElSherbiny","sequence":"additional","affiliation":[{"name":"1 Department of Biology and 2Department of Mathematics, Wilfrid Laurier University, Waterloo, ON N2L 3C5, Canada"}]}],"member":"286","published-online":{"date-parts":[[2013,2,8]]},"reference":[{"key":"2023020303400365500_btt064-B1","doi-asserted-by":"crossref","first-page":"332","DOI":"10.1186\/1471-2164-11-332","article-title":"Understanding the evolutionary relationships and major traits of bacillus through comparative genomics","volume":"11","author":"Alcaraz","year":"2010","journal-title":"BMC Genomics"},{"key":"2023020303400365500_btt064-B2","doi-asserted-by":"crossref","first-page":"9184","DOI":"10.1073\/pnas.96.16.9184","article-title":"Genome signature comparisons among prokaryote, plasmid, and mitochondrial DNA","volume":"96","author":"Campbell","year":"1999","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023020303400365500_btt064-B3","doi-asserted-by":"crossref","DOI":"10.1007\/978-0-387-84858-7","volume-title":"The Elements of Statistical Learning: Data Mining, Inference, and Prediction","author":"Hastie","year":"2009","edition":"2nd"},{"key":"2023020303400365500_btt064-B4","doi-asserted-by":"crossref","first-page":"5392","DOI":"10.1093\/nar\/gkh882","article-title":"Conservation of adjacency as evidence of paralogous operons","volume":"32","author":"Janga","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023020303400365500_btt064-B5","doi-asserted-by":"crossref","first-page":"616","DOI":"10.1093\/nar\/gki181","article-title":"Measuring genome conservation across taxa: divided strains and united kingdoms","volume":"33","author":"Kunin","year":"2005","journal-title":"Nucleic Acids Res."},{"key":"2023020303400365500_btt064-B6","doi-asserted-by":"crossref","first-page":"344","DOI":"10.1002\/prot.21564","article-title":"Operons and the effect of genome redundancy in deciphering functional relationships using phylogenetic profiles","volume":"70","author":"Moreno-Hagelsieb","year":"2008","journal-title":"Proteins"},{"key":"2023020303400365500_btt064-B7","doi-asserted-by":"crossref","first-page":"319","DOI":"10.1093\/bioinformatics\/btm585","article-title":"Choosing BLAST options for better detection of orthologs as reciprocal best hits","volume":"24","author":"Moreno-Hagelsieb","year":"2008","journal-title":"Bioinformatics"},{"key":"2023020303400365500_btt064-B8","doi-asserted-by":"crossref","first-page":"2896","DOI":"10.1073\/pnas.96.6.2896","article-title":"The use of gene clusters to infer functional coupling","volume":"96","author":"Overbeek","year":"1999","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023020303400365500_btt064-B9","doi-asserted-by":"crossref","first-page":"D61","DOI":"10.1093\/nar\/gkl842","article-title":"NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins","volume":"35","author":"Pruitt","year":"2007","journal-title":"Nucleic Acids Res."},{"key":"2023020303400365500_btt064-B10","volume-title":"R: A Language and Environment for Statistical Computing","author":"R Development Core Team","year":"2012"},{"key":"2023020303400365500_btt064-B11","doi-asserted-by":"crossref","first-page":"84","DOI":"10.1186\/1471-2148-6-84","article-title":"The reach of the genome signature in prokaryotes","volume":"6","author":"van Passel","year":"2006","journal-title":"BMC Evol. Biol."},{"key":"2023020303400365500_btt064-B12","doi-asserted-by":"crossref","first-page":"1056","DOI":"10.1038\/nature08656","article-title":"A phylogeny-driven genomic encyclopaedia of Bacteria and Archaea","volume":"462","author":"Wu","year":"2009","journal-title":"Nature"},{"key":"2023020303400365500_btt064-B13","doi-asserted-by":"crossref","first-page":"243","DOI":"10.1186\/1471-2105-6-243","article-title":"Phylogenetic detection of conserved gene clusters in microbial genomes","volume":"6","author":"Zheng","year":"2005","journal-title":"BMC Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/29\/7\/947\/49060880\/bioinformatics_29_7_947.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/29\/7\/947\/49060880\/bioinformatics_29_7_947.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,3]],"date-time":"2023-02-03T03:40:18Z","timestamp":1675395618000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/29\/7\/947\/253529"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,2,8]]},"references-count":13,"journal-issue":{"issue":"7","published-print":{"date-parts":[[2013,4,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btt064","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2013,4,1]]},"published":{"date-parts":[[2013,2,8]]}}}