{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,9]],"date-time":"2026-01-09T20:24:56Z","timestamp":1767990296134,"version":"3.49.0"},"reference-count":37,"publisher":"Oxford University Press (OUP)","issue":"19","license":[{"start":{"date-parts":[[2019,2,19]],"date-time":"2019-02-19T00:00:00Z","timestamp":1550534400000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000104","name":"National Aeronautics and Space Administration","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100000104","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000104","name":"NASA","doi-asserted-by":"publisher","award":["NNX16AJ30G"],"award-info":[{"award-number":["NNX16AJ30G"]}],"id":[{"id":"10.13039\/100000104","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Michigan Space Consortium"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2019,10,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>The promise of higher phylogenetic stability through increased dataset sizes within tree of life (TOL) reconstructions has not been fulfilled. Among the many possible causes are changes in species composition (taxon sampling) that could influence phylogenetic accuracy of the methods by altering the relative weight of the evolutionary histories of each individual species. This effect would be stronger in clades that are represented by few lineages, which is common in many prokaryote phyla. Indeed, phyla with fewer taxa showed the most discordance among recent TOL studies. We implemented an approach to systematically test how the identity of taxa among a larger dataset and the number of taxa included affected the accuracy of phylogenetic reconstruction.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>Utilizing an empirical dataset within Terrabacteria we found that even within scenarios consisting of the same number of taxa, the species used strongly affected phylogenetic stability. Furthermore, we found that trees with fewer species were more dissimilar to the tree produced from the full dataset. These results hold even when the tree is composed by many phyla and only one of them is being altered. Thus, the effect of taxon sampling in one group does not seem to be buffered by the presence of many other clades, making this issue relevant even to very large datasets. Our results suggest that a systematic evaluation of phylogenetic stability through taxon resampling is advisable even for very large datasets.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>https:\/\/github.com\/BlabOaklandU\/PATS.git.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btz121","type":"journal-article","created":{"date-parts":[[2019,2,18]],"date-time":"2019-02-18T04:25:53Z","timestamp":1550463953000},"page":"3608-3616","source":"Crossref","is-referenced-by-count":4,"title":["Choice of species affects phylogenetic stability of deep nodes: an empirical example in Terrabacteria"],"prefix":"10.1093","volume":"35","author":[{"given":"Ashley A","family":"Superson","sequence":"first","affiliation":[{"name":"Department of Biological Sciences, Oakland University , Rochester, MI, USA"}]},{"given":"Doug","family":"Phelan","sequence":"additional","affiliation":[{"name":"Department of Biological Sciences, Oakland University , Rochester, MI, USA"}]},{"given":"Allyson","family":"Dekovich","sequence":"additional","affiliation":[{"name":"Department of Biological Sciences, Oakland University , Rochester, MI, USA"}]},{"given":"Fabia U","family":"Battistuzzi","sequence":"additional","affiliation":[{"name":"Department of Biological Sciences, Oakland University , Rochester, MI, USA"},{"name":"Center for Data Science and Big Data Analytics, Oakland University , Rochester, MI, USA"}]}],"member":"286","published-online":{"date-parts":[[2019,2,19]]},"reference":[{"key":"2023013108155220400_btz121-B1","doi-asserted-by":"crossref","first-page":"439","DOI":"10.1016\/j.tree.2006.05.009","article-title":"The use of genome-level characters for phylogenetic reconstruction","volume":"21","author":"Boore","year":"2006","journal-title":"Trends Ecol. Evol"},{"key":"2023013108155220400_btz121-B2","doi-asserted-by":"crossref","first-page":"20152802.","DOI":"10.1098\/rspb.2015.2802","article-title":"Untangling the early diversification of eukaryotes: a phylogenomic study of the evolutionary origins of Centrohelida, Haptophyta and Cryptista","volume":"283","author":"Burki","year":"2016","journal-title":"Proc. R. Soc. B"},{"key":"2023013108155220400_btz121-B3","doi-asserted-by":"crossref","first-page":"1164","DOI":"10.1093\/bioinformatics\/btr088","article-title":"ProtTest 3: fast selection of best-fit models of protein evolution","volume":"27","author":"Darriba","year":"2011","journal-title":"Bioinformatics"},{"key":"2023013108155220400_btz121-B4","doi-asserted-by":"crossref","first-page":"1080","DOI":"10.1101\/gr.187002","article-title":"A phylogenomic approach to bacterial phylogeny: evidence of a core of genes sharing a common history","volume":"12","author":"Daubin","year":"2002","journal-title":"Genome Res"},{"key":"2023013108155220400_btz121-B5","doi-asserted-by":"crossref","first-page":"361","DOI":"10.1038\/nrg1603","article-title":"Phylogenomics and the reconstruction of the tree of life","volume":"6","author":"Delsuc","year":"2005","journal-title":"Nat. Rev. Genet"},{"key":"2023013108155220400_btz121-B6","doi-asserted-by":"crossref","first-page":"1792","DOI":"10.1093\/nar\/gkh340","article-title":"MUSCLE: multiple sequence alignment with high accuracy and high throughput","volume":"32","author":"Edgar","year":"2004","journal-title":"Nucleic Acids Res"},{"key":"2023013108155220400_btz121-B7","doi-asserted-by":"crossref","first-page":"27","DOI":"10.2307\/2412810","article-title":"The number of evolutionary trees","volume":"27","author":"Felsenstein","year":"1978","journal-title":"Syst. Zool"},{"key":"2023013108155220400_btz121-B8","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1086\/284325","article-title":"Phylogenies and the comparative method","volume":"125","author":"Felsenstein","year":"1985","journal-title":"Am. Nat"},{"key":"2023013108155220400_btz121-B10","doi-asserted-by":"crossref","first-page":"355","DOI":"10.1080\/10635150701294733","article-title":"How many genes should a systematist sample? Conflicting insights from a phylogenomic matrix characterized by replicated incongruence","volume":"56","author":"Gatesy","year":"2007","journal-title":"Syst. Biol"},{"key":"2023013108155220400_btz121-B11","doi-asserted-by":"crossref","first-page":"1590","DOI":"10.1093\/molbev\/msw038","article-title":"Clustering genes of common evolutionary history","volume":"33","author":"Gori","year":"2016","journal-title":"Mol. Biol. Evol"},{"key":"2023013108155220400_btz121-B12","doi-asserted-by":"crossref","first-page":"357","DOI":"10.1007\/s11120-010-9553-9","article-title":"Molecular signatures for the main phyla of photosynthetic bacteria and their subgroups","volume":"104","author":"Gupta","year":"2010","journal-title":"Photosynth. Res"},{"key":"2023013108155220400_btz121-B13","doi-asserted-by":"crossref","first-page":"99","DOI":"10.1007\/s10482-012-9790-3","article-title":"Phylogenetic framework and molecular signatures for the class Chloroflexi and its different clades; proposal for division of the class Chloroflexi class. nov. into the suborder Chloroflexineae subord. nov., consisting of the emended family Oscillochloridaceae and the family Chloroflexaceae fam. nov., and the suborder Roseiflexineae subord. nov., containing the family Roseiflexaceae fam. nov","volume":"103","author":"Gupta","year":"2013","journal-title":"Antonie Van Leeuwenhoek"},{"key":"2023013108155220400_btz121-B14","first-page":"239","article-title":"Taxon sampling and the accuracy of phylogenetic analyses","volume":"46","author":"Heath","year":"2008","journal-title":"J. Syst. Evol"},{"key":"2023013108155220400_btz121-B15","doi-asserted-by":"crossref","first-page":"522","DOI":"10.1080\/10635150600697358","article-title":"Resolution of phylogenetic conflict in large data sets by increased taxon sampling","volume":"55","author":"Hedtke","year":"2006","journal-title":"Syst. Biol"},{"key":"2023013108155220400_btz121-B17","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1126\/science.276.5310.227","article-title":"Phylogenetic methods come of age: testing hypotheses in an evolutionary context","volume":"276","author":"Huelsenbeck","year":"1997","journal-title":"Science"},{"key":"2023013108155220400_btz121-B18","doi-asserted-by":"crossref","first-page":"16048.","DOI":"10.1038\/nmicrobiol.2016.48","article-title":"A new view of the tree of life","volume":"1","author":"Hug","year":"2016","journal-title":"Nat. Microbiol"},{"key":"2023013108155220400_btz121-B19","doi-asserted-by":"crossref","first-page":"3801","DOI":"10.1073\/pnas.96.7.3801","article-title":"Horizontal gene transfer among genomes: the complexity hypothesis","volume":"96","author":"Jain","year":"1999","journal-title":"Proc. Natl. Acad. Sci"},{"key":"2023013108155220400_btz121-B20","doi-asserted-by":"crossref","first-page":"4276","DOI":"10.1099\/ijsem.0.000638","article-title":"Evolutionary relationships of completely sequenced Clostridia species and close relatives","volume":"65","author":"Kunisawa","year":"2015","journal-title":"Int. J. Syst. Evol. Microbiol"},{"key":"2023013108155220400_btz121-B21","doi-asserted-by":"crossref","first-page":"e62510.","DOI":"10.1371\/journal.pone.0062510","article-title":"Phylogeny of bacterial and archaeal genomes using conserved genes: supertrees and supermatrices","volume":"8","author":"Lang","year":"2013","journal-title":"PLoS One"},{"key":"2023013108155220400_btz121-B22","doi-asserted-by":"crossref","first-page":"124.","DOI":"10.1186\/1471-2105-12-124","article-title":"Proteinortho: detection of (Co-)orthologs in large-scale analysis","volume":"12","author":"Lechner","year":"2011","journal-title":"BMC Bioinform"},{"key":"2023013108155220400_btz121-B23","doi-asserted-by":"crossref","first-page":"33","DOI":"10.12688\/wellcomeopenres.14265.2","article-title":"Evaluation of phylogenetic reconstruction methods using bacterial whole genomes: a simulation based study","volume":"3","author":"Lees","year":"2018","journal-title":"Wellcome Open Res"},{"key":"2023013108155220400_btz121-B24","doi-asserted-by":"crossref","first-page":"84","DOI":"10.1016\/j.ympev.2013.09.010","article-title":"Increased sampling of both genes and taxa improves resolution of phylogenetic relationships within Magnoliidae, a large and early-diverging clade of angiosperms","volume":"70","author":"Massoni","year":"2014","journal-title":"Mol. Phylogenet. Evol"},{"key":"2023013108155220400_btz121-B25","doi-asserted-by":"crossref","first-page":"D12.","DOI":"10.1093\/nar\/gkw1071","article-title":"Database resources of the National Center for Biotechnology Information","volume":"45","author":"Ncbi","year":"2017","journal-title":"Nucleic Acids Res"},{"key":"2023013108155220400_btz121-B26","doi-asserted-by":"crossref","first-page":"268","DOI":"10.1093\/molbev\/msu300","article-title":"IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies","volume":"32","author":"Nguyen","year":"2015","journal-title":"Mol. Biol. Evol"},{"key":"2023013108155220400_btz121-B27","doi-asserted-by":"crossref","first-page":"1627","DOI":"10.1099\/ijsem.0.002720","article-title":"Proposal for a new classification of a deep branching bacterial phylogenetic lineage: transfer of Coprothermobacter proteolyticus and Coprothermobacter platensis to Coprothermobacteraceae fam. nov., within Coprothermobacterales ord. nov., Coprothermobacteria classis nov. and Coprothermobacterota phyl. nov. and emended description of the family Thermodesulfobiaceae","volume":"68","author":"Pavan","year":"2018","journal-title":"Int. J. Syst. Evol. Microbiol"},{"key":"2023013108155220400_btz121-B28","doi-asserted-by":"crossref","first-page":"e1000602.","DOI":"10.1371\/journal.pbio.1000602","article-title":"Resolving difficult phylogenetic questions: why more sequences are not enough","volume":"9","author":"Philippe","year":"2011","journal-title":"PLoS Biol"},{"key":"2023013108155220400_btz121-B29","doi-asserted-by":"crossref","first-page":"e9490.","DOI":"10.1371\/journal.pone.0009490","article-title":"FastTree 2\u2014approximately maximum-likelihood trees for large alignments","volume":"5","author":"Price","year":"2010","journal-title":"PLoS One"},{"key":"2023013108155220400_btz121-B30","doi-asserted-by":"crossref","first-page":"431","DOI":"10.1038\/nature12352","article-title":"Insights into the phylogeny and coding potential of microbial dark matter","volume":"499","author":"Rinke","year":"2013","journal-title":"Nature"},{"key":"2023013108155220400_btz121-B31","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1016\/0025-5564(81)90043-2","article-title":"Comparison of phylogenetic trees","volume":"53","author":"Robinson","year":"1981","journal-title":"Math. Biosci"},{"key":"2023013108155220400_btz121-B32","doi-asserted-by":"crossref","first-page":"798","DOI":"10.1038\/nature02053","article-title":"Genome-scale approaches to resolving incongruence in molecular phylogenies","volume":"425","author":"Rokas","year":"2003","journal-title":"Nature"},{"key":"2023013108155220400_btz121-B33","doi-asserted-by":"crossref","first-page":"0126","DOI":"10.1038\/s41559-017-0126","article-title":"Contentious relationships in phylogenomic studies can be driven by a handful of genes","volume":"1","author":"Shen","year":"2017","journal-title":"Nat. Ecol. Evol"},{"key":"2023013108155220400_btz121-B34","doi-asserted-by":"crossref","first-page":"277","DOI":"10.1111\/syen.12275","article-title":"Taxon sampling to address an ancient rapid radiation: a supermatrix phylogeny of early brachyceran flies (Diptera)","volume":"43","author":"Shin","year":"2018","journal-title":"Syst. Entomol"},{"key":"2023013108155220400_btz121-B35","doi-asserted-by":"crossref","first-page":"536","DOI":"10.1093\/bib\/bbu015","article-title":"Causes, consequences and solutions of phylogenetic incongruence","volume":"16","author":"Som","year":"2015","journal-title":"Brief. Bioinform"},{"key":"2023013108155220400_btz121-B36","doi-asserted-by":"crossref","first-page":"241","DOI":"10.1016\/j.copbio.2006.05.005","article-title":"The Planctomycetes, Verrucomicrobia, Chlamydiae and sister phyla comprise a superphylum with biotechnological and medical relevance","volume":"17","author":"Wagner","year":"2006","journal-title":"Curr. Opin. Biotechnol"},{"key":"2023013108155220400_btz121-B37","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1111\/1758-2229.12241","article-title":"Phylogenomic evaluation of members above the species level within the phylum Firmicutes based on conserved proteins","volume":"7","author":"Zhang","year":"2015","journal-title":"Environ. Microbiol. Rep"},{"key":"2023013108155220400_btz121-B38","doi-asserted-by":"crossref","first-page":"486","DOI":"10.1093\/molbev\/msx302","article-title":"Evaluating fast maximum likelihood-based phylogenetic programs using empirical phylogenomic data sets","volume":"35","author":"Zhou","year":"2017","journal-title":"MBE"},{"key":"2023013108155220400_btz121-B39","doi-asserted-by":"crossref","first-page":"588","DOI":"10.1080\/10635150290102339","article-title":"Increased taxon sampling greatly reduces phylogenetic error","volume":"51","author":"Zwickl","year":"2002","journal-title":"Syst. Biol"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/19\/3608\/48977296\/bioinformatics_35_19_3608.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/19\/3608\/48977296\/bioinformatics_35_19_3608.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,31]],"date-time":"2023-01-31T14:52:37Z","timestamp":1675176757000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/35\/19\/3608\/5341426"}},"subtitle":[],"editor":[{"given":"Russell","family":"Schwartz","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2019,2,19]]},"references-count":37,"journal-issue":{"issue":"19","published-print":{"date-parts":[[2019,10,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btz121","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2019,10,1]]},"published":{"date-parts":[[2019,2,19]]}}}