{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,16]],"date-time":"2026-04-16T06:59:38Z","timestamp":1776322778037,"version":"3.50.1"},"reference-count":35,"publisher":"Oxford University Press (OUP)","issue":"17","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":772,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/3.0"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2014,9,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Species trees provide insight into basic biology, including the mechanisms of evolution and how it modifies biomolecular function and structure, biodiversity and co-evolution between genes and species. Yet, gene trees often differ from species trees, creating challenges to species tree estimation. One of the most frequent causes for conflicting topologies between gene trees and species trees is incomplete lineage sorting (ILS), which is modelled by the multi-species coalescent. While many methods have been developed to estimate species trees from multiple genes, some which have statistical guarantees under the multi-species coalescent model, existing methods are too computationally intensive for use with genome-scale analyses or have been shown to have poor accuracy under some realistic conditions.<\/jats:p>\n               <jats:p>Results: We present ASTRAL, a fast method for estimating species trees from multiple genes. ASTRAL is statistically consistent, can run on datasets with thousands of genes and has outstanding accuracy\u2014improving on MP-EST and the population tree from BUCKy, two statistically consistent leading coalescent-based methods. ASTRAL is often more accurate than concatenation using maximum likelihood, except when ILS levels are low or there are too few gene trees.<\/jats:p>\n               <jats:p>Availability and implementation: ASTRAL is available in open source form at https:\/\/github.com\/smirarab\/ASTRAL\/. Datasets studied in this article are available at http:\/\/www.cs.utexas.edu\/users\/phylo\/datasets\/astral.<\/jats:p>\n               <jats:p>Contact: \u00a0warnow@illinois.edu<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btu462","type":"journal-article","created":{"date-parts":[[2014,8,26]],"date-time":"2014-08-26T11:23:57Z","timestamp":1409052237000},"page":"i541-i548","source":"Crossref","is-referenced-by-count":1256,"title":["ASTRAL: genome-scale coalescent-based species tree estimation"],"prefix":"10.1093","volume":"30","author":[{"given":"S.","family":"Mirarab","sequence":"first","affiliation":[{"name":"1 Department of Computer Science, The University of Texas at Austin, Austin, TX 78712, USA, 2Departement d'informatique, Ecole Normale Superieure, 45 Rue d'Ulm, F-75230 Paris Cedex 05, France and 3Department of Electrical Engineering, The University of Southern California, Los Angeles, CA 90089, USA"}]},{"given":"R.","family":"Reaz","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science, The University of Texas at Austin, Austin, TX 78712, USA, 2Departement d'informatique, Ecole Normale Superieure, 45 Rue d'Ulm, F-75230 Paris Cedex 05, France and 3Department of Electrical Engineering, The University of Southern California, Los Angeles, CA 90089, USA"}]},{"given":"Md. S.","family":"Bayzid","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science, The University of Texas at Austin, Austin, TX 78712, USA, 2Departement d'informatique, Ecole Normale Superieure, 45 Rue d'Ulm, F-75230 Paris Cedex 05, France and 3Department of Electrical Engineering, The University of Southern California, Los Angeles, CA 90089, USA"}]},{"given":"T.","family":"Zimmermann","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science, The University of Texas at Austin, Austin, TX 78712, USA, 2Departement d'informatique, Ecole Normale Superieure, 45 Rue d'Ulm, F-75230 Paris Cedex 05, France and 3Department of Electrical Engineering, The University of Southern California, Los Angeles, CA 90089, USA"},{"name":"1 Department of Computer Science, The University of Texas at Austin, Austin, TX 78712, USA, 2Departement d'informatique, Ecole Normale Superieure, 45 Rue d'Ulm, F-75230 Paris Cedex 05, France and 3Department of Electrical Engineering, The University of Southern California, Los Angeles, CA 90089, USA"}]},{"given":"M. S.","family":"Swenson","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science, The University of Texas at Austin, Austin, TX 78712, USA, 2Departement d'informatique, Ecole Normale Superieure, 45 Rue d'Ulm, F-75230 Paris Cedex 05, France and 3Department of Electrical Engineering, The University of Southern California, Los Angeles, CA 90089, USA"}]},{"given":"T.","family":"Warnow","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science, The University of Texas at Austin, Austin, TX 78712, USA, 2Departement d'informatique, Ecole Normale Superieure, 45 Rue d'Ulm, F-75230 Paris Cedex 05, France and 3Department of Electrical Engineering, The University of Southern California, Los Angeles, CA 90089, USA"}]}],"member":"286","published-online":{"date-parts":[[2014,8,22]]},"reference":[{"key":"2023012711550339300_btu462-B1","doi-asserted-by":"crossref","first-page":"833","DOI":"10.1007\/s00285-010-0355-7","article-title":"Identifying the rooted species tree from the distribution of unrooted gene trees under the coalescent","volume":"62","author":"Allman","year":"2011","journal-title":"J. Math. Biol."},{"key":"2023012711550339300_btu462-B2","doi-asserted-by":"crossref","first-page":"2277","DOI":"10.1093\/bioinformatics\/btt394","article-title":"Naive binning improves phylogenomic analyses","volume":"29","author":"Bayzid","year":"2013","journal-title":"Bioinformatics"},{"key":"2023012711550339300_btu462-B3","doi-asserted-by":"crossref","first-page":"65","DOI":"10.1186\/1741-7007-10-65","article-title":"Phylogenomic analyses support the position of turtles as the sister group of birds and crocodiles (archosauria)","volume":"10","author":"Chiari","year":"2012","journal-title":"BMC Biol."},{"key":"2023012711550339300_btu462-B4","doi-asserted-by":"crossref","first-page":"552","DOI":"10.1093\/molbev\/msp250","article-title":"Fast and consistent estimation of species trees using supermatrix rooted triples","volume":"27","author":"DeGiorgio","year":"2010","journal-title":"Mol. Biol. Evol."},{"key":"2023012711550339300_btu462-B5","doi-asserted-by":"crossref","first-page":"66","DOI":"10.1093\/sysbio\/syt059","article-title":"Robustness to divergence time underestimation when inferring species trees from estimated gene trees","volume":"63","author":"DeGiorgio","year":"2014","journal-title":"Syst. Biol."},{"key":"2023012711550339300_btu462-B6","doi-asserted-by":"crossref","first-page":"574","DOI":"10.1093\/sysbio\/syt023","article-title":"Anomalous unrooted gene trees","volume":"62","author":"Degnan","year":"2013","journal-title":"Syst. Biol."},{"key":"2023012711550339300_btu462-B7","doi-asserted-by":"crossref","first-page":"e68","DOI":"10.1371\/journal.pgen.0020068","article-title":"Discordance of species trees with their most likely gene trees","volume":"2","author":"Degnan","year":"2006","journal-title":"PLoS Genet."},{"key":"2023012711550339300_btu462-B8","doi-asserted-by":"crossref","first-page":"332","DOI":"10.1016\/j.tree.2009.01.009","article-title":"Gene tree discordance, phylogenetic inference and the multispecies coalescent","volume":"26","author":"Degnan","year":"2009","journal-title":"Trends Ecol. Evol."},{"key":"2023012711550339300_btu462-B9","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1111\/j.1558-5646.2008.00549.x","article-title":"Is a new and general theory of molecular systematics emerging?","volume":"63","author":"Edwards","year":"2009","journal-title":"Evolution"},{"key":"2023012711550339300_btu462-B10","first-page":"138","article-title":"New algorithms for the duplication-loss model","volume-title":"Proceedings of the 4th Conference of Computational Molecular Biology (RECOMB\u201900)","author":"Hallett","year":"2000"},{"key":"2023012711550339300_btu462-B11","doi-asserted-by":"crossref","first-page":"570","DOI":"10.1093\/molbev\/msp274","article-title":"Bayesian inference of species trees from multilocus data","volume":"27","author":"Heled","year":"2010","journal-title":"Mol. Biol. Evol."},{"key":"2023012711550339300_btu462-B12","doi-asserted-by":"crossref","first-page":"543","DOI":"10.1080\/10635150701477825","article-title":"Calibration choice, rate smoothing, and the pattern of tetrapod diversification according to the long nuclear gene rag-1","volume":"56","author":"Hugall","year":"2007","journal-title":"Syst. Biol."},{"key":"2023012711550339300_btu462-B13","doi-asserted-by":"crossref","first-page":"1924","DOI":"10.1137\/S0097539799361683","article-title":"A polynomial-time approximation scheme for inferring evolutionary trees from quartet topologies and its applications","volume":"30","author":"Jiang","year":"2001","journal-title":"SIAM J. Comput."},{"key":"2023012711550339300_btu462-B14","doi-asserted-by":"crossref","first-page":"1021","DOI":"10.1016\/j.ympev.2013.05.029","article-title":"Identifying localized biases in large datasets: a case study using the avian tree of life","volume":"69","author":"Kimball","year":"2013","journal-title":"Mol. Phylogenet. Evol."},{"key":"2023012711550339300_btu462-B15","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1016\/0304-4149(82)90011-4","article-title":"The coalescent","volume":"13","author":"Kingman","year":"1982","journal-title":"Stoch. Process. Appl."},{"key":"2023012711550339300_btu462-B16","doi-asserted-by":"crossref","first-page":"501","DOI":"10.1016\/j.ympev.2012.07.004","article-title":"Full modeling versus summarizing gene-tree uncertainty: method choice and species-tree accuracy","volume":"65","author":"Knowles","year":"2012","journal-title":"Mol. Phylogenet. Evol."},{"key":"2023012711550339300_btu462-B17","doi-asserted-by":"crossref","first-page":"17","DOI":"10.1080\/10635150601146041","article-title":"Inconsistency of phylogenetic estimates from concatenated data under coalescence","volume":"56","author":"Kubatko","year":"2007","journal-title":"Syst. Biol."},{"key":"2023012711550339300_btu462-B18","doi-asserted-by":"crossref","first-page":"2910","DOI":"10.1093\/bioinformatics\/btq539","article-title":"BUCKy: gene tree\/species tree reconciliation with the Bayesian concordance analysis","volume":"26","author":"Larget","year":"2010","journal-title":"Bioinfomatics"},{"key":"2023012711550339300_btu462-B19","doi-asserted-by":"crossref","first-page":"2542","DOI":"10.1093\/bioinformatics\/btn484","article-title":"BEST: Bayesian estimation of species trees under the coalescent model","volume":"24","author":"Liu","year":"2008","journal-title":"Bioinformatics"},{"key":"2023012711550339300_btu462-B20","doi-asserted-by":"crossref","first-page":"302","DOI":"10.1186\/1471-2148-10-302","article-title":"A maximum pseudo-likelihood approach for estimating species trees under the coalescent model","volume":"10","author":"Liu","year":"2010","journal-title":"BMC Evol. Biol."},{"key":"2023012711550339300_btu462-B21","doi-asserted-by":"crossref","first-page":"523","DOI":"10.1093\/sysbio\/46.3.523","article-title":"Gene trees in species trees","volume":"46","author":"Maddison","year":"1997","journal-title":"Syst. Biol."},{"key":"2023012711550339300_btu462-B22","doi-asserted-by":"crossref","first-page":"e54848","DOI":"10.1371\/journal.pone.0054848","article-title":"A phylogeny of birds based on over 1,500 loci collected by target enrichment and high-throughput sequencing","volume":"8","author":"McCormack","year":"2013","journal-title":"PLoS One"},{"key":"2023012711550339300_btu462-B23","doi-asserted-by":"crossref","first-page":"131","DOI":"10.1016\/0025-5564(81)90043-2","article-title":"Comparison of phylogenetic trees","volume":"53","author":"Robinson","year":"1981","journal-title":"Math. Biosci."},{"key":"2023012711550339300_btu462-B24","doi-asserted-by":"crossref","first-page":"247","DOI":"10.1093\/sysbio\/45.2.247","article-title":"Matrix representation of trees, redundancy, and weighting","volume":"45","author":"Ronquist","year":"1996","journal-title":"Syst. Biol."},{"key":"2023012711550339300_btu462-B25","doi-asserted-by":"crossref","first-page":"960","DOI":"10.1093\/molbev\/msn043","article-title":"Calculating bootstrap probabilities of phylogeny using multilocus sequence data","volume":"25","author":"Seo","year":"2008","journal-title":"Mol. Biol. Evol."},{"key":"2023012711550339300_btu462-B26","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1093\/sysbio\/syt061","article-title":"Target capture and massively parallel sequencing of ultraconserved elements for comparative studies at shallow evolutionary time scales","volume":"63","author":"Smith","year":"2014","journal-title":"Syst. Biol."},{"key":"2023012711550339300_btu462-B27","doi-asserted-by":"crossref","first-page":"14942","DOI":"10.1073\/pnas.1211733109","article-title":"Resolving conflict in eutherian mammal phylogeny using phylogenomics and the multispecies coalescent model","volume":"109","author":"Song","year":"2012","journal-title":"Proc. Natl Acad.Sci. USA"},{"key":"2023012711550339300_btu462-B28","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1016\/j.tplants.2014.02.012","article-title":"Land plant origins and coalescence confusion","volume":"19","author":"Springer","year":"2014","journal-title":"Trends Plant Sci."},{"key":"2023012711550339300_btu462-B29","doi-asserted-by":"crossref","first-page":"2688","DOI":"10.1093\/bioinformatics\/btl446","article-title":"RAxML-NI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models","volume":"22","author":"Stamatakis","year":"2006","journal-title":"Bioinformatics"},{"key":"2023012711550339300_btu462-B30","doi-asserted-by":"crossref","first-page":"157","DOI":"10.1093\/bioinformatics\/14.2.157","article-title":"Rose: generating sequence families","volume":"14","author":"Stoye","year":"1998","journal-title":"Bioinformatics"},{"key":"2023012711550339300_btu462-B31","doi-asserted-by":"crossref","first-page":"1569","DOI":"10.1093\/bioinformatics\/btq228","article-title":"Dendropy: a Python library for phylogenetic computing","volume":"26","author":"Sukumaran","year":"2010","journal-title":"Bioinformatics"},{"key":"2023012711550339300_btu462-B32","doi-asserted-by":"crossref","first-page":"S4","DOI":"10.1186\/1471-2105-12-S9-S4","article-title":"Fast and accurate methods for phylogenomic analyses","volume":"12","author":"Yang","year":"2011","journal-title":"BMC Bioinformatics"},{"key":"2023012711550339300_btu462-B33","first-page":"531","article-title":"Algorithms for MDC-based multi-locus phylogeny inference","volume-title":"Proceedings of the 15th Conference of Computational Molecular Biology (RECOMB\u201911)","author":"Yu","year":"2011"},{"key":"2023012711550339300_btu462-B34","doi-asserted-by":"crossref","first-page":"e64642","DOI":"10.1371\/journal.pone.0064642","article-title":"Phylogenomic analyses of nuclear genes reveal the evolutionary relationships within the bep clade and the evidence of positive selection in poaceae","volume":"8","author":"Zhao","year":"2013","journal-title":"PLoS One"},{"key":"2023012711550339300_btu462-B35","doi-asserted-by":"crossref","first-page":"492","DOI":"10.1016\/j.tplants.2013.04.009","article-title":"Origin of land plants using the multispecies coalescent model","volume":"18","author":"Zhong","year":"2013","journal-title":"Trends Plant Sci."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/30\/17\/i541\/48927941\/bioinformatics_30_17_i541.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/30\/17\/i541\/48927941\/bioinformatics_30_17_i541.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,27]],"date-time":"2023-01-27T12:34:16Z","timestamp":1674822856000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/30\/17\/i541\/200803"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,8,22]]},"references-count":35,"journal-issue":{"issue":"17","published-print":{"date-parts":[[2014,9,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btu462","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2014,9,1]]},"published":{"date-parts":[[2014,8,22]]}}}