{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,11]],"date-time":"2026-04-11T15:05:05Z","timestamp":1775919905077,"version":"3.50.1"},"reference-count":28,"publisher":"Oxford University Press (OUP)","issue":"12","license":[{"start":{"date-parts":[[2017,2,10]],"date-time":"2017-02-10T00:00:00Z","timestamp":1486684800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/about_us\/legal\/notices"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2017,6,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>It is well known that gene trees and species trees may have different topologies. One explanation is incomplete lineage sorting, which is commonly modeled by the coalescent process. In multispecies coalescent, a gene tree topology is observed with some probability (called the gene tree probability) for a given species tree. Gene tree probability is the main tool for the program STELLS, which finds the maximum likelihood estimate of the species tree from the given gene tree topologies. However, STELLS becomes slow when data size increases. Recently, several fast species tree inference methods have been developed, which can handle large data. However, these methods often do not fully utilize the information in the gene trees.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>In this paper, we present an algorithm (called STELLS2) for computing the gene tree probability more efficiently than the original STELLS. The key idea of STELLS2 is taking some \u2018shortcuts\u2019 during the computation and computing the gene tree probability approximately. We apply the STELLS2 algorithm in the species tree inference approach in the original STELLS, which leads to a new maximum likelihood species tree inference method (also called STELLS2). Through simulation we demonstrate that the gene tree probabilities computed by STELLS2 and STELLS have strong correlation. We show that STELLS2 is almost as accurate in species tree inference as STELLS. Also STELLS2 is usually more accurate than several existing methods when there is one allele per species, although STELLS2 is slower than these methods. STELLS2 outperforms these methods significantly when there are multiple alleles per species.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and Implementation<\/jats:title>\n                  <jats:p>The program STELLS2 is available for download at: https:\/\/github.com\/yufengwudcs\/STELLS2<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btx079","type":"journal-article","created":{"date-parts":[[2017,2,10]],"date-time":"2017-02-10T10:41:46Z","timestamp":1486723306000},"page":"1789-1797","source":"Crossref","is-referenced-by-count":16,"title":["STELLS2: fast and accurate coalescent-based maximum likelihood inference of species trees from gene tree topologies"],"prefix":"10.1093","volume":"33","author":[{"given":"Jingwen","family":"Pei","sequence":"first","affiliation":[{"name":"Department of Computer Science and Engineering, University of Connecticut, Storrs, CT, USA"}]},{"given":"Yufeng","family":"Wu","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering, University of Connecticut, Storrs, CT, USA"}]}],"member":"286","published-online":{"date-parts":[[2017,2,10]]},"reference":[{"key":"2023020205322960700_btx079-B1","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1016\/j.jtbi.2015.03.006","article-title":"Identifiability of the unrooted species tree topology under the coalescent model with time-reversible substitution processes, site-specific rate variation, and invariable sites","volume":"374","author":"Chifman","year":"2015","journal-title":"J. Theor. Biol"},{"key":"2023020205322960700_btx079-B2","doi-asserted-by":"crossref","first-page":"66","DOI":"10.1093\/sysbio\/syt059","article-title":"Robustness to divergence time underestimation when inferring species trees from estimated gene trees","volume":"63","author":"DeGiorgio","year":"2014","journal-title":"Syst. Biol"},{"key":"2023020205322960700_btx079-B3","doi-asserted-by":"crossref","first-page":"e68.","DOI":"10.1371\/journal.pgen.0020068","article-title":"Discordance of species trees with their most likely gene trees","volume":"2","author":"Degnan","year":"2006","journal-title":"PLOS Genet"},{"key":"2023020205322960700_btx079-B4","first-page":"24","article-title":"Gene tree distributions under the coalescent process","volume":"59","author":"Degnan","year":"2005","journal-title":"Evolution"},{"key":"2023020205322960700_btx079-B5","doi-asserted-by":"crossref","first-page":"918","DOI":"10.1089\/cmb.2015.0015","article-title":"Coalescent histories for lodgepole species trees","volume":"22","author":"Disanto","year":"2015","journal-title":"J. Comput. Biol"},{"key":"2023020205322960700_btx079-B6","doi-asserted-by":"crossref","first-page":"913","DOI":"10.1109\/TCBB.2015.2485217","article-title":"Asymptotic properties of the number of matching coalescent histories for caterpillar-like families of species trees","volume":"13","author":"Disanto","year":"2016","journal-title":"IEEE\/ACM Trans. Comput. Biol. Bioinf"},{"key":"2023020205322960700_btx079-B7","article-title":"Enumeration of ancestral configurations for matching gene trees and species trees","author":"Disanto","year":"2016","journal-title":"J. Comput. Biol"},{"key":"2023020205322960700_btx079-B8","doi-asserted-by":"crossref","first-page":"501","DOI":"10.1111\/evo.12280","article-title":"The influence of sampling design on speies tree inference: a new relationship for the new world chickadees (aves: Poecile)","volume":"68","author":"Harris","year":"2014","journal-title":"Evolution"},{"key":"2023020205322960700_btx079-B9","doi-asserted-by":"crossref","first-page":"160","DOI":"10.1007\/BF02101694","article-title":"Dating of the human-ape splitting by a molecular clock of mitochondrial DNA","volume":"22","author":"Hasegawa","year":"1985","journal-title":"J. Mol. Evol"},{"key":"2023020205322960700_btx079-B10","doi-asserted-by":"crossref","first-page":"570","DOI":"10.1093\/molbev\/msp274","article-title":"Bayesian inference of species trees from multilocus data","volume":"27","author":"Heled","year":"2010","journal-title":"Mol. Biol. Evol"},{"key":"2023020205322960700_btx079-B11","doi-asserted-by":"crossref","first-page":"337","DOI":"10.1093\/bioinformatics\/18.2.337","article-title":"Generating samples under the Wright\u2013Fisher neutral model of genetic variation","volume":"18","author":"Hudson","year":"2002","journal-title":"Bioinformatics"},{"key":"2023020205322960700_btx079-B12","doi-asserted-by":"crossref","first-page":"302.","DOI":"10.1186\/1471-2148-10-302","article-title":"A maximum pseudo-likelihood approach for estimating species trees under the coalescent model","volume":"10","author":"Liu","year":"2010","journal-title":"BMC Evol. Biol"},{"key":"2023020205322960700_btx079-B13","doi-asserted-by":"crossref","first-page":"468","DOI":"10.1093\/sysbio\/syp031","article-title":"Estimating species phylogenies using coalescence times among sequences","volume":"58","author":"Liu","year":"2009","journal-title":"Syst. Biol"},{"key":"2023020205322960700_btx079-B14","doi-asserted-by":"crossref","first-page":"523","DOI":"10.1093\/sysbio\/46.3.523","article-title":"Gene trees in species trees","volume":"46","author":"Maddison","year":"1997","journal-title":"Syst. Biol"},{"key":"2023020205322960700_btx079-B15","doi-asserted-by":"crossref","first-page":"i541","DOI":"10.1093\/bioinformatics\/btu462","article-title":"Astral: genome-scale coalescent-based species tree estimation","volume":"30","author":"Mirarab","year":"2014","journal-title":"Bioinformatics"},{"key":"2023020205322960700_btx079-B16","first-page":"235","article-title":"Seq-gen: an application for the monte carlo simulation of DNA sequence evolution along phylogenetic trees","volume":"13","author":"Rambaut","year":"1997","journal-title":"Comput. Appl. Biosci"},{"key":"2023020205322960700_btx079-B17","doi-asserted-by":"crossref","first-page":"225","DOI":"10.1006\/tpbi.2001.1568","article-title":"The probability of topological concordance of gene trees and species trees","volume":"61","author":"Rosenberg","year":"2002","journal-title":"Theor. Popul. Biol"},{"key":"2023020205322960700_btx079-B18","doi-asserted-by":"crossref","first-page":"1253","DOI":"10.1109\/TCBB.2013.123","article-title":"Coalescent histories for caterpillar-like families","volume":"10","author":"Rosenberg","year":"2013","journal-title":"IEEE\/ACM Trans. Comput. Biol. Bioinf"},{"key":"2023020205322960700_btx079-B19","doi-asserted-by":"crossref","first-page":"14942","DOI":"10.1073\/pnas.1211733109","article-title":"Resolving conflict in eutherian mammal phylogeny using phylogenomics and the multispecies coalescent model","volume":"109","author":"Song","year":"2012","journal-title":"Proc. Nat. Acad. Sci"},{"key":"2023020205322960700_btx079-B20","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.ympev.2015.07.018","article-title":"The gene tree delusion","volume":"94","author":"Springer","year":"2016","journal-title":"Mol. Phylogenet. Evol"},{"key":"2023020205322960700_btx079-B21","doi-asserted-by":"crossref","first-page":"1312","DOI":"10.1093\/bioinformatics\/btu033","article-title":"Raxml version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies","volume":"30","author":"Stamatakis","year":"2014","journal-title":"Bioinformatics"},{"key":"2023020205322960700_btx079-B22","doi-asserted-by":"crossref","first-page":"325","DOI":"10.1093\/genetics\/110.2.325","article-title":"Gene genealogy and variance of interpopulational nucleotide differences","volume":"110","author":"Takahata","year":"1985","journal-title":"Genetics"},{"key":"2023020205322960700_btx079-B23","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1016\/0040-5809(84)90027-3","article-title":"Line-of-descent and genealogical processes, and their applications in population genetics models","volume":"26","author":"Tavar\u00e8","year":"1984","journal-title":"Theor. Popul. Biol"},{"key":"2023020205322960700_btx079-B24","volume-title":"Coalescent Theory: An Introduction","author":"Wakeley","year":"2008"},{"key":"2023020205322960700_btx079-B25","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1016\/0040-5809(84)90025-X","article-title":"Lines of descent and the coalescent","volume":"26","author":"Watterson","year":"1984","journal-title":"Theor. Popul. Biol"},{"key":"2023020205322960700_btx079-B26","doi-asserted-by":"crossref","first-page":"763","DOI":"10.1111\/j.1558-5646.2011.01476.x","article-title":"Coalescent-based species tree inference from gene tree topologies under incomplete lineage sorting by maximum likelihood","volume":"66","author":"Wu","year":"2012","journal-title":"Evolution"},{"key":"2023020205322960700_btx079-B27","doi-asserted-by":"crossref","first-page":"691","DOI":"10.1093\/bioinformatics\/btu710","article-title":"A coalescent-based method for population tree inference with haplotypes","volume":"31","author":"Wu","year":"2015","journal-title":"Bioinformatics"},{"key":"2023020205322960700_btx079-B28","doi-asserted-by":"crossref","first-page":"i225","DOI":"10.1093\/bioinformatics\/btw261","article-title":"An algorithm for computing the gene tree probability under the multispecies coalescent and its application in the inference of population tree","volume":"32","author":"Wu","year":"2016","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/33\/12\/1789\/49039972\/bioinformatics_33_12_1789.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/33\/12\/1789\/49039972\/bioinformatics_33_12_1789.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T05:36:13Z","timestamp":1675316173000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/33\/12\/1789\/2982054"}},"subtitle":[],"editor":[{"given":"Alfonso","family":"Valencia","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2017,2,10]]},"references-count":28,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2017,6,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btx079","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2017,6,15]]},"published":{"date-parts":[[2017,2,10]]}}}