{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,9]],"date-time":"2026-03-09T10:28:57Z","timestamp":1773052137061,"version":"3.50.1"},"reference-count":40,"publisher":"Oxford University Press (OUP)","issue":"Supplement_1","license":[{"start":{"date-parts":[[2023,6,30]],"date-time":"2023-06-30T00:00:00Z","timestamp":1688083200000},"content-version":"vor","delay-in-days":29,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000001","name":"NSF","doi-asserted-by":"publisher","award":["1845967"],"award-info":[{"award-number":["1845967"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000001","name":"NSF","doi-asserted-by":"publisher","award":["1636933"],"award-info":[{"award-number":["1636933"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000001","name":"NSF","doi-asserted-by":"publisher","award":["1920920"],"award-info":[{"award-number":["1920920"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"name":"National Institute of Health","award":["1R35GM142725"],"award-info":[{"award-number":["1R35GM142725"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,6,30]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Branch lengths and topology of a species tree are essential in most downstream analyses, including estimation of diversification dates, characterization of selection, understanding adaptation, and comparative genomics. Modern phylogenomic analyses often use methods that account for the heterogeneity of evolutionary histories across the genome due to processes such as incomplete lineage sorting. However, these methods typically do not generate branch lengths in units that are usable by downstream applications, forcing phylogenomic analyses to resort to alternative shortcuts such as estimating branch lengths by concatenating gene alignments into a supermatrix. Yet, concatenation and other available approaches for estimating branch lengths fail to address heterogeneity across the genome.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>In this article, we derive expected values of gene tree branch lengths in substitution units under an extension of the multispecies coalescent (MSC) model that allows substitutions with varying rates across the species tree. We present CASTLES, a new technique for estimating branch lengths on the species tree from estimated gene trees that uses these expected values, and our study shows that CASTLES improves on the most accurate prior methods with respect to both speed and accuracy.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>CASTLES is available at https:\/\/github.com\/ytabatabaee\/CASTLES.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btad221","type":"journal-article","created":{"date-parts":[[2023,6,30]],"date-time":"2023-06-30T08:17:14Z","timestamp":1688113034000},"page":"i185-i193","source":"Crossref","is-referenced-by-count":33,"title":["Phylogenomic branch length estimation using quartets"],"prefix":"10.1093","volume":"39","author":[{"given":"Yasamin","family":"Tabatabaee","sequence":"first","affiliation":[{"name":"Department of Computer Science, University of Illinois at Urbana-Champaign , Urbana, IL 61801, United States"}]},{"given":"Chao","family":"Zhang","sequence":"additional","affiliation":[{"name":"Department of Integrative Biology, University of California at Berkeley , Berkeley, CA 94720, United States"}]},{"given":"Tandy","family":"Warnow","sequence":"additional","affiliation":[{"name":"Department of Computer Science, University of Illinois at Urbana-Champaign , Urbana, IL 61801, United States"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5410-1518","authenticated-orcid":false,"given":"Siavash","family":"Mirarab","sequence":"additional","affiliation":[{"name":"Department of Electrical and Computer Engineering, University of California, San Diego , La Jolla, CA 92093, United States"}]}],"member":"286","published-online":{"date-parts":[[2023,6,30]]},"reference":[{"key":"2023063008151866000_btad221-B1","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1186\/s12859-015-0821-8","article-title":"Fast and accurate branch lengths estimation for phylogenomic trees","volume":"17","author":"Binet","year":"2016","journal-title":"BMC Bioinformatics"},{"key":"2023063008151866000_btad221-B2","doi-asserted-by":"crossref","first-page":"216","DOI":"10.1038\/nrg1020","article-title":"The modern molecular clock","volume":"4","author":"Bromham","year":"2003","journal-title":"Nat Rev Genet"},{"key":"2023063008151866000_btad221-B3","doi-asserted-by":"crossref","first-page":"248","DOI":"10.1046\/j.1523-1739.2002.00503.x","article-title":"Quantifying biodiversity: a phylogenetic perspective","volume":"16","author":"Faith","year":"2002","journal-title":"Conserv Biol"},{"key":"2023063008151866000_btad221-B4","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1086\/284325","article-title":"Phylogenies and the comparative method","volume":"125","author":"Felsenstein","year":"1985","journal-title":"Am Nat"},{"key":"2023063008151866000_btad221-B5","doi-asserted-by":"crossref","first-page":"1153","DOI":"10.1101\/gr.3567505","article-title":"Estimating the tempo and mode of gene family evolution from comparative genomic data","volume":"15","author":"Hahn","year":"2005","journal-title":"Genome Res"},{"key":"2023063008151866000_btad221-B6","doi-asserted-by":"crossref","first-page":"1208","DOI":"10.1093\/molbev\/msi105","article-title":"Not so different after all: a comparison of methods for detecting amino acid sites under selection","volume":"22","author":"Kosakovsky Pond","year":"2005","journal-title":"Mol Biol Evol"},{"key":"2023063008151866000_btad221-B7","doi-asserted-by":"crossref","DOI":"10.1515\/9780691245157","volume-title":"Species Tree Inference: A Guide to Methods and Applications","author":"Kubatko","year":"2023"},{"key":"2023063008151866000_btad221-B8","doi-asserted-by":"crossref","first-page":"654","DOI":"10.1038\/nrg1659","article-title":"Molecular clocks: four decades of evolution","volume":"6","author":"Kumar","year":"2005","journal-title":"Nat Rev Genet"},{"key":"2023063008151866000_btad221-B9","doi-asserted-by":"crossref","first-page":"495","DOI":"10.1016\/j.tree.2010.06.007","article-title":"Watching the clock: studying variation in rates of molecular evolution between species","volume":"25","author":"Lanfear","year":"2010","journal-title":"Trends Ecol Evol"},{"key":"2023063008151866000_btad221-B10","doi-asserted-by":"crossref","first-page":"2798","DOI":"10.1093\/molbev\/msv150","article-title":"FastME 2.0: a comprehensive, accurate, and fast distance-based phylogeny inference program","volume":"32","author":"Lefort","year":"2015","journal-title":"Mol Biol Evol"},{"key":"2023063008151866000_btad221-B11","doi-asserted-by":"crossref","first-page":"365","DOI":"10.1016\/j.ympev.2012.01.013","article-title":"Phylogenetics of chondrichthyes and the problem of rooting phylogenies with distant outgroups","volume":"63","author":"Li","year":"2012","journal-title":"Mol Phylogenet Evol"},{"key":"2023063008151866000_btad221-B12","doi-asserted-by":"crossref","first-page":"302","DOI":"10.1186\/1471-2148-10-302","article-title":"A maximum pseudo-likelihood approach for estimating species trees under the coalescent model","volume":"10","author":"Liu","year":"2010","journal-title":"BMC Evol Biol"},{"key":"2023063008151866000_btad221-B13","doi-asserted-by":"crossref","first-page":"8228","DOI":"10.1128\/AEM.71.12.8228-8235.2005","article-title":"UniFrac : a new phylogenetic method for comparing microbial communities","volume":"71","author":"Lozupone","year":"2005","journal-title":"Appl Environ Microbiol"},{"key":"2023063008151866000_btad221-B14","doi-asserted-by":"crossref","first-page":"1532","DOI":"10.1093\/bioinformatics\/btab875","article-title":"Completing gene trees without species trees in sub-quadratic time","volume":"38","author":"Mai","year":"2022","journal-title":"Bioinformatics"},{"key":"2023063008151866000_btad221-B15","doi-asserted-by":"crossref","first-page":"e0182238","DOI":"10.1371\/journal.pone.0182238","article-title":"Minimum variance rooting of phylogenetic trees and implications for species tree reconstruction","volume":"12","author":"Mai","year":"2017","journal-title":"PLoS One"},{"key":"2023063008151866000_btad221-B16","doi-asserted-by":"crossref","first-page":"334","DOI":"10.1093\/sysbio\/syv082","article-title":"SimPhy : phylogenomic simulation of gene, locus, and species trees","volume":"65","author":"Mallo","year":"2016","journal-title":"Syst Biol"},{"key":"2023063008151866000_btad221-B17","doi-asserted-by":"crossref","first-page":"i541","DOI":"10.1093\/bioinformatics\/btu462","article-title":"ASTRAL: genome-scale coalescent-based species tree estimation","volume":"30","author":"Mirarab","year":"2014","journal-title":"Bioinformatics"},{"key":"2023063008151866000_btad221-B18","doi-asserted-by":"crossref","first-page":"e66695","DOI":"10.7554\/eLife.66695","article-title":"An estimate of the deepest branches of the tree of life from ancient vertically evolving genes","volume":"11","author":"Moody","year":"2022","journal-title":"eLife"},{"key":"2023063008151866000_btad221-B19","doi-asserted-by":"crossref","first-page":"475","DOI":"10.1093\/sysbio\/syx088","article-title":"A two-state model of tree evolution and its applications to Alu retrotransposition","volume":"67","author":"Moshiri","year":"2018","journal-title":"Syst Biol"},{"key":"2023063008151866000_btad221-B20","doi-asserted-by":"crossref","first-page":"166","DOI":"10.1109\/TCBB.2008.66","article-title":"Incomplete lineage sorting: consistent phylogeny estimation from multiple loci","volume":"7","author":"Mossel","year":"2010","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"key":"2023063008151866000_btad221-B21","doi-asserted-by":"crossref","first-page":"2101","DOI":"10.1093\/molbev\/msx126","article-title":"StarBEAST2 brings faster species tree inference and accurate estimates of substitution rates","volume":"34","author":"Ogilvie","year":"2017","journal-title":"Mol Biol Evol"},{"key":"2023063008151866000_btad221-B22","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1146\/annurev-ecolsys-110411-160331","article-title":"Evolutionary inferences from phylogenies: a review of methods","volume":"43","author":"O\u2019Meara","year":"2012","journal-title":"Ann Rev Ecol Evol Syst"},{"key":"2023063008151866000_btad221-B23","first-page":"568","article-title":"Relationships between gene trees and species trees","volume":"5","author":"Pamilo","year":"1988","journal-title":"Mol Biol Evol"},{"key":"2023063008151866000_btad221-B24","doi-asserted-by":"crossref","first-page":"e9490","DOI":"10.1371\/journal.pone.0009490","article-title":"FastTree-2\u2014approximately maximum-likelihood trees for large alignments","volume":"5","author":"Price","year":"2010","journal-title":"PLoS One"},{"key":"2023063008151866000_btad221-B25","doi-asserted-by":"crossref","first-page":"846","DOI":"10.1093\/czoolo\/61.5.846","article-title":"The art and science of species delimitation","volume":"61","author":"Rannala","year":"2015","journal-title":"Curr Zool"},{"key":"2023063008151866000_btad221-B26","doi-asserted-by":"crossref","first-page":"131","DOI":"10.1016\/0025-5564(81)90043-2","article-title":"Comparison of phylogenetic trees","volume":"53","author":"Robinson","year":"1981","journal-title":"Math Biosci"},{"key":"2023063008151866000_btad221-B27","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1016\/j.tpb.2014.12.005","article-title":"Likelihood-based tree reconstruction on a concatenation of aligned sequence data sets can be statistically inconsistent","volume":"100","author":"Roch","year":"2015","journal-title":"Theor Popul Biol"},{"key":"2023063008151866000_btad221-B28","doi-asserted-by":"crossref","first-page":"798","DOI":"10.1038\/nature02053","article-title":"Genome-scale approaches to resolving incongruence in molecular phylogenies","volume":"425","author":"Rokas","year":"2003","journal-title":"Nature"},{"key":"2023063008151866000_btad221-B29","doi-asserted-by":"crossref","first-page":"1654","DOI":"10.1093\/molbev\/msw079","article-title":"Fast coalescent-based computation of local branch support from quartet frequencies","volume":"33","author":"Sayyari","year":"2016","journal-title":"Mol Biol Evol"},{"key":"2023063008151866000_btad221-B30","doi-asserted-by":"crossref","first-page":"14942","DOI":"10.1073\/pnas.1211733109","article-title":"Resolving conflict in eutherian mammal phylogeny using phylogenomics and the multispecies coalescent model","volume":"109","author":"Song","year":"2012","journal-title":"Proc Natl Acad Sci USA"},{"key":"2023063008151866000_btad221-B31","doi-asserted-by":"crossref","first-page":"1312","DOI":"10.1093\/bioinformatics\/btu033","article-title":"RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies","volume":"30","author":"Stamatakis","year":"2014","journal-title":"Bioinformatics"},{"key":"2023063008151866000_btad221-B32","doi-asserted-by":"crossref","first-page":"1569","DOI":"10.1093\/bioinformatics\/btq228","article-title":"DendroPy: a Python library for phylogenetic computing","volume":"26","author":"Sukumaran","year":"2010","journal-title":"Bioinformatics"},{"key":"2023063008151866000_btad221-B33","author":"Tabatabaee"},{"key":"2023063008151866000_btad221-B34","doi-asserted-by":"crossref","first-page":"i109","DOI":"10.1093\/bioinformatics\/btac224","article-title":"Quintet rooting: rooting species trees under the multi-species coalescent model","volume":"38","author":"Tabatabaee","year":"2022","journal-title":"Bioinformatics"},{"key":"2023063008151866000_btad221-B35","doi-asserted-by":"crossref","first-page":"e1002947","DOI":"10.1371\/journal.pcbi.1002947","article-title":"Viral phylodynamics","volume":"9","author":"Volz","year":"2013","journal-title":"PLoS Comput Biol"},{"key":"2023063008151866000_btad221-B36","doi-asserted-by":"crossref","first-page":"610","DOI":"10.1093\/sysbio\/syab070","article-title":"DISCO: species tree inference using multicopy gene family tree decomposition","volume":"71","author":"Willson","year":"2022","journal-title":"Syst Biol"},{"key":"2023063008151866000_btad221-B37","doi-asserted-by":"crossref","first-page":"msac215","DOI":"10.1093\/molbev\/msac215","article-title":"Weighting by gene tree uncertainty improves accuracy of quartet-based species trees","volume":"39","author":"Zhang","year":"2022","journal-title":"Mol Biol Evol"},{"key":"2023063008151866000_btad221-B38","doi-asserted-by":"crossref","first-page":"153","DOI":"10.1186\/s12859-018-2129-y","article-title":"ASTRAL-III: polynomial time species tree reconstruction from partially resolved gene trees","volume":"19","author":"Zhang","year":"2018","journal-title":"BMC Bioinformatics"},{"key":"2023063008151866000_btad221-B39","doi-asserted-by":"crossref","first-page":"5477","DOI":"10.1038\/s41467-019-13443-4","article-title":"Phylogenomics of 10,575 genomes reveals evolutionary proximity between domains bacteria and archaea","volume":"10","author":"Zhu","year":"2019","journal-title":"Nat Commun"},{"key":"2023063008151866000_btad221-B40","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1471-2164-15-S6-S11","article-title":"BBCA: improving the scalability of *BEAST using random binning","volume":"15","author":"Zimmermann","year":"2014","journal-title":"BMC Genomics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/Supplement_1\/i185\/50741530\/btad221.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/Supplement_1\/i185\/50741530\/btad221.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,30]],"date-time":"2023-06-30T08:18:13Z","timestamp":1688113093000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/39\/Supplement_1\/i185\/7210452"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,6,1]]},"references-count":40,"journal-issue":{"issue":"Supplement_1","published-print":{"date-parts":[[2023,6,30]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btad221","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2023,6,1]]},"published":{"date-parts":[[2023,6,1]]}}}