{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,8]],"date-time":"2026-06-08T21:40:51Z","timestamp":1780954851865,"version":"3.54.1"},"reference-count":49,"publisher":"Oxford University Press (OUP)","issue":"8","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":536,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2015,4,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: The accurate inference of gene trees is a necessary step in many evolutionary studies. Although the problem of accurate gene tree inference has received considerable attention, most existing methods are only applicable to gene families unaffected by horizontal gene transfer. As a result, the accurate inference of gene trees affected by horizontal gene transfer remains a largely unaddressed problem.<\/jats:p><jats:p>Results: In this study, we introduce a new and highly effective method for gene tree error correction in the presence of horizontal gene transfer. Our method efficiently models horizontal gene transfers, gene duplications and losses, and uses a statistical hypothesis testing framework [Shimodaira\u2013Hasegawa (SH) test] to balance sequence likelihood with topological information from a known species tree. Using a thorough simulation study, we show that existing phylogenetic methods yield inaccurate gene trees when applied to horizontally transferred gene families and that our method dramatically improves gene tree accuracy. We apply our method to a dataset of 11 cyanobacterial species and demonstrate the large impact of gene tree accuracy on downstream evolutionary analyses.<\/jats:p><jats:p>Availability and implementation: An implementation of our method is available at http:\/\/compbio.mit.edu\/treefix-dtl\/<\/jats:p><jats:p>Contact: mukul@engr.uconn.edu or manoli@mit.edu<\/jats:p><jats:p>Supplementary information: Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btu806","type":"journal-article","created":{"date-parts":[[2014,12,7]],"date-time":"2014-12-07T01:08:27Z","timestamp":1417914507000},"page":"1211-1218","source":"Crossref","is-referenced-by-count":56,"title":["Improved gene tree error correction in the presence of horizontal gene transfer"],"prefix":"10.1093","volume":"31","author":[{"given":"Mukul S.","family":"Bansal","sequence":"first","affiliation":[{"name":"1 Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA, 2Department of Computer Science and Engineering, University of Connecticut, Storrs, CT, USA and 3Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge and 4Broad Institute, Cambridge, MA, USA"},{"name":"1 Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA, 2Department of Computer Science and Engineering, University of Connecticut, Storrs, CT, USA and 3Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge and 4Broad Institute, Cambridge, MA, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yi-Chieh","family":"Wu","sequence":"additional","affiliation":[{"name":"1 Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA, 2Department of Computer Science and Engineering, University of Connecticut, Storrs, CT, USA and 3Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge and 4Broad Institute, Cambridge, MA, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Eric J.","family":"Alm","sequence":"additional","affiliation":[{"name":"1 Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA, 2Department of Computer Science and Engineering, University of Connecticut, Storrs, CT, USA and 3Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge and 4Broad Institute, Cambridge, MA, USA"},{"name":"1 Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA, 2Department of Computer Science and Engineering, University of Connecticut, Storrs, CT, USA and 3Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge and 4Broad Institute, Cambridge, MA, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Manolis","family":"Kellis","sequence":"additional","affiliation":[{"name":"1 Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA, 2Department of Computer Science and Engineering, University of Connecticut, Storrs, CT, USA and 3Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge and 4Broad Institute, Cambridge, MA, USA"},{"name":"1 Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA, 2Department of Computer Science and Engineering, University of Connecticut, Storrs, CT, USA and 3Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge and 4Broad Institute, Cambridge, MA, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"286","published-online":{"date-parts":[[2015,12,5]]},"reference":[{"key":"2023051309024760300_btu806-B1","doi-asserted-by":"crossref","first-page":"5714","DOI":"10.1073\/pnas.0806251106","article-title":"Simultaneous bayesian gene tree reconstruction and reconciliation analysis","volume":"106","author":"\u00c5kerborg","year":"2009","journal-title":"Proc. Natl. Acad. Sci."},{"key":"2023051309024760300_btu806-B2","doi-asserted-by":"crossref","first-page":"i283","DOI":"10.1093\/bioinformatics\/bts225","article-title":"Efficient algorithms for the reconciliation problem with gene duplication, horizontal transfer and loss","volume":"28","author":"Bansal","year":"2012","journal-title":"Bioinformatics"},{"key":"2023051309024760300_btu806-B3","doi-asserted-by":"crossref","first-page":"406","DOI":"10.1016\/j.tim.2004.07.002","article-title":"Phylogenetic reconstruction and lateral gene transfer","volume":"12","author":"Bapteste","year":"2004","journal-title":"Trends Microbiol."},{"key":"2023051309024760300_btu806-B4","doi-asserted-by":"crossref","first-page":"117","DOI":"10.1093\/sysbio\/syq072","article-title":"Genome-scale phylogenetics: Inferring the plant tree of life from 18,896 gene trees","volume":"60","author":"Burleigh","year":"2011","journal-title":"Syst. Biol."},{"key":"2023051309024760300_btu806-B5","doi-asserted-by":"crossref","first-page":"96","DOI":"10.1145\/332306.332351","article-title":"Notung: dating gene duplications using gene family trees","volume-title":"RECOMB","author":"Chen","year":"2000"},{"key":"2023051309024760300_btu806-B6","doi-asserted-by":"crossref","first-page":"3309","DOI":"10.1093\/molbev\/mss138","article-title":"Replacing and additive horizontal gene transfer in streptococcus","volume":"29","author":"Choi","year":"2012","journal-title":"Mol. Biol. Evol."},{"key":"2023051309024760300_btu806-B7","doi-asserted-by":"crossref","first-page":"93","DOI":"10.1038\/nature09649","article-title":"Rapid evolutionary innovation during an archaean genetic expansion","volume":"469","author":"David","year":"2011","journal-title":"Nature"},{"key":"2023051309024760300_btu806-B8","first-page":"93","article-title":"An efficient algorithm for gene\/species trees parsimonious reconciliation with losses, duplications and transfers","volume-title":"RECOMB-CG, volume 6398 of LNCS","author":"Doyon","year":"2010"},{"key":"2023051309024760300_btu806-B9","doi-asserted-by":"crossref","first-page":"320","DOI":"10.1089\/cmb.2006.13.320","article-title":"A hybrid micro-macroevolutionary approach to gene tree reconstruction","volume":"13","author":"Durand","year":"2006","journal-title":"J. Comput. Biol."},{"key":"2023051309024760300_btu806-B10","volume-title":"Inferring Phylogenies","author":"Felsenstein","year":"2004"},{"key":"2023051309024760300_btu806-B11","doi-asserted-by":"crossref","first-page":"361","DOI":"10.1038\/nrg1603","article-title":"Phylogenomics and the reconstruction of the tree of life","volume":"6","author":"Delsuc","year":"2005","journal-title":"Nat. Rev. Genet."},{"key":"2023051309024760300_btu806-B12","doi-asserted-by":"crossref","first-page":"132","DOI":"10.2307\/2412519","article-title":"Fitting the gene lineage into its species lineage. a parsimony strategy illustrated by cladograms constructed from globin sequences","volume":"28","author":"Goodman","year":"1979","journal-title":"Syst. Zool."},{"key":"2023051309024760300_btu806-B13","first-page":"148","article-title":"A linear time algorithm for error-corrected reconciliation of unrooted gene trees","volume-title":"ISBRA, volume 6674 of LNCS","author":"G\u00f3recki","year":"2011"},{"key":"2023051309024760300_btu806-B14","doi-asserted-by":"crossref","first-page":"307","DOI":"10.1093\/sysbio\/syq010","article-title":"New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0","volume":"59","author":"Guindon","year":"2010","journal-title":"Syst. Biol."},{"key":"2023051309024760300_btu806-B15","doi-asserted-by":"crossref","first-page":"149","DOI":"10.1145\/369133.369188","article-title":"Efficient algorithms for lateral gene transfer problems","volume-title":"RECOMB","author":"Hallett","year":"2001"},{"key":"2023051309024760300_btu806-B16","doi-asserted-by":"crossref","first-page":"44","DOI":"10.2307\/1426329","article-title":"The probabilities of rooted tree-shapes generated by random bifurcation","volume":"3","author":"Harding","year":"1971","journal-title":"Adv. Appl. Prob."},{"key":"2023051309024760300_btu806-B17","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1146\/annurev.genet.39.073003.114725","article-title":"Orthologs, paralogs, and evolutionary genomics","volume":"39","author":"Koonin","year":"2005","journal-title":"Annu. Rev. Genet."},{"key":"2023051309024760300_btu806-B18","doi-asserted-by":"crossref","first-page":"e19","DOI":"10.1371\/journal.pbio.0000019","article-title":"From gene trees to organismal phylogeny in prokaryotes:the case of the -proteobacteria","volume":"1","author":"Lerat","year":"2003","journal-title":"PLoS Biol."},{"key":"2023051309024760300_btu806-B19","doi-asserted-by":"crossref","first-page":"e130","DOI":"10.1371\/journal.pbio.0030130","article-title":"Evolutionary origins of genomic repertoires in bacteria","volume":"3","author":"Lerat","year":"2005","journal-title":"PLoS Biol."},{"key":"2023051309024760300_btu806-B20","doi-asserted-by":"crossref","first-page":"D572","DOI":"10.1093\/nar\/gkj118","article-title":"Treefam: a curated database of phylogenetic trees of animal gene families","volume":"34","author":"Li","year":"2006","journal-title":"Nucleic Acids Res."},{"key":"2023051309024760300_btu806-B21","doi-asserted-by":"crossref","first-page":"105","DOI":"10.1089\/cmb.2008.0084","article-title":"On the computational complexity of the reticulate cophylogeny reconstruction problem","volume":"16","author":"Libeskind-Hadas","year":"2009","journal-title":"J. Comput. Biol."},{"key":"2023051309024760300_btu806-B22","doi-asserted-by":"crossref","first-page":"1561","DOI":"10.1126\/science.1171243","article-title":"Rapid and accurate large-scale coestimation of sequence alignments and phylogenetic trees","volume":"324","author":"Liu","year":"2009","journal-title":"Science"},{"key":"2023051309024760300_btu806-B23","doi-asserted-by":"crossref","first-page":"1007","DOI":"10.1089\/cmb.2008.0069","article-title":"Dupcar: Reconstructing contiguous ancestral regions with duplications","volume":"15","author":"Ma","year":"2008","journal-title":"J. Comput. Biol."},{"key":"2023051309024760300_btu806-B24","first-page":"123","article-title":"Accounting for gene tree uncertainties improves gene trees and reconciliation inference","volume-title":"WABI, volume 7534 of LNCS","author":"Nguyen","year":"2012"},{"key":"2023051309024760300_btu806-B25","doi-asserted-by":"crossref","first-page":"253","DOI":"10.1002\/9780470619902.ch14","article-title":"Phylogenomic approach to the evolutionary dynamics of gene duplication in birds","volume-title":"Evolution after Gene Duplication","author":"Organ","year":"2010"},{"key":"2023051309024760300_btu806-B26","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1089\/cmb.2009.0240","article-title":"The cophylogeny reconstruction problem is np-complete","volume":"18","author":"Ovadia","year":"2011","journal-title":"J. Comput. Biol."},{"key":"2023051309024760300_btu806-B27","first-page":"58","article-title":"Maps between trees and cladistic analysis of historical associations among genes, organisms, and areas","volume":"43","author":"Page","year":"1994","journal-title":"Syst. Biol."},{"key":"2023051309024760300_btu806-B28","first-page":"235","article-title":"Seq-Gen: An application for the Monte Carlo simulation of DNA sequence evolution along phylogenetic trees","volume":"13","author":"Rambaut","year":"1997","journal-title":"Comput. Appl. Biosci."},{"key":"2023051309024760300_btu806-B29","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1093\/molbev\/msq189","article-title":"A bayesian approach for fast and accurate gene tree reconstruction","volume":"28","author":"Rasmussen","year":"2011","journal-title":"Mol. Biol. Evol."},{"key":"2023051309024760300_btu806-B30","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1111\/j.1366-9516.2006.00210.x","article-title":"Molecular dating of phylogenetic trees: A brief review of current methods that estimate divergence times","volume":"12","author":"Rutschmann","year":"2006","journal-title":"Divers. Distrib."},{"key":"2023051309024760300_btu806-B31","first-page":"406","article-title":"The neighbor-joining method: a new method for reconstructing phylogenetic trees","volume":"4","author":"Saitou","year":"1987","journal-title":"Mol. Biol. Evol."},{"key":"2023051309024760300_btu806-B32","doi-asserted-by":"crossref","first-page":"970","DOI":"10.1080\/106351501753462902","article-title":"Complexity of the likelihood surface for a large dna dataset","volume":"50","author":"Salter","year":"2001","journal-title":"Syst. Biol."},{"key":"2023051309024760300_btu806-B33","doi-asserted-by":"crossref","first-page":"448","DOI":"10.1126\/science.1206357","article-title":"Terraces in phylogenetic tree space","volume":"333","author":"Sanderson","year":"2011","journal-title":"Science"},{"key":"2023051309024760300_btu806-B34","doi-asserted-by":"crossref","first-page":"411","DOI":"10.1093\/sysbio\/syp046","article-title":"Probabilistic orthology analysis","volume":"58","author":"Sennblad","year":"2009","journal-title":"Syst. Biol."},{"key":"2023051309024760300_btu806-B35","doi-asserted-by":"crossref","first-page":"1114","DOI":"10.1093\/oxfordjournals.molbev.a026201","article-title":"Multiple comparisons of log-likelihoods with applications to phylogenetic inference","volume":"16","author":"Shimodaira","year":"1999","journal-title":"Mol. Biol. Evol."},{"key":"2023051309024760300_btu806-B36","doi-asserted-by":"crossref","first-page":"2688","DOI":"10.1093\/bioinformatics\/btl446","article-title":"RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models","volume":"22","author":"Stamatakis","year":"2006","journal-title":"Bioinformatics"},{"key":"2023051309024760300_btu806-B37","doi-asserted-by":"crossref","first-page":"409","DOI":"10.1093\/bioinformatics\/bts386","article-title":"Inferring duplications, losses, transfers and incomplete lineage sorting with nonbinary species trees","volume":"28","author":"Stolzer","year":"2012","journal-title":"Bioinformatics"},{"key":"2023051309024760300_btu806-B38","doi-asserted-by":"crossref","first-page":"92","DOI":"10.1093\/bioinformatics\/18.1.92","article-title":"Automated ortholog inference from phylogenetic trees and calculation of orthology reliability","volume":"18","author":"Storm","year":"2002","journal-title":"Bioinformatics"},{"key":"2023051309024760300_btu806-B39","doi-asserted-by":"crossref","first-page":"366","DOI":"10.1038\/nrg1324","article-title":"Resurrecting ancient genes: experimental analysis of extinct molecules","volume":"5","author":"Thornton","year":"2004","journal-title":"Nat. Rev. Genet."},{"key":"2023051309024760300_btu806-B40","article-title":"Using trees to capture reticulate evolution: lateral gene transfers and cancer progression","author":"Tofigh","year":"2009"},{"key":"2023051309024760300_btu806-B41","doi-asserted-by":"crossref","first-page":"517","DOI":"10.1109\/TCBB.2010.14","article-title":"Simultaneous identification of duplications and lateral gene transfers","volume":"8","author":"Tofigh","year":"2011","journal-title":"IEEE\/ACM Trans. Comput. Biol. Bioinform."},{"key":"2023051309024760300_btu806-B42","doi-asserted-by":"crossref","first-page":"327","DOI":"10.1101\/gr.073585.107","article-title":"Ensemblcompara genetrees: Complete, duplication-aware phylogenetic trees in vertebrates","volume":"19","author":"Vilella","year":"2009","journal-title":"Genome Res."},{"key":"2023051309024760300_btu806-B43","doi-asserted-by":"crossref","first-page":"54","DOI":"10.1038\/nature06107","article-title":"Natural history and evolutionary principles of gene duplication in fungi","volume":"449","author":"Wapinski","year":"2007","journal-title":"Nature"},{"key":"2023051309024760300_btu806-B44","doi-asserted-by":"crossref","first-page":"449","DOI":"10.1093\/molbev\/msq215","article-title":"Phylogenetic substitution models for detecting heterotachy during plastid evolution","volume":"28","author":"Whelan","year":"2011","journal-title":"Mol. Biol. Evol."},{"key":"2023051309024760300_btu806-B45","doi-asserted-by":"crossref","first-page":"110","DOI":"10.1093\/sysbio\/sys076","article-title":"Treefix: statistically informed gene tree error correction using species trees","volume":"62","author":"Wu","year":"2013","journal-title":"Syst. Biol."},{"key":"2023051309024760300_btu806-B46","doi-asserted-by":"crossref","first-page":"1586","DOI":"10.1093\/molbev\/msm088","article-title":"PAML 4: phylogenetic analysis by maximum likelihood","volume":"24","author":"Yang","year":"2007","journal-title":"Mol. Biol. Evol."},{"key":"2023051309024760300_btu806-B47","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1098\/rstb.1925.0002","article-title":"A mathematical theory of evolution, based on the conclusions of Dr. J. C. Willis, F.R.S","volume":"213","author":"Yule","year":"1925","journal-title":"Philos. Trans. R Soc Lond B Biol. Char."},{"key":"2023051309024760300_btu806-B48","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1007\/978-1-60327-853-9_11","article-title":"Detection and quantitative assessment of horizontal gene transfer","volume":"532","author":"Zhaxybayeva","year":"2009","journal-title":"Methods Mol. Biol."},{"key":"2023051309024760300_btu806-B49","doi-asserted-by":"crossref","first-page":"1099","DOI":"10.1101\/gr.5322306","article-title":"Phylogenetic analyses of cyanobacterial genomes: Quantification of horizontal gene transfer events","volume":"16","author":"Zhaxybayeva","year":"2006","journal-title":"Genome Res."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/31\/8\/1211\/50306077\/bioinformatics_31_8_1211.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/31\/8\/1211\/50306077\/bioinformatics_31_8_1211.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,6,5]],"date-time":"2024-06-05T17:42:37Z","timestamp":1717609357000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/31\/8\/1211\/212713"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,4,15]]},"references-count":49,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2015,4,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btu806","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2015,4,15]]},"published":{"date-parts":[[2015,4,15]]}}}