{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,4]],"date-time":"2025-11-04T15:53:04Z","timestamp":1762271584735},"reference-count":53,"publisher":"Oxford University Press (OUP)","issue":"1","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2012,1,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Recent large-scale studies of individuals within a population have demonstrated that there is widespread variation in copy number in many gene families. In addition, there is increasing evidence that the variation in gene copy number can give rise to substantial phenotypic effects. In some cases, these variations have been shown to be adaptive. These observations show that a full understanding of the evolution of biological function requires an understanding of gene gain and gene loss. Accurate, robust evolutionary models of gain and loss events are, therefore, required.<\/jats:p>\n               <jats:p>Results: We have developed weighted parsimony and maximum likelihood methods for inferring gain and loss events. To test these methods, we have used Markov models of gain and loss to simulate data with known properties. We examine three models: a simple birth\u2013death model, a single rate model and a birth\u2013death innovation model with parameters estimated from Drosophila genome data. We find that for all simulations maximum likelihood-based methods are very accurate for reconstructing the number of duplication events on the phylogenetic tree, and that maximum likelihood and weighted parsimony have similar accuracy for reconstructing the ancestral state. Our implementations are robust to different model parameters and provide accurate inferences of ancestral states and the number of gain and loss events. For ancestral reconstruction, we recommend weighted parsimony because it has similar accuracy to maximum likelihood, but is much faster. For inferring the number of individual gene loss or gain events, maximum likelihood is noticeably more accurate, albeit at greater computational cost.<\/jats:p>\n               <jats:p>Availability: \u00a0www.bioinf.manchester.ac.uk\/dupliphy<\/jats:p>\n               <jats:p>Contact: \u00a0simon.lovell@manchester.ac.uk; simon.whelan@manchester.ac.uk<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btr592","type":"journal-article","created":{"date-parts":[[2011,10,29]],"date-time":"2011-10-29T02:04:25Z","timestamp":1319853865000},"page":"48-55","source":"Crossref","is-referenced-by-count":41,"title":["Determining the evolutionary history of gene families"],"prefix":"10.1093","volume":"28","author":[{"given":"Ryan M.","family":"Ames","sequence":"first","affiliation":[{"name":"Faculty of Life Sciences, University of Manchester, Oxford Road, Manchester M13 9PL, UK"}]},{"given":"Daniel","family":"Money","sequence":"additional","affiliation":[{"name":"Faculty of Life Sciences, University of Manchester, Oxford Road, Manchester M13 9PL, UK"}]},{"given":"Vikramsinh P.","family":"Ghatge","sequence":"additional","affiliation":[{"name":"Faculty of Life Sciences, University of Manchester, Oxford Road, Manchester M13 9PL, UK"}]},{"given":"Simon","family":"Whelan","sequence":"additional","affiliation":[{"name":"Faculty of Life Sciences, University of Manchester, Oxford Road, Manchester M13 9PL, UK"}]},{"given":"Simon C.","family":"Lovell","sequence":"additional","affiliation":[{"name":"Faculty of Life Sciences, University of Manchester, Oxford Road, Manchester M13 9PL, UK"}]}],"member":"286","published-online":{"date-parts":[[2011,10,28]]},"reference":[{"key":"2023061011452364400_B1","doi-asserted-by":"crossref","first-page":"2185","DOI":"10.1126\/science.287.5461.2185","article-title":"The Genome Sequence of Drosophila melanogaster","volume":"287","author":"Adams","year":"2000","journal-title":"Science"},{"key":"2023061011452364400_B2","doi-asserted-by":"crossref","first-page":"5714","DOI":"10.1073\/pnas.0806251106","article-title":"Simultaneous Bayesian gene tree reconstruction and reconciliation analysis","volume":"106","author":"Akerborg","year":"2009","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023061011452364400_B3","doi-asserted-by":"crossref","first-page":"403","DOI":"10.1016\/S0022-2836(05)80360-2","article-title":"Basic local alignment search tool","volume":"215","author":"Altschul","year":"1990","journal-title":"J. Mol. Biol."},{"key":"2023061011452364400_B4","doi-asserted-by":"crossref","first-page":"591","DOI":"10.1093\/gbe\/evq043","article-title":"Gene duplication and environmental adaptation within yeast populations","volume":"2","author":"Ames","year":"2010","journal-title":"Genome Biol. Evol."},{"key":"2023061011452364400_B5","doi-asserted-by":"crossref","first-page":"487","DOI":"10.1534\/genetics.107.084491","article-title":"De novo origination of a new protein-coding gene in Saccharomyces cerevisiae","volume":"179","author":"Cai","year":"2008","journal-title":"Genetics"},{"key":"2023061011452364400_B6","doi-asserted-by":"crossref","first-page":"429","DOI":"10.1089\/106652700750050871","article-title":"NOTUNG: a program for dating gene duplications and optimizing gene family trees","volume":"7","author":"Chen","year":"2000","journal-title":"J. Comput. Biol."},{"key":"2023061011452364400_B7","doi-asserted-by":"crossref","first-page":"203","DOI":"10.1038\/nature06341","article-title":"Evolution of genes and genomes on the Drosophila phylogeny","volume":"450","author":"Clark","year":"2007","journal-title":"Nature"},{"key":"2023061011452364400_B8","doi-asserted-by":"crossref","first-page":"3378","DOI":"10.1093\/nar\/gkf449","article-title":"GenomeHistory: a software tool and its application to fully sequenced genomes","volume":"30","author":"Conant","year":"2002","journal-title":"Nucleic Acids Res."},{"key":"2023061011452364400_B9","doi-asserted-by":"crossref","first-page":"72","DOI":"10.1007\/978-3-540-87989-3_6","article-title":"Ancestral reconstruction by asymmetric wagner parsimony over continuous characters and squared parsimony over distributions","volume-title":"Comparative Genomics: International Workshop, RECOMB-CG 2008, Paris, France, October 13-15, 2008, Proceedings.","author":"Csuros","year":"2008"},{"key":"2023061011452364400_B10","doi-asserted-by":"crossref","first-page":"1910","DOI":"10.1093\/bioinformatics\/btq315","article-title":"Count: evolutionary analysis of phylogenetic profiles with parsimony and likelihood","volume":"26","author":"Csuros","year":"2010","journal-title":"Bioinformatics"},{"key":"2023061011452364400_B11","doi-asserted-by":"crossref","first-page":"206","DOI":"10.1007\/11732990_18","article-title":"A probabilistic model for gene content evolution with duplication, loss, and horizontal transfer","volume":"3909","author":"Csuros","year":"2006","journal-title":"Lect. Notes Comput. Sci."},{"key":"2023061011452364400_B12","doi-asserted-by":"crossref","first-page":"1269","DOI":"10.1093\/bioinformatics\/btl097","article-title":"CAFE: a computational tool for the study of gene family evolution","volume":"22","author":"De Bie","year":"2006","journal-title":"Bioinformatics"},{"key":"2023061011452364400_B13","doi-asserted-by":"crossref","first-page":"19920","DOI":"10.1073\/pnas.0709888104","article-title":"A portrait of copy-number polymorphism in Drosophila melanogaster","volume":"104","author":"Dopman","year":"2007","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023061011452364400_B14","doi-asserted-by":"crossref","first-page":"1384","DOI":"10.1038\/ng.2007.19","article-title":"Recurrent DNA copy number variation in the laboratory mouse","volume":"39","author":"Egan","year":"2007","journal-title":"Nat. Genet."},{"key":"2023061011452364400_B15","doi-asserted-by":"crossref","first-page":"1629","DOI":"10.1126\/science.1158078","article-title":"Natural selection shapes genome-wide patterns of copy-number polymorphism in Drosophila melanogaster","volume":"320","author":"Emerson","year":"2008","journal-title":"Science"},{"key":"2023061011452364400_B16","doi-asserted-by":"crossref","first-page":"401","DOI":"10.1093\/sysbio\/27.4.401","article-title":"Cases in which parsimony or compatibility methods will be positively misleading","volume":"27","author":"Felsenstein","year":"1978","journal-title":"Syst. Biol."},{"key":"2023061011452364400_B17","first-page":"159","article-title":"Phylogenies from restriction sites: a maximum-likelihood approach","volume":"46","author":"Felsenstein","year":"1992","journal-title":"Evolution"},{"key":"2023061011452364400_B18","volume-title":"Inferring Phytogenies.","author":"Felsenstein","year":"2004"},{"key":"2023061011452364400_B19","doi-asserted-by":"crossref","first-page":"569","DOI":"10.1038\/nature07953","article-title":"Autism genome-wide copy number variation reveals ubiquitin and neuronal genes","volume":"459","author":"Glessner","year":"2009","journal-title":"Nature"},{"key":"2023061011452364400_B20","doi-asserted-by":"crossref","first-page":"R141","DOI":"10.1186\/gb-2007-8-7-r141","article-title":"Bias in phylogenetic tree reconciliation methods: implications for vertebrate genome evolution","volume":"8","author":"Hahn","year":"2007","journal-title":"Genome Biol."},{"key":"2023061011452364400_B21","doi-asserted-by":"crossref","first-page":"1153","DOI":"10.1101\/gr.3567505","article-title":"Estimating the tempo and mode of gene family evolution from comparative genomic data","volume":"15","author":"Hahn","year":"2005","journal-title":"Genome Res."},{"key":"2023061011452364400_B22","doi-asserted-by":"crossref","first-page":"e197","DOI":"10.1371\/journal.pgen.0030197","article-title":"Gene family evolution across 12 Drosophila genomes","volume":"3","author":"Hahn","year":"2007","journal-title":"PLoS Genet."},{"key":"2023061011452364400_B23","doi-asserted-by":"crossref","first-page":"R209","DOI":"10.1186\/gb-2007-8-10-r209","article-title":"All duplicates are not equal: the difference between small-scale and genome duplication","volume":"8","author":"Hakes","year":"2007","journal-title":"Genome Biol."},{"key":"2023061011452364400_B24","doi-asserted-by":"crossref","first-page":"1837","DOI":"10.1101\/gr.6249707","article-title":"Evolutionary rate analyses of orthologs and paralogs from 12 Drosophila genomes","volume":"17","author":"Heger","year":"2007","journal-title":"Genome Res."},{"key":"2023061011452364400_B25","doi-asserted-by":"crossref","first-page":"i230","DOI":"10.1093\/bioinformatics\/btm165","article-title":"Reconstruction of highly heterogeneous gene-content evolution across the three domains of life","volume":"23","author":"Iwasaki","year":"2007","journal-title":"Bioinformatics"},{"key":"2023061011452364400_B26","doi-asserted-by":"crossref","first-page":"18","DOI":"10.1186\/1471-2148-2-18","article-title":"Birth and death of protein domains: a simple model of evolution explains power law behavior","volume":"2","author":"Karev","year":"2002","journal-title":"BMC Evol. Biol."},{"key":"2023061011452364400_B27","doi-asserted-by":"crossref","first-page":"1752","DOI":"10.1101\/gr.095026.109","article-title":"Recent de novo origin of human protein-coding genes","volume":"19","author":"Knowles","year":"2009","journal-title":"Genome Res."},{"key":"2023061011452364400_B28","doi-asserted-by":"crossref","first-page":"366","DOI":"10.1016\/j.sbi.2008.02.005","article-title":"The current excitement about copy-number variation: how it relates to gene duplications and protein families","volume":"18","author":"Korbel","year":"2008","journal-title":"Curr. Opin. Struct. Biol."},{"key":"2023061011452364400_B29","doi-asserted-by":"crossref","first-page":"337","DOI":"10.1038\/nature07743","article-title":"Population genomics of domestic and wild yeasts","volume":"458","author":"Liti","year":"2009","journal-title":"Nature"},{"key":"2023061011452364400_B30","doi-asserted-by":"crossref","first-page":"S43","DOI":"10.1038\/ng2084","article-title":"Genomic rearrangements and sporadic disease","volume":"39","author":"Lupski","year":"2007","journal-title":"Nat. Genet."},{"key":"2023061011452364400_B31","doi-asserted-by":"crossref","first-page":"1151","DOI":"10.1126\/science.290.5494.1151","article-title":"The evolutionary fate and consequences of duplicate genes","volume":"290","author":"Lynch","year":"2000","journal-title":"Science"},{"key":"2023061011452364400_B32","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1023\/A:1022696612931","article-title":"The evolutionary demography of duplicate genes","volume":"3","author":"Lynch","year":"2003","journal-title":"J. Struct. Funct. Genomics"},{"key":"2023061011452364400_B33","doi-asserted-by":"crossref","first-page":"132","DOI":"10.1093\/sysbio\/syp083","article-title":"Probabilistic models of chromosome number evolution and the inference of polyploidy","volume":"59","author":"Mayrose","year":"2010","journal-title":"Syst. Biol."},{"key":"2023061011452364400_B34","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1038\/nature09708","article-title":"Mapping copy number variation by population-scale genome sequencing","volume":"470","author":"Mills","year":"2011","journal-title":"Nature"},{"key":"2023061011452364400_B35","doi-asserted-by":"crossref","first-page":"e1000243","DOI":"10.1371\/journal.pgen.1000243","article-title":"Adaptive copy number evolution in malaria parasites","volume":"4","author":"Nair","year":"2008","journal-title":"PLoS Genet."},{"key":"2023061011452364400_B36","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-642-86659-3","volume-title":"Evolution by Gene Duplication.","author":"Ohno","year":"1970"},{"key":"2023061011452364400_B37","doi-asserted-by":"crossref","first-page":"819","DOI":"10.1093\/bioinformatics\/14.9.819","article-title":"GeneTree: comparing gene and species phylogenies using reconciled trees","volume":"14","author":"Page","year":"1998","journal-title":"Bioinformatics"},{"key":"2023061011452364400_B38","doi-asserted-by":"crossref","first-page":"1256","DOI":"10.1038\/ng2123","article-title":"Diet and the evolution of human amylase gene copy number variation","volume":"39","author":"Perry","year":"2007","journal-title":"Nat. Genet."},{"key":"2023061011452364400_B39","doi-asserted-by":"crossref","first-page":"1634","DOI":"10.1371\/journal.pgen.0020173","article-title":"Widespread discordance of gene trees with species tree in Drosophila: evidence for incomplete lineage sorting","volume":"2","author":"Pollard","year":"2006","journal-title":"PLoS Genet."},{"key":"2023061011452364400_B40","doi-asserted-by":"crossref","first-page":"890","DOI":"10.1093\/oxfordjournals.molbev.a026369","article-title":"A fast algorithm for joint reconstruction of ancestral amino acid sequences","volume":"17","author":"Pupko","year":"2000","journal-title":"Mol. Biol. Evol."},{"key":"2023061011452364400_B41","doi-asserted-by":"crossref","first-page":"1116","DOI":"10.1093\/bioinformatics\/18.8.1116","article-title":"A branch-and-bound algorithm for the inference of ancestral amino-acid sequences when the replacement rate varies among sites: application to the evolution of five gene families","volume":"18","author":"Pupko","year":"2002","journal-title":"Bioinformatics"},{"key":"2023061011452364400_B42","doi-asserted-by":"crossref","first-page":"444","DOI":"10.1038\/nature05329","article-title":"Global variation in copy number in the human genome","volume":"444","author":"Redon","year":"2006","journal-title":"Nature"},{"key":"2023061011452364400_B43","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1101\/gr.3059305","article-title":"Comparative genome sequencing of Drosophila pseudoobscura: chromosomal, gene, and cis-element evolution","volume":"15","author":"Richards","year":"2005","journal-title":"Genome Res."},{"key":"2023061011452364400_B44","first-page":"183","article-title":"TreeBASE: a prototype database of phylogenetic analyses and an interactive tool for browsing the phylogeny of life","volume":"81","author":"Sanderson","year":"1994","journal-title":"Am. J. Botany"},{"key":"2023061011452364400_B45","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1137\/0128004","article-title":"Minimal mutation trees of sequences","volume":"28","author":"Sankoff","year":"1975","journal-title":"SIAM J. Appl. Math."},{"key":"2023061011452364400_B46","doi-asserted-by":"crossref","first-page":"240","DOI":"10.1007\/BF01681346","article-title":"Locating the vertices of a steiner tree in an arbitrary metric space","volume":"9","author":"Sankoff","year":"1975","journal-title":"Math. Program."},{"key":"2023061011452364400_B47","doi-asserted-by":"crossref","first-page":"157","DOI":"10.1177\/117693430600200029","article-title":"Modelling prokaryote gene content","volume":"2","author":"Spencer","year":"2006","journal-title":"Evol. Bioinform Online"},{"key":"2023061011452364400_B48","doi-asserted-by":"crossref","first-page":"641","DOI":"10.1126\/science.1197005","article-title":"Diversity of human copy number variation and multicopy genes","volume":"330","author":"Sudmant","year":"2010","journal-title":"Science"},{"key":"2023061011452364400_B49","doi-asserted-by":"crossref","first-page":"517","DOI":"10.1109\/TCBB.2010.14","article-title":"Simultaneous identification of duplications and lateral gene transfers","volume":"8","author":"Tofigh","year":"2010","journal-title":"IEEE IEEE\/ACM Trans. Comput. Biol. Bioinformatics"},{"key":"2023061011452364400_B50","first-page":"90","author":"Turner","year":"2007","journal-title":"Germline rates of de novo meiotic deletions and duplications causing s everal genomic disorders."},{"key":"2023061011452364400_B51","first-page":"1396","article-title":"Maximum-likelihood estimation of phylogeny from dna sequences when substitution rates differ over sites","volume":"10","author":"Yang","year":"1993","journal-title":"Mol. Biol. Evol."},{"key":"2023061011452364400_B52","doi-asserted-by":"crossref","DOI":"10.1093\/acprof:oso\/9780198567028.001.0001","volume-title":"Computational Molecular Evolution.","author":"Yang","year":"2006"},{"key":"2023061011452364400_B53","doi-asserted-by":"crossref","first-page":"1446","DOI":"10.1101\/gr.076588.108","article-title":"On the origin of new genes in Drosophila","volume":"18","author":"Zhou","year":"2008","journal-title":"Genome Res."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/1\/48\/50568502\/bioinformatics_28_1_48.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/1\/48\/50568502\/bioinformatics_28_1_48.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,10]],"date-time":"2023-06-10T11:46:51Z","timestamp":1686397611000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/28\/1\/48\/218268"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,10,28]]},"references-count":53,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2012,1,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btr592","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2012,1,1]]},"published":{"date-parts":[[2011,10,28]]}}}