{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,23]],"date-time":"2026-01-23T05:52:14Z","timestamp":1769147534975,"version":"3.49.0"},"reference-count":28,"publisher":"Oxford University Press (OUP)","issue":"5","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2008,3,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Gene duplications and losses (GDLs) are important events in genome evolution. They result in expansion or contraction of gene families, with a likely role in phenotypic evolution. As more genomes become available and their annotations are improved, software programs capable of rapidly and accurately identifying the content of ancestral genomes and the timings of GDLs become necessary to understand the unique evolution of each lineage.<\/jats:p>\n               <jats:p>Results: We report EvolMAP, a new algorithm and software that utilizes a species tree-based gene clustering method to join all-to-all symmetrical similarity comparisons of multiple gene sets in order to infer the gene composition of multiple ancestral genomes. The algorithm further uses Dollo parsimony-based comparison of the inferred ancestral genes to pinpoint the timings of GDLs onto evolutionary intervals marked by speciation events. Using EvolMAP, first we analyzed the expansion of four families of G-protein coupled receptors (GPCRs) within animal lineages. Additional to demonstrating the unique expansion tree for each family, results also show that the ancestral eumetazoan genome contained many fewer GPCRs than modern animals, and these families expanded through concurrent lineage-specific duplications. Second, we analyzed the history of GDLs in mammalian genomes by comparing seven proteomes. In agreement with previous studies, we report that the mammalian gene family sizes have changed drastically through their evolution. Interestingly, although we identified a potential source of duplication for 75% of the gained genes, remaining 25% did not have clear-cut sources, revealing thousands of genes that have likely gained their distinct sequence identities within the descent of mammals.<\/jats:p>\n               <jats:p>Availability: Query server, source code and executable are available at http:\/\/kosik-web.mcdb.ucsb.edu\/evolmap\/index.htm<\/jats:p>\n               <jats:p>Contact: \u00a0kosik@lifesci.ucsb.edu, oakley@lifesci.ucsb.edu<\/jats:p>\n               <jats:p>Supplementary information: Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btn005","type":"journal-article","created":{"date-parts":[[2008,1,10]],"date-time":"2008-01-10T01:34:38Z","timestamp":1199928878000},"page":"606-612","source":"Crossref","is-referenced-by-count":18,"title":["Reconstructing ancestral genome content based on symmetrical best alignments and Dollo parsimony"],"prefix":"10.1093","volume":"24","author":[{"given":"Onur","family":"Sakarya","sequence":"first","affiliation":[{"name":"1 Department of Computer Science, 2Neuroscience Research Institute, 3Department of Molecular, Cellular and Developmental Biology and 4Department of Ecology, Evolution and Marine Biology, University of California, Santa Barbara, CA 93106, USA"},{"name":"1 Department of Computer Science, 2Neuroscience Research Institute, 3Department of Molecular, Cellular and Developmental Biology and 4Department of Ecology, Evolution and Marine Biology, University of California, Santa Barbara, CA 93106, USA"}]},{"given":"Kenneth S.","family":"Kosik","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science, 2Neuroscience Research Institute, 3Department of Molecular, Cellular and Developmental Biology and 4Department of Ecology, Evolution and Marine Biology, University of California, Santa Barbara, CA 93106, USA"},{"name":"1 Department of Computer Science, 2Neuroscience Research Institute, 3Department of Molecular, Cellular and Developmental Biology and 4Department of Ecology, Evolution and Marine Biology, University of California, Santa Barbara, CA 93106, USA"}]},{"given":"Todd H.","family":"Oakley","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science, 2Neuroscience Research Institute, 3Department of Molecular, Cellular and Developmental Biology and 4Department of Ecology, Evolution and Marine Biology, University of California, Santa Barbara, CA 93106, USA"}]}],"member":"286","published-online":{"date-parts":[[2008,1,9]]},"reference":[{"key":"2023020210104844100_B1","doi-asserted-by":"crossref","first-page":"e9","DOI":"10.1093\/bioinformatics\/btl213","article-title":"Automatic clustering of orthologs and inparalogs shared by multiple proteomes","volume":"22","author":"Alexeyenko","year":"2006","journal-title":"Bioinformatics"},{"key":"2023020210104844100_B2","doi-asserted-by":"crossref","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","article-title":"Gapped BLAST and PSI-BLAST: a new generation of protein database search programs","volume":"25","author":"Altschul","year":"1997","journal-title":"Nucleic Acids Res"},{"key":"2023020210104844100_B3","doi-asserted-by":"crossref","first-page":"276","DOI":"10.1093\/nar\/30.1.276","article-title":"The Pfam protein families database","volume":"30","author":"Bateman","year":"2002","journal-title":"Nucleic Acids Res"},{"key":"2023020210104844100_B4","doi-asserted-by":"crossref","first-page":"699","DOI":"10.1093\/bioinformatics\/btk040","article-title":"OrthologID: automation of genome-scale ortholog identification within a parsimony framework","volume":"22","author":"Chiu","year":"2006","journal-title":"Bioinformatics"},{"key":"2023020210104844100_B5","doi-asserted-by":"crossref","first-page":"1269","DOI":"10.1093\/bioinformatics\/btl097","article-title":"CAFE: a computational tool for the study of gene family evolution","volume":"22","author":"De Bie","year":"2006","journal-title":"Bioinformatics"},{"key":"2023020210104844100_B6","doi-asserted-by":"crossref","first-page":"2044","DOI":"10.1093\/bioinformatics\/btl286","article-title":"Roundup: a multi-genome repository of orthologs and evolutionary distances","volume":"22","author":"Deluca","year":"2006","journal-title":"Bioinformatics"},{"key":"2023020210104844100_B7","doi-asserted-by":"crossref","first-page":"e85","DOI":"10.1371\/journal.pone.0000085","article-title":"The evolution of Mammalian gene families","volume":"1","author":"Demuth","year":"2006","journal-title":"PLoS ONE"},{"key":"2023020210104844100_B8","doi-asserted-by":"crossref","first-page":"320","DOI":"10.1089\/cmb.2006.13.320","article-title":"A hybrid micro-macroevolutionary approach to gene tree reconstruction","volume":"13","author":"Durand","year":"2006","journal-title":"J. Comput. Biol"},{"key":"2023020210104844100_B9","doi-asserted-by":"crossref","first-page":"77","DOI":"10.2307\/2412867","article-title":"Phylogenetic analysis under Dollo's law","volume":"26","author":"Farris","year":"1977","journal-title":"Syst. Zool"},{"key":"2023020210104844100_B10","doi-asserted-by":"crossref","first-page":"99","DOI":"10.2307\/2412448","article-title":"Distinguishing homologous from analogous proteins","volume":"19","author":"Fitch","year":"1970","journal-title":"Syst. Zool"},{"key":"2023020210104844100_B11","doi-asserted-by":"crossref","first-page":"10915","DOI":"10.1073\/pnas.89.22.10915","article-title":"Amino acid substitution matrices from protein blocks","volume":"89","author":"Henikoff","year":"1992","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023020210104844100_B12","doi-asserted-by":"crossref","first-page":"704","DOI":"10.1038\/189704a0","article-title":"Gene evolution and the haemoglobins","volume":"189","author":"Ingram","year":"1961","journal-title":"Nature"},{"key":"2023020210104844100_B13","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1146\/annurev.genet.39.073003.114725","article-title":"Orthologs, paralogs, and evolutionary genomics","volume":"39","author":"Koonin","year":"2005","journal-title":"Annu. Rev. Genet"},{"key":"2023020210104844100_B14","doi-asserted-by":"crossref","first-page":"D572","DOI":"10.1093\/nar\/gkj118","article-title":"TreeFam: a curated database of phylogenetic trees of animal gene families","volume":"34","author":"Li","year":"2006","journal-title":"Nucleic Acids Res"},{"key":"2023020210104844100_B15","doi-asserted-by":"crossref","first-page":"2178","DOI":"10.1101\/gr.1224503","article-title":"OrthoMCL: identification of ortholog groups for eukaryotic genomes","volume":"13","author":"Li","year":"2003","journal-title":"Genome Res"},{"key":"2023020210104844100_B16","doi-asserted-by":"crossref","first-page":"443","DOI":"10.1016\/0022-2836(70)90057-4","article-title":"A general method applicable to the search for similarities in the amino acid sequence of two proteins","volume":"48","author":"Needleman","year":"1970","journal-title":"J. Mol. Biol"},{"key":"2023020210104844100_B17","doi-asserted-by":"crossref","first-page":"D476","DOI":"10.1093\/nar\/gki107","article-title":"Inparanoid: a comprehensive database of eukaryotic orthologs","volume":"33","author":"O\u2019Brien","year":"2005","journal-title":"Nucleic Acids Res"},{"key":"2023020210104844100_B18","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-642-86659-3","volume-title":"Evolution by Gene Duplication.","author":"Ohno","year":"1970"},{"key":"2023020210104844100_B19","doi-asserted-by":"crossref","first-page":"759","DOI":"10.1093\/icb\/icm050","article-title":"Key transitions during the evolution of animal phototransduction: novelty, \u2018tree-thinking,\u2019 co-option, and co-duplication","volume":"47","author":"Plachetzki","year":"2007","journal-title":"Integrative and Comparative Biology"},{"key":"2023020210104844100_B20","doi-asserted-by":"crossref","first-page":"86","DOI":"10.1126\/science.1139158","article-title":"Sea anemone genome reveals ancestral eumetazoan gene repertoire and genomic organization","volume":"317","author":"Putnam","year":"2007","journal-title":"Science"},{"key":"2023020210104844100_B21","doi-asserted-by":"crossref","first-page":"1041","DOI":"10.1006\/jmbi.2000.5197","article-title":"Automatic clustering of orthologs and in-paralogs from pairwise species comparisons","volume":"314","author":"Remm","year":"2001","journal-title":"J. Mol. Biol"},{"key":"2023020210104844100_B22","doi-asserted-by":"crossref","first-page":"619","DOI":"10.1016\/S0168-9525(02)02793-2","article-title":"Orthology, paralogy and proposed classification for paralog subtypes","volume":"18","author":"Sonnhammer","year":"2002","journal-title":"Trends Genet"},{"key":"2023020210104844100_B23","doi-asserted-by":"crossref","first-page":"92","DOI":"10.1093\/bioinformatics\/18.1.92","article-title":"Automated ortholog inference from phylogenetic trees and calculation of orthology reliability","volume":"18","author":"Storm","year":"2002","journal-title":"Bioinformatics"},{"key":"2023020210104844100_B24","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1186\/1471-2105-4-41","article-title":"The COG database: an updated version includes eukaryotes","volume":"4","author":"Tatusov","year":"2003","journal-title":"BMC Bioinformatics"},{"key":"2023020210104844100_B25","doi-asserted-by":"crossref","first-page":"1710","DOI":"10.1093\/bioinformatics\/btg213","article-title":"Detecting putative orthologs","volume":"19","author":"Wall","year":"2003","journal-title":"Bioinformatics"},{"key":"2023020210104844100_B26","doi-asserted-by":"crossref","first-page":"i549","DOI":"10.1093\/bioinformatics\/btm193","article-title":"Automatic genome-wide reconstruction of phylogenetic gene trees","volume":"23","author":"Wapinski","year":"2007","journal-title":"Bioinformatics"},{"key":"2023020210104844100_B27","doi-asserted-by":"crossref","first-page":"509","DOI":"10.1110\/ps.051745906","article-title":"A general model of G protein-coupled receptor sequences and its application to detect remote homologs","volume":"15","author":"Wistrand","year":"2006","journal-title":"Protein Sci"},{"key":"2023020210104844100_B28","doi-asserted-by":"crossref","first-page":"14","DOI":"10.1186\/1471-2105-3-14","article-title":"RIO: analyzing proteomes by automated phylogenomics using resampled inference of orthologs","volume":"3","author":"Zmasek","year":"2002","journal-title":"BMC Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/5\/606\/49050930\/bioinformatics_24_5_606.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/5\/606\/49050930\/bioinformatics_24_5_606.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T11:47:03Z","timestamp":1675338423000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/24\/5\/606\/202110"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,1,9]]},"references-count":28,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2008,3,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btn005","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2008,3,1]]},"published":{"date-parts":[[2008,1,9]]}}}