{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,27]],"date-time":"2025-10-27T10:12:42Z","timestamp":1761559962344},"reference-count":22,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2006,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>We present here the PhIGs database, a phylogenomic resource for sequenced genomes. Although many methods exist for clustering gene families, very few attempt to create truly orthologous clusters sharing descent from a single ancestral gene across a range of evolutionary depths. Although these non-phylogenetic gene family clusters have been used broadly for gene annotation, errors are known to be introduced by the artifactual association of slowly evolving paralogs and lack of annotation for those more rapidly evolving. A full phylogenetic framework is necessary for accurate inference of function and for many studies that address pattern and mechanism of the evolution of the genome. The automated generation of evolutionary gene clusters, creation of gene trees, determination of orthology and paralogy relationships, and the correlation of this information with gene annotations, expression information, and genomic context is an important resource to the scientific community.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Discussion<\/jats:title>\n            <jats:p>The PhIGs database currently contains 23 completely sequenced genomes of fungi and metazoans, containing 409,653 genes that have been grouped into 42,645 gene clusters. Each gene cluster is built such that the gene sequence distances are consistent with the known organismal relationships and in so doing, maximizing the likelihood for the clusters to represent truly orthologous genes. The PhIGs website contains tools that allow the study of genes within their phylogenetic framework through keyword searches on annotations, such as GO and InterPro assignments, and sequence similarity searches by BLAST and HMM. In addition to displaying the evolutionary relationships of the genes in each cluster, the website also allows users to view the relative physical positions of homologous genes in specified sets of genomes.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Summary<\/jats:title>\n            <jats:p>Accurate analyses of genes and genomes can only be done within their full phylogenetic context. The PhIGs database and corresponding website <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" xlink:href=\"http:\/\/phigs.org\" ext-link-type=\"uri\">http:\/\/phigs.org<\/jats:ext-link> address this problem for the scientific community. Our goal is to expand the content as more genomes are sequenced and use this framework to incorporate more analyses.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-7-201","type":"journal-article","created":{"date-parts":[[2006,4,20]],"date-time":"2006-04-20T14:37:42Z","timestamp":1145543862000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":65,"title":["A phylogenomic gene cluster resource: the Phylogenetically Inferred Groups (PhIGs) database"],"prefix":"10.1186","volume":"7","author":[{"given":"Paramvir S","family":"Dehal","sequence":"first","affiliation":[]},{"given":"Jeffrey L","family":"Boore","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2006,4,11]]},"reference":[{"key":"940_CR1","doi-asserted-by":"publisher","first-page":"163","DOI":"10.1101\/gr.8.3.163","volume":"8","author":"JA Eisen","year":"1998","unstructured":"Eisen JA: Phylogenomics: improving functional predictions for uncharacterized genes by evolutionary analysis. Genome Res 1998, 8: 163\u2013167.","journal-title":"Genome Res"},{"key":"940_CR2","doi-asserted-by":"publisher","first-page":"99","DOI":"10.2307\/2412448","volume":"19","author":"WM Fitch","year":"1970","unstructured":"Fitch WM: Distinguishing homologous from analogous proteins. Syst Zool 1970, 19: 99\u2013113.","journal-title":"Syst Zool"},{"key":"940_CR3","doi-asserted-by":"crossref","first-page":"1531","DOI":"10.1093\/genetics\/151.4.1531","volume":"151","author":"A Force","year":"1999","unstructured":"Force A, Lynch M, Pickett FB, Amores A, Yan YL, Postlethwait J: Preservation of duplicate genes by complementary, degenerative mutations. Genetics 1999, 151: 1531\u20131545.","journal-title":"Genetics"},{"key":"940_CR4","doi-asserted-by":"crossref","first-page":"459","DOI":"10.1093\/genetics\/154.1.459","volume":"154","author":"M Lynch","year":"2000","unstructured":"Lynch M, Force A: The probability of duplicate gene preservation by subfunctionalization. Genetics 2000, 154: 459\u2013473.","journal-title":"Genetics"},{"key":"940_CR5","doi-asserted-by":"publisher","first-page":"315","DOI":"10.1016\/S0968-0004(02)02094-7","volume":"27","author":"EA Gaucher","year":"2002","unstructured":"Gaucher EA, Gu X, Miyamoto MM, Benner SA: Predicting functional divergence in protein evolution by site-specific rate shifts. Trends Biochem Sci 2002, 27: 315\u2013321. 10.1016\/S0968-0004(02)02094-7","journal-title":"Trends Biochem Sci"},{"key":"940_CR6","doi-asserted-by":"publisher","first-page":"287","DOI":"10.1146\/annurev.bi.64.070195.001443","volume":"64","author":"RF Doolittle","year":"1995","unstructured":"Doolittle RF: The multiplicity of domains in proteins. Annu Rev Biochem 1995, 64: 287\u2013314. 10.1146\/annurev.bi.64.070195.001443","journal-title":"Annu Rev Biochem"},{"key":"940_CR7","doi-asserted-by":"publisher","first-page":"41","DOI":"10.1186\/1471-2105-4-41","volume":"4","author":"RL Tatusov","year":"2003","unstructured":"Tatusov RL, Fedorova ND, Jackson JD, Jacobs AR, Kiryutin B, Koonin EV, Krylov DM, Mazumder R, Mekhedov SL, Nikolskaya AN, Rao BS, Smirnov S, Sverdlov AV, Vasudevan S, Wolf YI, Yin JJ, Natale DA: The COG database: an updated version includes eukaryotes. BMC Bioinformatics 2003, 4: 41. 10.1186\/1471-2105-4-41","journal-title":"BMC Bioinformatics"},{"key":"940_CR8","doi-asserted-by":"publisher","first-page":"1575","DOI":"10.1093\/nar\/30.7.1575","volume":"30","author":"AJ Enright","year":"2002","unstructured":"Enright AJ, Van Dongen S, Ouzounis CA: An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res 2002, 30: 1575\u20131584. 10.1093\/nar\/30.7.1575","journal-title":"Nucleic Acids Res"},{"key":"940_CR9","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1016\/S1367-5931(02)00003-0","volume":"7","author":"J Liu","year":"2003","unstructured":"Liu J, Rost B: Domains, motifs and clusters in the protein universe. Curr Opin Chem Biol 2003, 7: 5\u201311. 10.1016\/S1367-5931(02)00003-0","journal-title":"Curr Opin Chem Biol"},{"key":"940_CR10","first-page":"D476","volume":"33 Database Iss","author":"KP O'Brien","year":"2005","unstructured":"O'Brien KP, Remm M, Sonnhammer EL: Inparanoid: a comprehensive database of eukaryotic orthologs. Nucleic Acids Res 2005, 33 Database Issue: D476\u201380.","journal-title":"Nucleic Acids Res"},{"key":"940_CR11","doi-asserted-by":"publisher","first-page":"2178","DOI":"10.1101\/gr.1224503","volume":"13","author":"L Li","year":"2003","unstructured":"Li L, Stoeckert CJJ, Roos DS: OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res 2003, 13: 2178\u20132189. 10.1101\/gr.1224503","journal-title":"Genome Res"},{"key":"940_CR12","doi-asserted-by":"publisher","first-page":"493","DOI":"10.1101\/gr.212002","volume":"12","author":"Y Lee","year":"2002","unstructured":"Lee Y, Sultana R, Pertea G, Cho J, Karamycheva S, Tsai J, Parvizi B, Cheung F, Antonescu V, White J, Holt I, Liang F, Quackenbush J: Cross-referencing eukaryotic genomes: TIGR Orthologous Gene Alignments (TOGA). Genome Res 2002, 12: 493\u2013502. 10.1101\/gr.212002","journal-title":"Genome Res"},{"key":"940_CR13","doi-asserted-by":"publisher","first-page":"37","DOI":"10.1093\/nar\/29.1.37","volume":"29","author":"R Apweiler","year":"2001","unstructured":"Apweiler R, Attwood TK, Bairoch A, Bateman A, Birney E, Biswas M, Bucher P, Cerutti L, Corpet F, Croning MD, Durbin R, Falquet L, Fleischmann W, Gouzy J, Hermjakob H, Hulo N, Jonassen I, Kahn D, Kanapin A, Karavidopoulou Y, Lopez R, Marx B, Mulder NJ, Oinn TM, Pagni M, Servant F, Sigrist CJ, Zdobnov EM: The InterPro database, an integrated documentation resource for protein families, domains and functional sites. Nucleic Acids Res 2001, 29: 37\u201340. 10.1093\/nar\/29.1.37","journal-title":"Nucleic Acids Res"},{"key":"940_CR14","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1038\/75556","volume":"25","author":"M Ashburner","year":"2000","unstructured":"Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 2000, 25: 25\u201329. 10.1038\/75556","journal-title":"Nat Genet"},{"key":"940_CR15","doi-asserted-by":"publisher","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","volume":"25","author":"SF Altschul","year":"1997","unstructured":"Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 1997, 25: 3389\u20133402. 10.1093\/nar\/25.17.3389","journal-title":"Nucleic Acids Res"},{"key":"940_CR16","doi-asserted-by":"publisher","first-page":"4673","DOI":"10.1093\/nar\/22.22.4673","volume":"22","author":"JD Thompson","year":"1994","unstructured":"Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res 1994, 22: 4673\u20134680.","journal-title":"Nucleic Acids Res"},{"key":"940_CR17","volume-title":"PHYLIP (Phylogeny Inference Package) version 3.6","author":"J Felsenstein","year":"2004","unstructured":"Felsenstein J: PHYLIP (Phylogeny Inference Package) version 3.6. Department of Genome Sciences, University of Washington, Seattle, Distributed by the author; 2004."},{"key":"940_CR18","doi-asserted-by":"publisher","first-page":"502","DOI":"10.1093\/bioinformatics\/18.3.502","volume":"18","author":"HA Schmidt","year":"2002","unstructured":"Schmidt HA, Strimmer K, Vingron M, von Haeseler A: TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing. Bioinformatics 2002, 18: 502\u2013504. 10.1093\/bioinformatics\/18.3.502","journal-title":"Bioinformatics"},{"key":"940_CR19","doi-asserted-by":"publisher","first-page":"426","DOI":"10.1093\/bioinformatics\/btg430","volume":"20","author":"M Clamp","year":"2004","unstructured":"Clamp M, Cuff J, Searle SM, Barton GJ: The Jalview Java alignment editor. Bioinformatics 2004, 20: 426\u2013427. 10.1093\/bioinformatics\/btg430","journal-title":"Bioinformatics"},{"key":"940_CR20","doi-asserted-by":"publisher","first-page":"847","DOI":"10.1093\/bioinformatics\/17.9.847","volume":"17","author":"EM Zdobnov","year":"2001","unstructured":"Zdobnov EM, Apweiler R: InterProScan--an integration platform for the signature-recognition methods in InterPro. Bioinformatics 2001, 17: 847\u2013848. 10.1093\/bioinformatics\/17.9.847","journal-title":"Bioinformatics"},{"key":"940_CR21","unstructured":"Eddy S: http:\/\/hmmer.wustl.edu."},{"key":"940_CR22","doi-asserted-by":"publisher","first-page":"e314","DOI":"10.1371\/journal.pbio.0030314","volume":"3","author":"P Dehal","year":"2005","unstructured":"Dehal P, Boore JL: Two Rounds of Whole Genome Duplication in the Ancestral Vertebrate. PLoS Biology 2005, 3: e314. 10.1371\/journal.pbio.0030314","journal-title":"PLoS Biology"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-7-201.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T11:00:56Z","timestamp":1630494056000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-7-201"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2006,4,11]]},"references-count":22,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2006,12]]}},"alternative-id":["940"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-7-201","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2006,4,11]]},"assertion":[{"value":"20 April 2005","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"11 April 2006","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"11 April 2006","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"201"}}