{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,7]],"date-time":"2026-02-07T22:38:08Z","timestamp":1770503888872,"version":"3.49.0"},"reference-count":37,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2006,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Background<\/jats:title><jats:p>The rapidly increasing speed with which genome sequence data can be generated will be accompanied by an exponential increase in the number of sequenced eukaryotes. With the increasing number of sequenced eukaryotic genomes comes a need for bioinformatic techniques to aid in functional annotation. Ideally, genome context based techniques such as proximity, fusion, and phylogenetic profiling, which have been so successful in prokaryotes, could be utilized in eukaryotes. Here we explore the application of phylogenetic profiling, a method that exploits the evolutionary co-occurrence of genes in the assignment of functional linkages, to eukaryotic genomes.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>In order to evaluate the performance of phylogenetic profiling in eukaryotes, we assessed the relative performance of commonly used profile construction techniques and genome compositions in predicting functional linkages in both prokaryotic and eukaryotic organisms. When predicting linkages in<jats:italic>E. coli<\/jats:italic>with a prokaryotic profile, the use of continuous values constructed from transformed BLAST bit-scores performed better than profiles composed of discretized E-values; the use of discretized E-values resulted in more accurate linkages when using<jats:italic>S. cerevisiae<\/jats:italic>as the query organism. Extending this analysis by incorporating several eukaryotic genomes in profiles containing a majority of prokaryotes resulted in similar overall accuracy, but with a surprising reduction in pathway diversity among the most significant linkages. Furthermore, the application of phylogenetic profiling using profiles composed of only eukaryotes resulted in the loss of the strong correlation between common KEGG pathway membership and profile similarity score. Profile construction methods, orthology definitions, ontology and domain complexity were explored as possible sources of the poor performance of eukaryotic profiles, but with no improvement in results.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusion<\/jats:title><jats:p>Given the current set of completely sequenced eukaryotic organisms, phylogenetic profiling using profiles generated from any of the commonly used techniques was found to yield extremely poor results. These findings imply genome-specific requirements for constructing functionally relevant phylogenetic profiles, and suggest that differences in the evolutionary history between different kingdoms might generally limit the usefulness of phylogenetic profiling in eukaryotes.<\/jats:p><\/jats:sec>","DOI":"10.1186\/1471-2105-7-420","type":"journal-article","created":{"date-parts":[[2006,9,27]],"date-time":"2006-09-27T18:25:47Z","timestamp":1159381547000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":41,"title":["Comparative assessment of performance and genome dependence among phylogenetic profiling methods"],"prefix":"10.1186","volume":"7","author":[{"given":"Evan S","family":"Snitkin","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Adam M","family":"Gustafson","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Joseph","family":"Mellor","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jie","family":"Wu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Charles","family":"DeLisi","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2006,9,27]]},"reference":[{"key":"1159_CR1","doi-asserted-by":"publisher","first-page":"324","DOI":"10.1016\/S0968-0004(98)01274-2","volume":"23","author":"T Dandekar","year":"1998","unstructured":"Dandekar T, Snel B, Huynen M, Bork P: Conservation of gene order: a fingerprint of proteins that physically interact. Trends Biochem Sci 1998, 23: 324\u2013328. 10.1016\/S0968-0004(98)01274-2","journal-title":"Trends Biochem Sci"},{"key":"1159_CR2","doi-asserted-by":"publisher","first-page":"86","DOI":"10.1038\/47056","volume":"402","author":"AJ Enright","year":"1999","unstructured":"Enright AJ, Iliopoulos I, Kyrpides NC, Ouzounis CA: Protein interaction maps for complete genomes based on gene fusion events. Nature 1999, 402: 86\u201390. 10.1038\/47056","journal-title":"Nature"},{"key":"1159_CR3","doi-asserted-by":"publisher","first-page":"7940","DOI":"10.1073\/pnas.141236298","volume":"98","author":"I Yanai","year":"2001","unstructured":"Yanai I, Derti A, DeLisi C: Genes linked by fusion events are generally of the same functional category: a systematic analysis of 30 microbial genomes. Proc Natl Acad Sci U S A 2001, 98: 7940\u20137945. 10.1073\/pnas.141236298","journal-title":"Proc Natl Acad Sci U S A"},{"key":"1159_CR4","doi-asserted-by":"publisher","first-page":"176","DOI":"10.1016\/S0168-9525(01)02621-X","volume":"18","author":"I Yanai","year":"2002","unstructured":"Yanai I, Mellor JC, DeLisi C: Identifying functional links between genes using conserved chromosomal proximity. Trends Genet 2002, 18: 176\u2013179. 10.1016\/S0168-9525(01)02621-X","journal-title":"Trends Genet"},{"key":"1159_CR5","doi-asserted-by":"publisher","first-page":"306","DOI":"10.1093\/nar\/30.1.306","volume":"30","author":"JC Mellor","year":"2002","unstructured":"Mellor JC, Yanai I, Clodfelter KH, Mintseris J, DeLisi C: Predictome: a database of putative functional links between proteins. Nucleic Acids Res 2002, 30: 306\u2013309. 10.1093\/nar\/30.1.306","journal-title":"Nucleic Acids Res"},{"key":"1159_CR6","doi-asserted-by":"publisher","first-page":"923","DOI":"10.1093\/bioinformatics\/btg118","volume":"19","author":"SK Ng","year":"2003","unstructured":"Ng SK, Zhang Z, Tan SH: Integrative approach for computationally inferring protein domain interactions. Bioinformatics 2003, 19: 923\u2013929. 10.1093\/bioinformatics\/btg118","journal-title":"Bioinformatics"},{"key":"1159_CR7","doi-asserted-by":"publisher","first-page":"199","DOI":"10.1089\/omi.1.1998.3.177","volume":"3","author":"T Gaasterland","year":"1998","unstructured":"Gaasterland T, Ragan MA: Microbial genescapes: phyletic and functional patterns of ORF distribution among prokaryotes. Microb Comp Genomics 1998, 3: 199\u2013217.","journal-title":"Microb Comp Genomics"},{"key":"1159_CR8","doi-asserted-by":"publisher","first-page":"4285","DOI":"10.1073\/pnas.96.8.4285","volume":"96","author":"M Pellegrini","year":"1999","unstructured":"Pellegrini M, Marcotte EM, Thompson MJ, Eisenberg D, Yeates TO: Assigning protein functions by comparative genome analysis: protein phylogenetic profiles. Proc Natl Acad Sci U S A 1999, 96: 4285\u20134288. 10.1073\/pnas.96.8.4285","journal-title":"Proc Natl Acad Sci U S A"},{"key":"1159_CR9","doi-asserted-by":"publisher","first-page":"1055","DOI":"10.1038\/nbt861","volume":"21","author":"SV Date","year":"2003","unstructured":"Date SV, Marcotte EM: Discovery of uncharacterized cellular systems by genome-wide analysis of functional linkages. Nat Biotechnol 2003, 21: 1055\u20131062. 10.1038\/nbt861","journal-title":"Nat Biotechnol"},{"key":"1159_CR10","doi-asserted-by":"publisher","first-page":"1524","DOI":"10.1093\/bioinformatics\/btg187","volume":"19","author":"J Wu","year":"2003","unstructured":"Wu J, Kasif S, DeLisi C: Identification of functional links between genes using phylogenetic profiles. Bioinformatics 2003, 19: 1524\u20131530. 10.1093\/bioinformatics\/btg187","journal-title":"Bioinformatics"},{"key":"1159_CR11","doi-asserted-by":"publisher","first-page":"3409","DOI":"10.1093\/bioinformatics\/bti532","volume":"21","author":"J Sun","year":"2005","unstructured":"Sun J, Xu J, Liu Z, Liu Q, Zhao A, Shi T, Li Y: Refined phylogenetic profiles method for predicting protein-protein interactions. Bioinformatics 2005, 21: 3409\u20133415. 10.1093\/bioinformatics\/bti532","journal-title":"Bioinformatics"},{"key":"1159_CR12","doi-asserted-by":"publisher","first-page":"12115","DOI":"10.1073\/pnas.220399497","volume":"97","author":"EM Marcotte","year":"2000","unstructured":"Marcotte EM, Xenarios I, van Der Bliek AM, Eisenberg D: Localizing proteins in the cell from their phylogenetic profiles. Proc Natl Acad Sci U S A 2000, 97: 12115\u201312120. 10.1073\/pnas.220399497","journal-title":"Proc Natl Acad Sci U S A"},{"key":"1159_CR13","doi-asserted-by":"publisher","first-page":"i105","DOI":"10.1093\/bioinformatics\/btg1013","volume":"19 Suppl 1","author":"F Enault","year":"2003","unstructured":"Enault F, Suhre K, Abergel C, Poirot O, Claverie JM: Annotation of bacterial genomes using improved phylogenomic profiles. Bioinformatics 2003, 19 Suppl 1: i105\u20137. 10.1093\/bioinformatics\/btg1013","journal-title":"Bioinformatics"},{"key":"1159_CR14","doi-asserted-by":"publisher","first-page":"299","DOI":"10.1038\/nrg1319","volume":"5","author":"LD Hurst","year":"2004","unstructured":"Hurst LD, Pal C, Lercher MJ: The evolutionary dynamics of eukaryotic gene order. Nat Rev Genet 2004, 5: 299\u2013310. 10.1038\/nrg1319","journal-title":"Nat Rev Genet"},{"key":"1159_CR15","first-page":"93","volume":"1","author":"CJ Marcotte","year":"2002","unstructured":"Marcotte CJ, Marcotte EM: Predicting functional linkages from gene fusions with confidence. Appl Bioinformatics 2002, 1: 93\u2013100.","journal-title":"Appl Bioinformatics"},{"key":"1159_CR16","doi-asserted-by":"publisher","first-page":"41","DOI":"10.1186\/1471-2105-4-41","volume":"4","author":"RL Tatusov","year":"2003","unstructured":"Tatusov RL, Fedorova ND, Jackson JD, Jacobs AR, Kiryutin B, Koonin EV, Krylov DM, Mazumder R, Mekhedov SL, Nikolskaya AN, Rao BS, Smirnov S, Sverdlov AV, Vasudevan S, Wolf YI, Yin JJ, Natale DA: The COG database: an updated version includes eukaryotes. BMC Bioinformatics 2003, 4: 41. 10.1186\/1471-2105-4-41","journal-title":"BMC Bioinformatics"},{"key":"1159_CR17","doi-asserted-by":"publisher","first-page":"403","DOI":"10.1016\/S0022-2836(05)80360-2","volume":"215","author":"SF Altschul","year":"1990","unstructured":"Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol 1990, 215: 403\u2013410. 10.1006\/jmbi.1990.9999","journal-title":"J Mol Biol"},{"key":"1159_CR18","doi-asserted-by":"publisher","first-page":"D258","DOI":"10.1093\/nar\/gkh066","volume":"32","author":"MA Harris","year":"2004","unstructured":"Harris MA, Clark J, Ireland A, Lomax J, Ashburner M, Foulger R, Eilbeck K, Lewis S, Marshall B, Mungall C, Richter J, Rubin GM, Blake JA, Bult C, Dolan M, Drabkin H, Eppig JT, Hill DP, Ni L, Ringwald M, Balakrishnan R, Cherry JM, Christie KR, Costanzo MC, Dwight SS, Engel S, Fisk DG, Hirschman JE, Hong EL, Nash RS, Sethuraman A, Theesfeld CL, Botstein D, Dolinski K, Feierbach B, Berardini T, Mundodi S, Rhee SY, Apweiler R, Barrell D, Camon E, Dimmer E, Lee V, Chisholm R, Gaudet P, Kibbe W, Kishore R, Schwarz EM, Sternberg P, Gwinn M, Hannick L, Wortman J, Berriman M, Wood V, de la Cruz N, Tonellato P, Jaiswal P, Seigfried T, White R: The Gene Ontology (GO) database and informatics resource. Nucleic Acids Res 2004, 32: D258\u201361. 10.1093\/nar\/gkh066","journal-title":"Nucleic Acids Res"},{"key":"1159_CR19","doi-asserted-by":"publisher","first-page":"1046","DOI":"10.1038\/35082561","volume":"411","author":"AE Hirsh","year":"2001","unstructured":"Hirsh AE, Fraser HB: Protein dispensability and rate of evolution. Nature 2001, 411: 1046\u20131049. 10.1038\/35082561","journal-title":"Nature"},{"key":"1159_CR20","doi-asserted-by":"publisher","first-page":"962","DOI":"10.1101\/gr.87702. Article published online before print in May 2002","volume":"12","author":"IK Jordan","year":"2002","unstructured":"Jordan IK, Rogozin IB, Wolf YI, Koonin EV: Essential genes are more evolutionarily conserved than are nonessential genes in bacteria. Genome Res 2002, 12: 962\u2013968. 10.1101\/gr.87702. Article published online before print in May 2002","journal-title":"Genome Res"},{"key":"1159_CR21","doi-asserted-by":"publisher","first-page":"D138","DOI":"10.1093\/nar\/gkh121","volume":"32","author":"A Bateman","year":"2004","unstructured":"Bateman A, Coin L, Durbin R, Finn RD, Hollich V, Griffiths-Jones S, Khanna A, Marshall M, Moxon S, Sonnhammer EL, Studholme DJ, Yeats C, Eddy SR: The Pfam protein families database. Nucleic Acids Res 2004, 32: D138\u201341. 10.1093\/nar\/gkh121","journal-title":"Nucleic Acids Res"},{"key":"1159_CR22","doi-asserted-by":"publisher","first-page":"1710","DOI":"10.1093\/bioinformatics\/btg213","volume":"19","author":"DP Wall","year":"2003","unstructured":"Wall DP, Fraser HB, Hirsh AE: Detecting putative orthologs. Bioinformatics 2003, 19: 1710\u20131711. 10.1093\/bioinformatics\/btg213","journal-title":"Bioinformatics"},{"key":"1159_CR23","doi-asserted-by":"crossref","first-page":"550","DOI":"10.1101\/gr.9.6.550","volume":"9","author":"F Tekaia","year":"1999","unstructured":"Tekaia F, Lazcano A, Dujon B: The genomic tree as revealed from whole proteome comparisons. Genome Res 1999, 9: 550\u2013557.","journal-title":"Genome Res"},{"key":"1159_CR24","doi-asserted-by":"publisher","first-page":"108","DOI":"10.1038\/5052","volume":"21","author":"B Snel","year":"1999","unstructured":"Snel B, Bork P, Huynen MA: Genome phylogeny based on gene content. Nat Genet 1999, 21: 108\u2013110. 10.1038\/5052","journal-title":"Nat Genet"},{"key":"1159_CR25","doi-asserted-by":"publisher","first-page":"4218","DOI":"10.1093\/nar\/27.21.4218","volume":"27","author":"ST Fitz-Gibbon","year":"1999","unstructured":"Fitz-Gibbon ST, House CH: Whole genome-based phylogenetic analysis of free-living microorganisms. Nucleic Acids Res 1999, 27: 4218\u20134222. 10.1093\/nar\/27.21.4218","journal-title":"Nucleic Acids Res"},{"key":"1159_CR26","doi-asserted-by":"publisher","first-page":"e3","DOI":"10.1371\/journal.pcbi.0010003","volume":"1","author":"D Barker","year":"2005","unstructured":"Barker D, Pagel M: Predicting functional gene links from phylogenetic-statistical analyses of whole genomes. PLoS Comput Biol 2005, 1: e3. 10.1371\/journal.pcbi.0010003","journal-title":"PLoS Comput Biol"},{"key":"1159_CR27","doi-asserted-by":"publisher","first-page":"S83","DOI":"10.1093\/bioinformatics\/17.suppl_1.S83","volume":"17 Suppl 1","author":"G Apic","year":"2001","unstructured":"Apic G, Gough J, Teichmann SA: An insight into domain combinations. Bioinformatics 2001, 17 Suppl 1: S83\u20139.","journal-title":"Bioinformatics"},{"key":"1159_CR28","doi-asserted-by":"publisher","first-page":"1331","DOI":"10.1016\/j.jmb.2004.10.019","volume":"344","author":"P Pagel","year":"2004","unstructured":"Pagel P, Wong P, Frishman D: A domain interaction map based on phylogenetic profiling. J Mol Biol 2004, 344: 1331\u20131346. 10.1016\/j.jmb.2004.10.019","journal-title":"J Mol Biol"},{"key":"1159_CR29","doi-asserted-by":"publisher","first-page":"561","DOI":"10.1111\/j.1745-7270.2005.00075.x","volume":"37","author":"SY Shi","year":"2005","unstructured":"Shi SY, Cai XH, Ding DF: Identification and categorization of horizontally transferred genes in prokaryotic genomes. Acta Biochim Biophys Sin (Shanghai) 2005, 37: 561\u2013566. 10.1111\/j.1745-7270.2005.00075.x","journal-title":"Acta Biochim Biophys Sin (Shanghai)"},{"key":"1159_CR30","doi-asserted-by":"publisher","first-page":"475","DOI":"10.1086\/423903","volume":"75","author":"AP Chiang","year":"2004","unstructured":"Chiang AP, Nishimura D, Searby C, Elbedour K, Carmi R, Ferguson AL, Secrist J, Braun T, Casavant T, Stone EM, Sheffield VC: Comparative genomic analysis identifies an ADP-ribosylation factor-like gene as the cause of Bardet-Biedl syndrome (BBS3). Am J Hum Genet 2004, 75: 475\u2013484. 10.1086\/423903","journal-title":"Am J Hum Genet"},{"key":"1159_CR31","doi-asserted-by":"publisher","first-page":"541","DOI":"10.1016\/S0092-8674(04)00450-7","volume":"117","author":"JB Li","year":"2004","unstructured":"Li JB, Gerdes JM, Haycraft CJ, Fan Y, Teslovich TM, May-Simera H, Li H, Blacque OE, Li L, Leitch CC, Lewis RA, Green JS, Parfrey PS, Leroux MR, Davidson WS, Beales PL, Guay-Woodford LM, Yoder BK, Stormo GD, Katsanis N, Dutcher SK: Comparative genomics identifies a flagellar and basal body proteome that includes the BBS5 human disease gene. Cell 2004, 117: 541\u2013552. 10.1016\/S0092-8674(04)00450-7","journal-title":"Cell"},{"key":"1159_CR32","doi-asserted-by":"publisher","first-page":"D277","DOI":"10.1093\/nar\/gkh063","volume":"32","author":"M Kanehisa","year":"2004","unstructured":"Kanehisa M, Goto S, Kawashima S, Okuno Y, Hattori M: The KEGG resource for deciphering the genome. Nucleic Acids Res 2004, 32: D277\u201380. 10.1093\/nar\/gkh063","journal-title":"Nucleic Acids Res"},{"key":"1159_CR33","unstructured":"Tetrahymena thermophila Genome Project[http:\/\/www.tigr.org\/tdb\/e2k1\/ttg\/]"},{"key":"1159_CR34","unstructured":"Plasmodium Vivax Genome Project[http:\/\/www.tigr.org\/tdb\/e2k1\/pva1\/]"},{"key":"1159_CR35","doi-asserted-by":"publisher","first-page":"859","DOI":"10.1038\/nature01554","volume":"422","author":"JE Galagan","year":"2003","unstructured":"Galagan JE, Calvo SE, Borkovich KA, Selker EU, Read ND, Jaffe D, FitzHugh W, Ma LJ, Smirnov S, Purcell S, Rehman B, Elkins T, Engels R, Wang S, Nielsen CB, Butler J, Endrizzi M, Qui D, Ianakiev P, Bell-Pedersen D, Nelson MA, Werner-Washburne M, Selitrennikoff CP, Kinsey JA, Braun EL, Zelter A, Schulte U, Kothe GO, Jedd G, Mewes W, Staben C, Marcotte E, Greenberg D, Roy A, Foley K, Naylor J, Stange-Thomann N, Barrett R, Gnerre S, Kamal M, Kamvysselis M, Mauceli E, Bielke C, Rudd S, Frishman D, Krystofova S, Rasmussen C, Metzenberg RL, Perkins DD, Kroken S, Cogoni C, Macino G, Catcheside D, Li W, Pratt RJ, Osmani SA, DeSouza CP, Glass L, Orbach MJ, Berglund JA, Voelker R, Yarden O, Plamann M, Seiler S, Dunlap J, Radford A, Aramayo R, Natvig DO, Alex LA, Mannhaupt G, Ebbole DJ, Freitag M, Paulsen I, Sachs MS, Lander ES, Nusbaum C, Birren B: The genome sequence of the filamentous fungus Neurospora crassa. Nature 2003, 422: 859\u2013868. 10.1038\/nature01554","journal-title":"Nature"},{"key":"1159_CR36","doi-asserted-by":"publisher","first-page":"271","DOI":"10.1111\/j.1574-6968.2000.tb09242.x","volume":"189","author":"AG McArthur","year":"2000","unstructured":"McArthur AG, Morrison HG, Nixon JE, Passamaneck NQ, Kim U, Hinkle G, Crocker MK, Holder ME, Farr R, Reich CI, Olsen GE, Aley SB, Adam RD, Gillin FD, Sogin ML: The Giardia genome project database. FEMS Microbiol Lett 2000, 189: 271\u2013273. 10.1111\/j.1574-6968.2000.tb09242.x","journal-title":"FEMS Microbiol Lett"},{"key":"1159_CR37","unstructured":"Supplemental Data[http:\/\/biowulf.bu.edu\/2006_optimized_profile_supplement\/]"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-7-420.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,9]],"date-time":"2023-05-09T00:24:40Z","timestamp":1683591880000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-7-420"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2006,9,27]]},"references-count":37,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2006,12]]}},"alternative-id":["1159"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-7-420","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2006,9,27]]},"assertion":[{"value":"8 May 2006","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"27 September 2006","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"27 September 2006","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"420"}}