{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,8,3]],"date-time":"2024-08-03T15:38:26Z","timestamp":1722699506484},"reference-count":18,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2008,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Modern proteomes evolved by modification of pre-existing ones. It is extremely important to comparative biology that related proteins be identified as members of the same cognate group, since a characterized putative homolog could be used to find clues about the function of uncharacterized proteins from the same group. Typically, databases of related proteins focus on those from completely-sequenced genomes. Unfortunately, relatively few organisms have had their genomes fully sequenced; accordingly, many proteins are ignored by the currently available databases of cognate proteins, despite the high amount of important genes that are functionally described only for these incomplete proteomes.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>We have developed a method to cluster cognate proteins from multiple organisms beginning with only one sequence, through connectivity saturation with that Seed sequence. We show that the generated clusters are in agreement with some other approaches based on full genome comparison.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusion<\/jats:title>\n            <jats:p>The method produced results that are as reliable as those produced by conventional clustering approaches. Generating clusters based only on individual proteins of interest is less time consuming than generating clusters for whole proteomes.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-9-141","type":"journal-article","created":{"date-parts":[[2008,3,5]],"date-time":"2008-03-05T21:24:22Z","timestamp":1204752262000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["Clustering of cognate proteins among distinct proteomes derived from multiple links to a single seed sequence"],"prefix":"10.1186","volume":"9","author":[{"given":"Adriano","family":"Barbosa-Silva","sequence":"first","affiliation":[]},{"given":"Venkata P","family":"Satagopam","sequence":"additional","affiliation":[]},{"given":"Reinhard","family":"Schneider","sequence":"additional","affiliation":[]},{"given":"J Miguel","family":"Ortega","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2008,3,5]]},"reference":[{"issue":"12","key":"2126_CR1","doi-asserted-by":"publisher","first-page":"619","DOI":"10.1016\/S0168-9525(02)02793-2","volume":"18","author":"EL Sonnhammer","year":"2002","unstructured":"Sonnhammer EL, Koonin EV: Orthology, paralogy and proposed classification for paralog subtypes. Trends Genet 2002, 18(12):619\u2013620. 10.1016\/S0168-9525(02)02793-2","journal-title":"Trends Genet"},{"issue":"1","key":"2126_CR2","doi-asserted-by":"publisher","first-page":"22","DOI":"10.1093\/nar\/29.1.22","volume":"29","author":"RL Tatusov","year":"2001","unstructured":"Tatusov RL, Natale DA, Garkavtsev IV, Tatusova TA, Shankavaram UT, Rao BS, Kiryutin B, Galperin MY, Fedorova ND, Koonin EV: The COG database: new developments in phylogenetic classification of proteins from complete genomes. Nucleic Acids Res 2001, 29(1):22\u201328. 10.1093\/nar\/29.1.22","journal-title":"Nucleic Acids Res"},{"issue":"5","key":"2126_CR3","doi-asserted-by":"publisher","first-page":"1041","DOI":"10.1006\/jmbi.2000.5197","volume":"314","author":"M Remm","year":"2001","unstructured":"Remm M, Storm CE, Sonnhammer EL: Automatic clustering of orthologs and in-paralogs from pairwise species comparisons. J Mol Biol 2001, 314(5):1041\u20131052. 10.1006\/jmbi.2000.5197","journal-title":"J Mol Biol"},{"issue":"14","key":"2126_CR4","doi-asserted-by":"publisher","first-page":"e9","DOI":"10.1093\/bioinformatics\/btl213","volume":"22","author":"A Alexeyenko","year":"2006","unstructured":"Alexeyenko A, Tamas I, Liu G, Sonnhammer EL: Automatic clustering of orthologs and inparalogs shared by multiple proteomes. Bioinformatics 2006, 22(14):e9\u201315. 10.1093\/bioinformatics\/btl213","journal-title":"Bioinformatics"},{"issue":"Database issue","key":"2126_CR5","doi-asserted-by":"publisher","first-page":"D363","DOI":"10.1093\/nar\/gkj123","volume":"34","author":"F Chen","year":"2006","unstructured":"Chen F, Mackey AJ, Stoeckert CJ Jr., Roos DS: OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groups. Nucleic Acids Res 2006, 34(Database issue):D363\u20138. 10.1093\/nar\/gkj123","journal-title":"Nucleic Acids Res"},{"key":"2126_CR6","doi-asserted-by":"publisher","first-page":"41","DOI":"10.1186\/1471-2105-4-41","volume":"4","author":"RL Tatusov","year":"2003","unstructured":"Tatusov RL, Fedorova ND, Jackson JD, Jacobs AR, Kiryutin B, Koonin EV, Krylov DM, Mazumder R, Mekhedov SL, Nikolskaya AN, Rao BS, Smirnov S, Sverdlov AV, Vasudevan S, Wolf YI, Yin JJ, Natale DA: The COG database: an updated version includes eukaryotes. BMC Bioinformatics 2003, 4: 41. 10.1186\/1471-2105-4-41","journal-title":"BMC Bioinformatics"},{"key":"2126_CR7","unstructured":"MySQL[http:\/\/www.mysql.com]"},{"issue":"17","key":"2126_CR8","doi-asserted-by":"publisher","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","volume":"25","author":"SF Altschul","year":"1997","unstructured":"Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 1997, 25(17):3389\u20133402. 10.1093\/nar\/25.17.3389","journal-title":"Nucleic Acids Res"},{"key":"2126_CR9","unstructured":"PHP[http:\/\/www.php.net]"},{"issue":"11","key":"2126_CR10","doi-asserted-by":"publisher","first-page":"1679","DOI":"10.1101\/gr.GR-1491R","volume":"10","author":"M Remm","year":"2000","unstructured":"Remm M, Sonnhammer E: Classification of transmembrane protein families in the Caenorhabditis elegans genome and identification of human orthologs. Genome Research 2000, 10(11):1679\u20131689. 10.1101\/gr.GR-1491R","journal-title":"Genome Research"},{"key":"2126_CR11","doi-asserted-by":"publisher","first-page":"D476","DOI":"10.1093\/nar\/gki107","volume":"33","author":"KP O'Brien","year":"2005","unstructured":"O'Brien KP, Remm M, Sonnhammer ELL: Inparanoid: a comprehensive database of eukaryotic orthologs. Nucleic Acids Research 2005, 33: D476-D480. 10.1093\/nar\/gki107","journal-title":"Nucleic Acids Research"},{"key":"2126_CR12","doi-asserted-by":"publisher","first-page":"D115","DOI":"10.1093\/nar\/gkh131","volume":"32","author":"R Apweiler","year":"2004","unstructured":"Apweiler R, Bairoch A, Wu CH, Barker WC, Boeckmann B, Ferro S, Gasteiger E, Huang HZ, Lopez R, Magrane M, Martin MJ, Natale DA, O'Donovan C, Redaschi N, Yeh LSL: UniProt: the Universal Protein knowledgebase. Nucleic Acids Research 2004, 32: D115-D119. 10.1093\/nar\/gkh131","journal-title":"Nucleic Acids Research"},{"issue":"1","key":"2126_CR13","first-page":"5","volume":"4","author":"E Camon","year":"2004","unstructured":"Camon E, Barrell D, Lee V, Dimmer E, Apweiler R: The Gene Ontology Annotation (GOA) Database--an integrated resource of GO annotations to the UniProt Knowledgebase. In Silico Biol 2004, 4(1):5\u20136.","journal-title":"In Silico Biol"},{"issue":"9","key":"2126_CR14","doi-asserted-by":"publisher","first-page":"2178","DOI":"10.1101\/gr.1224503","volume":"13","author":"L Li","year":"2003","unstructured":"Li L, Stoeckert CJ Jr., Roos DS: OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res 2003, 13(9):2178\u20132189. 10.1101\/gr.1224503","journal-title":"Genome Res"},{"issue":"19","key":"2126_CR15","doi-asserted-by":"publisher","first-page":"3787","DOI":"10.1093\/bioinformatics\/bti430","volume":"21","author":"X Mao","year":"2005","unstructured":"Mao X, Cai T, Olyarchuk JG, Wei L: Automated genome annotation and pathway identification using the KEGG Orthology (KO) as a controlled vocabulary. Bioinformatics 2005, 21(19):3787\u20133793. 10.1093\/bioinformatics\/bti430","journal-title":"Bioinformatics"},{"issue":"3","key":"2126_CR16","doi-asserted-by":"publisher","first-page":"493","DOI":"10.1101\/gr.212002","volume":"12","author":"Y Lee","year":"2002","unstructured":"Lee Y, Sultana R, Pertea G, Cho J, Karamycheva S, Tsai J, Parvizi B, Cheung F, Antonescu V, White J, Holt I, Liang F, Quackenbush J: Cross-referencing eukaryotic genomes: TIGR Orthologous Gene Alignments (TOGA). Genome Res 2002, 12(3):493\u2013502. 10.1101\/gr.212002","journal-title":"Genome Res"},{"issue":"2","key":"2126_CR17","first-page":"32","volume":"16","author":"O Miotto","year":"2005","unstructured":"Miotto O, Tan TW, Brusic V: Supporting the curation of biological databases with reusable text mining. Genome Inform 2005, 16(2):32\u201344.","journal-title":"Genome Inform"},{"issue":"20","key":"2126_CR18","doi-asserted-by":"publisher","first-page":"4339","DOI":"10.1093\/nar\/gkf564","volume":"30","author":"N Sankar","year":"2002","unstructured":"Sankar N, Machado J, Abdulla P, Hilliker AJ, Coe IR: Comparative genomic analysis of equilibrative nucleoside transporters suggests conserved protein structure despite limited sequence identity. Nucleic Acids Res 2002, 30(20):4339\u20134350. 10.1093\/nar\/gkf564","journal-title":"Nucleic Acids Res"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-9-141.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T03:25:47Z","timestamp":1630466747000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-9-141"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,3,5]]},"references-count":18,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2008,12]]}},"alternative-id":["2126"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-9-141","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2008,3,5]]},"assertion":[{"value":"31 October 2007","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 March 2008","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 March 2008","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"141"}}