{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,2,14]],"date-time":"2023-02-14T16:34:16Z","timestamp":1676392456038},"reference-count":16,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2010,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>We propose a method for deriving enzymatic signatures from short read metagenomic data of unknown species. The short read data are converted to six pseudo-peptide candidates. We search for occurrences of Specific Peptides (SPs) on the latter. SPs are peptides that are indicative of enzymatic function as defined by the Enzyme Commission (EC) nomenclature. The number of SP hits on an ensemble of short reads is counted and then converted to estimates of numbers of enzymatic genes associated with different EC categories in the studied metagenome. Relative amounts of different EC categories define the enzymatic spectrum, without the need to perform genomic assemblies of short reads.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>The method is developed and tested on 22 bacteria for which there exist many EC annotations in Uniprot. Enzymatic signatures are derived for 3 metagenomes, and their functional profiles are explored.<\/jats:p>\n            <jats:p>We extend the SP methodology to taxon-specific SPs (TSPs), allowing us to estimate taxonomic features of metagenomic data from short reads. Using recent Swiss-Prot data we obtain TSPs for different phyla of bacteria, and different classes of proteobacteria. These allow us to analyze the major taxonomic content of 4 different metagenomic data-sets.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusions<\/jats:title>\n            <jats:p>The SP methodology can be successfully extended to applications on short read genomic and metagenomic data. This leads to direct derivation of enzymatic signatures from raw short reads. Furthermore, by employing TSPs, one obtains valuable taxonomic information.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-11-390","type":"journal-article","created":{"date-parts":[[2010,7,22]],"date-time":"2010-07-22T18:16:39Z","timestamp":1279822599000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["Deriving enzymatic and taxonomic signatures of metagenomes from short read data"],"prefix":"10.1186","volume":"11","author":[{"given":"Uri","family":"Weingart","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Erez","family":"Persi","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Uri","family":"Gophna","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"David","family":"Horn","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2010,7,22]]},"reference":[{"key":"3847_CR1","doi-asserted-by":"publisher","first-page":"554","DOI":"10.1126\/science.1107851","volume":"308","author":"SG Tringe","year":"2005","unstructured":"Tringe SG, Von Mering C, Kobayashi A, Salamov AA, Chen K, Chang HW, Podar M, Short JM, Mathur EJ, Detter JC, Bork P, Hugenholtz P, Rubin EM: Comparative metagenomics of microbial communities. Science 2005, 308: 554\u2013557. 10.1126\/science.1107851","journal-title":"Science"},{"issue":"2","key":"3847_CR2","doi-asserted-by":"publisher","first-page":"e1000667","DOI":"10.1371\/journal.pcbi.1000667","volume":"6","author":"JC Wooley","year":"2010","unstructured":"Wooley JC, Godzik A, Friedberg I: A Primer on Metagenomics. PLOS Comp Bio 2010, 6(2):e1000667. 10.1371\/journal.pcbi.1000667","journal-title":"PLOS Comp Bio"},{"key":"3847_CR3","doi-asserted-by":"publisher","first-page":"629","DOI":"10.1038\/nature06810","volume":"452","author":"EA Dinsdale","year":"2008","unstructured":"Dinsdale EA, Edwards RA, Hall D, Angly F, Breitbart M, Brulc JM, Furlan M, Desnues C, Haynes M, Li L, McDaniel L, Moran MA, Nelson KE, Nilsson C, Olson R, Paul J, Brito BR, Ruan Y, Swan BK, Stevens R, Valentine DL, Thurber RV, Wegley L, White BA, Rohwer F: Functional metagenomic profiling of nine biomes. Nature 2008, 452: 629\u2013632. 10.1038\/nature06810","journal-title":"Nature"},{"key":"3847_CR4","doi-asserted-by":"publisher","first-page":"525","DOI":"10.1146\/annurev.genet.38.072902.091216","volume":"38","author":"CS Riesenfeld","year":"2004","unstructured":"Riesenfeld CS, Schloss PD, Handelsman J: Metagenomics: genomic analysis of microbial communities. Annu Rev Genet 2004, 38: 525\u201352. 10.1146\/annurev.genet.38.072902.091216","journal-title":"Annu Rev Genet"},{"key":"3847_CR5","doi-asserted-by":"publisher","first-page":"693","DOI":"10.1038\/nrmicro1935","volume":"6","author":"J Raes","year":"2008","unstructured":"Raes J, Bork P: Molecular eco-systems biology: towards an understanding of community function. Nature Reviews Microbiology 2008, 6: 693\u2013699. 10.1038\/nrmicro1935","journal-title":"Nature Reviews Microbiology"},{"key":"3847_CR6","doi-asserted-by":"crossref","unstructured":"Cole JR, Wang Q, Cardenas E, Fish J, Chai B, Farris RJ, Kulam-Syed-Mohideen AS, McGarrell DM, Marsh T, Garrity GM, Tiedje JM: The Ribosomal Database Project: improved alignments and new tools for rRNA analysis. Nucleic Acids Res 2009, (37 Database):D141-D145. 10.1093\/nar\/gkn879","DOI":"10.1093\/nar\/gkn879"},{"key":"3847_CR7","doi-asserted-by":"publisher","first-page":"6773","DOI":"10.1128\/AEM.00474-06","volume":"72","author":"PD Schloss","year":"2006","unstructured":"Schloss PD, Handelsman J: Introducing SONS, a Tool for Operational Taxonomic Unit-Based Comparisons of Microbial Community Memberships and Structures. App Env Microb 2006, 72: 6773\u20136779. 10.1128\/AEM.00474-06","journal-title":"App Env Microb"},{"key":"3847_CR8","doi-asserted-by":"publisher","first-page":"2230","DOI":"10.1093\/nar\/gkn038","volume":"36","author":"L Krause","year":"2008","unstructured":"Krause L, Diaz NN, Goesmann A, Kelley S, Nattkemper TW, Rohwer F, Edwards RA, Stoye J: Phylogenetic classification of short environmental DNA fragments. Nucleic Acids Research 2008, 36: 2230\u20132239. 10.1093\/nar\/gkn038","journal-title":"Nucleic Acids Research"},{"issue":"8","key":"3847_CR9","doi-asserted-by":"publisher","first-page":"e167","DOI":"10.1371\/journal.pcbi.0030167","volume":"3","author":"V Kunik","year":"2007","unstructured":"Kunik V, Meroz Y, Solan Z, Sandbank B, Weingart U, Ruppin E, Horn D: Functional representation of enzymes by specific peptides. PLOS Comp Biol 2007, 3(8):e167. 10.1371\/journal.pcbi.0030167","journal-title":"PLOS Comp Biol"},{"key":"3847_CR10","doi-asserted-by":"publisher","first-page":"366","DOI":"10.1016\/S0959-440X(96)80057-1","volume":"6","author":"P Bork","year":"1996","unstructured":"Bork P, Koonin EV: Protein sequence motifs. Curr Op Structural Biology 1996, 6: 366\u2013376. 10.1016\/S0959-440X(96)80057-1","journal-title":"Curr Op Structural Biology"},{"key":"3847_CR11","doi-asserted-by":"publisher","first-page":"217","DOI":"10.1093\/nar\/25.1.217","volume":"25","author":"A Bairoch","year":"1997","unstructured":"Bairoch A, Bucher P, Hofmann K: Prosite. Nuc Acids Res 1997, 25: 217\u2013221. 10.1093\/nar\/25.1.217","journal-title":"Nuc Acids Res"},{"key":"3847_CR12","doi-asserted-by":"publisher","first-page":"11629","DOI":"10.1073\/pnas.0409746102","volume":"102","author":"Z Solan","year":"2005","unstructured":"Solan Z, Horn D, Ruppin E, Edelman S: Unsupervised learning of natural languages. Proc Natl Acad Sci USA 2005, 102: 11629\u201311634. 10.1073\/pnas.0409746102","journal-title":"Proc Natl Acad Sci USA"},{"key":"3847_CR13","doi-asserted-by":"publisher","first-page":"446","DOI":"10.1186\/1471-2105-10-446","volume":"10","author":"U Weingart","year":"2009","unstructured":"Weingart U, Lavi Y, Horn D: Data Mining of Enzymes using Specific Peptides. BMC Bioinfomratics 2009, 10: 446. 10.1186\/1471-2105-10-446","journal-title":"BMC Bioinfomratics"},{"key":"3847_CR14","doi-asserted-by":"publisher","first-page":"57","DOI":"10.1186\/1471-2164-7-57","volume":"7","author":"RA Edwards","year":"2006","unstructured":"Edwards RA, Rodrigues-Brito B, Wegley L, Haynes M, Breitbart M, Peterson D, Saar M, Alexander S, Alexander EC, Rohwer F: Using pyrosequencing to shed light on deep mine microbial ecology. BMC Genomics 2006, 7: 57\u201370. 10.1186\/1471-2164-7-57","journal-title":"BMC Genomics"},{"key":"3847_CR15","doi-asserted-by":"publisher","first-page":"496","DOI":"10.1126\/science.1120250","volume":"311","author":"EF DeLong","year":"2006","unstructured":"DeLong EF, Preston CM, Mincer T, Rich V, Hallam SJ, Frigaard N-U, Martinez A, Sullivan MB, Edwards R, Brito BR, Chisholm SW, Karl DM: Community Genomics Among Stratified Microbial Assemblages in the Ocean's Interior. Science 2006, 311: 496\u2013503. 10.1126\/science.1120250","journal-title":"Science"},{"key":"3847_CR16","volume-title":"Trends in Genetics","author":"P Lapierre","year":"2009","unstructured":"Lapierre P, Gogarten JP: Estimating the size of the bacterial pangenome. Trends in Genetics 2009."}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-11-390.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T05:19:40Z","timestamp":1630473580000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-11-390"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,7,22]]},"references-count":16,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2010,12]]}},"alternative-id":["3847"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-11-390","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2010,7,22]]},"assertion":[{"value":"3 March 2010","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"22 July 2010","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"22 July 2010","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"390"}}