{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,11]],"date-time":"2026-03-11T23:45:02Z","timestamp":1773272702620,"version":"3.50.1"},"reference-count":44,"publisher":"Oxford University Press (OUP)","issue":"7","license":[{"start":{"date-parts":[[2016,10,28]],"date-time":"2016-10-28T00:00:00Z","timestamp":1477612800000},"content-version":"vor","delay-in-days":426,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2016,4,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Metagenomics research has accelerated the studies of microbial organisms, providing insights into the composition and potential functionality of various microbial communities. Metatranscriptomics (studies of the transcripts from a mixture of microbial species) and other meta-omics approaches hold even greater promise for providing additional insights into functional and regulatory characteristics of the microbial communities. Current metatranscriptomics projects are often carried out without matched metagenomic datasets (of the same microbial communities). For the projects that produce both metatranscriptomic and metagenomic datasets, their analyses are often not integrated. Metagenome assemblies are far from perfect, partially explaining why metagenome assemblies are not used for the analysis of metatranscriptomic datasets.<\/jats:p>\n               <jats:p>Results: Here, we report a reads mapping algorithm for mapping of short reads onto a de Bruijn graph of assemblies. A hash table of junction k -mers ( k -mers spanning branching structures in the de Bruijn graph) is used to facilitate fast mapping of reads to the graph. We developed an application of this mapping algorithm: a reference-based approach to metatranscriptome assembly using graphs of metagenome assembly as the reference. Our results show that this new approach (called TAG) helps to assemble substantially more transcripts that otherwise would have been missed or truncated because of the fragmented nature of the reference metagenome.<\/jats:p>\n               <jats:p>Availability and implementation: TAG was implemented in C++ and has been tested extensively on the Linux platform. It is available for download as open source at http:\/\/omics.informatics.indiana.edu\/TAG .<\/jats:p>\n               <jats:p>Contact: \u00a0yye@indiana.edu<\/jats:p>","DOI":"10.1093\/bioinformatics\/btv510","type":"journal-article","created":{"date-parts":[[2015,8,30]],"date-time":"2015-08-30T00:08:51Z","timestamp":1440893331000},"page":"1001-1008","source":"Crossref","is-referenced-by-count":39,"title":["Utilizing de Bruijn graph of metagenome assembly for metatranscriptome analysis"],"prefix":"10.1093","volume":"32","author":[{"given":"Yuzhen","family":"Ye","sequence":"first","affiliation":[{"name":"School of Informatics and Computing, Indiana University, Bloomington, IN 47405, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Haixu","family":"Tang","sequence":"additional","affiliation":[{"name":"School of Informatics and Computing, Indiana University, Bloomington, IN 47405, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2015,8,29]]},"reference":[{"key":"2023020111573249000_btv510-B1","doi-asserted-by":"crossref","first-page":"e1002358","DOI":"10.1371\/journal.pcbi.1002358","article-title":"Metabolic reconstruction for metagenomic data and its application to the human microbiome","volume":"8","author":"Abubucker","year":"2012","journal-title":"PLoS Comput. Biol."},{"key":"2023020111573249000_btv510-B2","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-319-07566-2_10","article-title":"From indexing data structures to de bruijn graphs","volume-title":"Combinatorial Pattern Matching","author":"Cazaux","year":"2014"},{"key":"2023020111573249000_btv510-B3","doi-asserted-by":"crossref","first-page":"39","DOI":"10.1186\/2049-2618-2-39","article-title":"Comparison of assembly algorithms for improving rate of metatranscriptomic functional annotation","volume":"2","author":"Celaj","year":"2014","journal-title":"Microbiome"},{"key":"2023020111573249000_btv510-B4","doi-asserted-by":"crossref","first-page":"30","DOI":"10.1186\/s13059-015-0596-2","article-title":"Bridger: a new framework for de novo transcriptome assembly using RNA-seq data","volume":"16","author":"Chang","year":"2015","journal-title":"Genome Biol."},{"key":"2023020111573249000_btv510-B5","doi-asserted-by":"crossref","first-page":"2577","DOI":"10.1111\/j.1462-2920.2012.02781.x","article-title":"Comparative metatranscriptomics reveals widespread community responses during phenanthrene degradation in soil","volume":"14","author":"de Menezes","year":"2012","journal-title":"Environ. Microbiol."},{"key":"2023020111573249000_btv510-B6","doi-asserted-by":"crossref","first-page":"1204","DOI":"10.4161\/rna.24972","article-title":"Mapping the RNA-Seq trash bin: unusual transcripts in prokaryotic transcriptome sequencing data","volume":"10","author":"Doose","year":"2013","journal-title":"RNA Biol."},{"key":"2023020111573249000_btv510-B7","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1038\/nature11247","article-title":"An integrated encyclopedia of DNA elements in the human genome","volume":"489","author":"Dunham","year":"2012","journal-title":"Nature"},{"key":"2023020111573249000_btv510-B8","doi-asserted-by":"crossref","first-page":"E2329","DOI":"10.1073\/pnas.1319284111","article-title":"Relating the metatranscriptome and metagenome of the human gut","volume":"111","author":"Franzosa","year":"2014","journal-title":"Proc. Natl Acad. Sci. U. S. A."},{"key":"2023020111573249000_btv510-B9","doi-asserted-by":"crossref","first-page":"R23","DOI":"10.1186\/gb-2012-13-3-r23","article-title":"Efficient and robust RNA-seq process for cultured bacteria and complex community transcriptomes","volume":"13","author":"Giannoukos","year":"2012","journal-title":"Genome Biol."},{"key":"2023020111573249000_btv510-B10","doi-asserted-by":"crossref","first-page":"1513","DOI":"10.1073\/pnas.1017351108","article-title":"High-quality draft assemblies of mammalian genomes from massively parallel sequence data","volume":"108","author":"Gnerre","year":"2011","journal-title":"Proc. Natl Acad. Sci. U. S. A."},{"key":"2023020111573249000_btv510-B11","doi-asserted-by":"crossref","first-page":"e17447","DOI":"10.1371\/journal.pone.0017447","article-title":"Metatranscriptomic approach to analyze the functional human gut microbiota","volume":"6","author":"Gosalbes","year":"2011","journal-title":"PLoS One"},{"key":"2023020111573249000_btv510-B12","doi-asserted-by":"crossref","first-page":"644","DOI":"10.1038\/nbt.1883","article-title":"Trinity: reconstructing a full-length transcriptome without a genome from RNA-Seq data","volume":"29","author":"Grabherr","year":"2011","journal-title":"Nat. Biotechnol."},{"key":"2023020111573249000_btv510-B13","doi-asserted-by":"crossref","first-page":"1552","DOI":"10.1101\/gr.120618.111","article-title":"Integrative analysis of environmental sequences using MEGAN4","volume":"21","author":"Huson","year":"2011","journal-title":"Genome Res."},{"key":"2023020111573249000_btv510-B14","doi-asserted-by":"crossref","first-page":"207","DOI":"10.1038\/nature11234","article-title":"Structure, function and diversity of the healthy human microbiome","volume":"486","author":"Huttenhower","year":"2012","journal-title":"Nature"},{"key":"2023020111573249000_btv510-B15","doi-asserted-by":"crossref","first-page":"e75448","DOI":"10.1371\/journal.pone.0075448","article-title":"NeSSM: a Next-generation sequencing simulator for metagenomics","volume":"8","author":"Jia","year":"2013","journal-title":"PLoS One"},{"key":"2023020111573249000_btv510-B16","doi-asserted-by":"crossref","first-page":"e01012","DOI":"10.1128\/mBio.01012-14","article-title":"Metatranscriptomics of the human oral microbiome during health and disease","volume":"5","author":"Jorth","year":"2014","journal-title":"MBio"},{"key":"2023020111573249000_btv510-B17","doi-asserted-by":"crossref","first-page":"R86","DOI":"10.1186\/gb-2014-15-6-r86","article-title":"IVT-seq reveals extreme bias in RNA sequencing","volume":"15","author":"Lahens","year":"2014","journal-title":"Genome Biol."},{"key":"2023020111573249000_btv510-B18","doi-asserted-by":"crossref","first-page":"357","DOI":"10.1038\/nmeth.1923","article-title":"Fast gapped-read alignment with Bowtie 2","volume":"9","author":"Langmead","year":"2012","journal-title":"Nat. Methods"},{"key":"2023020111573249000_btv510-B19","doi-asserted-by":"crossref","first-page":"530","DOI":"10.1186\/1471-2164-14-530","article-title":"A comprehensive metatranscriptome analysis pipeline and its validation using human small intestine microbiota datasets","volume":"14","author":"Leimena","year":"2013","journal-title":"BMC Genomics"},{"key":"2023020111573249000_btv510-B20","doi-asserted-by":"crossref","first-page":"540","DOI":"10.1089\/cmb.2013.0042","article-title":"IDBA-MT: de novo assembler for metatranscriptomic data generated from next-generation sequencing technology","volume":"20","author":"Leung","year":"2013","journal-title":"J. Comput. Biol."},{"key":"2023020111573249000_btv510-B21","doi-asserted-by":"crossref","first-page":"160","DOI":"10.1007\/978-3-319-05269-4_12","article-title":"IDBA-MTP: A hybrid metatranscriptomic assembler based on protein information","volume":"8394","author":"Leung","year":"2014","journal-title":"Res. Comput. Mol. Biol.. Lect. Notes Comput. Sci."},{"key":"2023020111573249000_btv510-B22","doi-asserted-by":"crossref","first-page":"1754","DOI":"10.1093\/bioinformatics\/btp324","article-title":"Fast and accurate short read alignment with Burrows-Wheeler transform","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"key":"2023020111573249000_btv510-B23","doi-asserted-by":"crossref","first-page":"1658","DOI":"10.1093\/bioinformatics\/btl158","article-title":"Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences","volume":"22","author":"Li","year":"2006","journal-title":"Bioinformatics"},{"key":"2023020111573249000_btv510-B24","doi-asserted-by":"crossref","first-page":"265","DOI":"10.1101\/gr.097261.109","article-title":"De\u00a0novo assembly of human genomes with massively parallel short read sequencing","volume":"20","author":"Li","year":"2010","journal-title":"Genome Res."},{"key":"2023020111573249000_btv510-B25","doi-asserted-by":"crossref","first-page":"18","DOI":"10.1186\/2047-217X-1-18","article-title":"SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler","volume":"1","author":"Luo","year":"2012","journal-title":"Gigascience"},{"key":"2023020111573249000_btv510-B26","doi-asserted-by":"crossref","first-page":"39","DOI":"10.1016\/j.cell.2012.10.052","article-title":"Xenobiotics shape the physiology and gene expression of the active human gut microbiome","volume":"152","author":"Maurice","year":"2013","journal-title":"Cell"},{"key":"2023020111573249000_btv510-B27","doi-asserted-by":"crossref","first-page":"386","DOI":"10.1186\/1471-2105-9-386","article-title":"The metagenomics RAST server\u2014a public resource for the automatic phylogenetic and functional analysis of metagenomes","volume":"9","author":"Meyer","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023020111573249000_btv510-B28","doi-asserted-by":"crossref","first-page":"237","DOI":"10.1038\/ismej.2012.94","article-title":"Sizing up metatranscriptomics","volume":"7","author":"Moran","year":"2013","journal-title":"ISME J."},{"key":"2023020111573249000_btv510-B29","doi-asserted-by":"crossref","first-page":"897","DOI":"10.1089\/cmb.2009.0005","article-title":"Parametric complexity of sequence assembly: theory and applications to next generation sequencing","volume":"16","author":"Nagarajan","year":"2009","journal-title":"J. Comput. Biol."},{"key":"2023020111573249000_btv510-B30","doi-asserted-by":"crossref","first-page":"e155","DOI":"10.1093\/nar\/gks678","article-title":"MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads","volume":"40","author":"Namiki","year":"2012","journal-title":"Nucleic Acids Res."},{"key":"2023020111573249000_btv510-B31","doi-asserted-by":"crossref","first-page":"2826","DOI":"10.1093\/bioinformatics\/btt502","article-title":"Exploring variation-aware contig graphs for (comparative) metagenomics using MaryGold","volume":"29","author":"Nijkamp","year":"2013","journal-title":"Bioinformatics"},{"key":"2023020111573249000_btv510-B32","article-title":"Models for transcript quantification from rna-seq","author":"Pachter","year":"2011","journal-title":"arXiv preprint arXiv:1104.3889"},{"key":"2023020111573249000_btv510-B33","doi-asserted-by":"crossref","first-page":"1420","DOI":"10.1093\/bioinformatics\/bts174","article-title":"IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth","volume":"28","author":"Peng","year":"2012","journal-title":"Bioinformatics"},{"key":"2023020111573249000_btv510-B34","doi-asserted-by":"crossref","first-page":"9748","DOI":"10.1073\/pnas.171285098","article-title":"An Eulerian path approach to DNA fragment assembly","volume":"98","author":"Pevzner","year":"2001","journal-title":"Proc. Natl Acad. Sci. U. S. A."},{"key":"2023020111573249000_btv510-B35","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1038\/nature08821","article-title":"A human gut microbial gene catalogue established by metagenomic sequencing","volume":"464","author":"Qin","year":"2010","journal-title":"Nature"},{"key":"2023020111573249000_btv510-B36","doi-asserted-by":"crossref","first-page":"1787","DOI":"10.1126\/science.1198374","article-title":"Identification of functional elements and regulatory circuits by Drosophila modENCODE","volume":"330","author":"Roy","year":"2010","journal-title":"Science"},{"key":"2023020111573249000_btv510-B37","doi-asserted-by":"crossref","first-page":"1086","DOI":"10.1093\/bioinformatics\/bts094","article-title":"Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels","volume":"28","author":"Schulz","year":"2012","journal-title":"Bioinformatics"},{"key":"2023020111573249000_btv510-B38","doi-asserted-by":"crossref","first-page":"1086","DOI":"10.1093\/bioinformatics\/bts094","article-title":"Oases: robust de novo rna-seq assembly across the dynamic range of expression levels","volume":"28","author":"Schulz","year":"2012","journal-title":"Bioinformatics"},{"key":"2023020111573249000_btv510-B39","doi-asserted-by":"crossref","first-page":"191","DOI":"10.1111\/j.1462-2920.2011.02598.x","article-title":"Transcriptional responses of surface water marine microbial assemblages to deep-sea water amendment","volume":"14","author":"Shi","year":"2012","journal-title":"Environ. Microbiol."},{"key":"2023020111573249000_btv510-B40","doi-asserted-by":"crossref","first-page":"e00889","DOI":"10.1128\/mBio.00889-14","article-title":"Revealing the bacterial butyrate synthesis pathways by analyzing (meta)genomic data","volume":"5","author":"Vital","year":"2014","journal-title":"MBio"},{"key":"2023020111573249000_btv510-B41","doi-asserted-by":"crossref","first-page":"814","DOI":"10.1089\/cmb.2012.0058","article-title":"A de Bruijn graph approach to the quantification of closely-related genomes in a microbial community","volume":"19","author":"Wang","year":"2012","journal-title":"J. Comput. Biol."},{"key":"2023020111573249000_btv510-B42","doi-asserted-by":"crossref","first-page":"5288","DOI":"10.1128\/AEM.00564-12","article-title":"Oral spirochetes implicated in dental diseases are widespread in normal human subjects and carry extremely diverse integron gene cassettes","volume":"78","author":"Wu","year":"2012","journal-title":"Appl. Environ. Microbiol."},{"key":"2023020111573249000_btv510-B43","doi-asserted-by":"crossref","first-page":"i363","DOI":"10.1093\/bioinformatics\/bts388","article-title":"Stitching gene fragments with a network matching algorithm improves gene assembly for metagenomics","volume":"28","author":"Wu","year":"2012","journal-title":"Bioinformatics"},{"key":"2023020111573249000_btv510-B44","doi-asserted-by":"crossref","first-page":"821","DOI":"10.1101\/gr.074492.107","article-title":"Velvet: algorithms for de novo short read assembly using de Bruijn graphs","volume":"18","author":"Zerbino","year":"2008","journal-title":"Genome Res."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/32\/7\/1001\/49018669\/bioinformatics_32_7_1001.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/32\/7\/1001\/49018669\/bioinformatics_32_7_1001.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,1]],"date-time":"2023-02-01T22:23:09Z","timestamp":1675290189000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/32\/7\/1001\/2288363"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,8,29]]},"references-count":44,"journal-issue":{"issue":"7","published-print":{"date-parts":[[2016,4,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btv510","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2016,4,1]]},"published":{"date-parts":[[2015,8,29]]}}}