{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,5,18]],"date-time":"2024-05-18T16:10:07Z","timestamp":1716048607607},"reference-count":21,"publisher":"Oxford University Press (OUP)","issue":"5","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2014,3,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: The majority of next-generation sequencing technologies effectively sample small amounts of DNA or RNA that are amplified (i.e. copied) before sequencing. The amplification process is not perfect, leading to extreme bias in sequenced read counts. We present a novel procedure to account for amplification bias and demonstrate its effectiveness in mitigating gene length dependence when estimating true gene expression.<\/jats:p><jats:p>Results: We tested the proposed method on simulated and real data. Simulations indicated that our method captures true gene expression more effectively than classic censoring-based approaches and leads to power gains in differential expression testing, particularly for shorter genes with high transcription rates. We applied our method to an unreplicated Arabidopsis RNA-seq dataset resulting in disparate gene ontologies arising from gene set enrichment analyses.<\/jats:p><jats:p>Availability and implementation: R code to perform the RASTA procedures is freely available on the web at www.stat.purdue.edu\/\u223cdoerge\/.<\/jats:p><jats:p>Contact: \u00a0doerge@purdue.edu<\/jats:p>","DOI":"10.1093\/bioinformatics\/btt575","type":"journal-article","created":{"date-parts":[[2013,10,11]],"date-time":"2013-10-11T11:24:02Z","timestamp":1381490642000},"page":"601-605","source":"Crossref","is-referenced-by-count":1,"title":["Robust adjustment of sequence tag abundance"],"prefix":"10.1093","volume":"30","author":[{"given":"Douglas D.","family":"Baumann","sequence":"first","affiliation":[{"name":"Department of Statistics, Purdue University, West Lafayette, IN 47907, USA"}]},{"given":"Rebecca W.","family":"Doerge","sequence":"additional","affiliation":[{"name":"Department of Statistics, Purdue University, West Lafayette, IN 47907, USA"}]}],"member":"286","published-online":{"date-parts":[[2013,10,9]]},"reference":[{"key":"2023012710432740600_btt575-B1","doi-asserted-by":"crossref","first-page":"26","DOI":"10.2202\/1544-6115.1627","article-title":"A two-stage poisson model for testing RNA-Seq data","volume":"10","author":"Auer","year":"2011","journal-title":"Stat. Appl. Genet. Mol. Biol."},{"key":"2023012710432740600_btt575-B2","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","article-title":"Controlling the false discovery rate: a practical and powerful approach to multiple testing","volume":"57","author":"Benjamini","year":"1995","journal-title":"J. R. Stat. Soc. Series B Stat. Methodol."},{"key":"2023012710432740600_btt575-B3","doi-asserted-by":"crossref","first-page":"1165","DOI":"10.1214\/aos\/1013699998","article-title":"The control of the false discovery rate in multiple testing under dependency","volume":"29","author":"Benjamini","year":"2001","journal-title":"Ann. Stat."},{"key":"2023012710432740600_btt575-B4","doi-asserted-by":"crossref","first-page":"433","DOI":"10.1517\/14622416.5.4.433","article-title":"Solexa ltd","volume":"5","author":"Bennet","year":"2004","journal-title":"Pharmacogenomics"},{"key":"2023012710432740600_btt575-B5","doi-asserted-by":"crossref","first-page":"S9","DOI":"10.1186\/1471-2105-10-S11-S9","article-title":"Comparing gene annotation enrichment tools for functional modeling of agricultural microarray data","volume":"10","author":"Berg","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2023012710432740600_btt575-B6","doi-asserted-by":"crossref","first-page":"e106","DOI":"10.1093\/nar\/gkp507","article-title":"Detection of single nucleotide variations in expressed exons of the human genome using RNA-Seq","volume":"37","author":"Chepelev","year":"2009","journal-title":"Nucleic Acids Res."},{"key":"2023012710432740600_btt575-B7","doi-asserted-by":"crossref","first-page":"W64","DOI":"10.1093\/nar\/gkq310","article-title":"agriGO: a GO analysis toolkit for the agricultural community","volume":"38","author":"Du","year":"2010","journal-title":"Nucleic Acids Res."},{"key":"2023012710432740600_btt575-B8","doi-asserted-by":"crossref","first-page":"60","DOI":"10.1093\/comjnl\/9.1.60","article-title":"Computer programs for hierarchical polythetic classification (\u201csimilarity analysis\u201d)","volume":"9","author":"Lance","year":"1966","journal-title":"Comput. J."},{"key":"2023012710432740600_btt575-B9","doi-asserted-by":"crossref","first-page":"523","DOI":"10.1016\/j.cell.2008.03.029","article-title":"Highly integrated single-base resolution maps of the epigenome in Arabidopsis","volume":"133","author":"Lister","year":"2008","journal-title":"Cell"},{"key":"2023012710432740600_btt575-B10","doi-asserted-by":"crossref","first-page":"387","DOI":"10.1146\/annurev.genom.9.081307.164359","article-title":"Next-generation DNA sequencing methods","volume":"9","author":"Mardis","year":"2008","journal-title":"Ann. Rev. Genomics Hum. Genet."},{"key":"2023012710432740600_btt575-B11","doi-asserted-by":"crossref","first-page":"376","DOI":"10.1038\/nature03959","article-title":"Genome sequencing in microfabricated high-density picolitre reactor","volume":"437","author":"Margulies","year":"2005","journal-title":"Nature"},{"key":"2023012710432740600_btt575-B12","doi-asserted-by":"crossref","first-page":"1509","DOI":"10.1101\/gr.079558.108","article-title":"RNA-Seq: an assessment of technical reproducibility and comparison with gene expression arrays","volume":"18","author":"Marioni","year":"2008","journal-title":"Genome Res."},{"key":"2023012710432740600_btt575-B13","doi-asserted-by":"crossref","first-page":"519","DOI":"10.1101\/gr.142232.112","article-title":"iReckon: simultaneous isoform discovery and abundance estimation from RNA-seq data","volume":"23","author":"Mezlini","year":"2013","journal-title":"Genome Res."},{"key":"2023012710432740600_btt575-B14","volume-title":"R: A Language and Environment for Statistical Computing","author":"R Core Development Team","year":"2011"},{"key":"2023012710432740600_btt575-B15","doi-asserted-by":"crossref","first-page":"9","DOI":"10.1159\/000130315","article-title":"X inactivation, differentiation, and DNA methylation","volume":"14","author":"Riggs","year":"1975","journal-title":"Cytogenet. Cell Genet."},{"key":"2023012710432740600_btt575-B16","doi-asserted-by":"crossref","first-page":"597","DOI":"10.1038\/nrg1655","article-title":"DNA methylation and human disease","volume":"6","author":"Robertson","year":"2005","journal-title":"Nat. Rev. Genet."},{"key":"2023012710432740600_btt575-B17","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1093\/biostatistics\/kxm030","article-title":"Small-sample estimation of negative binomial dispersion, with applications to SAGE data","volume":"9","author":"Robinson","year":"2008","journal-title":"Biostatistics"},{"key":"2023012710432740600_btt575-B18","doi-asserted-by":"crossref","first-page":"85","DOI":"10.2174\/156652407779940413","article-title":"DNA methylation in health, disease, and cancer","volume":"7","author":"Shames","year":"2007","journal-title":"Curr. Mol. Med."},{"key":"2023012710432740600_btt575-B19","doi-asserted-by":"crossref","first-page":"1347","DOI":"10.1073\/pnas.1118018109","article-title":"Digital RNA sequencing minimizes sequence-dependent bias and amplification noise with optimized single-molecule barcodes","volume":"109","author":"Shiroguchi","year":"2012","journal-title":"Proc. Natl Acad. Sci."},{"key":"2023012710432740600_btt575-B20","doi-asserted-by":"crossref","first-page":"1009","DOI":"10.1093\/nar\/gkm965","article-title":"The Arabidopsis Information Resource (TAIR): gene structure and function annotation","volume":"36","author":"Swarbreck","year":"2008","journal-title":"Nucleic Acids Res."},{"key":"2023012710432740600_btt575-B21","doi-asserted-by":"crossref","first-page":"1","DOI":"10.18637\/jss.v032.i10","article-title":"The VGAM package for categorical data analysis","volume":"32","author":"Yee","year":"2010","journal-title":"J. Stat. Softw."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/30\/5\/601\/48919142\/bioinformatics_30_5_601.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/30\/5\/601\/48919142\/bioinformatics_30_5_601.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,5,18]],"date-time":"2024-05-18T15:52:04Z","timestamp":1716047524000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/30\/5\/601\/245172"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,10,9]]},"references-count":21,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2014,3,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btt575","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2014,3,1]]},"published":{"date-parts":[[2013,10,9]]}}}