{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,11]],"date-time":"2026-06-11T04:47:50Z","timestamp":1781153270158,"version":"3.54.1"},"reference-count":28,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2016,10,21]],"date-time":"2016-10-21T00:00:00Z","timestamp":1477008000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2016,10,21]],"date-time":"2016-10-21T00:00:00Z","timestamp":1477008000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100008349","name":"Boehringer Ingelheim","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100008349","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Background<\/jats:title>\n                    <jats:p>PCR clonal artefacts originating from NGS library preparation can affect both genomic as well as RNA-Seq applications when protocols are pushed to their limits. In RNA-Seq however the artifactual reads are not easy to tell apart from normal read duplication due to natural over-sequencing of highly expressed genes. Especially when working with little input material or single cells assessing the fraction of duplicate reads is an important quality control step for NGS data sets. Up to now there are only tools to calculate the global duplication rates that do not take into account the effect of gene expression levels which leaves them of limited use for RNA-Seq data.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>Here we present the tool dupRadar, which provides an easy means to distinguish the fraction of reads originating in natural duplication due to high expression from the fraction induced by artefacts. dupRadar assesses the fraction of duplicate reads per gene dependent on the expression level. Apart from the Bioconductor package dupRadar we provide shell scripts for easy integration into processing pipelines.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Conclusions<\/jats:title>\n                    <jats:p>The Bioconductor package dupRadar offers straight-forward methods to assess RNA-Seq datasets for quality issues with PCR duplicates. It is aimed towards simple integration into standard analysis pipelines as a default QC metric that is especially useful for low-input and single cell RNA-Seq data sets.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1186\/s12859-016-1276-2","type":"journal-article","created":{"date-parts":[[2016,10,21]],"date-time":"2016-10-21T01:52:05Z","timestamp":1477014725000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":187,"title":["dupRadar: a Bioconductor package for the assessment of PCR artifacts in RNA-Seq data"],"prefix":"10.1186","volume":"17","author":[{"given":"Sergi","family":"Sayols","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Denise","family":"Scherzinger","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Holger","family":"Klein","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2016,10,21]]},"reference":[{"key":"1276_CR1","unstructured":"siddharth.sethi5: \u201cQuestion: Should We Remove Duplicated Reads In Rna-Seq?\u201d, 2012, https:\/\/www.biostars.org\/p\/55648\/, Accessed 19 Oct 2015"},{"key":"1276_CR2","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1007\/978-1-4939-2291-8_8","volume-title":"RNA Bioinformatics, Methods in Molecular Biology","author":"X Li","year":"2015","unstructured":"Li X, Nair A, Wang S, Wang L. Quality Control of RNA-Seq Experiments. In: Picardi E, editor. RNA Bioinformatics, Methods in Molecular Biology, vol. Volume 1269. 2015. p. 137\u201346."},{"issue":"8","key":"1276_CR3","doi-asserted-by":"publisher","first-page":"e1004393","DOI":"10.1371\/journal.pcbi.1004393","volume":"11","author":"M Griffith","year":"2015","unstructured":"Griffith M, Walker JR, Spies NC, Ainscough BJ, Griffith OL. Informatics for RNA Sequencing: A Web Resource for Analysis on the Cloud. PLoS Comput Biol. 2015;11(8):e1004393.","journal-title":"PLoS Comput Biol"},{"issue":"October","key":"1276_CR4","doi-asserted-by":"publisher","first-page":"11.13.1","DOI":"10.1002\/0471142905.hg1113s83","volume":"83","author":"AG Williams","year":"2014","unstructured":"Williams AG, Thomas S, Wyman SK, Holloway AK. RNA-seq Data: Challenges in and Recommendations for Experimental Design and Analysis. Curr Protoc Hum Genet. 2014;83(October):11.13.1\u201311.13.20.","journal-title":"Curr Protoc Hum Genet"},{"issue":"12","key":"1276_CR5","doi-asserted-by":"publisher","first-page":"2213","DOI":"10.1101\/gr.124321.111","volume":"21","author":"S Tarazona","year":"2011","unstructured":"Tarazona S, Garc\u00eda-Alcalde F, Dopazo J, Ferrer A, Conesa A. Differential expression in RNA-seq: A matter of depth. Genome Res. 2011;21(12):2213\u201323.","journal-title":"Genome Res"},{"issue":"10","key":"1276_CR6","doi-asserted-by":"publisher","first-page":"12","DOI":"10.1016\/j.yexcr.2014.01.008","volume":"322","author":"EL van Dijk","year":"2014","unstructured":"van Dijk EL, Jaszczyszyn Y, Thermes C. Library preparation methods for next-generation sequencing: Tone down the bias. Exp Cell Res. 2014;322(10):12\u201320.","journal-title":"Exp Cell Res"},{"key":"1276_CR7","unstructured":"Picard tools, A set of tools (in Java) for working with next generation sequencing data, https:\/\/github.com\/broadinstitute\/picard, Accessed 19 Oct 2015"},{"key":"1276_CR8","unstructured":"BamUtil Dedup, a repository that contains several programs that perform operations on SAM\/BAM files, http:\/\/genome.sph.umich.edu\/wiki\/BamUtil, Accessed 19 Oct 2015"},{"key":"1276_CR9","doi-asserted-by":"publisher","first-page":"13","DOI":"10.1186\/1751-0473-9-13","volume":"9","author":"G Tischler","year":"2014","unstructured":"Tischler G, Leonard S. Biobambam: tools for read pair collation based algorithms on BAM files. Source Code Biol Med. 2014;9:13.","journal-title":"Source Code Biol Med"},{"key":"1276_CR10","doi-asserted-by":"publisher","first-page":"71","DOI":"10.1038\/nmeth.2251","volume":"10","author":"A Roberts","year":"2013","unstructured":"Roberts A, Pachter L. Streaming fragment assignment for real-time analysis of sequencing experiments. Nat Methods. 2013;10:71\u20133.","journal-title":"Nat Methods"},{"key":"1276_CR11","unstructured":"FastQC, A Quality Control tool for High Throughput Sequence Data, http:\/\/www.bioinformatics.bbsrc.ac.uk\/projects\/fastqc, Accessed 19 Oct 2015"},{"key":"1276_CR12","unstructured":"FASTX toolkit, http:\/\/hannonlab.cshl.edu\/fastx_toolkit\/, 2010, Accessed 19 Oct 2015"},{"key":"1276_CR13","doi-asserted-by":"publisher","first-page":"863","DOI":"10.1093\/bioinformatics\/btr026","volume":"27","author":"R Schmieder","year":"2011","unstructured":"Schmieder R, Edwards R. Quality control and preprocessing of metagenomic datasets. Bioinformatics. 2011;27:863\u20134.","journal-title":"Bioinformatics"},{"issue":"16","key":"1276_CR14","doi-asserted-by":"publisher","first-page":"2184","DOI":"10.1093\/bioinformatics\/bts356","volume":"28","author":"L Wang","year":"2012","unstructured":"Wang L, Wang S, Li W. RSeQC: quality control of RNA-seq experiments. Bioinformatics. 2012;28(16):2184\u20135.","journal-title":"Bioinformatics"},{"issue":"11","key":"1276_CR15","doi-asserted-by":"publisher","first-page":"1530","DOI":"10.1093\/bioinformatics\/bts196","volume":"28","author":"DS DeLuca","year":"2012","unstructured":"DeLuca DS, Levin JZ, Sivachenko A, Fennell T, Nazaire MD, Williams C, Reich M, Winckler W, Getz G. RNA-SeQC: RNA-seq metrics for quality control and process optimization. Bioinformatics. 2012;28(11):1530\u20132.","journal-title":"Bioinformatics"},{"issue":"20","key":"1276_CR16","doi-asserted-by":"publisher","first-page":"2678","DOI":"10.1093\/bioinformatics\/bts503","volume":"28","author":"F Garc\u00eda-Alcalde","year":"2012","unstructured":"Garc\u00eda-Alcalde F, Okonechnikov K, Carbonell J, Cruz LM, G\u00f6tz S, Tarazona S, Dopazo J, Meyer TF, Conesa A. Qualimap: evaluating next-generation sequencing alignment data. Bioinformatics. 2012;28(20):2678\u20139.","journal-title":"Bioinformatics"},{"key":"1276_CR17","doi-asserted-by":"publisher","first-page":"e108","DOI":"10.1093\/nar\/gkt214","volume":"41","author":"Y Liao","year":"2013","unstructured":"Liao Y, Smyth GK, Shi W. The Subread aligner: fast, accurate and scalable read mapping by seed-and-vote. Nucleic Acids Res. 2013;41:e108.","journal-title":"Nucleic Acids Res"},{"key":"1276_CR18","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1093\/bioinformatics\/btp616","volume":"26","author":"MD Robinson","year":"2010","unstructured":"Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26:1.","journal-title":"Bioinformatics"},{"key":"1276_CR19","doi-asserted-by":"publisher","first-page":"550","DOI":"10.1186\/s13059-014-0550-8","volume":"15","author":"M Love","year":"2014","unstructured":"Love M, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550.","journal-title":"Genome Biol"},{"issue":"7","key":"1276_CR20","doi-asserted-by":"publisher","first-page":"e47","DOI":"10.1093\/nar\/gkv007","volume":"43","author":"ME Ritchie","year":"2015","unstructured":"Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, Smyth GK. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015;43(7):e47.","journal-title":"Nucleic Acids Res"},{"issue":"6","key":"1276_CR21","doi-asserted-by":"publisher","first-page":"e99625","DOI":"10.1371\/journal.pone.0099625","volume":"9","author":"BE Himes","year":"2014","unstructured":"Himes BE, Jiang X, Wagner P, Hu R, Wang Q, Klanderman B, Whitaker RM, Duan Q, Lasky-Su J, Nikolos C, Jester W, Johnson M, Panettieri Jr R, Tantisira KG, Weiss ST, Lu Q. RNA-Seq Transcriptome Profiling Identifies CRISPLD2 as a Glucocorticoid Responsive Gene that Modulates Cytokine Function in Airway Smooth Muscle Cells. PLoS One. 2014;9(6):e99625. GEO:GSE52778.","journal-title":"PLoS One"},{"key":"1276_CR22","doi-asserted-by":"publisher","first-page":"139","DOI":"10.1093\/bioinformatics\/btp616","volume":"26","author":"MD Robinson","year":"2010","unstructured":"Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26:139\u201340.","journal-title":"Bioinformatics"},{"key":"1276_CR23","doi-asserted-by":"publisher","first-page":"496","DOI":"10.1101\/gr.161034.113","volume":"24","author":"G Marinov","year":"2014","unstructured":"Marinov G, Williams B, McCue K, Schroth G, Gertz J, Myers R, Wold B. From single cell to cell-pool transcriptomes: Stochasticity in gene expression and RNA splicing. Genome Res. 2014;24:496\u2013510.","journal-title":"Genome Res"},{"issue":"1","key":"1276_CR24","doi-asserted-by":"publisher","first-page":"15","DOI":"10.1093\/bioinformatics\/bts635","volume":"29","author":"A Dobin","year":"2012","unstructured":"Dobin A, Davis C, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M, Gingeras T. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2012;29(1):15\u201321.","journal-title":"Bioinformatics"},{"key":"1276_CR25","unstructured":"Illumina iGenomes, Ready-To-Use Reference Sequences and Annotations. https:\/\/support.illumina.com\/sequencing\/sequencing_software\/igenome.html. Accessed 19 Oct 2015."},{"key":"1276_CR26","doi-asserted-by":"publisher","first-page":"163","DOI":"10.1038\/nmeth.2772","volume":"11","author":"S Islam","year":"2014","unstructured":"Islam S, Zeisel A, Joost S, La Manno G, Zajac P, Kasper M, L\u00f6nnerberg P, Linnarsson S. Quantitative single-cell RNA-seq with unique molecular identifiers. Nat Methods. 2014;11:163\u20136.","journal-title":"Nat Methods"},{"key":"1276_CR27","doi-asserted-by":"publisher","first-page":"1185","DOI":"10.1038\/nmeth.2722","volume":"10","author":"PA Engstr\u00f6m","year":"2013","unstructured":"Engstr\u00f6m PA, Steijger T, Sipos B, Grant GR, Kahles A, The RGASP Consortium, R\u00e4tsch G, Goldman N, Hubbard TJ, Harrow J, Guig\u00f3 R, Bertone P. Systematic evaluation of spliced alignment programs for RNA-seq data. Nat Methods. 2013;10:1185\u201391.","journal-title":"Nat Methods"},{"key":"1276_CR28","doi-asserted-by":"publisher","first-page":"97","DOI":"10.1186\/s12864-015-1308-8","volume":"16","author":"S Zhao","year":"2015","unstructured":"Zhao S, Zhang B. A comprehensive evaluation of ensembl, RefSeq, and UCSC annotations in the context of RNA-seq read mapping and gene quantification. BMC Genomics. 2015;16:97.","journal-title":"BMC Genomics"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-016-1276-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-016-1276-2\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-016-1276-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,1]],"date-time":"2024-02-01T13:07:05Z","timestamp":1706792825000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-016-1276-2"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,10,21]]},"references-count":28,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2016,12]]}},"alternative-id":["1276"],"URL":"https:\/\/doi.org\/10.1186\/s12859-016-1276-2","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/046243","asserted-by":"object"}]},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2016,10,21]]},"assertion":[{"value":"16 January 2016","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"21 September 2016","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"21 October 2016","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"428"}}