{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,12]],"date-time":"2026-03-12T01:13:20Z","timestamp":1773278000554,"version":"3.50.1"},"reference-count":19,"publisher":"Oxford University Press (OUP)","issue":"20","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2012,10,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Further advancement of RNA-Seq technology and its application call for the development of effective normalization methods for RNA-Seq data. Currently, different normalization methods are compared and validated by their correlations with a certain gold standard. Gene expression measurements generated by a different technology or platform such as Real-time reverse transcription polymerase chain reaction (qRT\u2013PCR) or Microarray are usually used as the gold standard. Although the current approach is intuitive and easy to implement, it becomes statistically inadequate when the gold standard is also subject to measurement error (ME). Furthermore, the current approach is not informative, because the correlation of a normalization method with a certain gold standard does not provide much information about the exact quality of the normalized RNA-Seq measurements.<\/jats:p>\n               <jats:p>Results: We propose to use the system of ME models based on qRT\u2013PCR, Microarray and RNA-Seq gene expression data to compare and validate RNA-Seq normalization methods. This approach does not assume the existence of a gold standard. The performance of a normalization method can be characterized by a group of parameters of the system, which are referred to as the performance parameters, and these performance parameters can be consistently estimated. Different normalization methods can thus be compared by comparing their corresponding estimated performance parameters. We applied the proposed approach to compare five existing RNA-Seq normalization methods using the gene expression data of two RNA samples from the microArray Quality Control and Sequencing Quality Control projects and gained much insight about the pros and cons of these methods.<\/jats:p>\n               <jats:p>Contact: \u00a0sunz@purdue.edu; yuzhu@purdue.edu<\/jats:p>","DOI":"10.1093\/bioinformatics\/bts497","type":"journal-article","created":{"date-parts":[[2012,8,23]],"date-time":"2012-08-23T00:15:38Z","timestamp":1345680938000},"page":"2584-2591","source":"Crossref","is-referenced-by-count":26,"title":["Systematic comparison of RNA-Seq normalization methods using measurement error models"],"prefix":"10.1093","volume":"28","author":[{"given":"Zhaonan","family":"Sun","sequence":"first","affiliation":[{"name":"Department of Statistics, Purdue University, 250N University Street, West Lafayette, IN 47906, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yu","family":"Zhu","sequence":"additional","affiliation":[{"name":"Department of Statistics, Purdue University, 250N University Street, West Lafayette, IN 47906, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2012,8,22]]},"reference":[{"key":"2023012513142405500_bts497-B1","doi-asserted-by":"crossref","first-page":"383","DOI":"10.1186\/1471-2164-11-383","article-title":"Comparison and calibration of transcriptome data from RNA-Seq and tiling arrays","volume":"11","author":"Agarwal","year":"2010","journal-title":"BMC Genomics"},{"key":"2023012513142405500_bts497-B2","doi-asserted-by":"crossref","first-page":"129","DOI":"10.2307\/2528684","article-title":"Simultaneous pairwise linear structural relationships","volume":"25","author":"Barnett","year":"1969","journal-title":"Biometrics"},{"key":"2023012513142405500_bts497-B3","doi-asserted-by":"crossref","first-page":"3235","DOI":"10.1093\/nar\/25.16.3235","article-title":"The elimination of primer-dimer accumulation in PCR","volume":"25","author":"Brownie","year":"1997","journal-title":"Nucleic Acids Res."},{"key":"2023012513142405500_bts497-B4","doi-asserted-by":"crossref","first-page":"94","DOI":"10.1186\/1471-2105-11-94","article-title":"Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments","volume":"11","author":"Bullard","year":"2010","journal-title":"BMC Bioinformatics"},{"key":"2023012513142405500_bts497-B5","doi-asserted-by":"crossref","first-page":"613","DOI":"10.1038\/nmeth.1223","article-title":"Stem cell transcriptome profiling via massive-scale mRNA sequencing","volume":"5","author":"Cloonan","year":"2008","journal-title":"Nature Methods"},{"key":"2023012513142405500_bts497-B6","doi-asserted-by":"crossref","first-page":"3207","DOI":"10.1093\/bioinformatics\/btp579","article-title":"Effect of read-mapping biases on detecting allele-specific expression from RNA-Sequencing data","volume":"25","author":"Degner","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012513142405500_bts497-B7","doi-asserted-by":"crossref","first-page":"e131","DOI":"10.1093\/nar\/gkq224","article-title":"Biases in Illumina transcriptome sequencing caused by random hexamer priming","volume":"38","author":"Hansen","year":"2010","journal-title":"Nucleic Acids Res."},{"key":"2023012513142405500_bts497-B8","doi-asserted-by":"crossref","first-page":"345","DOI":"10.1038\/nmeth756","article-title":"Multiple-laboratory comparison of microarray platforms","volume":"2","author":"Irizarry","year":"2005","journal-title":"Nature Methods"},{"key":"2023012513142405500_bts497-B9","doi-asserted-by":"crossref","first-page":"R25","DOI":"10.1186\/gb-2009-10-3-r25","article-title":"Ultrafast and memory-efficient alignment of short DNA sequences to the human genome","volume":"10","author":"Langmead","year":"2009","journal-title":"Genome Biol."},{"key":"2023012513142405500_bts497-B10","volume-title":"mseq: Modeling non-uniformity in short-read rates in RNA-Seq data","author":"Li","year":"2011"},{"key":"2023012513142405500_bts497-B11","doi-asserted-by":"crossref","first-page":"R50","DOI":"10.1186\/gb-2010-11-5-r50","article-title":"Modeling non-uniformity in short-read rates in RNA-Seq data","volume":"11","author":"Li","year":"2010","journal-title":"Genome Biol."},{"key":"2023012513142405500_bts497-B12","doi-asserted-by":"crossref","first-page":"1509","DOI":"10.1101\/gr.079558.108","article-title":"RNA-Seq: an assessment of technical reproducibility and comparison with gene expression arrays","volume":"18","author":"Marioni","year":"2008","journal-title":"Genome Res."},{"key":"2023012513142405500_bts497-B13","doi-asserted-by":"crossref","first-page":"621","DOI":"10.1038\/nmeth.1226","article-title":"Mapping and quantifying mammalian transcriptomes by RNA-Seq","volume":"5","author":"Mortazavi","year":"2008","journal-title":"Nature Methods"},{"key":"2023012513142405500_bts497-B14","doi-asserted-by":"crossref","first-page":"1344","DOI":"10.1126\/science.1158441","article-title":"The transcriptional landscape of the yeast genome defined by RNA sequencing","volume":"320","author":"Nagalakshmi","year":"2008","journal-title":"Science"},{"key":"2023012513142405500_bts497-B15","doi-asserted-by":"crossref","first-page":"1413","DOI":"10.1038\/ng.259","article-title":"Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing","volume":"40","author":"Pan","year":"2008","journal-title":"Nat. Genet."},{"key":"2023012513142405500_bts497-B16","doi-asserted-by":"crossref","first-page":"480","DOI":"10.1186\/1471-2105-12-480","article-title":"GC-content normalization for RNA-Seq data","volume":"12","author":"Risso","year":"2011","journal-title":"BMC Bioinformatics"},{"key":"2023012513142405500_bts497-B17","doi-asserted-by":"crossref","first-page":"e170","DOI":"10.1093\/nar\/gkq670","article-title":"A two-parameter generalized Poisson model to improve the analysis of RNA-Seq data","volume":"38","author":"Srivastava","year":"2010","journal-title":"Nucleic Acids Res."},{"key":"2023012513142405500_bts497-B18","volume-title":"GPseq: using the generalized Poisson distribution to model sequence read counts from high throughput sequencing experiments","author":"Srivastava","year":"2011"},{"key":"2023012513142405500_bts497-B19","article-title":"Pm-seq: using finite poisson mixture models for rna-seq data analysis and transcript expression level quantification","author":"Wu","year":"2012","journal-title":"Stat. Biosci. doi:10.1007\/s12561-012-9070-9"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/20\/2584\/48876816\/bioinformatics_28_20_2584.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/20\/2584\/48876816\/bioinformatics_28_20_2584.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T19:15:59Z","timestamp":1674674159000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/28\/20\/2584\/203544"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,8,22]]},"references-count":19,"journal-issue":{"issue":"20","published-print":{"date-parts":[[2012,10,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bts497","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2012,10,15]]},"published":{"date-parts":[[2012,8,22]]}}}