{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,1]],"date-time":"2026-04-01T06:17:52Z","timestamp":1775024272423,"version":"3.50.1"},"reference-count":31,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2022,11,16]],"date-time":"2022-11-16T00:00:00Z","timestamp":1668556800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,11,16]],"date-time":"2022-11-16T00:00:00Z","timestamp":1668556800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"Rossiiskiy Nauchnii Fond","award":["22-14-00057"],"award-info":[{"award-number":["22-14-00057"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                <jats:title>Background<\/jats:title>\n                <jats:p>RNA-seq has become a standard technology to quantify mRNA. The measured values usually vary by several orders of magnitude, and while the detection of differences at high values is statistically well grounded, the significance of the differences for rare mRNAs can be weakened by the presence of biological and technical noise.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Results<\/jats:title>\n                <jats:p>We have developed a method for cleaning RNA-seq data, which improves the detection of differentially expressed genes and specifically genes with low to moderate transcription. Using a data modeling approach, parameters of randomly distributed mRNA counts are identified and reads, most probably originating from technical noise, are removed. We demonstrate that the removal of this random component leads to the significant increase in the number of detected differentially expressed genes, more significant <jats:italic>p<\/jats:italic>values and no bias towards low-count genes.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Conclusion<\/jats:title>\n                <jats:p>Application of RNAdeNoise to our RNA-seq data on polysome profiling and several published RNA-seq datasets reveals its suitability for different organisms and sequencing technologies such as Illumina and BGI, shows improved detection of differentially expressed genes, and excludes the subjective setting of thresholds for minimal RNA counts. The program, RNA-seq data, resulted gene lists and examples of use are in the supplementary data and at <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/github.com\/Deyneko\/RNAdeNoise\">https:\/\/github.com\/Deyneko\/RNAdeNoise<\/jats:ext-link>.<\/jats:p>\n              <\/jats:sec>","DOI":"10.1186\/s12859-022-05023-z","type":"journal-article","created":{"date-parts":[[2022,11,16]],"date-time":"2022-11-16T22:02:41Z","timestamp":1668636161000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":21,"title":["Modeling and cleaning RNA-seq data significantly improve detection of differentially expressed genes"],"prefix":"10.1186","volume":"23","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3173-9604","authenticated-orcid":false,"given":"Igor V.","family":"Deyneko","sequence":"first","affiliation":[]},{"given":"Orkhan N.","family":"Mustafaev","sequence":"additional","affiliation":[]},{"given":"Alexander \u0410.","family":"Tyurin","sequence":"additional","affiliation":[]},{"given":"Ksenya V.","family":"Zhukova","sequence":"additional","affiliation":[]},{"given":"Alexander","family":"Varzari","sequence":"additional","affiliation":[]},{"given":"Irina V.","family":"Goldenkova-Pavlova","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,11,16]]},"reference":[{"key":"5023_CR1","doi-asserted-by":"crossref","unstructured":"Goldenkova-Pavlova IV, Pavlenko OS, Mustafaev ON, Deyneko IV, Kabardaeva KV, Tyurin AA. Computational and experimental tools to monitor the changes in translation efficiency of plant mrna on a genome-wide scale: advantages, limitations, and solutions. Int J Mol Sci 2018, 20(1).","DOI":"10.3390\/ijms20010033"},{"issue":"1","key":"5023_CR2","doi-asserted-by":"publisher","first-page":"139","DOI":"10.1093\/bioinformatics\/btp616","volume":"26","author":"MD Robinson","year":"2010","unstructured":"Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26(1):139\u201340.","journal-title":"Bioinformatics"},{"issue":"12","key":"5023_CR3","doi-asserted-by":"publisher","first-page":"550","DOI":"10.1186\/s13059-014-0550-8","volume":"15","author":"MI Love","year":"2014","unstructured":"Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15(12):550.","journal-title":"Genome Biol"},{"issue":"1","key":"5023_CR4","doi-asserted-by":"publisher","first-page":"136","DOI":"10.1093\/bioinformatics\/btp612","volume":"26","author":"L Wang","year":"2010","unstructured":"Wang L, Feng Z, Wang X, Wang X, Zhang X. DEGseq: an R package for identifying differentially expressed genes from RNA-seq data. Bioinformatics. 2010;26(1):136\u20138.","journal-title":"Bioinformatics"},{"issue":"12","key":"5023_CR5","doi-asserted-by":"publisher","first-page":"e70","DOI":"10.1093\/nar\/gkz223","volume":"47","author":"C Oertlin","year":"2019","unstructured":"Oertlin C, Lorent J, Murie C, Furic L, Topisirovic I, Larsson O. Generally applicable transcriptome-wide analysis of translation using anota2seq. Nucleic Acids Res. 2019;47(12):e70.","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"5023_CR6","doi-asserted-by":"publisher","first-page":"139","DOI":"10.1093\/bioinformatics\/btw585","volume":"33","author":"Y Zhong","year":"2017","unstructured":"Zhong Y, Karaletsos T, Drewe P, Sreedharan VT, Kuo D, Singh K, Wendel HG, Ratsch G. RiboDiff: detecting changes of mRNA translation efficiency from ribosome footprints. Bioinformatics. 2017;33(1):139\u201341.","journal-title":"Bioinformatics"},{"issue":"1","key":"5023_CR7","doi-asserted-by":"publisher","first-page":"19737","DOI":"10.1038\/s41598-020-76881-x","volume":"10","author":"LA Corchete","year":"2020","unstructured":"Corchete LA, Rojas EA, Alonso-Lopez D, De Las Rivas J, Gutierrez NC, Burguillo FJ. Systematic comparison and assessment of RNA-seq procedures for gene expression quantitative analysis. Sci Rep. 2020;10(1):19737.","journal-title":"Sci Rep"},{"issue":"7","key":"5023_CR8","first-page":"410","volume":"15","author":"NM Davidson","year":"2014","unstructured":"Davidson NM, Oshlack A. Corset: enabling differential gene expression analysis for de novo assembled transcriptomes. Genome Biol. 2014;15(7):410.","journal-title":"Genome Biol"},{"issue":"Suppl 1","key":"5023_CR9","doi-asserted-by":"publisher","first-page":"6","DOI":"10.1186\/1471-2164-11-S1-S6","volume":"11","author":"W Koh","year":"2010","unstructured":"Koh W, Sheng CT, Tan B, Lee QY, Kuznetsov V, Kiang LS, Tanavde V. Analysis of deep sequencing microRNA expression profile from human embryonic stem cells derived mesenchymal stem cells reveals possible role of let-7 microRNA family in downstream targeting of hepatic nuclear factor 4 alpha. BMC Genomics. 2010;11(Suppl 1):6.","journal-title":"BMC Genomics"},{"issue":"13","key":"5023_CR10","doi-asserted-by":"publisher","first-page":"i383","DOI":"10.1093\/bioinformatics\/btr247","volume":"27","author":"PP \u0141abaj","year":"2011","unstructured":"\u0141abaj PP, Leparc GG, Linggi BE, Markillie LM, Wiley HS, Kreil DP. Characterization and improvement of RNA-Seq precision in quantitative transcript expression profiling. Bioinformatics. 2011;27(13):i383\u201391.","journal-title":"Bioinformatics"},{"issue":"3","key":"5023_CR11","doi-asserted-by":"publisher","first-page":"e17305","DOI":"10.1371\/journal.pone.0017305","volume":"6","author":"LT Sam","year":"2011","unstructured":"Sam LT, Lipson D, Raz T, Cao X, Thompson J, Milos PM, Robinson D, Chinnaiyan AM, Kumar-Sinha C, Maher CA. A comparison of single molecule and amplification based sequencing of cancer transcriptomes. PLoS ONE. 2011;6(3):e17305.","journal-title":"PLoS ONE"},{"issue":"5891","key":"5023_CR12","doi-asserted-by":"publisher","first-page":"956","DOI":"10.1126\/science.1160342","volume":"321","author":"M Sultan","year":"2008","unstructured":"Sultan M, Schulz MH, Richard H, Magen A, Klingenhoff A, Scherf M, Seifert M, Borodina T, Soldatov A, Parkhomchuk D, et al. A global view of gene activity and alternative splicing by deep sequencing of the human transcriptome. Science. 2008;321(5891):956\u201360.","journal-title":"Science"},{"issue":"3","key":"5023_CR13","doi-asserted-by":"publisher","first-page":"e17820","DOI":"10.1371\/journal.pone.0017820","volume":"6","author":"D Bottomly","year":"2011","unstructured":"Bottomly D, Walter NA, Hunter JE, Darakjian P, Kawane S, Buck KJ, Searles RP, Mooney M, McWeeney SK, Hitzemann R. Evaluating gene expression in C57BL\/6J and DBA\/2J mouse striatum using RNA-Seq and microarrays. PLoS ONE. 2011;6(3):e17820.","journal-title":"PLoS ONE"},{"issue":"5","key":"5023_CR14","doi-asserted-by":"publisher","first-page":"525","DOI":"10.1038\/nbt.3519","volume":"34","author":"NL Bray","year":"2016","unstructured":"Bray NL, Pimentel H, Melsted P, Pachter L. Near-optimal probabilistic RNA-seq quantification. Nat Biotechnol. 2016;34(5):525\u20137.","journal-title":"Nat Biotechnol"},{"issue":"21","key":"5023_CR15","doi-asserted-by":"publisher","first-page":"9546","DOI":"10.1073\/pnas.0914005107","volume":"107","author":"R Bourgon","year":"2010","unstructured":"Bourgon R, Gentleman R, Huber W. Independent filtering increases detection power for high-throughput experiments. Proc Natl Acad Sci U S A. 2010;107(21):9546\u201351.","journal-title":"Proc Natl Acad Sci U S A"},{"issue":"17","key":"5023_CR16","doi-asserted-by":"publisher","first-page":"2146","DOI":"10.1093\/bioinformatics\/btt350","volume":"29","author":"A Rau","year":"2013","unstructured":"Rau A, Gallopin M, Celeux G, Jaffrezic F. Data-based filtering for replicated high-throughput transcriptome sequencing experiments. Bioinformatics. 2013;29(17):2146\u201352.","journal-title":"Bioinformatics"},{"issue":"9","key":"5023_CR17","doi-asserted-by":"publisher","first-page":"1509","DOI":"10.1101\/gr.079558.108","volume":"18","author":"JC Marioni","year":"2008","unstructured":"Marioni JC, Mason CE, Mane SM, Stephens M, Gilad Y. RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. Genome Res. 2008;18(9):1509\u201317.","journal-title":"Genome Res"},{"issue":"10","key":"5023_CR18","doi-asserted-by":"publisher","first-page":"4288","DOI":"10.1093\/nar\/gks042","volume":"40","author":"DJ McCarthy","year":"2012","unstructured":"McCarthy DJ, Chen Y, Smyth GK. Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation. Nucleic Acids Res. 2012;40(10):4288\u201397.","journal-title":"Nucleic Acids Res"},{"issue":"10","key":"5023_CR19","doi-asserted-by":"publisher","first-page":"R106","DOI":"10.1186\/gb-2010-11-10-r106","volume":"11","author":"S Anders","year":"2010","unstructured":"Anders S, Huber W. Differential expression analysis for sequence count data. Genome Biol. 2010;11(10):R106.","journal-title":"Genome Biol"},{"key":"5023_CR20","unstructured":"Team RC. R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna. https:\/\/www.R-project.org 2018."},{"issue":"1","key":"5023_CR21","doi-asserted-by":"publisher","first-page":"269","DOI":"10.1186\/s12967-021-02936-w","volume":"19","author":"Y Zhao","year":"2021","unstructured":"Zhao Y, Li MC, Konate MM, Chen L, Das B, Karlovich C, Williams PM, Evrard YA, Doroshow JH, McShane LM. TPM, FPKM, or Normalized Counts? A Comparative study of quantification measures for the analysis of RNA-seq data from the NCI patient-derived models repository. J Transl Med. 2021;19(1):269.","journal-title":"J Transl Med"},{"issue":"13","key":"5023_CR22","doi-asserted-by":"publisher","first-page":"1805","DOI":"10.1093\/bioinformatics\/bts251","volume":"28","author":"X Jiao","year":"2012","unstructured":"Jiao X, Sherman BT, Huang da W, Stephens R, Baseler MW, Lane HC, Lempicki RA. DAVID-WS: a stateful web service to facilitate gene\/protein list analysis. Bioinformatics. 2012;28(13):1805\u20136.","journal-title":"Bioinformatics"},{"issue":"8","key":"5023_CR23","doi-asserted-by":"publisher","first-page":"e0237907","DOI":"10.1371\/journal.pone.0237907","volume":"15","author":"B Dufek","year":"2020","unstructured":"Dufek B, Meehan DT, Delimont D, Wilhelm K, Samuelson G, Coenen R, Madison J, Doyle E, Smyth B, Phillips G, et al. RNA-seq analysis of gene expression profiles in isolated stria vascularis from wild-type and Alport mice reveals key pathways underling Alport strial pathogenesis. PLoS ONE. 2020;15(8):e0237907.","journal-title":"PLoS ONE"},{"issue":"W1","key":"5023_CR24","doi-asserted-by":"publisher","first-page":"W537","DOI":"10.1093\/nar\/gky379","volume":"46","author":"E Afgan","year":"2018","unstructured":"Afgan E, Baker D, Batut B, van den Beek M, Bouvier D, Cech M, Chilton J, Clements D, Coraor N, Gruning BA, et al. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update. Nucleic Acids Res. 2018;46(W1):W537\u201344.","journal-title":"Nucleic Acids Res"},{"issue":"12","key":"5023_CR25","doi-asserted-by":"publisher","first-page":"e0242986","DOI":"10.1371\/journal.pone.0242986","volume":"15","author":"W Mhiri","year":"2020","unstructured":"Mhiri W, Ceylan M, Turgut-Kara N, Nalbanto\u011flu B, \u00c7ak\u0131r \u00d6. Transcriptomic analysis reveals responses to Cycloastragenol in Arabidopsis thaliana. PLoS ONE. 2020;15(12):e0242986.","journal-title":"PLoS ONE"},{"issue":"10","key":"5023_CR26","doi-asserted-by":"publisher","first-page":"986","DOI":"10.1101\/gr.7.10.986","volume":"7","author":"S Audic","year":"1997","unstructured":"Audic S, Claverie JM. The significance of digital gene expression profiles. Genome Res. 1997;7(10):986\u201395.","journal-title":"Genome Res"},{"issue":"7","key":"5023_CR27","doi-asserted-by":"publisher","first-page":"2164","DOI":"10.1093\/plcell\/koab113","volume":"33","author":"T Bonnot","year":"2021","unstructured":"Bonnot T, Nagel DH. Time of the day prioritizes the pool of translating mRNAs in response to heat stress. Plant Cell. 2021;33(7):2164\u201382.","journal-title":"Plant Cell"},{"issue":"2","key":"5023_CR28","doi-asserted-by":"publisher","first-page":"366","DOI":"10.1111\/j.1365-313X.2004.02051.x","volume":"38","author":"T Czechowski","year":"2004","unstructured":"Czechowski T, Bari RP, Stitt M, Scheible WR, Udvardi MK. Real-time RT-PCR profiling of over 1400 Arabidopsis transcription factors: unprecedented sensitivity reveals novel root- and shoot-specific genes. Plant J. 2004;38(2):366\u201379.","journal-title":"Plant J"},{"issue":"5","key":"5023_CR29","doi-asserted-by":"publisher","first-page":"501","DOI":"10.1038\/ng1543","volume":"37","author":"M Schmid","year":"2005","unstructured":"Schmid M, Davison TS, Henz SR, Pape UJ, Demar M, Vingron M, Scholkopf B, Weigel D, Lohmann JU. A gene expression map of Arabidopsis thaliana development. Nat Genet. 2005;37(5):501\u20136.","journal-title":"Nat Genet"},{"key":"5023_CR30","doi-asserted-by":"publisher","first-page":"e9952","DOI":"10.7717\/peerj.9952","volume":"8","author":"W Cai","year":"2020","unstructured":"Cai W, Zhou W, Han Z, Lei J, Zhuang J, Zhu P, Wu X, Yuan W. Master regulator genes and their impact on major diseases. PeerJ. 2020;8:e9952.","journal-title":"PeerJ"},{"issue":"3","key":"5023_CR31","doi-asserted-by":"publisher","first-page":"1537","DOI":"10.1093\/nar\/gkv014","volume":"43","author":"BH Yang","year":"2015","unstructured":"Yang BH, Floess S, Hagemann S, Deyneko IV, Groebe L, Pezoldt J, Sparwasser T, Lochner M, Huehn J. Development of a unique epigenetic signature during in vivo Th17 differentiation. Nucleic Acids Res. 2015;43(3):1537\u201348.","journal-title":"Nucleic Acids Res"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-022-05023-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-022-05023-z\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-022-05023-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,11,16]],"date-time":"2022-11-16T22:04:03Z","timestamp":1668636243000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-022-05023-z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,11,16]]},"references-count":31,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2022,12]]}},"alternative-id":["5023"],"URL":"https:\/\/doi.org\/10.1186\/s12859-022-05023-z","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,11,16]]},"assertion":[{"value":"5 July 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"29 October 2022","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"16 November 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"488"}}