{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,28]],"date-time":"2026-03-28T09:03:48Z","timestamp":1774688628989,"version":"3.50.1"},"update-to":[{"DOI":"10.1371\/journal.pcbi.1009954","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2022,4,14]],"date-time":"2022-04-14T00:00:00Z","timestamp":1649894400000}}],"reference-count":35,"publisher":"Public Library of Science (PLoS)","issue":"3","license":[{"start":{"date-parts":[[2022,3,30]],"date-time":"2022-03-30T00:00:00Z","timestamp":1648598400000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000057","name":"National Institute of General Medical Sciences","doi-asserted-by":"publisher","award":["R01GM121459"],"award-info":[{"award-number":["R01GM121459"]}],"id":[{"id":"10.13039\/100000057","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000051","name":"National Human Genome Research Institute","doi-asserted-by":"publisher","award":["R00HG009007"],"award-info":[{"award-number":["R00HG009007"]}],"id":[{"id":"10.13039\/100000051","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000923","name":"Silicon Valley Community Foundation","doi-asserted-by":"publisher","award":["CZF2019-002443"],"award-info":[{"award-number":["CZF2019-002443"]}],"id":[{"id":"10.13039\/100000923","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000923","name":"Silicon Valley Community Foundation","doi-asserted-by":"publisher","award":["CZF2018-183446"],"award-info":[{"award-number":["CZF2018-183446"]}],"id":[{"id":"10.13039\/100000923","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["www.ploscompbiol.org"],"crossmark-restriction":false},"short-container-title":["PLoS Comput Biol"],"abstract":"<jats:p>Estimates of correlation between pairs of genes in co-expression analysis are commonly used to construct networks among genes using gene expression data. As previously noted, the distribution of such correlations depends on the observed expression level of the involved genes, which we refer to this as a<jats:italic>mean-correlation relationship<\/jats:italic>in RNA-seq data, both bulk and single-cell. This dependence introduces an unwanted technical bias in co-expression analysis whereby highly expressed genes are more likely to be highly correlated. Such a relationship is not observed in protein-protein interaction data, suggesting that it is not reflecting biology. Ignoring this bias can lead to missing potentially biologically relevant pairs of genes that are lowly expressed, such as transcription factors. To address this problem, we introduce spatial quantile normalization (SpQN), a method for normalizing local distributions in a correlation matrix. We show that spatial quantile normalization removes the mean-correlation relationship and corrects the expression bias in network reconstruction.<\/jats:p>","DOI":"10.1371\/journal.pcbi.1009954","type":"journal-article","created":{"date-parts":[[2022,3,30]],"date-time":"2022-03-30T17:38:54Z","timestamp":1648661934000},"page":"e1009954","update-policy":"https:\/\/doi.org\/10.1371\/journal.pcbi.corrections_policy","source":"Crossref","is-referenced-by-count":22,"title":["Addressing the mean-correlation relationship in co-expression analysis"],"prefix":"10.1371","volume":"18","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6639-0054","authenticated-orcid":true,"given":"Yi","family":"Wang","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7858-0231","authenticated-orcid":true,"given":"Stephanie C.","family":"Hicks","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0086-0687","authenticated-orcid":true,"given":"Kasper D.","family":"Hansen","sequence":"additional","affiliation":[]}],"member":"340","published-online":{"date-parts":[[2022,3,30]]},"reference":[{"issue":"4","key":"pcbi.1009954.ref001","first-page":"575","article-title":"Gene co-expression analysis for functional classification and gene-disease predictions","volume":"19","author":"S van Dam","year":"2018","journal-title":"Brief Bioinform"},{"issue":"3","key":"pcbi.1009954.ref002","doi-asserted-by":"crossref","first-page":"447","DOI":"10.1111\/tpj.13502","article-title":"Phylogenomic analysis of gene co-expression networks reveals the evolution of functional modules","volume":"90","author":"C Ruprecht","year":"2017","journal-title":"Plant J"},{"key":"pcbi.1009954.ref003","doi-asserted-by":"crossref","first-page":"559","DOI":"10.1186\/1471-2105-9-559","article-title":"WGCNA: an R package for weighted correlation network analysis","volume":"9","author":"P Langfelder","year":"2008","journal-title":"BMC Bioinformatics"},{"issue":"3","key":"pcbi.1009954.ref004","doi-asserted-by":"crossref","first-page":"432","DOI":"10.1093\/biostatistics\/kxm045","article-title":"Sparse inverse covariance estimation with the graphical lasso","volume":"9","author":"J Friedman","year":"2008","journal-title":"Biostatistics"},{"issue":"8","key":"pcbi.1009954.ref005","doi-asserted-by":"crossref","first-page":"e130","DOI":"10.1371\/journal.pgen.0020130","article-title":"Integrating genetic and network analysis to characterize genes related to mouse weight","volume":"2","author":"A Ghazalpour","year":"2006","journal-title":"PLOS Genetics"},{"issue":"11","key":"pcbi.1009954.ref006","doi-asserted-by":"crossref","first-page":"1271","DOI":"10.1038\/nn.2207","article-title":"Functional organization of the transcriptome in human brain","volume":"11","author":"MC Oldham","year":"2008","journal-title":"Nature Neuroscience"},{"issue":"5","key":"pcbi.1009954.ref007","doi-asserted-by":"crossref","first-page":"997","DOI":"10.1016\/j.cell.2013.10.020","article-title":"Coexpression networks implicate human midfetal deep cortical projection neurons in the pathogenesis of autism","volume":"155","author":"AJ Willsey","year":"2013","journal-title":"Cell"},{"issue":"11","key":"pcbi.1009954.ref008","doi-asserted-by":"crossref","first-page":"1843","DOI":"10.1101\/gr.216721.116","article-title":"Co-expression networks reveal the tissue-specific regulation of transcription and splicing","volume":"27","author":"A Saha","year":"2017","journal-title":"Genome Research"},{"issue":"4","key":"pcbi.1009954.ref009","doi-asserted-by":"crossref","first-page":"532","DOI":"10.1101\/gr.239442.118","article-title":"Coexpression patterns define epigenetic regulators associated with neurological dysfunction","volume":"29","author":"L Boukas","year":"2019","journal-title":"Genome Research"},{"key":"pcbi.1009954.ref010","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1186\/s12859-015-0745-3","article-title":"Systematic noise degrades gene co-expression signals but can be corrected","volume":"16","author":"S Freytag","year":"2015","journal-title":"BMC Bioinformatics"},{"issue":"1","key":"pcbi.1009954.ref011","doi-asserted-by":"crossref","first-page":"94","DOI":"10.1186\/s13059-019-1700-9","article-title":"Addressing confounding artifacts in reconstruction of gene co-expression networks","volume":"20","author":"P Parsana","year":"2019","journal-title":"Genome Biology"},{"key":"pcbi.1009954.ref012","article-title":"The effect of tissue composition on gene co-expression","author":"Y Zhang","year":"2019","journal-title":"Briefings in Bioinformatics"},{"key":"pcbi.1009954.ref013","doi-asserted-by":"crossref","first-page":"101","DOI":"10.1186\/s13059-016-0964-6","article-title":"Exploiting single-cell expression to characterize co-expression replicability","volume":"17","author":"M Crow","year":"2016","journal-title":"Genome Biology"},{"issue":"1","key":"pcbi.1009954.ref014","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1093\/bioinformatics\/bty538","article-title":"Differential coexpression in human tissues and the confounding effect of mean expression levels","volume":"35","author":"M Farahbod","year":"2019","journal-title":"Bioinformatics"},{"issue":"7675","key":"pcbi.1009954.ref015","doi-asserted-by":"crossref","first-page":"204","DOI":"10.1038\/nature24277","article-title":"Genetic effects on gene expression across human tissues","volume":"550","author":"GTEx Consortium","year":"2017","journal-title":"Nature"},{"issue":"6167","key":"pcbi.1009954.ref016","doi-asserted-by":"crossref","first-page":"193","DOI":"10.1126\/science.1245316","article-title":"Single-cell RNA-seq reveals dynamic, random monoallelic gene expression in mammalian cells","volume":"343","author":"Q Deng","year":"2014","journal-title":"Science"},{"key":"pcbi.1009954.ref017","doi-asserted-by":"crossref","first-page":"449","DOI":"10.1186\/1471-2105-12-449","article-title":"ReCount: a multi-experiment resource of analysis-ready RNA-seq gene count datasets","volume":"12","author":"AC Frazee","year":"2011","journal-title":"BMC Bioinformatics"},{"key":"pcbi.1009954.ref018","first-page":"605451","article-title":"A reference map of the human protein interactome","author":"K Luck","year":"2019","journal-title":"bioRxiv"},{"issue":"3","key":"pcbi.1009954.ref019","doi-asserted-by":"crossref","first-page":"432","DOI":"10.1093\/biostatistics\/kxm045","article-title":"Sparse inverse covariance estimation with the graphical lasso","volume":"9","author":"J Friedman","year":"2008","journal-title":"Biostatistics"},{"key":"pcbi.1009954.ref020","first-page":"2330","volume-title":"Advances in Neural Information Processing Systems","author":"CJ Hsieh","year":"2011"},{"issue":"6","key":"pcbi.1009954.ref021","doi-asserted-by":"crossref","first-page":"882","DOI":"10.1093\/bioinformatics\/bts034","article-title":"The sva package for removing batch effects and other unwanted variation in high-throughput experiments","volume":"28","author":"JT Leek","year":"2012","journal-title":"Bioinformatics"},{"issue":"9","key":"pcbi.1009954.ref022","first-page":"1724","article-title":"Capturing heterogeneity in gene expression studies by surrogate variable analysis","volume":"3","author":"JT Leek","year":"2007","journal-title":"PLOS Genetics"},{"issue":"48","key":"pcbi.1009954.ref023","doi-asserted-by":"crossref","first-page":"18718","DOI":"10.1073\/pnas.0808709105","article-title":"A general framework for multiple testing dependence","volume":"105","author":"JT Leek","year":"2008","journal-title":"Proceedings of the National Academy of Sciences of the United States of America"},{"issue":"1","key":"pcbi.1009954.ref024","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1093\/bioinformatics\/btp616","article-title":"edgeR: a Bioconductor package for differential expression analysis of digital gene expression data","volume":"26","author":"MD Robinson","year":"2010","journal-title":"Bioinformatics"},{"issue":"12","key":"pcbi.1009954.ref025","doi-asserted-by":"crossref","first-page":"550","DOI":"10.1186\/s13059-014-0550-8","article-title":"Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2","volume":"15","author":"MI Love","year":"2014","journal-title":"Genome Biol"},{"issue":"9","key":"pcbi.1009954.ref026","doi-asserted-by":"crossref","first-page":"1509","DOI":"10.1101\/gr.079558.108","article-title":"RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays","volume":"18","author":"JC Marioni","year":"2008","journal-title":"Genome Research"},{"key":"pcbi.1009954.ref027","doi-asserted-by":"crossref","first-page":"94","DOI":"10.1186\/1471-2105-11-94","article-title":"Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments","volume":"11","author":"JH Bullard","year":"2010","journal-title":"BMC Bioinformatics"},{"issue":"456","key":"pcbi.1009954.ref028","doi-asserted-by":"crossref","first-page":"1161","DOI":"10.1198\/016214501753381814","article-title":"Analysis of Data from Viral DNA Microchips","volume":"96","author":"D Amaratunga","year":"2001","journal-title":"Journal of American Statistical Association"},{"issue":"9","key":"pcbi.1009954.ref029","doi-asserted-by":"crossref","first-page":"research0048","DOI":"10.1186\/gb-2002-3-9-research0048","article-title":"A new non-linear normalization method for reducing variability in DNA microarray experiments","volume":"3","author":"C Workman","year":"2002","journal-title":"Genome Biology"},{"issue":"2","key":"pcbi.1009954.ref030","doi-asserted-by":"crossref","first-page":"185","DOI":"10.1093\/bioinformatics\/19.2.185","article-title":"A comparison of normalization methods for high density oligonucleotide array data based on variance and bias","volume":"19","author":"B Bolstad","year":"2003","journal-title":"Bioinformatics"},{"issue":"4","key":"pcbi.1009954.ref031","doi-asserted-by":"crossref","first-page":"252","DOI":"10.1038\/nrg2538","article-title":"A census of human transcription factors: function, expression and evolution","volume":"10","author":"JM Vaquerizas","year":"2009","journal-title":"Nature Reviews Genetics"},{"issue":"6280","key":"pcbi.1009954.ref032","doi-asserted-by":"crossref","first-page":"1450","DOI":"10.1126\/science.aad2257","article-title":"Survey of variation in human transcription factors reveals prevalent DNA binding changes","volume":"351","author":"LA Barrera","year":"2016","journal-title":"Science"},{"issue":"83","key":"pcbi.1009954.ref033","first-page":"2911","article-title":"QUIC: Quadratic Approximation for Sparse Inverse Covariance Estimation","volume":"15","author":"CJ Hsieh","year":"2014","journal-title":"J Mach Learn Res"},{"issue":"1","key":"pcbi.1009954.ref034","doi-asserted-by":"crossref","first-page":"118","DOI":"10.1093\/biostatistics\/kxj037","article-title":"Adjusting batch effects in microarray expression data using empirical Bayes methods","volume":"8","author":"WE Johnson","year":"2007","journal-title":"Biostatistics"},{"key":"pcbi.1009954.ref035","doi-asserted-by":"crossref","first-page":"180061","DOI":"10.1038\/sdata.2018.61","article-title":"Unifying cancer and normal RNA sequencing data from different sources","volume":"5","author":"Q Wang","year":"2018","journal-title":"Sci Data"}],"updated-by":[{"DOI":"10.1371\/journal.pcbi.1009954","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2022,4,14]],"date-time":"2022-04-14T00:00:00Z","timestamp":1649894400000}}],"container-title":["PLOS Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1009954","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,30]],"date-time":"2023-01-30T23:58:16Z","timestamp":1675123096000},"score":1,"resource":{"primary":{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1009954"}},"subtitle":[],"editor":[{"given":"Michael","family":"Hawrylycz","sequence":"first","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2022,3,30]]},"references-count":35,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2022,3,30]]}},"URL":"https:\/\/doi.org\/10.1371\/journal.pcbi.1009954","relation":{"new_version":[{"id-type":"doi","id":"10.1371\/journal.pcbi.1009954","asserted-by":"object"}]},"ISSN":["1553-7358"],"issn-type":[{"value":"1553-7358","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,3,30]]}}}