{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,14]],"date-time":"2026-03-14T21:38:45Z","timestamp":1773524325173,"version":"3.50.1"},"reference-count":23,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2005,5,16]],"date-time":"2005-05-16T00:00:00Z","timestamp":1116201600000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/2.0\/"},{"start":{"date-parts":[[2005,5,16]],"date-time":"2005-05-16T00:00:00Z","timestamp":1116201600000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/2.0\/"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                        <jats:title>Background<\/jats:title>\n                        <jats:p>Stochastic dependence between gene expression levels in microarray data is of critical importance for the methods of statistical inference that resort to pooling test-statistics across genes. It is frequently assumed that dependence between genes (or tests) is suffciently weak to justify the proposed methods of testing for differentially expressed genes. A potential impact of between-gene correlations on the performance of such methods has yet to be explored.<\/jats:p>\n                     <\/jats:sec><jats:sec>\n                        <jats:title>Results<\/jats:title>\n                        <jats:p>The paper presents a systematic study of correlation between the <jats:italic>t<\/jats:italic>-statistics associated with different genes. We report the effects of four different normalization methods using a large set of microarray data on childhood leukemia in addition to several sets of simulated data. Our findings help decipher the correlation structure of microarray data before and after the application of normalization procedures.<\/jats:p>\n                     <\/jats:sec><jats:sec>\n                        <jats:title>Conclusion<\/jats:title>\n                        <jats:p>A long-range correlation in microarray data manifests itself in thousands of genes that are heavily correlated with a given gene in terms of the associated <jats:italic>t<\/jats:italic>-statistics. By using normalization methods it is possible to significantly reduce correlation between the <jats:italic>t<\/jats:italic>-statistics computed for different genes. Normalization procedures affect both the true correlation, stemming from gene interactions, and the spurious correlation induced by random noise. When analyzing real world biological data sets, normalization procedures are unable to completely remove correlation between the test statistics. The long-range correlation structure also persists in normalized data.<\/jats:p>\n                     <\/jats:sec>","DOI":"10.1186\/1471-2105-6-120","type":"journal-article","created":{"date-parts":[[2005,5,17]],"date-time":"2005-05-17T06:14:11Z","timestamp":1116310451000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":76,"title":["The effects of normalization on the correlation structure of microarray data"],"prefix":"10.1186","volume":"6","author":[{"given":"Xing","family":"Qiu","sequence":"first","affiliation":[]},{"given":"Andrew I","family":"Brooks","sequence":"additional","affiliation":[]},{"given":"Lev","family":"Klebanov","sequence":"additional","affiliation":[]},{"given":"Andrei","family":"Yakovlev","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2005,5,16]]},"reference":[{"key":"445_CR1","doi-asserted-by":"publisher","first-page":"1151","DOI":"10.1198\/016214501753382129","volume":"96","author":"B Efron","year":"2001","unstructured":"Efron B, Tibshirani R, Storey JD, Tusher V: Empirical Bayes analysis of a microarray experiment. J Amer Statist Assoc 2001, 96: 1151\u20131160. 10.1198\/016214501753382129","journal-title":"J Amer Statist Assoc"},{"key":"445_CR2","doi-asserted-by":"publisher","first-page":"366","DOI":"10.1214\/aos\/1051027871","volume":"31","author":"B Efron","year":"2003","unstructured":"Efron B: Robbins, empirical Bayes and microarrays. Ann Statist 2003, 31: 366\u2013378. 10.1214\/aos\/1051027871","journal-title":"Ann Statist"},{"key":"445_CR3","doi-asserted-by":"publisher","first-page":"96","DOI":"10.1198\/016214504000000089","volume":"99","author":"B Efron","year":"2004","unstructured":"Efron B: Large-scale simultaneous hypothesis testing: The choice of a null hypothesis. J Amer Statist Assoc 2004, 99: 96\u2013104.","journal-title":"J Amer Statist Assoc"},{"issue":"1","key":"445_CR4","doi-asserted-by":"publisher","first-page":"37","DOI":"10.1089\/106652701300099074","volume":"8","author":"MA Newton","year":"2000","unstructured":"Newton MA, Kendziorski CM, Richmond CS, Blattner FR, Tsui KW: On differential variability of expression ratios: Improving statistical inference about gene expression changes from microarray data. J of Comput Biol 2000, 8(1):37\u201352. 10.1089\/106652701300099074","journal-title":"J of Comput Biol"},{"key":"445_CR5","doi-asserted-by":"publisher","first-page":"254","DOI":"10.1007\/0-387-21679-0_11","volume-title":"The Analysis of Gene Expression Data","author":"MA Newton","year":"2003","unstructured":"Newton MA, Kendziorski CM: Parametric empirical Bayes methods for microarrays. In The Analysis of Gene Expression Data. Edited by: Parmigiani G, Garrett ES, Irizarry RA, Zeger SL. Springer, New York; 2003:254\u2013271."},{"key":"445_CR6","doi-asserted-by":"publisher","DOI":"10.1002\/047172842X","volume-title":"Analyzing Microarray Gene Expression Data","author":"GJ McLachlan","year":"2004","unstructured":"McLachlan GJ, Do K-A, Ambroise C: Analyzing Microarray Gene Expression Data. Wiley, New York; 2004."},{"key":"445_CR7","volume-title":"Bioinformatics","author":"C Dalmasso","year":"2004","unstructured":"Dalmasso C, Broet P, Moreau T: A simple procedure for estimating the false discovery rate. Bioinformatics 2004."},{"key":"445_CR8","first-page":"2562","volume-title":"Bioinfor-matics","author":"P Broet","year":"2004","unstructured":"Broet P, Lewin A, Richardson S, Dalmasso C, Magdalenat H: A mixture model-based strategy for selecting genes in multiclass response microarray experiments. Bioinfor-matics 2004, 2562\u20132571. 10.1093\/bioinformatics\/bth285"},{"issue":"6","key":"445_CR9","doi-asserted-by":"publisher","first-page":"805","DOI":"10.1089\/10665270050514945","volume":"7","author":"T Ideker","year":"2000","unstructured":"Ideker T, Thorsson V, Seigel AF, Hood LE: Testing for differentially expressed genes by maximum likelihood analysis of microarray data. J Comput Biol 2000, 7(6):805\u2013817. 10.1089\/10665270050514945","journal-title":"J Comput Biol"},{"key":"445_CR10","doi-asserted-by":"publisher","first-page":"i264","DOI":"10.1093\/bioinformatics\/btg1037","volume":"19","author":"E Segal","year":"2003","unstructured":"Segal E, Wang H, Koller D: Discovering molecular pathways from protein interactions and gene expression data. Bioinformatics 2003, 19: i264-i272. 10.1093\/bioinformatics\/btg1037","journal-title":"Bioinformatics"},{"key":"445_CR11","doi-asserted-by":"publisher","first-page":"1071","DOI":"10.1111\/j.0006-341X.2003.00123.x","volume":"59","author":"C-A Tsai","year":"2003","unstructured":"Tsai C-A, Hsueh H-M, Chen JJ: Estimation of false discovery rates in multiple testing: application to gene microarray data. Biometrics 2003, 59: 1071\u20131081. 10.1111\/j.0006-341X.2003.00123.x","journal-title":"Biometrics"},{"key":"445_CR12","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/BF02595811","volume":"12","author":"JD Storey","year":"2003","unstructured":"Storey JD: Comment on 'Resampling-based multiple testing for DNA microarray data analysis' by Ge, Dudoit, and Speed. Test 2003, 12: 1\u201377.","journal-title":"Test"},{"key":"445_CR13","doi-asserted-by":"publisher","first-page":"187","DOI":"10.1111\/j.1467-9868.2004.00439.x","volume":"66","author":"JD Storey","year":"2004","unstructured":"Storey JD, Taylor JE, Siegmund D: Strong control, conservative point estimation and simultaneous conservative consistency of false discovery rates: a unified approach. J R Statist Soc B 2004, 66: 187\u2013205. 10.1111\/j.1467-9868.2004.00439.x","journal-title":"J R Statist Soc B"},{"issue":"1","key":"445_CR14","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/BF02595811","volume":"12","author":"Y Ge","year":"2003","unstructured":"Ge Y, Dudoit S, Speed TP: Resampling-based multiple testing for DNA microarray data analysis. TEST 2003, 12(1):1\u201344.","journal-title":"TEST"},{"key":"445_CR15","doi-asserted-by":"publisher","first-page":"707","DOI":"10.1093\/bioinformatics\/16.8.707","volume":"16","author":"P D'haeseller","year":"2000","unstructured":"D'haeseller P, Liang S, Somogyi R: Genetic network inference: from co-expression clustering to reverse engineering. Bioinformatics 2000, 16: 707\u2013726. 10.1093\/bioinformatics\/16.8.707","journal-title":"Bioinformatics"},{"key":"445_CR16","doi-asserted-by":"publisher","first-page":"33","DOI":"10.1186\/1471-2105-4-33","volume":"4","author":"T Park","year":"2003","unstructured":"Park T, Yi S-G, Kang S-H, Lee S-Y, Lee Y-S, Simon R: Evaluation of normalization methods for microarray data. BMC Bioinformatics 2003, 4: 33. 10.1186\/1471-2105-4-33","journal-title":"BMC Bioinformatics"},{"key":"445_CR17","doi-asserted-by":"publisher","first-page":"630","DOI":"10.1126\/science.306.5696.630","volume":"306","author":"E Marshall","year":"2004","unstructured":"Marshall E: Getting the noise out of gene arrays. Science 2004, 306: 630\u2013631. 10.1126\/science.306.5696.630","journal-title":"Science"},{"key":"445_CR18","unstructured":"St. Jude Children's Research Hospital (SJCRH) Database on childhood leukemia[http:\/\/www.stjuderesearch.org\/data\/ALL1\/]"},{"issue":"2","key":"445_CR19","doi-asserted-by":"publisher","first-page":"251","DOI":"10.1093\/bioinformatics\/18.2.251","volume":"18","author":"A Tsodikov","year":"2002","unstructured":"Tsodikov A, Szabo A, Jones D: Adjustments and measures of differential expression for microarray data. Bioinformatics 2002, 18(2):251\u2013260. 10.1093\/bioinformatics\/18.2.251","journal-title":"Bioinformatics"},{"key":"445_CR20","doi-asserted-by":"publisher","first-page":"71","DOI":"10.1016\/S0025-5564(01)00103-1","volume":"176","author":"A Szabo","year":"2002","unstructured":"Szabo A, Boucher K, Carroll W, Klebanov L, Tsodikov A, Yakovlev A: Variable selection and pattern recognition with gene expression data generated by the microarray technology. Mathematical Biosciences 2002, 176: 71\u201398. 10.1016\/S0025-5564(01)00103-1","journal-title":"Mathematical Biosciences"},{"issue":"2","key":"445_CR21","doi-asserted-by":"publisher","first-page":"185","DOI":"10.1093\/bioinformatics\/19.2.185","volume":"19","author":"BM Bolstad","year":"2003","unstructured":"Bolstad BM, Irizarry RA, Astrand M, Speed TP: A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinfor-matics 2003, 19(2):185\u2013193. 10.1093\/bioinformatics\/19.2.185","journal-title":"Bioinfor-matics"},{"key":"445_CR22","volume-title":"Design and Analysis of DNA Microarray Investigations","author":"RM Simon","year":"2003","unstructured":"Simon RM, Korn EL, McShane LM, Radmacher MD, Wright GW, Zhao Y: . In Design and Analysis of DNA Microarray Investigations. Springer, New York; 2003."},{"key":"445_CR23","doi-asserted-by":"publisher","first-page":"102","DOI":"10.1007\/0-387-21679-0_4","volume-title":"The Analysis of Gene Expression Data","author":"RA Irizarry","year":"2003","unstructured":"Irizarry RA, Gautier L, Cope LM: An R package for analyses of Affymetrix oligonu-cleotide arrays. In The Analysis of Gene Expression Data. Edited by: Parmigiani G, Garrett ES, Irizarry RA, Zeger SL. Springer, New York; 2003:102\u2013119."}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-6-120.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/1471-2105-6-120\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-6-120.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,10,7]],"date-time":"2024-10-07T12:11:59Z","timestamp":1728303119000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-6-120"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2005,5,16]]},"references-count":23,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2005,12]]}},"alternative-id":["445"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-6-120","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2005,5,16]]},"assertion":[{"value":"16 December 2004","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"16 May 2005","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"16 May 2005","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"120"}}