{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,12]],"date-time":"2026-03-12T00:26:36Z","timestamp":1773275196571,"version":"3.50.1"},"reference-count":30,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2006,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Background<\/jats:title><jats:p>The number of genes declared differentially expressed is a random variable and its variability can be assessed by resampling techniques. Another important stability indicator is the frequency with which a given gene is selected across subsamples. We have conducted studies to assess stability and some other properties of several gene selection procedures with biological and simulated data.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>Using resampling techniques we have found that some genes are selected much less frequently (across sub-samples) than other genes with the same adjusted<jats:italic>p<\/jats:italic>-values. The extent to which this type of instability manifests itself can be assessed by a method introduced in this paper. The effect of correlation between gene expression levels on the performance of multiple testing procedures is studied by computer simulations.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusion<\/jats:title><jats:p>Resampling represents a tool for reducing the set of initially selected genes to those with a sufficiently high selection frequency. Using resampling techniques it is also possible to assess variability of different performance indicators. Stability properties of several multiple testing procedures are described at length in the present paper.<\/jats:p><\/jats:sec>","DOI":"10.1186\/1471-2105-7-50","type":"journal-article","created":{"date-parts":[[2006,2,4]],"date-time":"2006-02-04T19:16:50Z","timestamp":1139080610000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":60,"title":["Assessing stability of gene selection in microarray data analysis"],"prefix":"10.1186","volume":"7","author":[{"given":"Xing","family":"Qiu","sequence":"first","affiliation":[]},{"given":"Yuanhui","family":"Xiao","sequence":"additional","affiliation":[]},{"given":"Alexander","family":"Gordon","sequence":"additional","affiliation":[]},{"given":"Andrei","family":"Yakovlev","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2006,2,1]]},"reference":[{"key":"789_CR1","doi-asserted-by":"publisher","first-page":"120","DOI":"10.1186\/1471-2105-6-120","volume":"6","author":"X Qiu","year":"2005","unstructured":"Qiu X, Brooks AI, Klebanov L, Yakovlev A: The effects of normalization on the correlation structure of microarray data. BMC Bioinformatics 2005, 6: 120. 10.1186\/1471-2105-6-120","journal-title":"BMC Bioinformatics"},{"key":"789_CR2","doi-asserted-by":"publisher","first-page":"2093","DOI":"10.1002\/sim.4780111607","volume":"11","author":"W Sauerbrei","year":"1993","unstructured":"Sauerbrei W, Schumacher M: A bootstrapping resampling procedure for model building: application to the Cox regression model. Statistics in Medicine 1993, 11: 2093\u20132109.","journal-title":"Statistics in Medicine"},{"key":"789_CR3","doi-asserted-by":"publisher","first-page":"1620","DOI":"10.1093\/bioinformatics\/btg227","volume":"19","author":"P Pavlidis","year":"2003","unstructured":"Pavlidis P, Li Q, Noble WS: The effect of replication on gene expression microarray experiments. Bioinformatics 2003, 19: 1620\u20131627. 10.1093\/bioinformatics\/btg227","journal-title":"Bioinformatics"},{"key":"789_CR4","doi-asserted-by":"publisher","first-page":"370","DOI":"10.1016\/S0959-440X(03)00078-2","volume":"13","author":"G Stolovitzky","year":"2003","unstructured":"Stolovitzky G: Gene selection in microarray data: the elephant, the blind men and our algorithms. Current Opinion in Structural Biology 2003, 13: 370\u2013376. 10.1016\/S0959-440X(03)00078-2","journal-title":"Current Opinion in Structural Biology"},{"key":"789_CR5","doi-asserted-by":"publisher","first-page":"2031","DOI":"10.1214\/aos\/1176325770","volume":"22","author":"DN Politis","year":"1994","unstructured":"Politis DN, Romano JP: Large sample confidence regions based on subsamples under minimal assumptions. The Annals of Statistics 1994, 22: 2031\u20132050.","journal-title":"The Annals of Statistics"},{"key":"789_CR6","volume-title":"Resampling-Based Multiple Testing","author":"PH Westfall","year":"1993","unstructured":"Westfall PH, Young S: Resampling-Based Multiple Testing. Wiley, New York; 1993."},{"key":"789_CR7","doi-asserted-by":"publisher","first-page":"1151","DOI":"10.1198\/016214501753382129","volume":"96","author":"B Efron","year":"2001","unstructured":"Efron B, Tibshirani R, Storey JD, Tusher V: Empirical Bayes analysis of a microarray experiment. J Amer Statist Assoc 2001, 96: 1151\u20131160. 10.1198\/016214501753382129","journal-title":"J Amer Statist Assoc"},{"key":"789_CR8","doi-asserted-by":"publisher","first-page":"366","DOI":"10.1214\/aos\/1051027871","volume":"31","author":"B Efron","year":"2003","unstructured":"Efron B: Robbins, empirical Bayes and microarrays. Ann Statist 2003, 31: 366\u2013378. 10.1214\/aos\/1051027871","journal-title":"Ann Statist"},{"key":"789_CR9","doi-asserted-by":"publisher","first-page":"96","DOI":"10.1198\/016214504000000089","volume":"99","author":"B Efron","year":"2004","unstructured":"Efron B: Large-scale simultaneous hypothesis testing: The choice of a null hypothesis. J Amer Statist Assoc 2004, 99: 96\u2013104.","journal-title":"J Amer Statist Assoc"},{"key":"789_CR10","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","volume":"57","author":"Y Benjamini","year":"1995","unstructured":"Benjamini Y, Hochberg Y: Controlling the false discovery rate: A practical and powerful approach to multiple testing. J Roy Statist Soc Ser B 1995, 57: 289\u2013300.","journal-title":"J Roy Statist Soc Ser B"},{"key":"789_CR11","doi-asserted-by":"publisher","first-page":"1165","DOI":"10.1214\/aos\/1013699998","volume":"29","author":"Y Benjamini","year":"2001","unstructured":"Benjamini Y, Yekutieli D: The control of the false discovery rate in multiple testing under dependency. Ann Statist 2001, 29: 1165\u20131188. 10.1214\/aos\/1013699998","journal-title":"Ann Statist"},{"key":"789_CR12","unstructured":"St. Jude Children's Research Hospital (SJCRH) Database on childhood leukemia[http:\/\/www.stjuderesearch.org\/data\/ALL1\/]"},{"issue":"2","key":"789_CR13","doi-asserted-by":"publisher","first-page":"185","DOI":"10.1093\/bioinformatics\/19.2.185","volume":"19","author":"BM Bolstad","year":"2003","unstructured":"Bolstad BM, Irizarry RA, Astrand M, Speed TP: A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinformatics 2003, 19(2):185\u2013193. 10.1093\/bioinformatics\/19.2.185","journal-title":"Bioinformatics"},{"key":"789_CR14","doi-asserted-by":"publisher","first-page":"102","DOI":"10.1007\/0-387-21679-0_4","volume-title":"The Analysis of Gene Expression Data","author":"RA Irizarry","year":"2003","unstructured":"Irizarry RA, Gautier L, Cope LM: An R package for analyses of Affymetrix oligonucleotide arrays. In The Analysis of Gene Expression Data. Edited by: Parmigiani G, Garrett ES, Irizarry RA, Zeger SL. Springer, New York; 2003:102\u2013119."},{"key":"789_CR15","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4899-4541-9","volume-title":"An Introduction to the Bootstrap","author":"B Efron","year":"1993","unstructured":"Efron B, Tibshirani R: An Introduction to the Bootstrap. Chapman & Hall\/CRC, New York; 1993."},{"key":"789_CR16","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4612-0795-5","volume-title":"The Jackknife and Bootstrap","author":"J Shao","year":"1995","unstructured":"Shao J, Tu D: The Jackknife and Bootstrap. Springer Series in Statistics, Springer, New York; 1995."},{"key":"789_CR17","volume-title":"Practical Nonparametric Statistics","author":"WJ Conover","year":"1999","unstructured":"Conover WJ: Practical Nonparametric Statistics. 3rd edition. Wiley, New York; 1999.","edition":"3"},{"key":"789_CR18","doi-asserted-by":"publisher","first-page":"193","DOI":"10.1214\/aoms\/1177729437","volume":"23","author":"TW Anderson","year":"1952","unstructured":"Anderson TW, Darling DA: Asymptotic theory of certain \"goodness of fit\" criterion based on stochastic processes. The Annals of Mathematical Statistics 1952, 23: 193\u2013212.","journal-title":"The Annals of Mathematical Statistics"},{"key":"789_CR19","doi-asserted-by":"publisher","first-page":"617","DOI":"10.1214\/aoms\/1177729341","volume":"23","author":"M Rosenblatt","year":"1952","unstructured":"Rosenblatt M: Limit theorems associated with variants of the von Mises statistic. The Annals of Mathematical Statistics 1952, 23: 617\u2013623.","journal-title":"The Annals of Mathematical Statistics"},{"key":"789_CR20","doi-asserted-by":"publisher","first-page":"1148","DOI":"10.1214\/aoms\/1177704477","volume":"33","author":"TW Anderson","year":"1962","unstructured":"Anderson TW: (1962) On the distribution of the two-sample Cram\u00e9r-von Mises criterion. The Annals of Mathematical Statistics 1962, 33: 1148\u20131159.","journal-title":"The Annals of Mathematical Statistics"},{"key":"789_CR21","doi-asserted-by":"crossref","first-page":"221","DOI":"10.1111\/j.2517-6161.1996.tb02077.x","volume":"58","author":"S Csorgo","year":"1996","unstructured":"Csorgo S, Faraway JJ: (1996) The exact and asymptotic distributions of Cram\u00e9e-von Mises statistics. Journal of the Royal Statistical Society. Series B (Methodological) 1996, 58: 221\u2013234.","journal-title":"Journal of the Royal Statistical Society. Series B (Methodological)"},{"key":"789_CR22","doi-asserted-by":"publisher","first-page":"95","DOI":"10.1214\/aoms\/1177704245","volume":"34","author":"EJ Burr","year":"1963","unstructured":"Burr EJ: Small-sample distribution of the two-sample Cram\u00e9r-von Mises criterion for small equal samples. The Annals of Mathematical Statistics 1963, 34: 95\u2013101.","journal-title":"The Annals of Mathematical Statistics"},{"key":"789_CR23","unstructured":"Klebanov L, Gordon A, Xiao Y, Land H, Yakovlev A: A permutation test motivated by microarray data analysis. Comp Stat Data Anal, in press."},{"key":"789_CR24","doi-asserted-by":"publisher","first-page":"9440","DOI":"10.1073\/pnas.1530509100","volume":"100","author":"JD Storey","year":"2003","unstructured":"Storey JD, Tibshirani R: Statistical significance for genomewide studies. Proc Natl Acad Sci USA 2003, 100: 9440\u20139445. 10.1073\/pnas.1530509100","journal-title":"Proc Natl Acad Sci USA"},{"key":"789_CR25","doi-asserted-by":"publisher","first-page":"368","DOI":"10.1093\/bioinformatics\/btf877","volume":"19","author":"A Reiner","year":"2003","unstructured":"Reiner A, Yekutieli D, Benjamini Y: Identifying differentially expressed genes using false discovery rate controlling procedures. Bioinformatics 2003, 19: 368\u2013375. 10.1093\/bioinformatics\/btf877","journal-title":"Bioinformatics"},{"key":"789_CR26","doi-asserted-by":"publisher","first-page":"2013","DOI":"10.1214\/aos\/1074290335","volume":"31","author":"JD Storey","year":"2004","unstructured":"Storey JD: The positive false discovery rate: a Bayesian interpretation and the q -value. Ann Statist 2004, 31: 2013\u20132035. 10.1214\/aos\/1074290335","journal-title":"Ann Statist"},{"key":"789_CR27","doi-asserted-by":"publisher","first-page":"71","DOI":"10.1214\/ss\/1056397487","volume":"18","author":"S Dudoit","year":"2003","unstructured":"Dudoit S, Shaffer JP, Boldrick JC: Multiple hypothesis testing in microarray experiments. Statistical Science 2003, 18: 71\u2013103. 10.1214\/ss\/1056397487","journal-title":"Statistical Science"},{"key":"789_CR28","first-page":"Article 34","volume-title":"Correlation between gene expression levels and limitations of the empirical Bayes methodology for finding differentially expressed genes, Statistical Applications in Genetics and Molecular Biology","author":"X Qiu","year":"2005","unstructured":"Qiu X, Klebanov L, Yakovlev A: Correlation between gene expression levels and limitations of the empirical Bayes methodology for finding differentially expressed genes, Statistical Applications in Genetics and Molecular Biology. 2005, 4: Article 34."},{"key":"789_CR29","doi-asserted-by":"publisher","first-page":"2067","DOI":"10.1093\/bioinformatics\/bti270","volume":"21","author":"GK Smyth","year":"2005","unstructured":"Smyth GK, Michaud L, Scott HS: Use of within-array replicate spots for assessing differential expression in microarray experiments. Bioinformatics 2005, 21: 2067\u20132075. 10.1093\/bioinformatics\/bti270","journal-title":"Bioinformatics"},{"key":"789_CR30","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4757-3522-2","volume-title":"Fundamentals of Modern Statistical Methods","author":"RR Wilcox","year":"2001","unstructured":"Wilcox RR: Fundamentals of Modern Statistical Methods. Springer, New York; 2001."}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-7-50.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,1,7]],"date-time":"2025-01-07T16:22:38Z","timestamp":1736266958000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-7-50"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2006,2,1]]},"references-count":30,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2006,12]]}},"alternative-id":["789"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-7-50","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2006,2,1]]},"assertion":[{"value":"17 August 2005","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"1 February 2006","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"1 February 2006","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"50"}}