{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,1,9]],"date-time":"2025-01-09T05:10:21Z","timestamp":1736399421835,"version":"3.32.0"},"reference-count":24,"publisher":"Oxford University Press (OUP)","issue":"12","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2006,6,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: The parametric F-test has been widely used in the analysis of factorial microarray experiments to assess treatment effects. However, the normality assumption is often untenable for microarray experiments with small replications. Therefore, permutation-based methods are called for help to assess the statistical significance. The distribution of the F-statistics across all the genes on the array can be regarded as a mixture distribution with a proportion of statistics generated from the null distribution of no differential gene expression whereas the other proportion of statistics generated from the alternative distribution of genes differentially expressed. This results in the fact that the permutation distribution of the F-statistics may not approximate well to the true null distribution of the F-statistics. Therefore, the construction of a proper null statistic to better approximate the null distribution of F-statistic is of great importance to the permutation-based multiple testing in microarray data analysis.<\/jats:p><jats:p>Results: In this paper, we extend the ideas of constructing null statistics based on pairwise differences to neglect the treatment effects from the two-sample comparison problem to the multifactorial balanced or unbalanced microarray experiments. A null statistic based on a subpartition method is proposed and its distribution is employed to approximate the null distribution of the F-statistic. The proposed null statistic is able to accommodate unbalance in the design and is also corrected for the undue correlation between its numerator and denominator. In the simulation studies and real biological data analysis, the number of true positives and the false discovery rate (FDR) of the proposed null statistic are compared with those of the permutated version of the F-statistic. It has been shown that our proposed method has a better control of the FDRs and a higher power than the standard permutation method to detect differentially expressed genes because of the better approximated tail probabilities.<\/jats:p><jats:p>Availability: R codes available upon request<\/jats:p><jats:p>Contact: \u00a0xingao@mathstat.yorku.ca<\/jats:p>","DOI":"10.1093\/bioinformatics\/btl109","type":"journal-article","created":{"date-parts":[[2006,3,31]],"date-time":"2006-03-31T01:29:31Z","timestamp":1143768571000},"page":"1486-1494","source":"Crossref","is-referenced-by-count":14,"title":["Construction of null statistics in permutation-based multiple testing for multi-factorial microarray experiments"],"prefix":"10.1093","volume":"22","author":[{"given":"Xin","family":"Gao","sequence":"first","affiliation":[{"name":"Department of Mathematics and Statistics, York University \u00a0 4700 Keele Street, Toronto, ON M3J 1P3, Canada"}]}],"member":"286","published-online":{"date-parts":[[2006,3,30]]},"reference":[{"key":"2023012408401202700_b1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/S0167-9473(01)00046-9","article-title":"A mixture model approach for the analysis of microarray gene expression data","volume":"39","author":"Allison","year":"2002","journal-title":"Comput. Stat. Data. Anal."},{"key":"2023012408401202700_b2","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","article-title":"Controlling the false discovery rate: a practical and powerful approach to multiple testing","volume":"57","author":"Benjamini","year":"1995","journal-title":"J. R. Statiat. Soc. Ser. B"},{"key":"2023012408401202700_b3","doi-asserted-by":"crossref","first-page":"660","DOI":"10.1093\/bioinformatics\/bti063","article-title":"A simple procedure for estimating the false discovery rate","volume":"21","author":"Dalmasso","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012408401202700_b4","doi-asserted-by":"crossref","first-page":"1151","DOI":"10.1198\/016214501753382129","article-title":"Empirical Bayes analysis of a microarray experiment","volume":"96","author":"Efron","year":"2001","journal-title":"J. Am. Stat. Assoc."},{"key":"2023012408401202700_b5","doi-asserted-by":"crossref","first-page":"186","DOI":"10.1186\/1471-2105-6-186","article-title":"Nonparametric tests for differential gene expression and interaction effects in multifactorial microarray experiments.","volume":"6","author":"Gao","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2023012408401202700_b6","article-title":"\u2018Model-based approach to FDR estimation\u2019","volume-title":"Research Report 2004-016","author":"Guan","year":"2004"},{"key":"2023012408401202700_b7","doi-asserted-by":"crossref","first-page":"3264","DOI":"10.1093\/bioinformatics\/bti519","article-title":"Practical FDR-based sample size calculations in microarray experiments","volume":"21","author":"Hu","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012408401202700_b8","doi-asserted-by":"crossref","first-page":"S115","DOI":"10.1093\/bioinformatics\/17.suppl_1.S115","article-title":"GEST: a gene expression search tool based on a novel Bayesian similarity metric","volume":"17","author":"Hunter","year":"2001","journal-title":"Bioinformatics"},{"key":"2023012408401202700_b9","doi-asserted-by":"crossref","first-page":"389","DOI":"10.1038\/ng766","article-title":"The contributions of sex, genotype and age to transcriptional variance in Drosophila melanogaster","volume":"29","author":"Jin","year":"2001","journal-title":"Nat. Genet."},{"key":"2023012408401202700_b10","doi-asserted-by":"crossref","first-page":"819","DOI":"10.1089\/10665270050514954","article-title":"Analysis of variance for gene expression microarray data","volume":"7","author":"Kerr","year":"2000","journal-title":"J. Comput. Biol."},{"key":"2023012408401202700_b11","doi-asserted-by":"crossref","first-page":"1333","DOI":"10.1093\/bioinformatics\/btg167","article-title":"On the use of permutation in and the performance of a class of nonparametric methods to detect differential gene expression","volume":"19","author":"Pan","year":"2003","journal-title":"Bioinformatics"},{"key":"2023012408401202700_b12","doi-asserted-by":"crossref","first-page":"117","DOI":"10.1007\/s10142-003-0085-7","article-title":"A mixture model approach to detecting differentially expressed genes with microarray data","volume":"3","author":"Pan","year":"2003","journal-title":"Funct. Integr. Genomics"},{"key":"2023012408401202700_b13","doi-asserted-by":"crossref","first-page":"282","DOI":"10.1016\/S1046-2023(03)00157-9","article-title":"Using ANOVA for gene selection from microarray studies of the nervous system","volume":"31","author":"Pavlidis","year":"2003","journal-title":"Methods"},{"key":"2023012408401202700_b14","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1016\/j.jspi.2003.07.019","article-title":"Choice of a null distribution in resampling-based multiple testing","volume":"125","author":"Pollard","year":"2004","journal-title":"J. Stat. Plan. Infer."},{"key":"2023012408401202700_b15","article-title":"Test statistics null distributions in multiple testing: simulation studies and applications to genomics","volume-title":"Working Paper Series, Working Paper 184","author":"Pollard","year":"2005"},{"key":"2023012408401202700_b16","doi-asserted-by":"crossref","first-page":"1236","DOI":"10.1093\/bioinformatics\/btg148","article-title":"Estimating the occurrence of false positives and false negatives in microarray studies by approximating and partitioning the empirical distribution of P-values","volume":"19","author":"Pounds","year":"2003","journal-title":"Bioinformatics"},{"key":"2023012408401202700_b17","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1093\/bioinformatics\/bth160","article-title":"Improving false discovery rate estimation","volume":"20","author":"Pounds","year":"2004","journal-title":"Bioinformatics"},{"key":"2023012408401202700_b18","doi-asserted-by":"crossref","first-page":"368","DOI":"10.1093\/bioinformatics\/btf877","article-title":"Identifying differentially expressed genes using false discovery rate controlling procedures","volume":"19","author":"Reiner","year":"2003","journal-title":"Bioinformatics"},{"volume-title":"Linear Models for Unbalanced Data","year":"1987","author":"Searle","key":"2023012408401202700_b19"},{"key":"2023012408401202700_b20","doi-asserted-by":"crossref","first-page":"9440","DOI":"10.1073\/pnas.1530509100","article-title":"Statistical significance for genomewide studies","volume":"100","author":"Storey","year":"2003","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012408401202700_b21","doi-asserted-by":"crossref","first-page":"5116","DOI":"10.1073\/pnas.091062498","article-title":"Significance analysis of microarrays applied to the ionizing radiation response","volume":"98","author":"Tusher","year":"2001","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012408401202700_b22","article-title":"Parametric and nonparametric FDR estimation","volume-title":"Revisited Research Report 2004-015","author":"Wu","year":"2004"},{"key":"2023012408401202700_b23","doi-asserted-by":"crossref","first-page":"4280","DOI":"10.1093\/bioinformatics\/bti685","article-title":"A note on using permutation based false discoveray rate estimate to compare different analysis methods for microarray data","volume":"21","author":"Xie","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012408401202700_b24","doi-asserted-by":"crossref","first-page":"1046","DOI":"10.1093\/bioinformatics\/btf879","article-title":"Modified nonparametric approaches to detecting differentially expressed genes in replicated microarray experiments","volume":"19","author":"Zhao","year":"2003","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/22\/12\/1486\/48838111\/bioinformatics_22_12_1486.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/22\/12\/1486\/48838111\/bioinformatics_22_12_1486.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,1,8]],"date-time":"2025-01-08T05:49:44Z","timestamp":1736315384000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/22\/12\/1486\/207064"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2006,3,30]]},"references-count":24,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2006,6,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btl109","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"type":"electronic","value":"1367-4811"},{"type":"print","value":"1367-4803"}],"subject":[],"published-other":{"date-parts":[[2006,6,15]]},"published":{"date-parts":[[2006,3,30]]}}}