{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,8]],"date-time":"2025-10-08T22:03:36Z","timestamp":1759961016289,"version":"3.32.0"},"reference-count":34,"publisher":"Oxford University Press (OUP)","issue":"20","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2006,10,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: High-throughput microarray technology can be used to examine thousands of features, such as all the genes of an organism, and measure their expression. Two important issues of microarray bioinformatics are first, how to combine the significance values for each feature across experiments with high statistical power, and second, how to control the proportion of false positives. Existing methods address these issues separately, in spite of their linked usage.<\/jats:p><jats:p>Results: We present a novel method (ESP) to address the two requirements in an interdependent way. It generalizes the truncated product method of Zaykin et al. to combine only those significance values which clear their respective experiment-specific false discovery restrictive thresholds, thus allowing us to control the false discovery rate (FDR) for the final combined result. Further, we introduce several concepts that together offer FDR control, high power, quality control and speed-up in meta-analysis as done by our algorithm. Computational and statistical methods of research synthesis like the one described here will be increasingly important as additional genome-wide datasets accumulate in databases.<\/jats:p><jats:p>We apply our method to combine three well-known ChIP-chip transcription factor binding datasets for budding yeast to identify significant intergenic regulatory sequences for nine cell cycle regulating transcription factors, both with high power and controlled FDR.<\/jats:p><jats:p>Contact: \u00a0spyne@cs.sunysb.edu<\/jats:p><jats:p>Supplementary Materials and Appendices: \u00a0<\/jats:p>","DOI":"10.1093\/bioinformatics\/btl439","type":"journal-article","created":{"date-parts":[[2006,8,15]],"date-time":"2006-08-15T00:13:59Z","timestamp":1155600839000},"page":"2516-2522","source":"Crossref","is-referenced-by-count":16,"title":["Meta-analysis based on control of false discovery rate: combining yeast ChIP-chip datasets"],"prefix":"10.1093","volume":"22","author":[{"given":"Saumyadipta","family":"Pyne","sequence":"first","affiliation":[{"name":"Department of Computer Science, Stony Brook University 1 \u00a0 1 \u00a0 \u00a0 NY 11794, USA"}]},{"given":"Bruce","family":"Futcher","sequence":"additional","affiliation":[{"name":"Department of Molecular Genetics and Microbiology, Stony Brook University 2 \u00a0 2 \u00a0 \u00a0 NY 11794, USA"}]},{"given":"Steve","family":"Skiena","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Stony Brook University 1 \u00a0 1 \u00a0 \u00a0 NY 11794, USA"}]}],"member":"286","published-online":{"date-parts":[[2006,8,14]]},"reference":[{"key":"2023012409225004700_b1","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1089\/cmb.1998.5.211","article-title":"Methods and statistics for combining motif match scores","volume":"5","author":"Bailey","year":"1998","journal-title":"J. Comput. Biol."},{"key":"2023012409225004700_b2","first-page":"215","article-title":"Combining significance levels","volume-title":"The Handbook of Research Synthesis","author":"Becker","year":"1994"},{"key":"2023012409225004700_b3","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","article-title":"Controlling the false discovery rate: a practical and powerful approach to multiple testing","volume":"57","author":"Benjamini","year":"1995","journal-title":"J. R. Statis. Soc. B"},{"key":"2023012409225004700_b4","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1016\/0167-7152(92)90282-A","article-title":"On the distribution of the weighted combination of independent probabilities","volume":"15","author":"Bhoj","year":"1992","journal-title":"Stat. Prob. Lett."},{"volume-title":"Statistical Inference","year":"1990","author":"Casella","key":"2023012409225004700_b5"},{"key":"2023012409225004700_b6","doi-asserted-by":"crossref","first-page":"i84","DOI":"10.1093\/bioinformatics\/btg1010","article-title":"Combining multiple microarray studies and modeling interstudy variation","volume":"19","author":"Choi","year":"2003","journal-title":"Bioinformatics"},{"key":"2023012409225004700_b7","doi-asserted-by":"crossref","first-page":"496","DOI":"10.1037\/1082-989X.5.4.496","article-title":"Combining independent p values: extensions of the stouffer and binomial methods","volume":"5","author":"Darlington","year":"2000","journal-title":"Psychol. Methods"},{"key":"2023012409225004700_b8","doi-asserted-by":"crossref","first-page":"69","DOI":"10.1046\/j.1529-8817.2003.00061.x","article-title":"Meta-analysis of linkage studies for complex diseases: an overview of methods and a simulation study","volume":"68","author":"Dempfle","year":"2004","journal-title":"Ann. Hum. Genet."},{"key":"2023012409225004700_b9","doi-asserted-by":"crossref","first-page":"360","DOI":"10.1002\/gepi.10264","article-title":"Rank truncated product of P-values, with application to genomewide association scans","volume":"25","author":"Dudbridge","year":"2003","journal-title":"Genet. Epidemiol."},{"key":"2023012409225004700_b10","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1214\/ss\/1056397487","article-title":"Multiple hypothesis testing in microarray experiments","volume":"18","author":"Dudoit","year":"2003","journal-title":"Statistical Science"},{"volume-title":"Statistical Methods For Research Workers","year":"1932","author":"Fisher","key":"2023012409225004700_b11"},{"key":"2023012409225004700_b12","doi-asserted-by":"crossref","first-page":"499","DOI":"10.1111\/1467-9868.00347","article-title":"Operating characteristics and extensions of the FDR procedure","volume":"64","author":"Genovese","year":"2002","journal-title":"J. R. Stat. Soc. B"},{"key":"2023012409225004700_b13","doi-asserted-by":"crossref","first-page":"1035","DOI":"10.1214\/009053604000000283","article-title":"A stochastic process approach to false discovery control","volume":"32","author":"Genovese","year":"2004","journal-title":"Ann. Stat."},{"key":"2023012409225004700_b14","doi-asserted-by":"crossref","first-page":"264","DOI":"10.1111\/j.2517-6161.1955.tb00201.x","article-title":"On the weighted combination of significance tests","volume":"17","author":"Good","year":"1955","journal-title":"J. R. Stat. Soc."},{"key":"2023012409225004700_b15","doi-asserted-by":"crossref","first-page":"5079","DOI":"10.1038\/sj.onc.1208696","article-title":"Meta-analysis of microarray data on pancreatic cancer defines a set of commonly dysregulated genes","volume":"24","author":"Grutzmann","year":"2005","journal-title":"Oncogene"},{"key":"2023012409225004700_b16","doi-asserted-by":"crossref","first-page":"99","DOI":"10.1038\/nature02800","article-title":"Transcriptional regulatory code of a eukaryotic genome","volume":"431","author":"Harbison","year":"2004","journal-title":"Nature"},{"volume-title":"Statistical Methods for Meta-Analysis","year":"1985","author":"Hedges","key":"2023012409225004700_b17"},{"key":"2023012409225004700_b18","doi-asserted-by":"crossref","first-page":"533","DOI":"10.1038\/35054095","article-title":"Genomic binding sites of the yeast cell-cycle transcription factors sbf and mbf","volume":"409","author":"Iyer","year":"2001","journal-title":"Nature"},{"key":"2023012409225004700_b19","doi-asserted-by":"crossref","first-page":"379","DOI":"10.1016\/S0378-3758(03)00211-8","article-title":"Controlling the number of false discoveries: application to high-dimensional genomic data","volume":"124","author":"Korn","year":"2004","journal-title":"J. Stat. Plan Inference"},{"key":"2023012409225004700_b20","doi-asserted-by":"crossref","first-page":"799","DOI":"10.1126\/science.1075090","article-title":"Transcriptional Regulatory Networks in Saccharomyces cerevisiae","volume":"298","author":"Lee","year":"2002","journal-title":"Science"},{"key":"2023012409225004700_b21","doi-asserted-by":"crossref","first-page":"1138","DOI":"10.1214\/009053605000000084","article-title":"Generalizations of the familywise error rate","volume":"33","author":"Lehmann","year":"2005","journal-title":"Ann Stat."},{"key":"2023012409225004700_b22","doi-asserted-by":"crossref","first-page":"570","DOI":"10.1016\/j.tig.2003.08.006","article-title":"Comparison and meta-analysis of microarray data: from the bench to the computer desk","volume":"19","author":"Moreau","year":"2003","journal-title":"Trends Genet."},{"key":"2023012409225004700_b23","doi-asserted-by":"crossref","first-page":"1239","DOI":"10.1371\/journal.pbio.0030225","article-title":"The cell cycle-regulated genes of Schizosaccharomyces pombe","volume":"3","author":"Oliva","year":"2005","journal-title":"PLoS Biol."},{"key":"2023012409225004700_b24","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1177\/096228020101000403","article-title":"Approximations for trimmed Fisher procedures in research synthesis","volume":"10","author":"Olkin","year":"2001","journal-title":"Stat. Methods Med. Res."},{"key":"2023012409225004700_b25","first-page":"4427","article-title":"Meta-analysis of microarrays: interstudy validation of gene expression profiles reveals pathway dysregulation in prostate cancer","volume":"62","author":"Rhodes","year":"2002","journal-title":"Cancer Res."},{"key":"2023012409225004700_b26","doi-asserted-by":"crossref","first-page":"9309","DOI":"10.1073\/pnas.0401994101","article-title":"Large-scale meta-analysis of cancer microarray data identifies common transcriptional profiles of neoplastic transformation and progression","volume":"101","author":"Rhodes","year":"2004","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012409225004700_b27","doi-asserted-by":"crossref","DOI":"10.1214\/009053606000000461","article-title":"Stepup procedures for control of generalizations of the familywise error rate","volume":"34","author":"Romano","year":"2006","journal-title":"Ann. Stat."},{"key":"2023012409225004700_b28","doi-asserted-by":"crossref","DOI":"10.4135\/9781412984997","volume-title":"Meta-Analytic Procedures for Social Research","author":"Rosenthal","year":"1991"},{"key":"2023012409225004700_b29","first-page":"626","article-title":"Rectangular confidence regions for the means of the multivariate normal distributions","volume":"62","author":"Sidak","year":"1967","journal-title":"J. Am. Stat. Assoc."},{"key":"2023012409225004700_b30","doi-asserted-by":"crossref","first-page":"697","DOI":"10.1016\/S0092-8674(01)00494-9","article-title":"Serial regulation of transcriptional regulators in the yeast cell cycle","volume":"106","author":"Simon","year":"2001","journal-title":"Cell"},{"key":"2023012409225004700_b31","doi-asserted-by":"crossref","first-page":"3273","DOI":"10.1091\/mbc.9.12.3273","article-title":"Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization","volume":"9","author":"Spellman","year":"1998","journal-title":"Mol. Biol. Cell"},{"key":"2023012409225004700_b32","doi-asserted-by":"crossref","first-page":"479","DOI":"10.1111\/1467-9868.00346","article-title":"A direct approach to false discovery rates","volume":"64","author":"Storey","year":"2002","journal-title":"J. R Stat. Soc. B"},{"key":"2023012409225004700_b33","doi-asserted-by":"crossref","first-page":"9440","DOI":"10.1073\/pnas.1530509100","article-title":"Statistical significance for genome-wide studies","volume":"100","author":"Storey","year":"2003","journal-title":"Proc. Natt Acad. Sci. USA"},{"key":"2023012409225004700_b34","doi-asserted-by":"crossref","first-page":"170","DOI":"10.1002\/gepi.0042","article-title":"Truncated product method for combining P-values","volume":"22","author":"Zaykin","year":"2002","journal-title":"Genetic Epidemiol."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/22\/20\/2516\/48840102\/bioinformatics_22_20_2516.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/22\/20\/2516\/48840102\/bioinformatics_22_20_2516.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,1,10]],"date-time":"2025-01-10T12:53:38Z","timestamp":1736513618000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/22\/20\/2516\/218866"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2006,8,14]]},"references-count":34,"journal-issue":{"issue":"20","published-print":{"date-parts":[[2006,10,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btl439","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"type":"electronic","value":"1367-4811"},{"type":"print","value":"1367-4803"}],"subject":[],"published-other":{"date-parts":[[2006,10,15]]},"published":{"date-parts":[[2006,8,14]]}}}