{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,9]],"date-time":"2026-03-09T19:24:15Z","timestamp":1773084255152,"version":"3.50.1"},"reference-count":14,"publisher":"Oxford University Press (OUP)","issue":"24","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2006,12,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: Wide-scale correlations between genes are commonly observed in gene expression data, due to both biological and technical reasons. These correlations increase the variability of the standard estimate of the false discovery rate (FDR). We highlight the false discovery proportion (FDP, instead of the FDR) as the suitable quantity for assessing differential expression in microarray data, demonstrate the deleterious effects of correlation on FDP estimation and propose an improved estimation method that accounts for the correlations.<\/jats:p><jats:p>Methods: We analyse the variation pattern of the distribution of test statistics under permutation using the singular value decomposition. The results suggest a latent FDR model that accounts for the effects of correlation, and is statistically closer to the FDP. We develop a procedure for estimating the latent FDR (ELF) based on a Poisson regression model.<\/jats:p><jats:p>Results: For simulated data based on the correlation structure of real datasets, we find that ELF performs substantially better than the standard FDR approach in estimating the FDP. We illustrate the use of ELF in the analysis of breast cancer and lymphoma data.<\/jats:p><jats:p>Availability: R code to perform ELF is available in .<\/jats:p><jats:p>Contact: \u00a0yudi.pawitan@ki.se<\/jats:p><jats:p>Supplementary information: Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btl527","type":"journal-article","created":{"date-parts":[[2006,10,18]],"date-time":"2006-10-18T03:08:10Z","timestamp":1161140890000},"page":"3025-3031","source":"Crossref","is-referenced-by-count":33,"title":["Estimation of false discovery proportion under general dependence"],"prefix":"10.1093","volume":"22","author":[{"given":"Yudi","family":"Pawitan","sequence":"first","affiliation":[{"name":"Department of Medical Epidemiology and Biostatistics, Karolinska Institutet 1 \u00a0 1 \u00a0 \u00a0 Stockholm, Sweden"}]},{"given":"Stefano","family":"Calza","sequence":"additional","affiliation":[{"name":"Department of Medical Epidemiology and Biostatistics, Karolinska Institutet 1 \u00a0 1 \u00a0 \u00a0 Stockholm, Sweden"},{"name":"Department of Biomedical Sciences and Biotechnology 2 \u00a0 2 \u00a0 \u00a0 Brescia, Italy"}]},{"given":"Alexander","family":"Ploner","sequence":"additional","affiliation":[{"name":"Department of Medical Epidemiology and Biostatistics, Karolinska Institutet 1 \u00a0 1 \u00a0 \u00a0 Stockholm, Sweden"}]}],"member":"286","published-online":{"date-parts":[[2006,10,17]]},"reference":[{"key":"2023012408513394400_b1","doi-asserted-by":"crossref","first-page":"1151","DOI":"10.1198\/016214501753382129","article-title":"Empirical Bayes analysis of a microarray experiment","volume":"96","author":"Efron","year":"2001","journal-title":"J. Am. Stat. Soc."},{"key":"2023012408513394400_b2","doi-asserted-by":"crossref","first-page":"499","DOI":"10.1111\/1467-9868.00347","article-title":"Operating characteristics and extensions of the false discovery rate procedure","volume":"64","author":"Genovese","year":"2002","journal-title":"J. R. Statist. Soc. B"},{"key":"2023012408513394400_b3","doi-asserted-by":"crossref","first-page":"539","DOI":"10.1056\/NEJM200102223440801","article-title":"Gene-expression profiles in hereditary breast cancer","volume":"344","author":"Hedenfalk","year":"2001","journal-title":"N Engl. J. Med."},{"key":"2023012408513394400_b4","doi-asserted-by":"crossref","DOI":"10.2202\/1544-6115.1185","article-title":"Treating expression levels of different genes as a sample in microarray data analysis: is it worth a risk?","volume":"5","author":"Klebanov","year":"2006","journal-title":"Stat. Appl Genet. Mol. Biol."},{"key":"2023012408513394400_b5","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1111\/j.1467-9469.2005.00488.x","article-title":"False discovery control for multiple tests of association under general dependence","volume":"33","author":"Meinshausen","year":"2006","journal-title":"Scand. J. Stat."},{"key":"2023012408513394400_b6","doi-asserted-by":"crossref","DOI":"10.1093\/oso\/9780198507659.001.0001","volume-title":"In All Likelihood: Statistical Modelling and Inference Using Likelihood","author":"Pawitan","year":"2001"},{"key":"2023012408513394400_b7","doi-asserted-by":"crossref","first-page":"3017","DOI":"10.1093\/bioinformatics\/bti448","article-title":"False discovery rate, sensitivity and sample size for microarray studies","volume":"21","author":"Pawitan","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012408513394400_b8","doi-asserted-by":"crossref","first-page":"3865","DOI":"10.1093\/bioinformatics\/bti626","article-title":"Bias in the estimation of false discovery rate in microarray studies","volume":"21","author":"Pawitan","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012408513394400_b9","doi-asserted-by":"crossref","first-page":"80","DOI":"10.1186\/1471-2105-6-80","article-title":"Using correlations to evaluate low-level analysis procedures for high-density oligonucleotide microarray data","volume":"6","author":"Ploner","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2023012408513394400_b10","doi-asserted-by":"crossref","first-page":"1737","DOI":"10.1093\/bioinformatics\/bth160","article-title":"Improving false discovery rate estimation","volume":"20","author":"Pounds","year":"2004","journal-title":"Bioinformatics"},{"key":"2023012408513394400_b11","doi-asserted-by":"crossref","first-page":"1937","DOI":"10.1056\/NEJMoa012914","article-title":"The use of molecular profiling to predict survival after chemotherapy for diffuse large-B-cell lymphoma","volume":"346","author":"Rosenwald","year":"2002","journal-title":"N Engl. J. Med."},{"key":"2023012408513394400_b12","doi-asserted-by":"crossref","DOI":"10.1186\/1471-2105-7-50","article-title":"Assessing stability of gene selection in microarray data analysis","volume":"7","author":"Qiu","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023012408513394400_b13","doi-asserted-by":"crossref","first-page":"9440","DOI":"10.1073\/pnas.1530509100","article-title":"Statistical significance for genomewide studies","volume":"100","author":"Storey","year":"2003","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012408513394400_b14","unstructured":"Vallon-Christersson J. Functional and molecular characterization of BRCA1 and BRCA2 associated breast cancer 2005 Sweden Faculty of Medicine, Lund Unversity PhD thesis"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/22\/24\/3025\/48840224\/bioinformatics_22_24_3025.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/22\/24\/3025\/48840224\/bioinformatics_22_24_3025.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,7]],"date-time":"2024-02-07T17:12:27Z","timestamp":1707325947000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/22\/24\/3025\/209100"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2006,10,17]]},"references-count":14,"journal-issue":{"issue":"24","published-print":{"date-parts":[[2006,12,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btl527","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2006,12,15]]},"published":{"date-parts":[[2006,10,17]]}}}