{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,17]],"date-time":"2025-02-17T05:09:21Z","timestamp":1739768961654,"version":"3.37.1"},"reference-count":42,"publisher":"Oxford University Press (OUP)","issue":"5","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010,3,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: The proportion of non-differentially expressed genes (\u03c00) is an important quantity in microarray data analysis. Although many statistical methods have been proposed for its estimation, it is still necessary to develop more efficient methods.<\/jats:p><jats:p>Methods: Our approach for improving \u03c00 estimation is to modify an existing simple method by introducing artificial censoring to P-values. In a comprehensive simulation study and the applications to experimental datasets, we compare our method with eight existing estimation methods.<\/jats:p><jats:p>Results: The simulation study confirms that our method can clearly improve the estimation performance. Compared with the existing methods, our method can generally provide a relatively accurate estimate with relatively small variance. Using experimental microarray datasets, we also demonstrate that our method can generally provide satisfactory estimates in practice.<\/jats:p><jats:p>Availability: The R code is freely available at http:\/\/home.gwu.edu\/~ylai\/research\/CBpi0\/.<\/jats:p><jats:p>Contact: \u00a0ylai@gwu.edu<\/jats:p><jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btq001","type":"journal-article","created":{"date-parts":[[2010,1,16]],"date-time":"2010-01-16T01:14:00Z","timestamp":1263604440000},"page":"640-646","source":"Crossref","is-referenced-by-count":22,"title":["A censored beta mixture model for the estimation of the proportion of non-differentially expressed genes"],"prefix":"10.1093","volume":"26","author":[{"given":"Anastasios","family":"Markitsis","sequence":"first","affiliation":[{"name":"1 Department of Statistics and 2 Department of Statistics and Biostatistics Center, The George Washington University, Washington D.C. 20052, USA"}]},{"given":"Yinglei","family":"Lai","sequence":"additional","affiliation":[{"name":"1 Department of Statistics and 2 Department of Statistics and Biostatistics Center, The George Washington University, Washington D.C. 20052, USA"}]}],"member":"286","published-online":{"date-parts":[[2010,1,15]]},"reference":[{"key":"2023012511004905800_B1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/S0167-9473(01)00046-9","article-title":"A mixture model approach for the analysis of microarray gene expression data","volume":"39","author":"Allison","year":"2002","journal-title":"Comput. Stat. Data Anal."},{"key":"2023012511004905800_B2","doi-asserted-by":"crossref","first-page":"D760","DOI":"10.1093\/nar\/gkl887","article-title":"NCBI GEO: mining tens of millions of expression profiles\u2013database and tools update","volume":"35","author":"Barrett","year":"2007","journal-title":"Nucleic Acids Res."},{"key":"2023012511004905800_B3","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","article-title":"Controlling the false discovery rate: a practical and powerful approach to multiple testing","volume":"57","author":"Benjamini","year":"1995","journal-title":"J. R. Stat. Soc. Ser. B"},{"key":"2023012511004905800_B4","doi-asserted-by":"crossref","first-page":"1165","DOI":"10.1214\/aos\/1013699998","article-title":"The control of the false discovery rate in multiple testing under dependency","volume":"29","author":"Benjamini","year":"2001","journal-title":"Ann. Stat."},{"key":"2023012511004905800_B5","doi-asserted-by":"crossref","first-page":"199","DOI":"10.1186\/1471-2105-6-199","article-title":"A comparative review of estimates of the proportion unchanged genes and the false discovery rate","volume":"6","author":"Broberg","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2023012511004905800_B6","doi-asserted-by":"crossref","first-page":"1061","DOI":"10.1038\/nature07385","article-title":"Comprehensive genomic characterization defines human glioblastoma genes and core pathways","volume":"455","author":"Cancer Genome Atlas Research Network","year":"2008","journal-title":"Nature"},{"key":"2023012511004905800_B7","doi-asserted-by":"crossref","first-page":"1387","DOI":"10.1084\/jem.20051409","article-title":"Where CD4 + CD25 + T reg cells impinge on autoimmune diabetes","volume":"202","author":"Chen","year":"2005","journal-title":"J. Exp. Med."},{"key":"2023012511004905800_B8","doi-asserted-by":"crossref","first-page":"210","DOI":"10.1186\/gb-2003-4-4-210","article-title":"Statistical tests for differential expression in cDNA microarray experiments","volume":"4","author":"Cui","year":"2003","journal-title":"Genome Biol."},{"key":"2023012511004905800_B9","doi-asserted-by":"crossref","first-page":"660","DOI":"10.1093\/bioinformatics\/bti063","article-title":"A simple procedure for estimating the false discovery rate","volume":"21","author":"Dalmasso","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012511004905800_B10","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1214\/ss\/1056397487","article-title":"Multiple hypothesis testing in microarray experiments","volume":"18","author":"Dudoit","year":"2003","journal-title":"Stat. Sci."},{"key":"2023012511004905800_B11","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1214\/aos\/1176344552","article-title":"Bootstrap methods: another look at the jackknife","volume":"7","author":"Efron","year":"1979","journal-title":"Ann. Stat."},{"key":"2023012511004905800_B12","doi-asserted-by":"crossref","first-page":"93","DOI":"10.1198\/016214506000001211","article-title":"Correlation and large-scale simultaneous significance testing","volume":"102","author":"Efron","year":"2007","journal-title":"J. Am. Stat. Assoc."},{"key":"2023012511004905800_B13","doi-asserted-by":"crossref","first-page":"1776","DOI":"10.1111\/j.1600-6143.2004.00627.x","article-title":"De novo kidney transplantation without use of calcineurin inhibitors preserves renal structure and function at two years","volume":"4","author":"Flechner","year":"2004","journal-title":"Am. J. Transplant."},{"key":"2023012511004905800_B14","first-page":"905","article-title":"Nonparametric estimator of false discovery rate based on Bernstein polynomials","volume":"18","author":"Guan","year":"2008","journal-title":"Stat. Sin."},{"key":"2023012511004905800_B15","doi-asserted-by":"crossref","first-page":"39","DOI":"10.1007\/s00109-006-0103-z","article-title":"Modification of gene expression of the small airway epithelium in response to cigarette smoking","volume":"85","author":"Harvey","year":"2007","journal-title":"J. Mol. Med."},{"key":"2023012511004905800_B16","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1177\/117693510800600001","article-title":"Estimating the proportion of true null hypotheses for multiple comparisons","volume":"6","author":"Jiang","year":"2008","journal-title":"Cancer Inform."},{"key":"2023012511004905800_B17","doi-asserted-by":"crossref","first-page":"2118","DOI":"10.1093\/bioinformatics\/bti318","article-title":"Applications of beta-mixture models in bioinformatics","volume":"21","author":"Ji","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012511004905800_B18","doi-asserted-by":"crossref","first-page":"3097","DOI":"10.1093\/bioinformatics\/bti456","article-title":"Sample size for FDR-control in microarray data analysis","volume":"21","author":"Jung","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012511004905800_B19","doi-asserted-by":"crossref","first-page":"744","DOI":"10.1093\/biostatistics\/kxm002","article-title":"A moment-based method for estimating the proportion of true null hypotheses and its application to microarray gene expression data","volume":"8","author":"Lai","year":"2007","journal-title":"Biostatistics"},{"key":"2023012511004905800_B20","doi-asserted-by":"crossref","first-page":"555","DOI":"10.1111\/j.1467-9868.2005.00515.x","article-title":"Estimating the proportion of true null hypotheses, with application to DNA microarray data","volume":"67","author":"Langaas","year":"2005","journal-title":"J. R. Stat. Soc. Ser. B"},{"key":"2023012511004905800_B21","doi-asserted-by":"crossref","first-page":"2694","DOI":"10.1093\/bioinformatics\/bth310","article-title":"A mixture model for estimating the local false discovery rate in DNA microarray analysis","volume":"20","author":"Liao","year":"2004","journal-title":"Bioinformatics"},{"key":"2023012511004905800_B22","doi-asserted-by":"crossref","first-page":"1675","DOI":"10.1038\/nbt1296-1675","article-title":"Expression monitoring by hybridization to high-density oligonucleotide arrays","volume":"14","author":"Lockhart","year":"1996","journal-title":"Nat. Biotechnol."},{"key":"2023012511004905800_B23","doi-asserted-by":"crossref","first-page":"157","DOI":"10.1186\/1471-2105-8-157","article-title":"Re-sampling strategy to improve the estimation of number of null hypotheses in FDR control under strong correlation structures","volume":"18","author":"Lu","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023012511004905800_B24","doi-asserted-by":"crossref","first-page":"1608","DOI":"10.1093\/bioinformatics\/btl148","article-title":"A simple implementation of a normal mixture approach to differential gene expression in multiclass microarrays","volume":"22","author":"McLachlan","year":"2006","journal-title":"Bioinformatics"},{"key":"2023012511004905800_B25","doi-asserted-by":"crossref","first-page":"18","DOI":"10.1002\/9780470191613","volume-title":"The EM algorithm and extensions","author":"McLachlan","year":"2008","edition":"2"},{"key":"2023012511004905800_B26","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1038\/ng1180","article-title":"PGC-1\u03b1-response genes involved in oxidative phos-phorylation are coordinately downregulated in human diabetes","volume":"34","author":"Mootha","year":"2003","journal-title":"Nat. Genet."},{"key":"2023012511004905800_B27","doi-asserted-by":"crossref","first-page":"1683","DOI":"10.1093\/genetics\/157.4.1683","article-title":"A whole genome scan for quantitative trait loci affecting milk protein percentage in Israeli-Holstein cattle, by means of selective milk DNA pooling in a daughter design, using an adjusted false discovery rate criterion","volume":"157","author":"Mosig","year":"2001","journal-title":"Genetics"},{"key":"2023012511004905800_B28","doi-asserted-by":"crossref","first-page":"1344","DOI":"10.1126\/science.1158441","article-title":"The transcriptional landscape of the yeast genome defined by RNA sequencing","volume":"320","author":"Nagalakshmi","year":"2008","journal-title":"Science"},{"key":"2023012511004905800_B29","doi-asserted-by":"crossref","first-page":"337","DOI":"10.1198\/108571106X129135","article-title":"Estimating the number of true null hypotheses from a histogram of p values","volume":"11","author":"Nettleton","year":"2006","journal-title":"J. Agric. Biol. Environ. Stat."},{"key":"2023012511004905800_B30","doi-asserted-by":"crossref","first-page":"1236","DOI":"10.1093\/bioinformatics\/btg148","article-title":"Estimating the occurrence of false positives and false negatives in microarray studies by approximating and partitioning the empirical distribution of p-values","volume":"19","author":"Pounds","year":"2003","journal-title":"Bioinformatics"},{"key":"2023012511004905800_B31","doi-asserted-by":"crossref","first-page":"1737","DOI":"10.1093\/bioinformatics\/bth160","article-title":"Improving false discovery rate estimation","volume":"20","author":"Pounds","year":"2004","journal-title":"Bioinformatics"},{"key":"2023012511004905800_B32","doi-asserted-by":"crossref","first-page":"4263","DOI":"10.1093\/bioinformatics\/bti699","article-title":"Sample size determination for the false discovery rate","volume":"21","author":"Pounds","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012511004905800_B33","doi-asserted-by":"crossref","first-page":"1979","DOI":"10.1093\/bioinformatics\/btl328","article-title":"Robust estimation of the false discovery rate","volume":"22","author":"Pounds","year":"2006","journal-title":"Bioinformatics"},{"key":"2023012511004905800_B34","doi-asserted-by":"crossref","first-page":"9427","DOI":"10.1073\/pnas.0803602105","article-title":"Detrimental effects of Bartonella henselae are counteracted by L-arginine and nitric oxide in human endothelial progenitor cells","volume":"105","author":"Salvatore","year":"2008","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012511004905800_B35","doi-asserted-by":"crossref","first-page":"98","DOI":"10.1109\/TCBB.2004.24","article-title":"A stochastic downhill search algorithm for estimating the local false discovery rate","volume":"1","author":"Scheid","year":"2004","journal-title":"IEEE Trans. Comput. Biol. Bioinform."},{"key":"2023012511004905800_B36","doi-asserted-by":"crossref","first-page":"467","DOI":"10.1126\/science.270.5235.467","article-title":"Quantitative monitoring of gene expression patterns with a complementary DNA microarray","volume":"270","author":"Schena","year":"1995","journal-title":"Science"},{"key":"2023012511004905800_B37","doi-asserted-by":"crossref","first-page":"203","DOI":"10.1016\/S1535-6108(02)00030-2","article-title":"Gene expression correlates of clinical prostate cancer behavior","volume":"1","author":"Singh","year":"2002","journal-title":"Cancer Cell"},{"key":"2023012511004905800_B38","doi-asserted-by":"crossref","first-page":"479","DOI":"10.1111\/1467-9868.00346","article-title":"A direct approach to false discovery rates","volume":"64","author":"Storey","year":"2002","journal-title":"J. R. Stat. Soc. B"},{"key":"2023012511004905800_B39","doi-asserted-by":"crossref","first-page":"9440","DOI":"10.1073\/pnas.1530509100","article-title":"Statistical significance for genomewide studies","volume":"100","author":"Storey","year":"2003","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012511004905800_B40","doi-asserted-by":"crossref","first-page":"1071","DOI":"10.1111\/j.0006-341X.2003.00123.x","article-title":"Estimation of false discovery rates in multiple testing: application to gene microarray data","volume":"59","author":"Tsai","year":"2003","journal-title":"Biometrics"},{"key":"2023012511004905800_B41","doi-asserted-by":"crossref","first-page":"714","DOI":"10.1089\/cmb.2004.11.714","article-title":"Sample size for identifying differentially expressed genes in microarray experiments","volume":"11","author":"Wang","year":"2004","journal-title":"J. Comput. Biol."},{"key":"2023012511004905800_B42","doi-asserted-by":"crossref","first-page":"1239","DOI":"10.1038\/nature07002","article-title":"Dynamic repertoire of a eukaryotic transcriptome surveyed at single-nucleotide resolution","volume":"453","author":"Wilhelm","year":"2008","journal-title":"Nature"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/5\/640\/48860817\/bioinformatics_26_5_640.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/5\/640\/48860817\/bioinformatics_26_5_640.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,2,16]],"date-time":"2025-02-16T20:10:24Z","timestamp":1739736624000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/26\/5\/640\/211929"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,1,15]]},"references-count":42,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2010,3,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btq001","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"type":"electronic","value":"1367-4811"},{"type":"print","value":"1367-4803"}],"subject":[],"published-other":{"date-parts":[[2010,3,1]]},"published":{"date-parts":[[2010,1,15]]}}}