{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,31]],"date-time":"2026-03-31T06:25:58Z","timestamp":1774938358161,"version":"3.50.1"},"reference-count":34,"publisher":"Oxford University Press (OUP)","issue":"4","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2007,2,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: A number of available program packages determine the significant enrichments and\/or depletions of GO categories among a class of genes of interest. Whereas a correct formulation of the problem leads to a single exact null distribution, these GO tools use a large variety of statistical tests whose denominations often do not clarify the underlying P-value computations.<\/jats:p><jats:p>Summary: We review the different formulations of the problem and the tests they lead to: the binomial, \u03c72, equality of two probabilities, Fisher's exact and hypergeometric tests. We clarify the relationships existing between these tests, in particular the equivalence between the hypergeometric test and Fisher's exact test. We recall that the other tests are valid only for large samples, the test of equality of two probabilities and the \u03c72-test being equivalent. We discuss the appropriateness of one- and two-sided P-values, as well as some discreteness and conservatism issues.<\/jats:p><jats:p>Contact: \u00a0isabelle.rivals@espci.fr<\/jats:p><jats:p>Supplementary information: Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btl633","type":"journal-article","created":{"date-parts":[[2006,12,21]],"date-time":"2006-12-21T01:34:06Z","timestamp":1166664846000},"page":"401-407","source":"Crossref","is-referenced-by-count":593,"title":["Enrichment or depletion of a GO category within a class of genes: which test?"],"prefix":"10.1093","volume":"23","author":[{"given":"Isabelle","family":"Rivals","sequence":"first","affiliation":[]},{"given":"L\u00e9on","family":"Personnaz","sequence":"additional","affiliation":[]},{"given":"Lieng","family":"Taing","sequence":"additional","affiliation":[{"name":"Laboratoire de Neurobiologie et Diversit\u00e9 Cellulaire, \u00c9cole Sup\u00e9rieure de Physique et de Chimie Industrielles (ESPCI) 1 \u00a0 1 \u00a0 \u00a0 10 rue Vauquelin, 75005 Paris, France"}]},{"given":"Marie-Claude","family":"Potier","sequence":"additional","affiliation":[{"name":"Laboratoire de Neurobiologie et Diversit\u00e9 Cellulaire, \u00c9cole Sup\u00e9rieure de Physique et de Chimie Industrielles (ESPCI) 1 \u00a0 1 \u00a0 \u00a0 10 rue Vauquelin, 75005 Paris, France"}]}],"member":"286","published-online":{"date-parts":[[2006,12,20]]},"reference":[{"key":"2023041109271120000_","first-page":"131","article-title":"A survey of exact inference for contingency tables","volume":"7","author":"Agresti","year":"1992","journal-title":"Stat. Sci."},{"key":"2023041109271120000_","doi-asserted-by":"crossref","first-page":"963","DOI":"10.1111\/j.0006-341X.2001.00963.x","article-title":"On small-sample confidence intervals for parameters in discrete distributions","volume":"57","author":"Agresti","year":"2001","journal-title":"Biometrics"},{"key":"2023041109271120000_","doi-asserted-by":"crossref","DOI":"10.1002\/0471249688","volume-title":"Categorical Data Analysis","author":"Agresti","year":"2002","edition":"2nd edn"},{"key":"2023041109271120000_","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-7908-1709-6_19","article-title":"Reducing conservatism of exact small-sample methods of inference for discrete data","author":"Agresti","year":"2006"},{"key":"2023041109271120000_","doi-asserted-by":"crossref","first-page":"578","DOI":"10.1093\/bioinformatics\/btg455","article-title":"FatiGO: A web tool for finding significant associations of Gene Ontology terms with groups of genes","volume":"20","author":"Al-Sharour","year":"2004","journal-title":"Bioinformatics"},{"key":"2023041109271120000_","doi-asserted-by":"crossref","first-page":"1464","DOI":"10.1093\/bioinformatics\/bth088","article-title":"GOstat: find statistically overrepresented Gene Ontologies within & group of genes","volume":"20","author":"Bei\u00dfbarth","year":"2004","journal-title":"Bioinformatics"},{"key":"2023041109271120000_","first-page":"3710","article-title":"GO: TermFinder\u2013open source software for accessing Gene Ontology information and finding significantly enriched Gene Ontology terms associated with a list of genes","volume-title":"Bioinformatics","author":"Boyle","year":"2004"},{"key":"2023041109271120000_","doi-asserted-by":"crossref","first-page":"891","DOI":"10.1093\/bioinformatics\/btg114","article-title":"GeneMerge\u2013post-genomics analysis, data mining, and hypothesis testing","volume":"19","author":"Castillo-Davis","year":"2003","journal-title":"Bioinformatics"},{"key":"2023041109271120000_","doi-asserted-by":"crossref","first-page":"1462","DOI":"10.1093\/bioinformatics\/bth087","article-title":"NetAffx Gene Ontology Mining Tool: a visual approach for microarray data analysis","volume":"20","author":"Cheng","year":"2004","journal-title":"Bioinformatics"},{"key":"2023041109271120000_","doi-asserted-by":"crossref","first-page":"R60","DOI":"10.1186\/gb-2003-4-9-r60","article-title":"DAVID: Database for Annotation, Visualization, and Integrated Discovery","volume":"4","author":"Dennis","year":"2003","journal-title":"Genome Biol."},{"key":"2023041109271120000_","first-page":"98","article-title":"Global functional profiling of gene expression","volume":"81","author":"Draghici","year":"2003","journal-title":"Genomics"},{"key":"2023041109271120000_","doi-asserted-by":"crossref","first-page":"397","DOI":"10.2307\/2988542","article-title":"Two-sided P-values from discrete asymmetric distributions based on uniformly most powerful unbiased tests","volume":"45","author":"Dunne","year":"1996","journal-title":"The Statistician"},{"key":"2023041109271120000_","author":"eGOn Reference Manual (2004)","year":"2004"},{"key":"2023041109271120000_","doi-asserted-by":"crossref","first-page":"i159","DOI":"10.1093\/bioinformatics\/bti1022","article-title":"Clustering short time series gene expression data","volume":"21","author":"Ernst","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041109271120000_","doi-asserted-by":"crossref","first-page":"39","DOI":"10.2307\/2342435","article-title":"The logic of inductive inference","volume":"98","author":"Fisher","year":"1935","journal-title":"J. Royal Stat. Soc."},{"key":"2023041109271120000_","first-page":"20","article-title":"P-values: interpretation and methodology","volume":"29","author":"Gibbons","year":"1975","journal-title":"Am. Stat."},{"key":"2023041109271120000_","doi-asserted-by":"crossref","first-page":"R70","DOI":"10.1186\/gb-2003-4-10-r70","article-title":"Identifying biological themes within lists of genes with EASE","volume":"4","author":"Hosack","year":"2003","journal-title":"Genome Biol."},{"key":"2023041109271120000_","doi-asserted-by":"crossref","first-page":"266","DOI":"10.1006\/geno.2002.6698","article-title":"Profiling gene expression utilizing onto-express","volume":"79","author":"Khatri","year":"2002","journal-title":"Genomics"},{"key":"2023041109271120000_","doi-asserted-by":"crossref","first-page":"3587","DOI":"10.1093\/bioinformatics\/bti565","article-title":"Ontological analysis of gene expression data: current tools, limitations, and open problems","volume":"21","author":"Khatri","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041109271120000_","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4757-1923-9","volume-title":"Testing Statistical Hypotheses","author":"Lehman","year":"1986","edition":"2nd edn"},{"key":"2023041109271120000_","doi-asserted-by":"crossref","first-page":"3448","DOI":"10.1093\/bioinformatics\/bti551","article-title":"BiNGO: a Cytoscape plugin to assass overrepresentation of Gene Ontology categories in Biological Networks","volume":"21","author":"Maere","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041109271120000_","doi-asserted-by":"crossref","first-page":"R101","DOI":"10.1186\/gb-2004-5-12-r101","article-title":"GOToolbox: functional analysis of gene datasets based on Gene Ontology","volume":"5","author":"Martin","year":"2004","journal-title":"Genome Biol."},{"key":"2023041109271120000_","doi-asserted-by":"crossref","first-page":"W293","DOI":"10.1093\/nar\/gkh432","article-title":"GFINDer: Genome Function INtegrated Discoverer through dynamic annotation, statistical analysisn and mining","volume":"32","author":"Masseroli","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023041109271120000_","first-page":"1411","article-title":"Exact inference for categorical data","volume":"Vol. 2","author":"Mehta","year":"1998","journal-title":"Encyclopedia of Biostatistics"},{"key":"2023041109271120000_","volume-title":"Introduction to the Theory of Statistics","author":"Mood","year":"1974","edition":"3rd edn"},{"key":"2023041109271120000_","doi-asserted-by":"crossref","first-page":"R8","DOI":"10.1186\/gb-2005-6-9-r81","article-title":"L2L: a simple tool for discovering the hidden significance in microarray expression data","volume":"6","author":"Newman","year":"2005","journal-title":"Genome Biol."},{"key":"2023041109271120000_","doi-asserted-by":"crossref","first-page":"2636","DOI":"10.1093\/bioinformatics\/bth295","article-title":"THEA: ontology-driven analysis of microarray","volume":"20","author":"Pasquier","year":"2004","journal-title":"Bioinformatics"},{"key":"2023041109271120000_","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1186\/1471-2105-3-35","article-title":"FunSpec: a web-based cluster interpreter for yeast","volume":"3","author":"Robinson","year":"2002","journal-title":"BMC Bioinformatics"},{"key":"2023041109271120000_","doi-asserted-by":"crossref","first-page":"1196","DOI":"10.1093\/bioinformatics\/bth056","article-title":"CLENCH: a program for calculating Cluster ENriCHment using the Gene Ontology","volume":"20","author":"Shah","year":"2004","journal-title":"Bioinformatics"},{"key":"2023041109271120000_","doi-asserted-by":"crossref","first-page":"426","DOI":"10.2307\/2981577","article-title":"Test of significance for 2x2 contingency tables","volume":"147","author":"Yates","year":"1984","journal-title":"J. Royal Stat. Soc. Series A"},{"key":"2023041109271120000_","doi-asserted-by":"crossref","first-page":"275","DOI":"10.1093\/bioinformatics\/bth495","article-title":"Ontology Traverser: an R package for GO analysis","volume":"21","author":"Young","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041109271120000_","doi-asserted-by":"crossref","first-page":"R28","DOI":"10.1186\/gb-2003-4-4-r28","article-title":"GoMiner: a resource for biological interpretation of genomic and proteomic data","volume":"4","author":"Zeeberg","year":"2003","journal-title":"Genome Biol."},{"key":"2023041109271120000_","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1186\/1471-2105-5-16","article-title":"GOTree Machine (GOTM): a web-based platform for interpreting sets of iinteresting genes using Gene Ontology hierarchies","volume":"5","author":"Zhang","year":"2004","journal-title":"BMC Bioinformatics"},{"key":"2023041109271120000_","doi-asserted-by":"crossref","first-page":"261","DOI":"10.2165\/00822942-200403040-00009","article-title":"GoSurfer: a graphical interactive tool for comparative analysis of large gene sets in gene ontology space","volume":"3","author":"Zhong","year":"2004","journal-title":"Appl. Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/4\/401\/49830059\/bioinformatics_23_4_401.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/4\/401\/49830059\/bioinformatics_23_4_401.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,9]],"date-time":"2024-02-09T14:16:27Z","timestamp":1707488187000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/23\/4\/401\/181853"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2006,12,20]]},"references-count":34,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2007,2,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btl633","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2007,2,15]]},"published":{"date-parts":[[2006,12,20]]}}}