{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,10]],"date-time":"2026-01-10T18:29:22Z","timestamp":1768069762394,"version":"3.49.0"},"reference-count":14,"publisher":"Oxford University Press (OUP)","issue":"22","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2007,11,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: High-throughput experiments such as microarray hybridizations often yield long lists of genes found to share a certain characteristic such as differential expression. Exploring Gene Ontology (GO) annotations for such lists of genes has become a widespread practice to get first insights into the potential biological meaning of the experiment. The standard statistical approach to measuring overrepresentation of GO terms cannot cope with the dependencies resulting from the structure of GO because they analyze each term in isolation. Especially the fact that annotations are inherited from more specific descendant terms can result in certain types of false-positive results with potentially misleading biological interpretation, a phenomenon which we term the inheritance problem.<\/jats:p><jats:p>Results: We present here a novel approach to analysis of GO term overrepresentation that determines overrepresentation of terms in the context of annotations to the term's parents. This approach reduces the dependencies between the individual term's measurements, and thereby avoids producing false-positive results owing to the inheritance problem. ROC analysis using study sets with overrepresented GO terms showed a clear advantage for our approach over the standard algorithm with respect to the inheritance problem. Although there can be no gold standard for exploratory methods such as analysis of GO term overrepresentation, analysis of biological datasets suggests that our algorithm tends to identify the core GO terms that are most characteristic of the dataset being analyzed.<\/jats:p><jats:p>Availability: The Ontologizer can be found at the project homepage http:\/\/www.charite.de\/ch\/medgen\/ontologizer<\/jats:p><jats:p>Contact: \u00a0peter.robinson@charite.de and vingron@molgen.mpg.de<\/jats:p>","DOI":"10.1093\/bioinformatics\/btm440","type":"journal-article","created":{"date-parts":[[2007,9,12]],"date-time":"2007-09-12T03:18:10Z","timestamp":1189567090000},"page":"3024-3031","source":"Crossref","is-referenced-by-count":304,"title":["Improved detection of overrepresentation of Gene-Ontology annotations with parent\u2013child analysis"],"prefix":"10.1093","volume":"23","author":[{"given":"Steffen","family":"Grossmann","sequence":"first","affiliation":[{"name":"1 Max-Planck-Institute for Molecular Genetics, Ihnestrasse 73, 14195 Berlin and 2Institute of Medical Genetics, Universit\u00e4tsmedizin Charit\u00e9, Augustenburger Platz 1, 13353 Berlin, Germany"}]},{"given":"Sebastian","family":"Bauer","sequence":"additional","affiliation":[{"name":"1 Max-Planck-Institute for Molecular Genetics, Ihnestrasse 73, 14195 Berlin and 2Institute of Medical Genetics, Universit\u00e4tsmedizin Charit\u00e9, Augustenburger Platz 1, 13353 Berlin, Germany"}]},{"given":"Peter N.","family":"Robinson","sequence":"additional","affiliation":[{"name":"1 Max-Planck-Institute for Molecular Genetics, Ihnestrasse 73, 14195 Berlin and 2Institute of Medical Genetics, Universit\u00e4tsmedizin Charit\u00e9, Augustenburger Platz 1, 13353 Berlin, Germany"}]},{"given":"Martin","family":"Vingron","sequence":"additional","affiliation":[{"name":"1 Max-Planck-Institute for Molecular Genetics, Ihnestrasse 73, 14195 Berlin and 2Institute of Medical Genetics, Universit\u00e4tsmedizin Charit\u00e9, Augustenburger Platz 1, 13353 Berlin, Germany"}]}],"member":"286","published-online":{"date-parts":[[2007,9,11]]},"reference":[{"key":"2023041208262184400_","doi-asserted-by":"crossref","first-page":"1600","DOI":"10.1093\/bioinformatics\/btl140","article-title":"Improved scoring of functional groups from gene expression data by decorrelating GO graph structure","volume":"22","author":"Alexa","year":"2006","journal-title":"Bioinformatics"},{"key":"2023041208262184400_","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1038\/75556","article-title":"Gene Ontology: tool for the unification of biology. The Gene Ontology Consortium","volume":"25","author":"Ashburner","year":"2000","journal-title":"Nat. Genet."},{"key":"2023041208262184400_","first-page":"5","article-title":"The Gene Ontology Annotation (GOA) Database an integrated resource of GO annotations to the UniProt Knowledgebase","volume":"4","author":"Camon","year":"2004","journal-title":"In Silico Biol."},{"key":"2023041208262184400_","doi-asserted-by":"crossref","first-page":"D262","DOI":"10.1093\/nar\/gkh021","article-title":"The Gene Ontology Annotation (GOA) Database: sharing knowledge in Uniprot with Gene Ontology","volume":"32","author":"Camon","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023041208262184400_","doi-asserted-by":"crossref","first-page":"69","DOI":"10.1093\/nar\/30.1.69","article-title":"Saccharomyces Genome Database (SGD) provides secondary gene annotation using the Gene Ontology (GO)","volume":"30","author":"Dwight","year":"2002","journal-title":"Nucleic Acids Res."},{"key":"2023041208262184400_","doi-asserted-by":"crossref","first-page":"257","DOI":"10.1093\/bioinformatics\/btl567","article-title":"Using GOstats to test gene lists for GO term association","volume":"23","author":"Falcon","year":"2007","journal-title":"Bioinformatics"},{"key":"2023041208262184400_","doi-asserted-by":"crossref","first-page":"D322","DOI":"10.1093\/nar\/gkj021","article-title":"The Gene Ontology (GO) project in 2006","volume":"34","author":"Gene Ontology Consortium","year":"2006","journal-title":"Nucleic Acids Res."},{"key":"2023041208262184400_","doi-asserted-by":"crossref","first-page":"R80","DOI":"10.1186\/gb-2004-5-10-r80","article-title":"Bioconductor: open software development for computational biology and bioinformatics","volume":"5","author":"Gentleman","year":"2004","journal-title":"Genome Biol."},{"key":"2023041208262184400_","first-page":"85","article-title":"An improved statistic for detecting over-representated Gene Ontology annotations in gene sets","author":"Grossmann","year":"2006"},{"key":"2023041208262184400_","doi-asserted-by":"crossref","first-page":"524","DOI":"10.1038\/ni1188","article-title":"Suppression of allergic inflammation by the Prostaglandin E receptor subtype EP3","volume":"6","author":"Kunikata","year":"2005","journal-title":"Nat. Immunol."},{"key":"2023041208262184400_","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1186\/1471-213X-4-11","article-title":"Regulation of signaling genes by TGFbeta during entry into dauer diapause in C. elegans","volume":"4","author":"Liu","year":"2004","journal-title":"BMC Dev. Biol."},{"key":"2023041208262184400_","doi-asserted-by":"crossref","first-page":"1974","DOI":"10.1073\/pnas.0409522102","article-title":"Conserved patterns of protein interaction in multiple species","volume":"102","author":"Sharan","year":"2005","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023041208262184400_","doi-asserted-by":"crossref","DOI":"10.2202\/1544-6115.1027","article-title":"Linear models and empirical Bayes methods for assessing differential expression in microarray experiments","volume":"3","author":"Smyth","year":"2004","journal-title":"Stat. Appl. Genet. Mol. Biol."},{"key":"2023041208262184400_","volume-title":"Resampling-Based Multiple Testing","author":"Westfall","year":"1993"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/22\/3024\/49857685\/bioinformatics_23_22_3024.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/22\/3024\/49857685\/bioinformatics_23_22_3024.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,1,20]],"date-time":"2025-01-20T23:24:16Z","timestamp":1737415456000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/23\/22\/3024\/208216"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,9,11]]},"references-count":14,"journal-issue":{"issue":"22","published-print":{"date-parts":[[2007,11,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btm440","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2007,11,15]]},"published":{"date-parts":[[2007,9,11]]}}}