{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,1,26]],"date-time":"2023-01-26T05:20:26Z","timestamp":1674710426787},"reference-count":39,"publisher":"Oxford University Press (OUP)","issue":"1","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2011,1,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Functional enrichment analysis using primary genomics datasets is an emerging approach to complement established methods for functional enrichment based on predefined lists of functionally related genes. Currently used methods depend on creating lists of \u2018significant\u2019 and \u2018non-significant\u2019 genes based on ad hoc significance cutoffs. This can lead to loss of statistical power and can introduce biases affecting the interpretation of experimental results.<\/jats:p>\n               <jats:p>Results: We developed and validated a new statistical framework, generalized random set (GRS) analysis, for comparing the genomic signatures in two datasets without the need for gene categorization. In our tests, GRS produced correct measures of statistical significance, and it showed dramatic improvement in the statistical power over other methods currently used in this setting. We also developed a procedure for identifying genes driving the concordance of the genomics profiles and demonstrated a dramatic improvement in functional coherence of genes identified in such analysis.<\/jats:p>\n               <jats:p>Availability: GRS can be downloaded as part of the R package CLEAN from http:\/\/ClusterAnalysis.org\/. An online implementation is available at http:\/\/GenomicsPortals.org\/.<\/jats:p>\n               <jats:p>Contact: \u00a0mario.medvedovic@uc.edu<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btq593","type":"journal-article","created":{"date-parts":[[2010,10,24]],"date-time":"2010-10-24T00:13:41Z","timestamp":1287879221000},"page":"70-77","source":"Crossref","is-referenced-by-count":11,"title":["Generalized random set framework for functional enrichment analysis using primary genomics datasets"],"prefix":"10.1093","volume":"27","author":[{"given":"Johannes M.","family":"Freudenberg","sequence":"first","affiliation":[{"name":"1 Department of Environmental Health, University of Cincinnati College of Medicine, Cincinnati, OH 45267 and 2Mathematical Sciences Department, University of Cincinnati, Cincinnati, OH 45221, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Siva","family":"Sivaganesan","sequence":"additional","affiliation":[{"name":"1 Department of Environmental Health, University of Cincinnati College of Medicine, Cincinnati, OH 45267 and 2Mathematical Sciences Department, University of Cincinnati, Cincinnati, OH 45221, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mukta","family":"Phatak","sequence":"additional","affiliation":[{"name":"1 Department of Environmental Health, University of Cincinnati College of Medicine, Cincinnati, OH 45267 and 2Mathematical Sciences Department, University of Cincinnati, Cincinnati, OH 45221, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kaustubh","family":"Shinde","sequence":"additional","affiliation":[{"name":"1 Department of Environmental Health, University of Cincinnati College of Medicine, Cincinnati, OH 45267 and 2Mathematical Sciences Department, University of Cincinnati, Cincinnati, OH 45221, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mario","family":"Medvedovic","sequence":"additional","affiliation":[{"name":"1 Department of Environmental Health, University of Cincinnati College of Medicine, Cincinnati, OH 45267 and 2Mathematical Sciences Department, University of Cincinnati, Cincinnati, OH 45221, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2010,10,22]]},"reference":[{"key":"2023012511155608900_B1","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1186\/1471-2105-10-47","article-title":"A general modular framework for gene set enrichment analysis","volume":"10","author":"Ackermann","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2023012511155608900_B2","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1038\/75556","article-title":"Gene ontology: tool for the unification of biology. The Gene Ontology Consortium","volume":"25","author":"Ashburner","year":"2000","journal-title":"Nat. Genet."},{"key":"2023012511155608900_B3","doi-asserted-by":"crossref","first-page":"D885","DOI":"10.1093\/nar\/gkn764","article-title":"NCBI GEO: archive for high-throughput functional genomic data","volume":"37","author":"Barrett","year":"2009","journal-title":"Nucleic Acids Res."},{"key":"2023012511155608900_B4","doi-asserted-by":"crossref","first-page":"i145","DOI":"10.1093\/bioinformatics\/btp215","article-title":"Probabilistic retrieval and visualization of biologically relevant microarray experiments","volume":"25","author":"Caldas","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012511155608900_B5","volume-title":"Statistical Inference.","author":"Casella","year":"2001"},{"key":"2023012511155608900_B6","doi-asserted-by":"crossref","first-page":"e175","DOI":"10.1093\/nar\/gni179","article-title":"Evolving gene\/transcript definitions significantly alter the interpretation of GeneChip data","volume":"33","author":"Dai","year":"2005","journal-title":"Nucleic Acids Res."},{"key":"2023012511155608900_B7","doi-asserted-by":"crossref","first-page":"411","DOI":"10.1186\/1471-2164-10-411","article-title":"GEM-TREND: a web tool for gene expression data mining toward relevant network discovery","volume":"10","author":"Feng","year":"2009","journal-title":"BMC Genomics"},{"key":"2023012511155608900_B8","doi-asserted-by":"crossref","first-page":"234","DOI":"10.1186\/1471-2105-10-234","article-title":"CLEAN: CLustering Enrichment ANalysis","volume":"10","author":"Freudenberg","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2023012511155608900_B9","doi-asserted-by":"crossref","first-page":"2692","DOI":"10.1093\/bioinformatics\/btm403","article-title":"Exploring the functional landscape of gene expression: directed search of large microarray compendia","volume":"23","author":"Hibbs","year":"2007","journal-title":"Bioinformatics"},{"key":"2023012511155608900_B10","doi-asserted-by":"crossref","first-page":"e15","DOI":"10.1093\/nar\/gng015","article-title":"Summaries of affymetrix GeneChip probe level data","volume":"31","author":"Irizarry","year":"2003","journal-title":"Nucleic Acids Res."},{"key":"2023012511155608900_B11","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1093\/nar\/28.1.27","article-title":"KEGG: kyoto encyclopedia of genes and genomes","volume":"28","author":"Kanehisa","year":"2000","journal-title":"Nucleic Acids Res."},{"key":"2023012511155608900_B12","doi-asserted-by":"crossref","first-page":"1929","DOI":"10.1126\/science.1132939","article-title":"The Connectivity Map: using gene-expression signatures to connect small molecules, genes, and disease","volume":"313","author":"Lamb","year":"2006","journal-title":"Science"},{"key":"2023012511155608900_B13","doi-asserted-by":"crossref","first-page":"e137","DOI":"10.1093\/nar\/gkn610","article-title":"Gene expression module-based chemical function similarity search","volume":"36","author":"Li","year":"2008","journal-title":"Nucleic Acids Res."},{"key":"2023012511155608900_B14","doi-asserted-by":"crossref","first-page":"D54","DOI":"10.1093\/nar\/gki031","article-title":"Entrez Gene: gene-centered information at NCBI","volume":"33","author":"Maglott","year":"2005","journal-title":"Nucleic Acids Res."},{"key":"2023012511155608900_B15","doi-asserted-by":"crossref","first-page":"80","DOI":"10.1152\/physiolgenomics.00007.2009","article-title":"Influence of fatty acid diets on gene expression in rat mammary epithelial cells","volume":"38","author":"Medvedovic","year":"2009","journal-title":"Physiol. Genomics"},{"key":"2023012511155608900_B16","doi-asserted-by":"crossref","first-page":"13550","DOI":"10.1073\/pnas.0506230102","article-title":"From The Cover: An expression signature for p53 status in human breast cancer predicts mutation status, transcriptional effects, and patient survival","volume":"102","author":"Miller","year":"2005","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012511155608900_B17","doi-asserted-by":"crossref","first-page":"535","DOI":"10.1677\/jme.1.01677","article-title":"Anti-proliferative effect of estrogen in breast cancer cells that re-express ER {alpha} is mediated by aberrant regulation of cell cycle genes","volume":"34","author":"Moggs","year":"2005","journal-title":"J. Mol. Endocrinol."},{"key":"2023012511155608900_B18","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1214\/07-AOAS104","article-title":"Random-set methods identify distinct aspects of the enrichment signal in gene-set analysis","volume":"1","author":"Newton","year":"2007","journal-title":"Ann. Appl. Stat."},{"key":"2023012511155608900_B19","doi-asserted-by":"crossref","first-page":"1828","DOI":"10.1101\/gr.1125403","article-title":"A gene recommender algorithm to identify coexpressed genes in C. elegans","volume":"13","author":"Owen","year":"2003","journal-title":"Genome Res."},{"key":"2023012511155608900_B20","doi-asserted-by":"crossref","first-page":"8961","DOI":"10.1073\/pnas.0502674102","article-title":"Effects of threshold choice on biological conclusions reached during analysis of gene expression by DNA microarrays","volume":"102","author":"Pan","year":"2005","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012511155608900_B21","doi-asserted-by":"crossref","first-page":"D868","DOI":"10.1093\/nar\/gkn889","article-title":"ArrayExpress update\u2013from an archive of functional genomics experiments to the atlas of gene expression","volume":"37","author":"Parkinson","year":"2009","journal-title":"Nucleic Acids Res."},{"key":"2023012511155608900_B22","doi-asserted-by":"crossref","first-page":"S2","DOI":"10.1186\/gb-2008-9-s1-s2","article-title":"A critical assessment of Mus musculus gene function prediction using integrated genomic evidence","volume":"9","author":"Pena-Castillo","year":"2008","journal-title":"Genome Biol."},{"issue":"Suppl.","key":"2023012511155608900_B23","doi-asserted-by":"crossref","first-page":"S31","DOI":"10.1038\/ng1570","article-title":"Integrative analysis of the cancer transcriptome","volume":"37","author":"Rhodes","year":"2005","journal-title":"Nat. Genet."},{"key":"2023012511155608900_B24","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1093\/bioinformatics\/btn592","article-title":"LRpath: a logistic regression approach for identifying enriched biological groups in gene expression data","volume":"25","author":"Sartor","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012511155608900_B25","doi-asserted-by":"crossref","first-page":"538","DOI":"10.1186\/1471-2105-7-538","article-title":"Intensity-based hierarchical Bayes method improves testing for differentially expressed genes in microarray experiments","volume":"7","author":"Sartor","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023012511155608900_B26","doi-asserted-by":"crossref","first-page":"5405","DOI":"10.1158\/0008-5472.CAN-07-5206","article-title":"The humoral immune system has a key prognostic impact in node-negative breast cancer","volume":"68","author":"Schmidt","year":"2008","journal-title":"Cancer Res."},{"key":"2023012511155608900_B27","doi-asserted-by":"crossref","first-page":"62","DOI":"10.1198\/000313001300339950","article-title":"Calibration of p-values for testing precise null hypothesis","volume":"55","author":"Sellke","year":"2001","journal-title":"Am. Stat."},{"key":"2023012511155608900_B28","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1186\/1471-2164-11-27","article-title":"Genomics Portals: integrative web-platform for mining genomics data","volume":"11","author":"Shinde","year":"2010","journal-title":"BMC Genomics"},{"key":"2023012511155608900_B29","article-title":"Linear models and empirical Bayes methods for assessing differential expression in microarray experiments","volume":"3","author":"Smyth","year":"2004","journal-title":"Stat. Appli. Genet. Mol. Biol."},{"key":"2023012511155608900_B30","doi-asserted-by":"crossref","first-page":"9440","DOI":"10.1073\/pnas.1530509100","article-title":"Statistical significance for genomewide studies","volume":"100","author":"Storey","year":"2003","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012511155608900_B31","doi-asserted-by":"crossref","first-page":"15545","DOI":"10.1073\/pnas.0506580102","article-title":"Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles","volume":"102","author":"Subramanian","year":"2005","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012511155608900_B32","doi-asserted-by":"crossref","first-page":"51","DOI":"10.1186\/1755-8794-1-51","article-title":"Expression-based Pathway Signature Analysis (EPSA): Mining publicly available microarray data for insight into human disease","volume":"1","author":"Tenenbaum","year":"2008","journal-title":"BMC Med. Genomics"},{"key":"2023012511155608900_B33","doi-asserted-by":"crossref","first-page":"13544","DOI":"10.1073\/pnas.0506577102","article-title":"Discovering statistically significant pathways in expression profiling studies","volume":"102","author":"Tian","year":"2005","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012511155608900_B34","doi-asserted-by":"crossref","first-page":"52","DOI":"10.1016\/j.toxlet.2008.08.009","article-title":"Similar compounds searching system by using the gene expression microarray database","volume":"186","author":"Toyoshiba","year":"2009","journal-title":"Toxicol. Lett."},{"key":"2023012511155608900_B35","doi-asserted-by":"crossref","first-page":"W228","DOI":"10.1093\/nar\/gkq476","article-title":"MARQ: an online tool to mine GEO for experiments with similar or opposite gene expression signatures","volume":"38","author":"Vazquez","year":"2010","journal-title":"Nucleic Acids Res."},{"key":"2023012511155608900_B36","doi-asserted-by":"crossref","first-page":"383","DOI":"10.1186\/1471-2105-8-383","article-title":"ProbCD: enrichment analysis accounting for categorization uncertainty","volume":"8","author":"V\u00eancio","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023012511155608900_B37","doi-asserted-by":"crossref","first-page":"80","DOI":"10.2307\/3001968","article-title":"Individual comparisons by ranking methods","volume":"1","author":"Wilcoxon","year":"1945","journal-title":"Biomet. Bull."},{"key":"2023012511155608900_B38","doi-asserted-by":"crossref","first-page":"1694","DOI":"10.1093\/bioinformatics\/btp290","article-title":"A global meta-analysis of microarray expression data to predict unknown gene functions and estimate the literature-data divide","volume":"25","author":"Wren","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012511155608900_B39","doi-asserted-by":"crossref","first-page":"R133","DOI":"10.1186\/gb-2007-8-7-r133","article-title":"Strategy for encoding and comparison of gene expression signatures","volume":"8","author":"Yi","year":"2007","journal-title":"Genome Biol."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/27\/1\/70\/48861415\/bioinformatics_27_1_70.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/27\/1\/70\/48861415\/bioinformatics_27_1_70.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T11:26:14Z","timestamp":1674645974000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/27\/1\/70\/201264"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,10,22]]},"references-count":39,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2011,1,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btq593","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2011,1,1]]},"published":{"date-parts":[[2010,10,22]]}}}