{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,15]],"date-time":"2026-01-15T04:42:28Z","timestamp":1768452148036,"version":"3.49.0"},"reference-count":41,"publisher":"Oxford University Press (OUP)","issue":"19","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":1543,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/3.0"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2012,10,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: Meta-analysis of genomics data seeks to identify genes associated with a biological phenotype across multiple datasets; however, merging data from different platforms by their features (genes) is challenging. Meta-analysis using functionally or biologically characterized gene sets simplifies data integration is biologically intuitive and is seen as having great potential, but is an emerging field with few established statistical methods.<\/jats:p><jats:p>Results: We transform gene expression profiles into binary gene set profiles by discretizing results of gene set enrichment analyses and apply a new iterative bi-clustering algorithm (iBBiG) to identify groups of gene sets that are coordinately associated with groups of phenotypes across multiple studies. iBBiG is optimized for meta-analysis of large numbers of diverse genomics data that may have unmatched samples. It does not require prior knowledge of the number or size of clusters. When applied to simulated data, it outperforms commonly used clustering methods, discovers overlapping clusters of diverse sizes and is robust in the presence of noise. We apply it to meta-analysis of breast cancer studies, where iBBiG extracted novel gene set\u2014phenotype association that predicted tumor metastases within tumor subtypes.<\/jats:p><jats:p>Availability: Implemented in the Bioconductor package iBBiG<\/jats:p><jats:p>Contact: \u00a0aedin@jimmy.harvard.edu<\/jats:p>","DOI":"10.1093\/bioinformatics\/bts438","type":"journal-article","created":{"date-parts":[[2012,7,13]],"date-time":"2012-07-13T20:44:25Z","timestamp":1342212265000},"page":"2484-2492","source":"Crossref","is-referenced-by-count":44,"title":["iBBiG: iterative binary bi-clustering of gene sets"],"prefix":"10.1093","volume":"28","author":[{"given":"Daniel","family":"Gusenleitner","sequence":"first","affiliation":[{"name":"1 Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA, USA, 2Department of Statistics, University of Oxford, Oxford, UK, 3Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA and 4Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Eleanor A.","family":"Howe","sequence":"additional","affiliation":[{"name":"1 Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA, USA, 2Department of Statistics, University of Oxford, Oxford, UK, 3Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA and 4Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA"},{"name":"1 Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA, USA, 2Department of Statistics, University of Oxford, Oxford, UK, 3Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA and 4Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Stefan","family":"Bentink","sequence":"additional","affiliation":[{"name":"1 Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA, USA, 2Department of Statistics, University of Oxford, Oxford, UK, 3Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA and 4Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA"},{"name":"1 Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA, USA, 2Department of Statistics, University of Oxford, Oxford, UK, 3Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA and 4Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"John","family":"Quackenbush","sequence":"additional","affiliation":[{"name":"1 Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA, USA, 2Department of Statistics, University of Oxford, Oxford, UK, 3Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA and 4Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA"},{"name":"1 Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA, USA, 2Department of Statistics, University of Oxford, Oxford, UK, 3Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA and 4Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA"},{"name":"1 Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA, USA, 2Department of Statistics, University of Oxford, Oxford, UK, 3Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA and 4Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Aed\u00edn C.","family":"Culhane","sequence":"additional","affiliation":[{"name":"1 Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA, USA, 2Department of Statistics, University of Oxford, Oxford, UK, 3Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA and 4Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA"},{"name":"1 Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA, USA, 2Department of Statistics, University of Oxford, Oxford, UK, 3Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA and 4Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2012,7,12]]},"reference":[{"key":"2023012513053498800_bts438-B1","doi-asserted-by":"crossref","DOI":"10.1007\/3-211-27389-1_52","article-title":"Offspring selection: a new self-adaptive selection scheme for genetic algorithms","volume-title":"Adaptive and Natural Computing Algorithms","author":"Affenzeller","year":"2005"},{"key":"2023012513053498800_bts438-B2","doi-asserted-by":"crossref","DOI":"10.1093\/nar\/gkn764","article-title":"NCBI GEO: archive for high-throughput functional genomic data","author":"Barrett","year":"2009","journal-title":"Nucleic Acids Res."},{"key":"2023012513053498800_bts438-B3","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","article-title":"Controlling the false discovery rate: a practical and powerful approach to multiple testing","volume":"57","author":"Benjamini","year":"1995","journal-title":"J. Roy. Stat. Soc."},{"key":"2023012513053498800_bts438-B4","doi-asserted-by":"crossref","first-page":"6","DOI":"10.1016\/j.ajhg.2009.11.017","article-title":"Prioritizing GWAS results: a review of statistical methods and recommendations for their application","volume":"86","author":"Cantor","year":"2010","journal-title":"Am. J. Hum. Genet."},{"key":"2023012513053498800_bts438-B5","first-page":"407","article-title":"An analysis of linear ranking and binary tournament selection in genetic algorithms","volume-title":"Proceedings of ICICS. Singapore","author":"Chakraborty","year":"1997"},{"key":"2023012513053498800_bts438-B6","first-page":"93","article-title":"Biclustering of expression data. In","volume":"8","author":"Cheng","year":"2000","journal-title":"Proceedings of ISMB"},{"key":"2023012513053498800_bts438-B7","doi-asserted-by":"crossref","first-page":"D1060","DOI":"10.1093\/nar\/gkr901","article-title":"Genesigdb: a manually curated database and resource for analysis of gene expression signatures","volume":"40","author":"Culhane","year":"2012","journal-title":"Nucleic Acids Res."},{"key":"2023012513053498800_bts438-B8","article-title":"GeneSigDBa curated database of gene expression signatures","author":"Culhane","year":"2009","journal-title":"Nucleic Acids Res."},{"key":"2023012513053498800_bts438-B9","doi-asserted-by":"crossref","DOI":"10.1038\/nature10983","article-title":"The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups","author":"Curtis","year":"2012","journal-title":"Nature"},{"key":"2023012513053498800_bts438-B10","doi-asserted-by":"crossref","first-page":"980","DOI":"10.1093\/bioinformatics\/btm051","article-title":"Analyzing gene expression data in terms of gene sets: methodological issues","volume":"23","author":"Goeman","year":"2007","journal-title":"Bioinformatics (Oxford, England)"},{"key":"2023012513053498800_bts438-B11","doi-asserted-by":"crossref","first-page":"9362","DOI":"10.1073\/pnas.0903103106","article-title":"Potential etiologic and functional implications of genome-wide association loci for human diseases and traits","volume":"106","author":"Hindorff","year":"2009","journal-title":"Proc. Nat. Acad. Sci. USA"},{"key":"2023012513053498800_bts438-B12","doi-asserted-by":"crossref","first-page":"1520","DOI":"10.1093\/bioinformatics\/btq227","article-title":"FABIA: factor analysis for bicluster acquisition","volume":"26","author":"Hochreiter","year":"2010","journal-title":"Bioinformatics"},{"key":"2023012513053498800_bts438-B13","doi-asserted-by":"crossref","first-page":"3267","DOI":"10.1093\/bioinformatics\/btp588","article-title":"Detailing regulatory networks through large scale data integration","volume":"25","author":"Huttenhower","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012513053498800_bts438-B14","doi-asserted-by":"crossref","first-page":"518","DOI":"10.1093\/bib\/bbq082","article-title":"Literature-aided interpretation of gene expression data with the weighted global test","volume":"12","author":"Jelier","year":"2011","journal-title":"Brief. Bioinformatics"},{"key":"2023012513053498800_bts438-B15","doi-asserted-by":"crossref","first-page":"523","DOI":"10.1109\/IJCNN.2003.1223401","article-title":"Clustering using renyi's entropy","volume-title":"Proceedings of the International Joint Conference on Neural Networks, 2003","author":"Jenssen","year":"2003"},{"key":"2023012513053498800_bts438-B16","doi-asserted-by":"crossref","first-page":"143","DOI":"10.1186\/1471-2407-11-143","article-title":"Correlation of microarray-based breast cancer molecular subtypes and clinical outcomes: implications for treatment optimization","volume":"11","author":"Kao","year":"2011","journal-title":"BMC Cancer"},{"key":"2023012513053498800_bts438-B17","doi-asserted-by":"crossref","first-page":"703","DOI":"10.1101\/gr.648603","article-title":"Spectral biclustering of microarray data: coclustering genes and conditions","volume":"13","author":"Kluger","year":"2003","journal-title":"Genome Res."},{"key":"2023012513053498800_bts438-B18","doi-asserted-by":"crossref","first-page":"341","DOI":"10.1007\/978-1-60327-194-3_16","article-title":"Analysis of biological processes and diseases using text mining approaches","volume":"593","author":"Krallinger","year":"2010","journal-title":"Methods Mol. Biol. (Clifton, NJ)"},{"key":"2023012513053498800_bts438-B19","first-page":"142","article-title":"Minimum entropy clustering and applications to gene expression analysis","volume":"0","author":"Li","year":"2004","journal-title":"CSB Conference"},{"key":"2023012513053498800_bts438-B20","doi-asserted-by":"crossref","first-page":"46","DOI":"10.1186\/1471-2105-12-46","article-title":"GCOD - GeneChip oncology database","volume":"12","author":"Liu","year":"2011","journal-title":"BMC Bioinformatics"},{"key":"2023012513053498800_bts438-B21","doi-asserted-by":"crossref","first-page":"6740","DOI":"10.1073\/pnas.0701138104","article-title":"Lung metastasis genes couple breast tumor size and metastatic spread","volume":"104","author":"Minn","year":"2007","journal-title":"Proc. Nat. Acad. Sci. USA"},{"key":"2023012513053498800_bts438-B22","doi-asserted-by":"crossref","first-page":"e10348","DOI":"10.1371\/journal.pone.0010348","article-title":"Multidimensional gene set analysis of genomic data","volume":"5","author":"Montaner","year":"2010","journal-title":"PLoS One"},{"key":"2023012513053498800_bts438-B23","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1038\/ng1180","article-title":"PGC-1-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes","volume":"34","author":"Mootha","year":"2003","journal-title":"Nat. Genet."},{"key":"2023012513053498800_bts438-B24","first-page":"77","article-title":"Extracting conserved gene expression motifs from gene expression data","volume":"8","author":"Murali","year":"2003","journal-title":"Pac. Symp. Biocomput"},{"key":"2023012513053498800_bts438-B25","doi-asserted-by":"crossref","first-page":"2586","DOI":"10.1093\/bioinformatics\/btn465","article-title":"Gene set enrichment analysis using linear models and diagnostics","volume":"24","author":"Oron","year":"2008","journal-title":"Bioinformatics"},{"key":"2023012513053498800_bts438-B26","doi-asserted-by":"crossref","DOI":"10.1093\/nar\/gkn889","article-title":"ArrayExpress update from an archive of functional genomics experiments to the atlas of gene expression","author":"Parkinson","year":"2009","journal-title":"Nucleic Acids Res."},{"key":"2023012513053498800_bts438-B27","doi-asserted-by":"crossref","first-page":"1122","DOI":"10.1093\/bioinformatics\/btl060","article-title":"A systematic comparison and evaluation of biclustering methods for gene expression data","volume":"22","author":"Prelic","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012513053498800_bts438-B28","doi-asserted-by":"crossref","first-page":"e1000534","DOI":"10.1371\/journal.pgen.1000534","article-title":"Identifying relationships among genomic disease regions: Predicting genes at pathogenic SNP associations and rare deletions","volume":"5","author":"Raychaudhuri","year":"2009","journal-title":"PLoS Genet."},{"key":"2023012513053498800_bts438-B29","first-page":"2738","article-title":"A biclustering algorithm for extracting bit-patterns from binary datasets","volume":"27","author":"Rodriguez-Baena","year":"2011","journal-title":"Bioinformatics (Oxford, England)"},{"key":"2023012513053498800_bts438-B30","doi-asserted-by":"crossref","first-page":"1212","DOI":"10.1093\/bioinformatics\/btn076","article-title":"BicOverlapper: a tool for bicluster visualization","volume":"24","author":"Santamaria","year":"2008","journal-title":"Bioinformatics"},{"key":"2023012513053498800_bts438-B31","doi-asserted-by":"crossref","first-page":"1090","DOI":"10.1038\/ng1434","article-title":"A module map showing conditional activity of expression modules in cancer","volume":"36","author":"Segal","year":"2004","journal-title":"Nature Genetics"},{"key":"2023012513053498800_bts438-B32","doi-asserted-by":"crossref","first-page":"623","DOI":"10.1002\/j.1538-7305.1948.tb00917.x","article-title":"A mathematical theory of communication","volume":"27","author":"Shannon","year":"1948","journal-title":"Bell Syst. Tech. J."},{"key":"2023012513053498800_bts438-B33","doi-asserted-by":"crossref","first-page":"1316","DOI":"10.1093\/bioinformatics\/btq148","article-title":"Meta-analysis for pathway enrichment analysis when combining multiple genomic studies","volume":"26","author":"Shen","year":"2010","journal-title":"Bioinformatics"},{"key":"2023012513053498800_bts438-B34","doi-asserted-by":"crossref","first-page":"74","DOI":"10.1186\/1752-0509-4-74","article-title":"Co-expression module analysis reveals biological processes, genomic gain, and regulatory mechanisms associated with breast cancer progression","volume":"4","author":"Shi","year":"2010","journal-title":"BMC Syst. Biol."},{"key":"2023012513053498800_bts438-B35","doi-asserted-by":"crossref","first-page":"271","DOI":"10.1016\/j.canlet.2008.03.018","article-title":"The inflammatory chemokines CCL2 and CCL5 in breast cancer","volume":"267","author":"Soria","year":"2008","journal-title":"Cancer Lett."},{"key":"2023012513053498800_bts438-B36","doi-asserted-by":"crossref","first-page":"15545","DOI":"10.1073\/pnas.0506580102","article-title":"From the cover: gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles","volume":"102","author":"Subramanian","year":"2005","journal-title":"Proc. Nat. Acad. Sci."},{"key":"2023012513053498800_bts438-B37","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1016\/j.csda.2004.02.003","article-title":"Improved biclustering of microarray data demonstrated through systematic performance tests","volume":"48","author":"Turner","year":"2005","journal-title":"Comput. Stat. Data Anal."},{"key":"2023012513053498800_bts438-B38","doi-asserted-by":"crossref","first-page":"R105","DOI":"10.1186\/gb-2011-12-10-r105","article-title":"Integrating diverse genomic data using gene sets","volume":"12","author":"Tyekucheva","year":"2011","journal-title":"Genome Biol."},{"key":"2023012513053498800_bts438-B39","doi-asserted-by":"crossref","first-page":"530","DOI":"10.1038\/415530a","article-title":"Gene expression profiling predicts clinical outcome of breast cancer","volume":"415","author":"van't Veer","year":"2002","journal-title":"Nature"},{"key":"2023012513053498800_bts438-B40","doi-asserted-by":"crossref","first-page":"e1000070","DOI":"10.1371\/journal.pgen.1000070","article-title":"Gene set enrichment in eQTL data identifies novel annotations and pathway regulators","volume":"4","author":"Wu","year":"2008","journal-title":"PLoS Genetics"},{"key":"2023012513053498800_bts438-B41","first-page":"1113","article-title":"Role of CCL5 in invasion, proliferation and proportion of CD44+\/CD24- phenotype of MCF-7 cells and correlation of CCL5 and CCR5 expression with breast cancer progression","volume":"21","author":"Zhang","year":"2009","journal-title":"Oncol. Rep."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/19\/2484\/48876276\/bioinformatics_28_19_2484.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/19\/2484\/48876276\/bioinformatics_28_19_2484.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,4,26]],"date-time":"2024-04-26T20:02:20Z","timestamp":1714161740000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/28\/19\/2484\/287925"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,7,12]]},"references-count":41,"journal-issue":{"issue":"19","published-print":{"date-parts":[[2012,10,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bts438","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2012,10,1]]},"published":{"date-parts":[[2012,7,12]]}}}