{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,8]],"date-time":"2026-04-08T03:06:18Z","timestamp":1775617578350,"version":"3.50.1"},"reference-count":47,"publisher":"Oxford University Press (OUP)","issue":"12","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":2354,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/2.0\/uk\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010,6,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: Biclustering of transcriptomic data groups genes and samples simultaneously. It is emerging as a standard tool for extracting knowledge from gene expression measurements. We propose a novel generative approach for biclustering called \u2018FABIA: Factor Analysis for Bicluster Acquisition\u2019. FABIA is based on a multiplicative model, which accounts for linear dependencies between gene expression and conditions, and also captures heavy-tailed distributions as observed in real-world transcriptomic data. The generative framework allows to utilize well-founded model selection methods and to apply Bayesian techniques.<\/jats:p><jats:p>Results: On 100 simulated datasets with known true, artificially implanted biclusters, FABIA clearly outperformed all 11 competitors. On these datasets, FABIA was able to separate spurious biclusters from true biclusters by ranking biclusters according to their information content. FABIA was tested on three microarray datasets with known subclusters, where it was two times the best and once the second best method among the compared biclustering approaches.<\/jats:p><jats:p>Availability: FABIA is available as an R package on Bioconductor (http:\/\/www.bioconductor.org). All datasets, results and software are available at http:\/\/www.bioinf.jku.at\/software\/fabia\/fabia.html<\/jats:p><jats:p>Contact: \u00a0hochreit@bioinf.jku.at<\/jats:p><jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btq227","type":"journal-article","created":{"date-parts":[[2010,4,24]],"date-time":"2010-04-24T00:46:10Z","timestamp":1272069970000},"page":"1520-1527","source":"Crossref","is-referenced-by-count":251,"title":["FABIA: factor analysis for bicluster acquisition"],"prefix":"10.1093","volume":"26","author":[{"given":"Sepp","family":"Hochreiter","sequence":"first","affiliation":[{"name":"1 Institute of Bioinformatics, Johannes Kepler University, Linz, Austria, 2 Institute for Biostatistics and Statistical Bioinformatics, Hasselt University, Hasselt, 3 Johnson & Johnson Pharmaceutical Research & Development, Division of Janssen Pharmaceutica, Beerse, Belgium and 4 Department of Nephrology and Internal Intensive Care, Charit\u00e9, Berlin, Germany"}]},{"given":"Ulrich","family":"Bodenhofer","sequence":"additional","affiliation":[{"name":"1 Institute of Bioinformatics, Johannes Kepler University, Linz, Austria, 2 Institute for Biostatistics and Statistical Bioinformatics, Hasselt University, Hasselt, 3 Johnson & Johnson Pharmaceutical Research & Development, Division of Janssen Pharmaceutica, Beerse, Belgium and 4 Department of Nephrology and Internal Intensive Care, Charit\u00e9, Berlin, Germany"}]},{"given":"Martin","family":"Heusel","sequence":"additional","affiliation":[{"name":"1 Institute of Bioinformatics, Johannes Kepler University, Linz, Austria, 2 Institute for Biostatistics and Statistical Bioinformatics, Hasselt University, Hasselt, 3 Johnson & Johnson Pharmaceutical Research & Development, Division of Janssen Pharmaceutica, Beerse, Belgium and 4 Department of Nephrology and Internal Intensive Care, Charit\u00e9, Berlin, Germany"}]},{"given":"Andreas","family":"Mayr","sequence":"additional","affiliation":[{"name":"1 Institute of Bioinformatics, Johannes Kepler University, Linz, Austria, 2 Institute for Biostatistics and Statistical Bioinformatics, Hasselt University, Hasselt, 3 Johnson & Johnson Pharmaceutical Research & Development, Division of Janssen Pharmaceutica, Beerse, Belgium and 4 Department of Nephrology and Internal Intensive Care, Charit\u00e9, Berlin, Germany"}]},{"given":"Andreas","family":"Mitterecker","sequence":"additional","affiliation":[{"name":"1 Institute of Bioinformatics, Johannes Kepler University, Linz, Austria, 2 Institute for Biostatistics and Statistical Bioinformatics, Hasselt University, Hasselt, 3 Johnson & Johnson Pharmaceutical Research & Development, Division of Janssen Pharmaceutica, Beerse, Belgium and 4 Department of Nephrology and Internal Intensive Care, Charit\u00e9, Berlin, Germany"}]},{"given":"Adetayo","family":"Kasim","sequence":"additional","affiliation":[{"name":"1 Institute of Bioinformatics, Johannes Kepler University, Linz, Austria, 2 Institute for Biostatistics and Statistical Bioinformatics, Hasselt University, Hasselt, 3 Johnson & Johnson Pharmaceutical Research & Development, Division of Janssen Pharmaceutica, Beerse, Belgium and 4 Department of Nephrology and Internal Intensive Care, Charit\u00e9, Berlin, Germany"}]},{"given":"Tatsiana","family":"Khamiakova","sequence":"additional","affiliation":[{"name":"1 Institute of Bioinformatics, Johannes Kepler University, Linz, Austria, 2 Institute for Biostatistics and Statistical Bioinformatics, Hasselt University, Hasselt, 3 Johnson & Johnson Pharmaceutical Research & Development, Division of Janssen Pharmaceutica, Beerse, Belgium and 4 Department of Nephrology and Internal Intensive Care, Charit\u00e9, Berlin, Germany"}]},{"given":"Suzy","family":"Van Sanden","sequence":"additional","affiliation":[{"name":"1 Institute of Bioinformatics, Johannes Kepler University, Linz, Austria, 2 Institute for Biostatistics and Statistical Bioinformatics, Hasselt University, Hasselt, 3 Johnson & Johnson Pharmaceutical Research & Development, Division of Janssen Pharmaceutica, Beerse, Belgium and 4 Department of Nephrology and Internal Intensive Care, Charit\u00e9, Berlin, Germany"}]},{"given":"Dan","family":"Lin","sequence":"additional","affiliation":[{"name":"1 Institute of Bioinformatics, Johannes Kepler University, Linz, Austria, 2 Institute for Biostatistics and Statistical Bioinformatics, Hasselt University, Hasselt, 3 Johnson & Johnson Pharmaceutical Research & Development, Division of Janssen Pharmaceutica, Beerse, Belgium and 4 Department of Nephrology and Internal Intensive Care, Charit\u00e9, Berlin, Germany"}]},{"given":"Willem","family":"Talloen","sequence":"additional","affiliation":[{"name":"1 Institute of Bioinformatics, Johannes Kepler University, Linz, Austria, 2 Institute for Biostatistics and Statistical Bioinformatics, Hasselt University, Hasselt, 3 Johnson & Johnson Pharmaceutical Research & Development, Division of Janssen Pharmaceutica, Beerse, Belgium and 4 Department of Nephrology and Internal Intensive Care, Charit\u00e9, Berlin, Germany"}]},{"given":"Luc","family":"Bijnens","sequence":"additional","affiliation":[{"name":"1 Institute of Bioinformatics, Johannes Kepler University, Linz, Austria, 2 Institute for Biostatistics and Statistical Bioinformatics, Hasselt University, Hasselt, 3 Johnson & Johnson Pharmaceutical Research & Development, Division of Janssen Pharmaceutica, Beerse, Belgium and 4 Department of Nephrology and Internal Intensive Care, Charit\u00e9, Berlin, Germany"}]},{"given":"Hinrich W. H.","family":"G\u00f6hlmann","sequence":"additional","affiliation":[{"name":"1 Institute of Bioinformatics, Johannes Kepler University, Linz, Austria, 2 Institute for Biostatistics and Statistical Bioinformatics, Hasselt University, Hasselt, 3 Johnson & Johnson Pharmaceutical Research & Development, Division of Janssen Pharmaceutica, Beerse, Belgium and 4 Department of Nephrology and Internal Intensive Care, Charit\u00e9, Berlin, Germany"}]},{"given":"Ziv","family":"Shkedy","sequence":"additional","affiliation":[{"name":"1 Institute of Bioinformatics, Johannes Kepler University, Linz, Austria, 2 Institute for Biostatistics and Statistical Bioinformatics, Hasselt University, Hasselt, 3 Johnson & Johnson Pharmaceutical Research & Development, Division of Janssen Pharmaceutica, Beerse, Belgium and 4 Department of Nephrology and Internal Intensive Care, Charit\u00e9, Berlin, Germany"}]},{"given":"Djork-Arn\u00e9","family":"Clevert","sequence":"additional","affiliation":[{"name":"1 Institute of Bioinformatics, Johannes Kepler University, Linz, Austria, 2 Institute for Biostatistics and Statistical Bioinformatics, Hasselt University, Hasselt, 3 Johnson & Johnson Pharmaceutical Research & Development, Division of Janssen Pharmaceutica, Beerse, Belgium and 4 Department of Nephrology and Internal Intensive Care, Charit\u00e9, Berlin, Germany"},{"name":"1 Institute of Bioinformatics, Johannes Kepler University, Linz, Austria, 2 Institute for Biostatistics and Statistical Bioinformatics, Hasselt University, Hasselt, 3 Johnson & Johnson Pharmaceutical Research & Development, Division of Janssen Pharmaceutica, Beerse, Belgium and 4 Department of Nephrology and Internal Intensive Care, Charit\u00e9, Berlin, Germany"}]}],"member":"286","published-online":{"date-parts":[[2010,4,23]]},"reference":[{"key":"2023012508062576600_B1","doi-asserted-by":"crossref","first-page":"1282","DOI":"10.1093\/bioinformatics\/btl099","article-title":"BicAT: a biclustering analysis toolbox","volume":"22","author":"Barkow","year":"2006","journal-title":"Bioinformatics"},{"key":"2023012508062576600_B2","doi-asserted-by":"crossref","first-page":"373","DOI":"10.1089\/10665270360688075","article-title":"Discovering local structure in gene expression data: the order-preserving submatrix problem","volume":"10","author":"Ben-Dor","year":"2003","journal-title":"J. Comput. Biol."},{"key":"2023012508062576600_B3","article-title":"Distributions involving correlated generalized gamma variables","volume-title":"Proceedings of the International Conference on Applied Stochastic Models and Data Analysis","author":"Bithas","year":"2007"},{"key":"2023012508062576600_B4","article-title":"Double conjugated clustering applied to leukemia microarray data","volume-title":"Proceedings of the 2nd SIAM International Conference on Data Mining\/Workshop on Clustering High Dimensional Data","author":"Busygin","year":"2002"},{"key":"2023012508062576600_B5","first-page":"291","article-title":"Bayesian biclustering with the plaid model","volume-title":"Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing","author":"Caldas","year":"2008"},{"key":"2023012508062576600_B6","first-page":"75","article-title":"Analysis of gene expression microarays for phenotype classification","volume-title":"Proceedings of the International Conference on Computational Molecular Biology","author":"Califano","year":"2000"},{"key":"2023012508062576600_B7","first-page":"93","article-title":"Biclustering of expression data","volume-title":"Proceedings of the International Conference on Intelligent Systems for Molecular Biology","author":"Cheng","year":"2000"},{"key":"2023012508062576600_B8","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1111\/j.2517-6161.1977.tb01600.x","article-title":"Maximum likelihood from incomplete data via the EM algorithm","volume":"39","author":"Dempster","year":"1977","journal-title":"J. R. Stat. Soc. B Met."},{"key":"2023012508062576600_B9","doi-asserted-by":"crossref","DOI":"10.1007\/978-94-009-5564-6","volume-title":"An Introduction to Latent Variable Models.","author":"Everitt","year":"1984"},{"key":"2023012508062576600_B10","doi-asserted-by":"crossref","first-page":"209","DOI":"10.1186\/1471-2105-9-209","article-title":"Discovering biclusters in gene expression data based on high-dimensional linear geometries","volume":"9","author":"Gan","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023012508062576600_B11","first-page":"679","article-title":"Learning probabilistic models of link structure","volume":"3","author":"Getoor","year":"2002","journal-title":"J. Mach. Learn. Res."},{"key":"2023012508062576600_B12","doi-asserted-by":"crossref","first-page":"12079","DOI":"10.1073\/pnas.210134797","article-title":"Coupled two-way clustering analysis of gene microarray data","volume":"97","author":"Getz","year":"2000","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012508062576600_B13","doi-asserted-by":"crossref","first-page":"2517","DOI":"10.1162\/089976601753196003","article-title":"A variational method for learning sparse and overcomplete representations","volume":"13","author":"Girolami","year":"2001","journal-title":"Neural Comput."},{"issue":"Suppl. 1","key":"2023012508062576600_B14","doi-asserted-by":"crossref","first-page":"S4","DOI":"10.1186\/1471-2164-9-S1-S4","article-title":"Bayesian biclustering of gene expression data","volume":"9","author":"Gu","year":"2008","journal-title":"BMC Genomics"},{"key":"2023012508062576600_B15","doi-asserted-by":"crossref","first-page":"446","DOI":"10.1093\/biostatistics\/kxp003","article-title":"A note on oligonucleotide expression values not being normally distributed","volume":"10","author":"Hardn","year":"2009","journal-title":"Biostatistics"},{"key":"2023012508062576600_B16","doi-asserted-by":"crossref","first-page":"123","DOI":"10.1080\/01621459.1972.10481214","article-title":"Direct clustering of a data matrix","volume":"67","author":"Hartigan","year":"1972","journal-title":"J. Am. Stat. Assoc."},{"key":"2023012508062576600_B17","doi-asserted-by":"crossref","first-page":"943","DOI":"10.1093\/bioinformatics\/btl033","article-title":"A new summarization method for Affymetrix probe level data","volume":"22","author":"Hochreiter","year":"2006","journal-title":"Bioinformatics"},{"key":"2023012508062576600_B18","doi-asserted-by":"crossref","first-page":"e1195","DOI":"10.1371\/journal.pone.0001195","article-title":"Subclass mapping: identifying common subtypes in independent disease data sets","volume":"2","author":"Hoshida","year":"2007","journal-title":"PLoS ONE"},{"key":"2023012508062576600_B19","first-page":"1457","article-title":"Non-negative matrix factorization with sparseness constraints","volume":"5","author":"Hoyer","year":"2004","journal-title":"J. Mach. Learn. Res."},{"key":"2023012508062576600_B20","first-page":"94","article-title":"Survey on independent component analysis","volume":"2","author":"Hyv\u00e4rinen","year":"1999","journal-title":"Neural Comput. Surv."},{"key":"2023012508062576600_B21","doi-asserted-by":"crossref","first-page":"1483","DOI":"10.1162\/neco.1997.9.7.1483","article-title":"A fast fixed-point algorithm for independent component analysis","volume":"9","author":"Hyv\u00e4rinen","year":"1999","journal-title":"Neural Comput."},{"key":"2023012508062576600_B22","doi-asserted-by":"crossref","first-page":"1993","DOI":"10.1093\/bioinformatics\/bth166","article-title":"Defining transcription modules using large-scale gene expression data","volume":"20","author":"Ihmels","year":"2004","journal-title":"Bioinformatics"},{"key":"2023012508062576600_B23","first-page":"201","article-title":"A toolbox for bicluster analysis in R","volume-title":"Compstat 2008 \u2013 Proceedings in Computational Statistics.","author":"Kaiser","year":"2008"},{"key":"2023012508062576600_B24","doi-asserted-by":"crossref","first-page":"703","DOI":"10.1101\/gr.648603","article-title":"Spectral biclustering of microarray data: coclustering genes and conditions","volume":"13","author":"Kluger","year":"2003","journal-title":"Genome Res."},{"key":"2023012508062576600_B25","first-page":"61","article-title":"Plaid models for gene expression data","volume":"12","author":"Lazzeroni","year":"2002","journal-title":"Stat. Sin."},{"key":"2023012508062576600_B26","doi-asserted-by":"crossref","first-page":"e101","DOI":"10.1093\/nar\/gkp491","article-title":"QUBIC: a qualitative biclustering algorithm for analyses of gene expression data","volume":"37","author":"Li","year":"2009","journal-title":"Nucleic Acids Res."},{"key":"2023012508062576600_B27","doi-asserted-by":"crossref","first-page":"24","DOI":"10.1109\/TCBB.2004.2","article-title":"Biclustering algorithms for biological data analysis: a survey","volume":"1","author":"Madeira","year":"2004","journal-title":"IEEE ACM Trans. Comput. Biol."},{"key":"2023012508062576600_B28","doi-asserted-by":"crossref","first-page":"8","DOI":"10.1186\/1748-7188-4-8","article-title":"A polynomial time biclustering algorithm for finding approximate expression patterns in gene expression time series","volume":"4","author":"Madeira","year":"2009","journal-title":"Algorithm Mol. Biol."},{"key":"2023012508062576600_B29","doi-asserted-by":"crossref","first-page":"153","DOI":"10.1109\/TCBB.2008.34","article-title":"Identification of regulatory modules in time series gene expression data using a linear time biclustering algorithm","volume":"7","author":"Madeira","year":"2010","journal-title":"IEEE ACM Trans. Comput. Biol."},{"key":"2023012508062576600_B30","doi-asserted-by":"crossref","first-page":"32","DOI":"10.1137\/0105003","article-title":"Algorithms for the assignment and transportation problems","volume":"5","author":"Munkres","year":"1957","journal-title":"J. Soc. Ind. Appl. Math."},{"key":"2023012508062576600_B31","first-page":"77","article-title":"Extracting conserved gene expression motifs from gene expression data","volume-title":"Pacific Symposium on Biocomputing","author":"Murali","year":"2003"},{"key":"2023012508062576600_B32","first-page":"1059","article-title":"Variational EM algorithms for non-Gaussian latent variable models","volume-title":"Advances in Neural Information Processing Systems 18","author":"Palmer","year":"2006"},{"key":"2023012508062576600_B33","doi-asserted-by":"crossref","first-page":"1122","DOI":"10.1093\/bioinformatics\/btl060","article-title":"A systematic comparison and evaluation of biclustering methods for gene expression data","volume":"22","author":"Prelic","year":"2006","journal-title":"Bioinformatics"},{"key":"2023012508062576600_B34","doi-asserted-by":"crossref","first-page":"280","DOI":"10.1186\/1471-2105-7-280","article-title":"Integrated biclustering of heterogeneous genome-wide datasets for the inference of global regulatory networks","volume":"2","author":"Reiss","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023012508062576600_B35","doi-asserted-by":"crossref","first-page":"1937","DOI":"10.1056\/NEJMoa012914","article-title":"The use of molecular profiling to predict survival after chemotherapy for diffuse large-B-cell lymphoma","volume":"346","author":"Rosenwald","year":"2002","journal-title":"New Engl. J. Med."},{"key":"2023012508062576600_B36","doi-asserted-by":"crossref","first-page":"232","DOI":"10.1186\/1471-2105-6-232","article-title":"EXPANDER \u2013 an integrative program suite for microarray data analysis","volume":"6","author":"Shamir","year":"2005","journal-title":"BMC Bioinformatics"},{"issue":"Suppl. 2","key":"2023012508062576600_B37","doi-asserted-by":"crossref","first-page":"ii196","DOI":"10.1093\/bioinformatics\/btg1078","article-title":"Biclustering micrarray data by Gibbs sampling","volume":"19","author":"Sheng","year":"2003","journal-title":"Bioinformatics"},{"key":"2023012508062576600_B38","doi-asserted-by":"crossref","first-page":"4465","DOI":"10.1073\/pnas.012025199","article-title":"Large-scale analysis of the human and mouse transcriptomes","volume":"99","author":"Su","year":"2002","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012508062576600_B39","doi-asserted-by":"crossref","first-page":"2897","DOI":"10.1093\/bioinformatics\/btm478","article-title":"I\/NI-calls for the exclusion of non-informative genes: a highly effective feature filtering tool for microarray data","volume":"23","author":"Talloen","year":"2007","journal-title":"Bioinformatics"},{"issue":"Suppl. 1","key":"2023012508062576600_B40","doi-asserted-by":"crossref","first-page":"S136","DOI":"10.1093\/bioinformatics\/18.suppl_1.S136","article-title":"Discovering statistically significant biclusters in gene expression data","volume":"18","author":"Tanay","year":"2002","journal-title":"Bioinformatics"},{"key":"2023012508062576600_B41","first-page":"41","article-title":"Interrelated two-way clustering: an unsupervised approach for gene expression data analysis","volume-title":"Proceedings of the 2nd IEEE International Symposium on Bioinformatics and Bioengineering","author":"Tang","year":"2001"},{"key":"2023012508062576600_B42","article-title":"Clustering methods for the analysis of DNA microarray data","volume-title":"Technical report","author":"Tibshirani","year":"1999"},{"key":"2023012508062576600_B43","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1016\/j.csda.2004.02.003","article-title":"Improved biclustering of microarray data demonstrated through systematic performance tests","volume":"48","author":"Turner","year":"2003","journal-title":"Comput. Stat. Data Anal."},{"key":"2023012508062576600_B44","article-title":"Robust Algorithms for Inferring Regulatory Networks Based on Gene Expression Measurements and Biological Prior Information","volume-title":"PhD Thesis","author":"Van den Bulcke","year":"2009"},{"key":"2023012508062576600_B45","doi-asserted-by":"crossref","first-page":"530","DOI":"10.1038\/415530a","article-title":"Gene expression profiling predicts clinical outcome of breast cancer","volume":"415","author":"van't Veer","year":"2002","journal-title":"Nature"},{"key":"2023012508062576600_B46","doi-asserted-by":"crossref","first-page":"394","DOI":"10.1145\/564691.564737","article-title":"Clustering by pattern similarity in large data sets","volume-title":"Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data","author":"Wang","year":"2002"},{"key":"2023012508062576600_B47","doi-asserted-by":"crossref","first-page":"771","DOI":"10.1142\/S0218213005002387","article-title":"An improved biclustering method for analyzing gene expression profiles","volume":"14","author":"Yang","year":"2005","journal-title":"Int. J. Artif. Intell. T."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/12\/1520\/48859059\/bioinformatics_26_12_1520.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/12\/1520\/48859059\/bioinformatics_26_12_1520.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,2,20]],"date-time":"2025-02-20T05:05:29Z","timestamp":1740027929000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/26\/12\/1520\/287036"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,4,23]]},"references-count":47,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2010,6,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btq227","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2010,6,15]]},"published":{"date-parts":[[2010,4,23]]}}}