{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,12]],"date-time":"2026-03-12T04:58:38Z","timestamp":1773291518148,"version":"3.50.1"},"reference-count":38,"publisher":"Oxford University Press (OUP)","issue":"18","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2012,9,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: Large public repositories of gene expression measurements offer the opportunity to position a new experiment into the context of earlier studies. While previous methods rely on experimental annotation or global similarity of expression profiles across genes or gene sets, we compare experiments by measuring similarity based on an unsupervised, data-driven regulatory model around pre-specified genes of interest. Our experiment retrieval approach is novel in two conceptual respects: (i) targetable focus and interpretability: the analysis is targeted at regulatory relationships of genes that are relevant to the analyst or come from prior knowledge; (ii) regulatory model-based similarity measure: related experiments are retrieved based on the strength of inferred regulatory links between genes.<\/jats:p><jats:p>Results: We learn a model for the regulation of specific genes from a data repository and exploit it to construct a similarity metric for an information retrieval task. We use the Fisher kernel, a rigorous similarity measure that typically has been applied to use generative models in discriminative classifiers. Results on human and plant microarray collections indicate that our method is able to substantially improve the retrieval of related experiments against standard methods. Furthermore, it allows the user to interpret biological conditions in terms of changes in link activity patterns. Our study of the osmotic stress network for Arabidopsis thaliana shows that the method successfully identifies relevant relationships around given key genes.<\/jats:p><jats:p>Availability: The code (R) is available at http:\/\/research.ics.tkk.fi\/mi\/software.shtml.<\/jats:p><jats:p>Contact: elisabeth.georgii@aalto.fi; jarkko.salojarvi@helsinki.fi; samuel.kaski@hiit.fi<\/jats:p><jats:p>Supplementary Information: Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/bts361","type":"journal-article","created":{"date-parts":[[2012,6,29]],"date-time":"2012-06-29T00:52:19Z","timestamp":1340931139000},"page":"2349-2356","source":"Crossref","is-referenced-by-count":17,"title":["Targeted retrieval of gene expression measurements using regulatory models"],"prefix":"10.1093","volume":"28","author":[{"given":"Elisabeth","family":"Georgii","sequence":"first","affiliation":[{"name":"1 Helsinki Institute for Information Technology HIIT, Department of Information and Computer Science, Aalto University, 00076 Aalto, Espoo, Finland, 2Division of Plant Biology, Department of Biosciences, University of Helsinki, 00014 University of Helsinki, Helsinki, Finland, 3Veterinary Microbiology and Epidemiology, Department of Veterinary Biosciences, University of Helsinki, 00014 University of Helsinki, Helsinki, Finland and 4Helsinki Institute for Information Technology HIIT, Department of Computer Science, University of Helsinki, 00014 University of Helsinki, Helsinki, Finland"}]},{"given":"Jarkko","family":"Saloj\u00e4rvi","sequence":"additional","affiliation":[{"name":"1 Helsinki Institute for Information Technology HIIT, Department of Information and Computer Science, Aalto University, 00076 Aalto, Espoo, Finland, 2Division of Plant Biology, Department of Biosciences, University of Helsinki, 00014 University of Helsinki, Helsinki, Finland, 3Veterinary Microbiology and Epidemiology, Department of Veterinary Biosciences, University of Helsinki, 00014 University of Helsinki, Helsinki, Finland and 4Helsinki Institute for Information Technology HIIT, Department of Computer Science, University of Helsinki, 00014 University of Helsinki, Helsinki, Finland"},{"name":"1 Helsinki Institute for Information Technology HIIT, Department of Information and Computer Science, Aalto University, 00076 Aalto, Espoo, Finland, 2Division of Plant Biology, Department of Biosciences, University of Helsinki, 00014 University of Helsinki, Helsinki, Finland, 3Veterinary Microbiology and Epidemiology, Department of Veterinary Biosciences, University of Helsinki, 00014 University of Helsinki, Helsinki, Finland and 4Helsinki Institute for Information Technology HIIT, Department of Computer Science, University of Helsinki, 00014 University of Helsinki, Helsinki, Finland"}]},{"given":"Mikael","family":"Brosch\u00e9","sequence":"additional","affiliation":[{"name":"1 Helsinki Institute for Information Technology HIIT, Department of Information and Computer Science, Aalto University, 00076 Aalto, Espoo, Finland, 2Division of Plant Biology, Department of Biosciences, University of Helsinki, 00014 University of Helsinki, Helsinki, Finland, 3Veterinary Microbiology and Epidemiology, Department of Veterinary Biosciences, University of Helsinki, 00014 University of Helsinki, Helsinki, Finland and 4Helsinki Institute for Information Technology HIIT, Department of Computer Science, University of Helsinki, 00014 University of Helsinki, Helsinki, Finland"}]},{"given":"Jaakko","family":"Kangasj\u00e4rvi","sequence":"additional","affiliation":[{"name":"1 Helsinki Institute for Information Technology HIIT, Department of Information and Computer Science, Aalto University, 00076 Aalto, Espoo, Finland, 2Division of Plant Biology, Department of Biosciences, University of Helsinki, 00014 University of Helsinki, Helsinki, Finland, 3Veterinary Microbiology and Epidemiology, Department of Veterinary Biosciences, University of Helsinki, 00014 University of Helsinki, Helsinki, Finland and 4Helsinki Institute for Information Technology HIIT, Department of Computer Science, University of Helsinki, 00014 University of Helsinki, Helsinki, Finland"}]},{"given":"Samuel","family":"Kaski","sequence":"additional","affiliation":[{"name":"1 Helsinki Institute for Information Technology HIIT, Department of Information and Computer Science, Aalto University, 00076 Aalto, Espoo, Finland, 2Division of Plant Biology, Department of Biosciences, University of Helsinki, 00014 University of Helsinki, Helsinki, Finland, 3Veterinary Microbiology and Epidemiology, Department of Veterinary Biosciences, University of Helsinki, 00014 University of Helsinki, Helsinki, Finland and 4Helsinki Institute for Information Technology HIIT, Department of Computer Science, University of Helsinki, 00014 University of Helsinki, Helsinki, Finland"},{"name":"1 Helsinki Institute for Information Technology HIIT, Department of Information and Computer Science, Aalto University, 00076 Aalto, Espoo, Finland, 2Division of Plant Biology, Department of Biosciences, University of Helsinki, 00014 University of Helsinki, Helsinki, Finland, 3Veterinary Microbiology and Epidemiology, Department of Veterinary Biosciences, University of Helsinki, 00014 University of Helsinki, Helsinki, Finland and 4Helsinki Institute for Information Technology HIIT, Department of Computer Science, University of Helsinki, 00014 University of Helsinki, Helsinki, Finland"}]}],"member":"286","published-online":{"date-parts":[[2012,6,27]]},"reference":[{"key":"2023012513001870900_bts361-B1","volume-title":"Introduction to Machine Learning","author":"Alpaydin","year":"2010","edition":"2nd"},{"key":"2023012513001870900_bts361-B2","doi-asserted-by":"crossref","first-page":"205","DOI":"10.1214\/08-EJS314","article-title":"Inferring sparse Gaussian graphical models with latent structure","volume":"3","author":"Ambroise","year":"2009","journal-title":"Electron J. Stat."},{"key":"2023012513001870900_bts361-B3","first-page":"179","article-title":"Statistical analysis of non-lattice data","volume":"24","author":"Besag","year":"1975","journal-title":"J. R. Stat. Soc. Ser. D Statist."},{"key":"2023012513001870900_bts361-B4","doi-asserted-by":"crossref","first-page":"1185","DOI":"10.1104\/pp.105.061275","article-title":"Osmotic signaling in plants. Multiple pathways mediated by emerging kinase families","volume":"138","author":"Boudsocq","year":"2005","journal-title":"Plant Physiology"},{"key":"2023012513001870900_bts361-B5","doi-asserted-by":"crossref","first-page":"i145","DOI":"10.1093\/bioinformatics\/btp215","article-title":"Probabilistic retrieval and visualization of biologically relevant microarray experiments","volume":"25","author":"Caldas","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012513001870900_bts361-B6","doi-asserted-by":"crossref","first-page":"246","DOI":"10.1093\/bioinformatics\/btr634","article-title":"Data-driven information retrieval in heterogeneous collections of transcriptomics data links SIM2s to malignant pleural mesothelioma","volume":"28","author":"Caldas","year":"2012","journal-title":"Bioinformatics"},{"key":"2023012513001870900_bts361-B7","doi-asserted-by":"crossref","first-page":"157","DOI":"10.2307\/2528966","article-title":"Covariance selection","volume":"28","author":"Dempster","year":"1972","journal-title":"Biometrics"},{"issue":"1","key":"2023012513001870900_bts361-B8","doi-asserted-by":"crossref","first-page":"196","DOI":"10.1016\/j.jmva.2004.02.009","article-title":"Sparse graphical models for exploring gene expression data","volume":"90","author":"Dobra","year":"2004","journal-title":"J. Multivariate Analy."},{"key":"2023012513001870900_bts361-B9","doi-asserted-by":"crossref","first-page":"603","DOI":"10.1186\/1471-2105-11-603","article-title":"Content-based microarray search using differential expression profiles","volume":"11","author":"Engreitz","year":"2010","journal-title":"BMC Bioinformatics"},{"key":"2023012513001870900_bts361-B10","doi-asserted-by":"crossref","first-page":"411","DOI":"10.1186\/1471-2164-10-411","article-title":"GEM-TREND: a web tool for gene expression data mining toward relevant network discovery","volume":"10","author":"Feng","year":"2009","journal-title":"BMC Genomics"},{"key":"2023012513001870900_bts361-B11","doi-asserted-by":"crossref","first-page":"3103","DOI":"10.1093\/bioinformatics\/btm462","article-title":"CellMontage: similar expression profile search server","volume":"23","author":"Fujibuchi","year":"2007","journal-title":"Bioinformatics"},{"key":"2023012513001870900_bts361-B12","doi-asserted-by":"crossref","first-page":"W368","DOI":"10.1093\/nar\/gkr440","article-title":"Pilgrm: an interactive data-driven discovery platform for expert biologists","volume":"39","author":"Greene","year":"2011","journal-title":"Nucleic Acids Res."},{"key":"2023012513001870900_bts361-B13","first-page":"49","article-title":"Dependency networks for inference, collaborative filtering, and data visualization","volume":"1","author":"Heckerman","year":"2001","journal-title":"J. Mach. Learn. Res."},{"key":"2023012513001870900_bts361-B14","doi-asserted-by":"crossref","first-page":"2692","DOI":"10.1093\/bioinformatics\/btm403","article-title":"Exploring the functional landscape of gene expression: directed search of large microarray compendia","volume":"23","author":"Hibbs","year":"2007","journal-title":"Bioinformatics"},{"key":"2023012513001870900_bts361-B15","doi-asserted-by":"crossref","first-page":"6823","DOI":"10.1073\/pnas.0912043107","article-title":"Bayesian approach to transforming public gene expression repositories into disease diagnosis databases","volume":"107","author":"Huang","year":"2010","journal-title":"Proc. Nat. Acad. Sci. USA"},{"key":"2023012513001870900_bts361-B16","doi-asserted-by":"crossref","first-page":"i330","DOI":"10.1093\/bioinformatics\/btn160","article-title":"Assessing the functional structure of genomic data","volume":"24","author":"Huttenhower","year":"2008","journal-title":"Bioinformatics"},{"key":"2023012513001870900_bts361-B17","first-page":"487","article-title":"Exploiting generative models in discriminative classifiers","volume-title":"Proceedings of the 1998 Conference on Advances in Neural Information Processing Systems II","author":"Jaakkola","year":"1999"},{"key":"2023012513001870900_bts361-B18","doi-asserted-by":"crossref","first-page":"2713","DOI":"10.1093\/bioinformatics\/btq500","article-title":"Global modeling of transcriptional responses in interaction networks","volume":"26","author":"Lahti","year":"2010","journal-title":"Bioinformatics"},{"key":"2023012513001870900_bts361-B19","doi-asserted-by":"crossref","first-page":"2416","DOI":"10.1093\/bioinformatics\/btq451","article-title":"Cross-species queries of large gene expression databases","volume":"26","author":"Le","year":"2010","journal-title":"Bioinformatics"},{"key":"2023012513001870900_bts361-B20","doi-asserted-by":"crossref","first-page":"322","DOI":"10.1038\/nbt0410-322","article-title":"A global map of human gene expression","volume":"28","author":"Lukk","year":"2010","journal-title":"Nat. Biotechnol."},{"key":"2023012513001870900_bts361-B21","doi-asserted-by":"crossref","first-page":"24","DOI":"10.1109\/TCBB.2004.2","article-title":"Biclustering algorithms for biological data analysis: A survey","volume":"1","author":"Madeira","year":"2004","journal-title":"IEEE\/ACM Trans. Comput. Biol. Bioinformatics"},{"key":"2023012513001870900_bts361-B22","doi-asserted-by":"crossref","first-page":"S5","DOI":"10.1186\/1471-2105-8-S6-S5","article-title":"Inferring cellular networks\u2014a review","volume":"8","author":"Markowetz","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023012513001870900_bts361-B23","doi-asserted-by":"crossref","first-page":"1436","DOI":"10.1214\/009053606000000281","article-title":"High dimensional graphs and variable selection with the Lasso","volume":"34","author":"Meinshausen","year":"2006","journal-title":"Ann. Stat."},{"key":"2023012513001870900_bts361-B24","doi-asserted-by":"crossref","first-page":"D868","DOI":"10.1093\/nar\/gkn889","article-title":"Arrayexpress update\u2014from an archive of functional genomics experiments to the atlas of gene expression","volume":"37","author":"Parkinson","year":"2009","journal-title":"Nucleic Acids Res."},{"key":"2023012513001870900_bts361-B25","doi-asserted-by":"crossref","first-page":"18822","DOI":"10.1073\/pnas.0605639103","article-title":"Dual function of an Arabidopsis transcription factor DREB2A in water-stress-responsive and heat-stress-responsive gene expression","volume":"103","author":"Sakuma","year":"2006","journal-title":"Proc. Nat. Acad. Sci. USA"},{"key":"2023012513001870900_bts361-B26","doi-asserted-by":"crossref","DOI":"10.2202\/1544-6115.1175","article-title":"A shrinkage approach to large-scale covariance matrix estimation and implications for functional genomics","volume":"4","author":"Sch\u00e4fer","year":"2005","journal-title":"Stat. Appl. Genet. Mol. Biol."},{"key":"2023012513001870900_bts361-B27","doi-asserted-by":"crossref","DOI":"10.1109\/CVPR.2008.4587367","article-title":"Structure learning in random fields for heart motion abnormality detection","author":"Schmidt","year":"2008","journal-title":"Computer Vision and Pattern Recognition (CVPR 2008)"},{"key":"2023012513001870900_bts361-B28","doi-asserted-by":"crossref","first-page":"1090","DOI":"10.1038\/ng1434","article-title":"A module map showing conditional activity of expression modules in cancer","volume":"36","author":"Segal","year":"2004","journal-title":"Nat Genet"},{"key":"2023012513001870900_bts361-B29","doi-asserted-by":"crossref","first-page":"583429","DOI":"10.1155\/2009\/583429","article-title":"STIFDB\u2014Arabidopsis stress responsive transcription factor database","volume":"2009","author":"Shameer","year":"2009","journal-title":"Int. J. Plant Genomics"},{"key":"2023012513001870900_bts361-B30","doi-asserted-by":"crossref","DOI":"10.1017\/CBO9780511809682","volume-title":"Kernel Methods for Pattern Analysis.","author":"Shawe-Taylor","year":"2004"},{"key":"2023012513001870900_bts361-B31","doi-asserted-by":"crossref","first-page":"1064","DOI":"10.1093\/bioinformatics\/btq080","article-title":"Inferring dynamic gene networks under varying conditions for transcriptomic network comparison","volume":"26","author":"Shimamura","year":"2010","journal-title":"Bioinformatics"},{"key":"2023012513001870900_bts361-B32","doi-asserted-by":"crossref","first-page":"e72","DOI":"10.1093\/nar\/gkl1133","article-title":"Three methods for optimization of cross-laboratory and cross-platform microarray expression data","volume":"35","author":"Stafford","year":"2007","journal-title":"Nucleic Acids Res."},{"key":"2023012513001870900_bts361-B33","doi-asserted-by":"crossref","first-page":"e1000662","DOI":"10.1371\/journal.pcbi.1000662","article-title":"Network-based elucidation of human disease similarities reveals common functional modules enriched for pluripotent drug targets","volume":"6","author":"Suthram","year":"2010","journal-title":"PLoS Comput Biol."},{"key":"2023012513001870900_bts361-B34","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1111\/j.2517-6161.1996.tb02080.x","article-title":"Regression shrinkage and selection via the Lasso","volume":"58","author":"Tibshirani","year":"1996","journal-title":"J. Roy. Stat. Soc. Ser. B Methodol."},{"key":"2023012513001870900_bts361-B35","doi-asserted-by":"crossref","first-page":"515","DOI":"10.1016\/j.bbrc.2008.01.134","article-title":"Functional analysis of an Arabidopsis heat-shock transcription factor HsfA3 in the transcriptional cascade downstream of the DREB2A stress-regulatory system","volume":"368","author":"Yoshida","year":"2008","journal-title":"Biochem Biophys Res Commun"},{"key":"2023012513001870900_bts361-B36","first-page":"701","article-title":"Learning structural changes of Gaussian graphical models in controlled experiments","volume-title":"Proceedings of the Twenty-Sixth Conference Annual Conference on Uncertainty in Artificial Intelligence (UAI-10)","author":"Zhang","year":"2010"},{"key":"2023012513001870900_bts361-B37","doi-asserted-by":"crossref","first-page":"2798","DOI":"10.1093\/bioinformatics\/btn520","article-title":"GEOmetadb: powerful alternative search engine for the Gene Expression Omnibus","volume":"24","author":"Zhu","year":"2008","journal-title":"Bioinformatics"},{"key":"2023012513001870900_bts361-B38","doi-asserted-by":"crossref","first-page":"301","DOI":"10.1111\/j.1467-9868.2005.00503.x","article-title":"Regularization and variable selection via the elastic net","volume":"67","author":"Zou","year":"2005","journal-title":"J. Roy. Stat. Soc. Ser. B Methodol"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/18\/2349\/48884775\/bioinformatics_28_18_2349.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/18\/2349\/48884775\/bioinformatics_28_18_2349.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,4,26]],"date-time":"2024-04-26T06:08:26Z","timestamp":1714111706000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/28\/18\/2349\/242165"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,6,27]]},"references-count":38,"journal-issue":{"issue":"18","published-print":{"date-parts":[[2012,9,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bts361","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2012,9,15]]},"published":{"date-parts":[[2012,6,27]]}}}