{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,8]],"date-time":"2026-02-08T07:50:04Z","timestamp":1770537004508,"version":"3.49.0"},"reference-count":29,"publisher":"Oxford University Press (OUP)","issue":"5","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2011,3,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Large-scale RNA expression measurements are generating enormous quantities of data. During the last two decades, many methods were developed for extracting insights regarding the interrelationships between genes from such data. The mathematical and computational perspectives that underlie these methods are usually algebraic or probabilistic.<\/jats:p>\n               <jats:p>Results: Here, we introduce an unexplored geometric view point where expression levels of genes in multiple experiments are interpreted as vectors in a high-dimensional space. Specifically, we find, for the expression profile of each particular gene, its approximation as a linear combination of profiles of a few other genes. This method is inspired by recent developments in the realm of compressed sensing in the machine learning domain. To demonstrate the power of our approach in extracting valuable information from the expression data, we independently applied it to large-scale experiments carried out on the yeast and malaria parasite whole transcriptomes. The parameters extracted from the sparse reconstruction of the expression profiles, when fed to a supervised learning platform, were used to successfully predict the relationships between genes throughout the Gene Ontology hierarchy and protein\u2013protein interaction map. Extensive assessment of the biological results shows high accuracy in both recovering known predictions and in yielding accurate predictions missing from the current databases. We suggest that the geometrical approach presented here is suitable for a broad range of high-dimensional experimental data.<\/jats:p>\n               <jats:p>Contact: \u00a0michall@cc.huji.ac.il<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btr002","type":"journal-article","created":{"date-parts":[[2011,1,23]],"date-time":"2011-01-23T01:16:30Z","timestamp":1295745390000},"page":"655-661","source":"Crossref","is-referenced-by-count":9,"title":["Recovering key biological constituents through sparse representation of gene expression"],"prefix":"10.1093","volume":"27","author":[{"given":"Yosef","family":"Prat","sequence":"first","affiliation":[{"name":"1 School of Computer Science and Engineering, 2Sudarsky Center for Computational Biology and 3Department of Biological Chemistry, Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Menachem","family":"Fromer","sequence":"additional","affiliation":[{"name":"1 School of Computer Science and Engineering, 2Sudarsky Center for Computational Biology and 3Department of Biological Chemistry, Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Nathan","family":"Linial","sequence":"additional","affiliation":[{"name":"1 School of Computer Science and Engineering, 2Sudarsky Center for Computational Biology and 3Department of Biological Chemistry, Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel"},{"name":"1 School of Computer Science and Engineering, 2Sudarsky Center for Computational Biology and 3Department of Biological Chemistry, Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Michal","family":"Linial","sequence":"additional","affiliation":[{"name":"1 School of Computer Science and Engineering, 2Sudarsky Center for Computational Biology and 3Department of Biological Chemistry, Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2011,1,21]]},"reference":[{"key":"2023012511564475800_B1","doi-asserted-by":"crossref","first-page":"6745","DOI":"10.1073\/pnas.96.12.6745","article-title":"Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays","volume":"96","author":"Alon","year":"1999","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012511564475800_B2","doi-asserted-by":"crossref","first-page":"10101","DOI":"10.1073\/pnas.97.18.10101","article-title":"Singular value decomposition for genome-wide expression data processing and modeling","volume":"97","author":"Alter","year":"2000","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012511564475800_B3","doi-asserted-by":"crossref","first-page":"589","DOI":"10.1093\/bioinformatics\/btk026","article-title":"A multi-step approach to time series analysis and gene expression clustering","volume":"22","author":"Amato","year":"2006","journal-title":"Bioinformatics"},{"key":"2023012511564475800_B4","doi-asserted-by":"crossref","first-page":"1880","DOI":"10.1002\/pmic.200900723","article-title":"Pathway Palette: a rich internet application for peptide-, protein- and network-oriented analysis of MS data","volume":"10","author":"Askenazi","year":"2010","journal-title":"Proteomics"},{"key":"2023012511564475800_B5","doi-asserted-by":"crossref","first-page":"78","DOI":"10.1038\/nbt924","article-title":"Gaining confidence in high-throughput protein interaction networks","volume":"22","author":"Bader","year":"2004","journal-title":"Nat. Biotechnol."},{"key":"2023012511564475800_B6","doi-asserted-by":"crossref","first-page":"D396","DOI":"10.1093\/nar\/gkn803","article-title":"The GOA database in 2009\u2013an integrated Gene Ontology Annotation resource","volume":"37","author":"Barrell","year":"2009","journal-title":"Nucleic Acids Res."},{"key":"2023012511564475800_B7","doi-asserted-by":"crossref","first-page":"699","DOI":"10.1038\/nrg2144","article-title":"Integrating physical and genetic maps: from genomes to interaction networks","volume":"8","author":"Beyer","year":"2007","journal-title":"Nat. Rev. Genet."},{"key":"2023012511564475800_B8","doi-asserted-by":"crossref","first-page":"589","DOI":"10.1016\/j.crma.2008.03.014","article-title":"The restricted isometry property and its implications for compressed sensing","volume":"346","author":"Candes","year":"2008","journal-title":"Compt. Rend. Math."},{"key":"2023012511564475800_B9","doi-asserted-by":"crossref","first-page":"4203","DOI":"10.1109\/TIT.2005.858979","article-title":"Decoding by linear programming","volume":"51","author":"Candes","year":"2005","journal-title":"IEEE Trans. Inf. Theory"},{"key":"2023012511564475800_B10","doi-asserted-by":"crossref","first-page":"R9","DOI":"10.1186\/gb-2004-6-1-r9","article-title":"Integration with the human genome of peptide sequences obtained by high-throughput mass spectrometry","volume":"6","author":"Desiere","year":"2005","journal-title":"Genome Biol."},{"key":"2023012511564475800_B11","doi-asserted-by":"crossref","first-page":"797","DOI":"10.1002\/cpa.20132","article-title":"For most large underdetermined systems of linear equations the minimal l(1)-norm solution is also the sparsest solution","volume":"59","author":"Donoho","year":"2006","journal-title":"Commun. Pure Appl. Math."},{"key":"2023012511564475800_B12","doi-asserted-by":"crossref","first-page":"14863","DOI":"10.1073\/pnas.95.25.14863","article-title":"Cluster analysis and display of genome-wide expression patterns","volume":"95","author":"Eisen","year":"1998","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012511564475800_B13","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1006\/jcss.1997.1504","article-title":"A decision-theoretic generalization of on-line learning and an application to boosting","volume":"55","author":"Freund","year":"1997","journal-title":"J. Comput. Syst. Sci."},{"key":"2023012511564475800_B14","doi-asserted-by":"crossref","first-page":"601","DOI":"10.1089\/106652700750050961","article-title":"Using Bayesian networks to analyze expression data","volume":"7","author":"Friedman","year":"2000","journal-title":"J. Comput. Biol."},{"key":"2023012511564475800_B15","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1038\/nbt.1597","article-title":"Transcriptional profiling of growth perturbations of the human malaria parasite Plasmodium falciparum","volume":"28","author":"Hu","year":"2009","journal-title":"Nat. Biotechnol."},{"key":"2023012511564475800_B16","doi-asserted-by":"crossref","first-page":"359","DOI":"10.1186\/1471-2105-7-359","article-title":"Comparison and evaluation of methods for generating differentially expressed gene lists from microarray data","volume":"7","author":"Jeffery","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023012511564475800_B17","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1038\/ng0501-21","article-title":"A literature network of human genes for high-throughput analysis of gene expression","volume":"28","author":"Jenssen","year":"2001","journal-title":"Nat. Genet."},{"key":"2023012511564475800_B18","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1186\/1471-2164-10-53","article-title":"Combinatorial effects of environmental parameters on transcriptional regulation in Saccharomyces cerevisiae: a quantitative analysis of a compendium of chemostat-based transcriptome data","volume":"10","author":"Knijnenburg","year":"2009","journal-title":"BMC Genomics"},{"key":"2023012511564475800_B19","doi-asserted-by":"crossref","first-page":"1112","DOI":"10.1101\/gr.225302","article-title":"Interactive exploration of microarray gene expression patterns in a reduced dimensional space","volume":"12","author":"Misra","year":"2002","journal-title":"Genome Res."},{"key":"2023012511564475800_B20","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1137\/S0097539792240406","article-title":"Sparse Approximate Solutions to Linear-Systems","volume":"24","author":"Natarajan","year":"1995","journal-title":"SIAM J. Comput."},{"key":"2023012511564475800_B21","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1038\/msb4100096","article-title":"Weighing our measures of gene expression","volume":"2","author":"Quackenbush","year":"2006","journal-title":"Mol. Syst. Biol."},{"key":"2023012511564475800_B22","first-page":"455","article-title":"Principal components analysis to summarize microarray experiments: application to sporulation time series","author":"Raychaudhuri","year":"2000","journal-title":"Pac. Symp. Biocomput."},{"key":"2023012511564475800_B23","doi-asserted-by":"crossref","first-page":"1025","DOI":"10.1002\/cpa.20227","article-title":"On sparse reconstruction from Fourier and Gaussian measurements","volume":"61","author":"Rudelson","year":"2008","journal-title":"Commun. Pure Appl. Math."},{"key":"2023012511564475800_B24","doi-asserted-by":"crossref","first-page":"502","DOI":"10.1038\/ng1033","article-title":"From patterns to pathways: gene expression data analysis comes of age","volume":"32","author":"Slonim","year":"2002","journal-title":"Nat. Genet."},{"key":"2023012511564475800_B25","doi-asserted-by":"crossref","first-page":"D535","DOI":"10.1093\/nar\/gkj109","article-title":"BioGRID: a general repository for interaction datasets","volume":"34","author":"Stark","year":"2006","journal-title":"Nucleic Acids Res."},{"key":"2023012511564475800_B26","doi-asserted-by":"crossref","first-page":"258","DOI":"10.1093\/nar\/gkg034","article-title":"STRING: a database of predicted functional associations between proteins","volume":"31","author":"von Mering","year":"2003","journal-title":"Nucleic Acids Res."},{"key":"2023012511564475800_B27","doi-asserted-by":"crossref","first-page":"2137","DOI":"10.1093\/nar\/gkl219","article-title":"Prediction of yeast protein-protein interaction network: insights from the Gene Ontology and annotations","volume":"34","author":"Wu","year":"2006","journal-title":"Nucleic Acids Res."},{"key":"2023012511564475800_B28","doi-asserted-by":"crossref","first-page":"763","DOI":"10.1093\/bioinformatics\/17.9.763","article-title":"Principal component analysis for clustering gene expression data","volume":"17","author":"Yeung","year":"2001","journal-title":"Bioinformatics"},{"key":"2023012511564475800_B29","doi-asserted-by":"crossref","first-page":"12783","DOI":"10.1073\/pnas.192159399","article-title":"Transitive functional annotation by shortest-path analysis of gene expression data","volume":"99","author":"Zhou","year":"2002","journal-title":"Proc. Natl Acad. Sci. USA"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/27\/5\/655\/48864899\/bioinformatics_27_5_655.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/27\/5\/655\/48864899\/bioinformatics_27_5_655.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T12:35:58Z","timestamp":1674650158000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/27\/5\/655\/1746213"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,1,21]]},"references-count":29,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2011,3,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btr002","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2011,3,1]]},"published":{"date-parts":[[2011,1,21]]}}}