{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,6]],"date-time":"2026-05-06T07:53:54Z","timestamp":1778054034145,"version":"3.51.4"},"reference-count":39,"publisher":"Oxford University Press (OUP)","issue":"15","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":3041,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/2.0\/uk\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2008,8,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: Modern machine learning methods based on matrix decomposition techniques, like independent component analysis (ICA) or non-negative matrix factorization (NMF), provide new and efficient analysis tools which are currently explored to analyze gene expression profiles. These exploratory feature extraction techniques yield expression modes (ICA) or metagenes (NMF). These extracted features are considered indicative of underlying regulatory processes. They can as well be applied to the classification of gene expression datasets by grouping samples into different categories for diagnostic purposes or group genes into functional categories for further investigation of related metabolic pathways and regulatory networks.<\/jats:p><jats:p>Results: In this study we focus on unsupervised matrix factorization techniques and apply ICA and sparse NMF to microarray datasets. The latter monitor the gene expression levels of human peripheral blood cells during differentiation from monocytes to macrophages. We show that these tools are able to identify relevant signatures in the deduced component matrices and extract informative sets of marker genes from these gene expression profiles. The methods rely on the joint discriminative power of a set of marker genes rather than on single marker genes. With these sets of marker genes, corroborated by leave-one-out or random forest cross-validation, the datasets could easily be classified into related diagnostic categories. The latter correspond to either monocytes versus macrophages or healthy vs Niemann Pick C disease patients.<\/jats:p><jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p><jats:p>Contact: \u00a0elmar.lang@biologie.uni-regensburg.de<\/jats:p>","DOI":"10.1093\/bioinformatics\/btn245","type":"journal-article","created":{"date-parts":[[2008,6,6]],"date-time":"2008-06-06T00:45:31Z","timestamp":1212713131000},"page":"1688-1697","source":"Crossref","is-referenced-by-count":33,"title":["Knowledge-based gene expression classification via matrix factorization"],"prefix":"10.1093","volume":"24","author":[{"given":"R.","family":"Schachtner","sequence":"first","affiliation":[{"name":"1 CIML\/Biophysics, University of Regensburg, D-93040 Regensburg, 2CMB\/IBI, GSF Munich, 3Clinical Chemistry, University Hospital Regensburg, D-93042 Regensburg, Germany, 4IEETA\/DETI, Universidade de Aveiro, 3810-193 Aveiro, Portugal, 5Siemens Corporate Technology, Siemens AG, Munich, Germany and 6DATSI\/FI, Universidad Polit'ecnica de Madrid, E-18500 Madrid, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"D.","family":"Lutter","sequence":"additional","affiliation":[{"name":"1 CIML\/Biophysics, University of Regensburg, D-93040 Regensburg, 2CMB\/IBI, GSF Munich, 3Clinical Chemistry, University Hospital Regensburg, D-93042 Regensburg, Germany, 4IEETA\/DETI, Universidade de Aveiro, 3810-193 Aveiro, Portugal, 5Siemens Corporate Technology, Siemens AG, Munich, Germany and 6DATSI\/FI, Universidad Polit'ecnica de Madrid, E-18500 Madrid, Spain"},{"name":"1 CIML\/Biophysics, University of Regensburg, D-93040 Regensburg, 2CMB\/IBI, GSF Munich, 3Clinical Chemistry, University Hospital Regensburg, D-93042 Regensburg, Germany, 4IEETA\/DETI, Universidade de Aveiro, 3810-193 Aveiro, Portugal, 5Siemens Corporate Technology, Siemens AG, Munich, Germany and 6DATSI\/FI, Universidad Polit'ecnica de Madrid, E-18500 Madrid, Spain"},{"name":"1 CIML\/Biophysics, University of Regensburg, D-93040 Regensburg, 2CMB\/IBI, GSF Munich, 3Clinical Chemistry, University Hospital Regensburg, D-93042 Regensburg, Germany, 4IEETA\/DETI, Universidade de Aveiro, 3810-193 Aveiro, Portugal, 5Siemens Corporate Technology, Siemens AG, Munich, Germany and 6DATSI\/FI, Universidad Polit'ecnica de Madrid, E-18500 Madrid, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"P.","family":"Knollm\u00fcller","sequence":"additional","affiliation":[{"name":"1 CIML\/Biophysics, University of Regensburg, D-93040 Regensburg, 2CMB\/IBI, GSF Munich, 3Clinical Chemistry, University Hospital Regensburg, D-93042 Regensburg, Germany, 4IEETA\/DETI, Universidade de Aveiro, 3810-193 Aveiro, Portugal, 5Siemens Corporate Technology, Siemens AG, Munich, Germany and 6DATSI\/FI, Universidad Polit'ecnica de Madrid, E-18500 Madrid, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"A. M.","family":"Tom\u00e9","sequence":"additional","affiliation":[{"name":"1 CIML\/Biophysics, University of Regensburg, D-93040 Regensburg, 2CMB\/IBI, GSF Munich, 3Clinical Chemistry, University Hospital Regensburg, D-93042 Regensburg, Germany, 4IEETA\/DETI, Universidade de Aveiro, 3810-193 Aveiro, Portugal, 5Siemens Corporate Technology, Siemens AG, Munich, Germany and 6DATSI\/FI, Universidad Polit'ecnica de Madrid, E-18500 Madrid, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"F. J.","family":"Theis","sequence":"additional","affiliation":[{"name":"1 CIML\/Biophysics, University of Regensburg, D-93040 Regensburg, 2CMB\/IBI, GSF Munich, 3Clinical Chemistry, University Hospital Regensburg, D-93042 Regensburg, Germany, 4IEETA\/DETI, Universidade de Aveiro, 3810-193 Aveiro, Portugal, 5Siemens Corporate Technology, Siemens AG, Munich, Germany and 6DATSI\/FI, Universidad Polit'ecnica de Madrid, E-18500 Madrid, Spain"},{"name":"1 CIML\/Biophysics, University of Regensburg, D-93040 Regensburg, 2CMB\/IBI, GSF Munich, 3Clinical Chemistry, University Hospital Regensburg, D-93042 Regensburg, Germany, 4IEETA\/DETI, Universidade de Aveiro, 3810-193 Aveiro, Portugal, 5Siemens Corporate Technology, Siemens AG, Munich, Germany and 6DATSI\/FI, Universidad Polit'ecnica de Madrid, E-18500 Madrid, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"G.","family":"Schmitz","sequence":"additional","affiliation":[{"name":"1 CIML\/Biophysics, University of Regensburg, D-93040 Regensburg, 2CMB\/IBI, GSF Munich, 3Clinical Chemistry, University Hospital Regensburg, D-93042 Regensburg, Germany, 4IEETA\/DETI, Universidade de Aveiro, 3810-193 Aveiro, Portugal, 5Siemens Corporate Technology, Siemens AG, Munich, Germany and 6DATSI\/FI, Universidad Polit'ecnica de Madrid, E-18500 Madrid, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"M.","family":"Stetter","sequence":"additional","affiliation":[{"name":"1 CIML\/Biophysics, University of Regensburg, D-93040 Regensburg, 2CMB\/IBI, GSF Munich, 3Clinical Chemistry, University Hospital Regensburg, D-93042 Regensburg, Germany, 4IEETA\/DETI, Universidade de Aveiro, 3810-193 Aveiro, Portugal, 5Siemens Corporate Technology, Siemens AG, Munich, Germany and 6DATSI\/FI, Universidad Polit'ecnica de Madrid, E-18500 Madrid, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"P. G\u00f3mez","family":"Vilda","sequence":"additional","affiliation":[{"name":"1 CIML\/Biophysics, University of Regensburg, D-93040 Regensburg, 2CMB\/IBI, GSF Munich, 3Clinical Chemistry, University Hospital Regensburg, D-93042 Regensburg, Germany, 4IEETA\/DETI, Universidade de Aveiro, 3810-193 Aveiro, Portugal, 5Siemens Corporate Technology, Siemens AG, Munich, Germany and 6DATSI\/FI, Universidad Polit'ecnica de Madrid, E-18500 Madrid, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"E. W.","family":"Lang","sequence":"additional","affiliation":[{"name":"1 CIML\/Biophysics, University of Regensburg, D-93040 Regensburg, 2CMB\/IBI, GSF Munich, 3Clinical Chemistry, University Hospital Regensburg, D-93042 Regensburg, Germany, 4IEETA\/DETI, Universidade de Aveiro, 3810-193 Aveiro, Portugal, 5Siemens Corporate Technology, Siemens AG, Munich, Germany and 6DATSI\/FI, Universidad Polit'ecnica de Madrid, E-18500 Madrid, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2008,6,5]]},"reference":[{"key":"2023020210490766700_B1","volume-title":"Affymetrix Microarray Suite User Guide","author":"Affymetrix","year":"2002"},{"key":"2023020210490766700_B2","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1038\/nrg1749","article-title":"Microarray data analysis: from disarray to consolidation and consensus","volume":"7","author":"Allison","year":"2006","journal-title":"Nat. Rev. Genet."},{"key":"2023020210490766700_B3","doi-asserted-by":"crossref","DOI":"10.1017\/CBO9780511541773","volume-title":"DNA Microarrays and Gene Expression","author":"Baldi","year":"2002"},{"key":"2023020210490766700_B4","doi-asserted-by":"crossref","first-page":"389","DOI":"10.1023\/A:1012487302797","article-title":"Gene selection for cancer classification using support vector machines","volume":"46","author":"Barnhill","year":"2002","journal-title":"Mach. Learn"},{"key":"2023020210490766700_B5","doi-asserted-by":"crossref","first-page":"185","DOI":"10.1093\/bioinformatics\/19.2.185","article-title":"A comparison of normalization methods for high density oligonucleotide array data based on bias and variance","volume":"19","author":"Bolstad","year":"2003","journal-title":"Bioinformatics"},{"key":"2023020210490766700_B6","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach. Learn"},{"key":"2023020210490766700_B7","first-page":"362","article-title":"Blind beamformimg for non-gaussian signals","volume":"F140","author":"Cardoso","year":"1993","journal-title":"IEEE Proc"},{"key":"2023020210490766700_B8","doi-asserted-by":"crossref","first-page":"161","DOI":"10.1137\/S0895479893259546","article-title":"Jacobi angles for simultaneous diagonalization","volume":"17","author":"Cardoso","year":"1996","journal-title":"SIAM J. Math. Anal. Appl"},{"key":"2023020210490766700_B9","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1093\/bioinformatics\/btl609","article-title":"A distribution free summarization method for affymetrix genechip arrays","volume":"23","author":"Chen","year":"2007","journal-title":"Bioinformatics"},{"key":"2023020210490766700_B10","doi-asserted-by":"crossref","first-page":"328","DOI":"10.1186\/1471-2105-8-328","article-title":"Genesrf and varselrf: a web-based tool and r package for gene selection and classification using random forest","volume":"8","author":"Diaz-Uriarte","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023020210490766700_B11","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1186\/1471-2105-7-3","article-title":"Gene selection and classification of microarray data using random forest","volume":"7","author":"Diaz-Uriarte","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023020210490766700_B12","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1109\/MSP.2005.1407722","article-title":"Genomic signal processing: diagnosis and therapy","volume":"22","author":"Dougherty","year":"2005","journal-title":"IEEE Signal Proc. Mag"},{"key":"2023020210490766700_B13","doi-asserted-by":"crossref","first-page":"46","DOI":"10.1109\/MSP.2005.1550189","article-title":"Research issues in genomic signal processing","volume":"Nov","author":"Dougherty","year":"2005","journal-title":"IEEE Signal Proc. Mag"},{"key":"2023020210490766700_B14","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1198\/016214502753479248","article-title":"Comparision of dicrimination methods for classification of tumors using gene expression data","volume":"97","author":"Dudoit","year":"2002","journal-title":"J. Am. Stat. Assoc"},{"key":"2023020210490766700_B15","first-page":"135","article-title":"Co-relations and their measurement, chiefly from anthropometric data","volume":"45","author":"Galton","year":"1888","journal-title":"Proc. R. Soc"},{"key":"2023020210490766700_B16","first-page":"238","article-title":"Co-relations and their measurement, chiefly from anthropometric data","volume":"39","author":"Galton","year":"1889","journal-title":"Nature"},{"key":"2023020210490766700_B17","doi-asserted-by":"crossref","DOI":"10.1126\/science.286.5439.531","article-title":"Molecular classification of cancer: class discovery and class prediction by gene expression monitoring","volume":"286","author":"Golub","year":"1999","journal-title":"Science"},{"key":"2023020210490766700_B18","first-page":"1157","article-title":"An introduction to variable and feature selection","volume":"3","author":"Guyon","year":"2003","journal-title":"J. Mach. Learn. Res"},{"key":"2023020210490766700_B19","doi-asserted-by":"crossref","first-page":"943","DOI":"10.1093\/bioinformatics\/btl033","article-title":"A new summarization method for affymetrix probe level data","volume":"22","author":"Hochreiter","year":"2006","journal-title":"Bioinformatics"},{"key":"2023020210490766700_B20","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1093\/nar\/gng015","article-title":"Summaries of affymetrix genechip probe level data","volume":"31","author":"Irrizarry","year":"2003","journal-title":"Nucleic Acids Res"},{"key":"2023020210490766700_B21","doi-asserted-by":"crossref","first-page":"R76.1","DOI":"10.1186\/gb-2003-4-11-r76","article-title":"Application of independent component analysis to microarrays","volume":"4","author":"Lee","year":"2003","journal-title":"Genome Biol"},{"key":"2023020210490766700_B22","article-title":"Learning spatially localized, parts-based representation. In","volume":"vol. 1","author":"Li","year":"2001"},{"key":"2023020210490766700_B23","doi-asserted-by":"crossref","first-page":"51","DOI":"10.1093\/bioinformatics\/18.1.51","article-title":"Linear modes of gene expression determined by independent component analysis","volume":"18","author":"Liebermeister","year":"2002","journal-title":"Bioinformatics"},{"key":"2023020210490766700_B24","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1155\/JBB.2005.155","article-title":"Gene expression data classification with kernel principal component analysis","volume":"2","author":"Liu","year":"2005","journal-title":"J. Biomed. Biotechnol"},{"key":"2023020210490766700_B25","doi-asserted-by":"crossref","DOI":"10.1186\/1471-2105-9-100","article-title":"Analysing M-CSF dependent monocyte\/macrophage differentiation and meta-clustering with independent component analysis derived expression modes","author":"Lutter","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023020210490766700_B26","first-page":"161","article-title":"Lagrangian support vector machines","volume":"1","author":"Mangasarian","year":"2001","journal-title":"J. Mach. Learn. Res"},{"key":"2023020210490766700_B27","doi-asserted-by":"crossref","first-page":"559","DOI":"10.1080\/14786440109462720","article-title":"On lines and planes of closest fit to points in space","volume":"2","author":"Pearson","year":"1901","journal-title":"Phil. Mag"},{"key":"2023020210490766700_B28","first-page":"418","article-title":"Computational analysis of microarray data","volume":"2","author":"Quackenbush","year":"2001","journal-title":"Nature"},{"key":"2023020210490766700_B29","doi-asserted-by":"crossref","first-page":"6677","DOI":"10.1038\/sj.onc.1207562","article-title":"Independent component analysis of microarray data in the study of endometrial cancer","volume":"23","author":"Saidi","year":"2004","journal-title":"Oncogene"},{"key":"2023020210490766700_B30","article-title":"Blind matrix decomposition techniques to identify marker genes from microarrays. In","volume-title":"Lecture Notes in Computer Science","author":"Schachtner","year":"2007"},{"key":"2023020210490766700_B31","first-page":"4617","article-title":"Routes to identify marker genes for microarray classification. In","author":"Schachtner","year":"2007"},{"key":"2023020210490766700_B32","author":"Sch\u00f6lkopf","year":"2002","journal-title":"Learning with Kernels"},{"key":"2023020210490766700_B33","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1145\/980972.980978","article-title":"Supervised analysis when the number of candidate features (p) greatly exceeds the number of cases (n)","volume":"5","author":"Simon","year":"2003","journal-title":"SIGKDD Explor"},{"key":"2023020210490766700_B34","first-page":"33","article-title":"Prediction and uncertainty in the analysis of gene expression profiles","volume":"2","author":"Spang","year":"2002","journal-title":"In Silico Biol"},{"key":"2023020210490766700_B35","doi-asserted-by":"crossref","first-page":"2897","DOI":"10.1093\/bioinformatics\/btm478","article-title":"I\/NI-calls for the exclusion of non-informative genes: a highly effective filtering tool for microarray data","volume":"23","author":"Talloen","year":"2007","journal-title":"Bioinformatics"},{"key":"2023020210490766700_B36","doi-asserted-by":"crossref","first-page":"520","DOI":"10.1093\/bioinformatics\/17.6.520","article-title":"Missing value estimation methods for DNA microarrays","volume":"17","author":"Troyanskaya","year":"2001","journal-title":"Bioinformatics"},{"key":"2023020210490766700_B37","doi-asserted-by":"crossref","first-page":"5116","DOI":"10.1073\/pnas.091062498","article-title":"Significance analysis of microarrays applied to the ionizing radiation response","volume":"98","author":"Tusher","year":"2001","journal-title":"PNAS"},{"key":"2023020210490766700_B38","doi-asserted-by":"crossref","first-page":"333","DOI":"10.1214\/07-AOAS116","article-title":"A statistical framework for the analysis of microarray probe-level data","volume":"1","author":"Wu","year":"2007","journal-title":"Ann. Appl. Stat"},{"key":"2023020210490766700_B39","doi-asserted-by":"crossref","first-page":"909","DOI":"10.1198\/016214504000000683","article-title":"A model-based background adjustment for oligonucleotide expression arrays","volume":"99","author":"Wu","year":"2004","journal-title":"J. Am. Stat. Assoc"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/15\/1688\/49049345\/bioinformatics_24_15_1688.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/15\/1688\/49049345\/bioinformatics_24_15_1688.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,27]],"date-time":"2024-02-27T04:31:02Z","timestamp":1709008262000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/24\/15\/1688\/264436"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,6,5]]},"references-count":39,"journal-issue":{"issue":"15","published-print":{"date-parts":[[2008,8,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btn245","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2008,8,1]]},"published":{"date-parts":[[2008,6,5]]}}}