{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,2]],"date-time":"2025-11-02T16:27:56Z","timestamp":1762100876079},"reference-count":36,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2008,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Background<\/jats:title><jats:p>Many statistical methods have been proposed to identify disease biomarkers from gene expression profiles. However, from gene expression profile data alone, statistical methods often fail to identify biologically meaningful biomarkers related to a specific disease under study. In this paper, we develop a novel strategy, namely knowledge-guided multi-scale independent component analysis (ICA), to first infer regulatory signals and then identify biologically relevant biomarkers from microarray data.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>Since gene expression levels reflect the joint effect of several underlying biological functions, disease-specific biomarkers may be involved in several distinct biological functions. To identify disease-specific biomarkers that provide unique mechanistic insights, a meta-data \"knowledge gene pool\" (KGP) is first constructed from multiple data sources to provide important information on the likely functions (such as gene ontology information) and regulatory events (such as promoter responsive elements) associated with potential genes of interest. The gene expression and biological meta data associated with the members of the KGP can then be used to guide subsequent analysis. ICA is then applied to multi-scale gene clusters to reveal regulatory modes reflecting the underlying biological mechanisms. Finally disease-specific biomarkers are extracted by their weighted connectivity scores associated with the extracted regulatory modes. A statistical significance test is used to evaluate the significance of transcription factor enrichment for the extracted gene set based on motif information. We applied the proposed method to yeast cell cycle microarray data and Rsf-1-induced ovarian cancer microarray data. The results show that our knowledge-guided ICA approach can extract biologically meaningful regulatory modes and outperform several baseline methods for biomarker identification.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusion<\/jats:title><jats:p>We have proposed a novel method, namely knowledge-guided multi-scale ICA, to identify disease-specific biomarkers. The goal is to infer knowledge-relevant regulatory signals and then identify corresponding biomarkers through a multi-scale strategy. The approach has been successfully applied to two expression profiling experiments to demonstrate its improved performance in extracting biologically meaningful and disease-related biomarkers. More importantly, the proposed approach shows promising results to infer novel biomarkers for ovarian cancer and extend current knowledge.<\/jats:p><\/jats:sec>","DOI":"10.1186\/1471-2105-9-416","type":"journal-article","created":{"date-parts":[[2008,10,29]],"date-time":"2008-10-29T19:14:10Z","timestamp":1225307650000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":18,"title":["Knowledge-guided multi-scale independent component analysis for biomarker identification"],"prefix":"10.1186","volume":"9","author":[{"given":"Li","family":"Chen","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jianhua","family":"Xuan","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chen","family":"Wang","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ie-Ming","family":"Shih","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yue","family":"Wang","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhen","family":"Zhang","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Eric","family":"Hoffman","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Robert","family":"Clarke","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2008,10,6]]},"reference":[{"key":"2401_CR1","volume-title":"Statistics: The Exploration and Analysis of Data","author":"J Devore","year":"1997","unstructured":"Devore J, Peck R: Statistics: The Exploration and Analysis of Data. CA Duxbury Press; 1997."},{"issue":"9","key":"2401_CR2","doi-asserted-by":"publisher","first-page":"5116","DOI":"10.1073\/pnas.091062498","volume":"98","author":"VG Tusher","year":"2001","unstructured":"Tusher VG, Tibshirani R, Chu G: Significance analysis of microarrays applied to the ionizing radiation response. Proc Natl Acad Sci USA 2001, 98(9):5116\u20135121. 10.1073\/pnas.091062498","journal-title":"Proc Natl Acad Sci USA"},{"issue":"36","key":"2401_CR3","doi-asserted-by":"publisher","first-page":"12837","DOI":"10.1073\/pnas.0504609102","volume":"102","author":"JD Storey","year":"2005","unstructured":"Storey JD, Xiao W, Leek JT, Tompkins RG, Davis RW: Significance analysis of time course microarray experiments. Proc Natl Acad Sci USA 2005, 102(36):12837\u201312842. 10.1073\/pnas.0504609102","journal-title":"Proc Natl Acad Sci USA"},{"issue":"9","key":"2401_CR4","doi-asserted-by":"publisher","first-page":"1096","DOI":"10.1093\/bioinformatics\/btl056","volume":"22","author":"A Conesa","year":"2006","unstructured":"Conesa A, Nueda MJ, Ferrer A, Talon M: maSigPro: a method to identify significantly differential expression profiles in time-course microarray experiments. Bioinformatics 2006, 22(9):1096\u20131102. 10.1093\/bioinformatics\/btl056","journal-title":"Bioinformatics"},{"key":"2401_CR5","doi-asserted-by":"publisher","first-page":"100","DOI":"10.2307\/2346830","volume":"28","author":"JA Hartigan","year":"1978","unstructured":"Hartigan JA, Wong MA: A K-means clustering algorithm. App Statist 1978, 28: 100\u2013108. 10.2307\/2346830","journal-title":"App Statist"},{"key":"2401_CR6","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-97966-8","volume-title":"Self-Organizing Maps","author":"T Kohonen","year":"1997","unstructured":"Kohonen T: Self-Organizing Maps. NY: Springer; 1997."},{"issue":"1","key":"2401_CR7","doi-asserted-by":"publisher","first-page":"37","DOI":"10.1038\/nrc2294","volume":"8","author":"R Clarke","year":"2008","unstructured":"Clarke R, Ressom HW, Wang A, Xuan J, Liu MC, Gehan EA, Wang Y: The properties of high-dimensional data spaces: implications for exploring gene and protein expression data. Nat Rev Cancer 2008, 8(1):37\u201349. 10.1038\/nrc2294","journal-title":"Nat Rev Cancer"},{"issue":"4","key":"2401_CR8","doi-asserted-by":"publisher","first-page":"382","DOI":"10.1038\/ng1532","volume":"37","author":"K Basso","year":"2005","unstructured":"Basso K, Margolin AA, Stolovitzky G, Klein U, Dalla-Favera R, Califano A: Reverse engineering of regulatory networks in human B cells. Nat Genet 2005, 37(4):382\u2013390. 10.1038\/ng1532","journal-title":"Nat Genet"},{"issue":"2","key":"2401_CR9","doi-asserted-by":"publisher","first-page":"166","DOI":"10.1038\/ng1165","volume":"34","author":"E Segal","year":"2003","unstructured":"Segal E, Shapira M, Regev A, Pe'er D, Botstein D, Koller D, Friedman N: Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data. Nat Genet 2003, 34(2):166\u2013176. 10.1038\/ng1165","journal-title":"Nat Genet"},{"issue":"1","key":"2401_CR10","doi-asserted-by":"publisher","first-page":"51","DOI":"10.1093\/bioinformatics\/18.1.51","volume":"18","author":"W Liebermeister","year":"2002","unstructured":"Liebermeister W: Linear modes of gene expression determined by independent component analysis. Bioinformatics 2002, 18(1):51\u201360. 10.1093\/bioinformatics\/18.1.51","journal-title":"Bioinformatics"},{"key":"2401_CR11","first-page":"332","volume-title":"Blind gene classification on ICA of microarray data","author":"G Hori","year":"2001","unstructured":"Hori G, Inoue M, Nishimura S, Nakahara H: Blind gene classification on ICA of microarray data. ICA: 2001; San Diego, CA; 2001:332\u2013336."},{"issue":"11","key":"2401_CR12","doi-asserted-by":"publisher","first-page":"R76","DOI":"10.1186\/gb-2003-4-11-r76","volume":"4","author":"SI Lee","year":"2003","unstructured":"Lee SI, Batzoglou S: Application of independent component analysis to microarrays. Genome Biol 2003, 4(11):R76. 10.1186\/gb-2003-4-11-r76","journal-title":"Genome Biol"},{"issue":"39","key":"2401_CR13","doi-asserted-by":"publisher","first-page":"6677","DOI":"10.1038\/sj.onc.1207562","volume":"23","author":"SA Saidi","year":"2004","unstructured":"Saidi SA, Holland CM, Kreil DP, MacKay DJ, Charnock-Jones DS, Print CG, Smith SK: Independent component analysis of microarray data in the study of endometrial cancer. Oncogene 2004, 23(39):6677\u20136683. 10.1038\/sj.onc.1207562","journal-title":"Oncogene"},{"key":"2401_CR14","doi-asserted-by":"publisher","DOI":"10.1002\/0471221317","volume-title":"Independent Component Analysis","author":"A Hyvarinen","year":"2001","unstructured":"Hyvarinen A, Karhunen J, Oja E: Independent Component Analysis. John Wiley & Sons; 2001."},{"key":"2401_CR15","doi-asserted-by":"crossref","first-page":"349","DOI":"10.1177\/117762500700100023","volume":"1","author":"T Gong","year":"2007","unstructured":"Gong T, Xuan J, Wang C, Li H, Hoffman E, Clarke R, Wang Y: Gene module identification from microarray data using nonnegative independent component analysis. Gene Regulation and Systems Biology 2007, 1: 349\u2013363.","journal-title":"Gene Regulation and Systems Biology"},{"key":"2401_CR16","volume-title":"The International Conference on Bioinformatics & Computational Biology: 2007","author":"C Wang","year":"2007","unstructured":"Wang C, Xuan J, Gong T, Clarke R, Hoffman E, Wang Y: Stability based dimension estimation of ICA with application to microarray data analysis. The International Conference on Bioinformatics & Computational Biology: 2007 2007."},{"issue":"26","key":"2401_CR17","doi-asserted-by":"publisher","first-page":"15522","DOI":"10.1073\/pnas.2136632100","volume":"100","author":"JC Liao","year":"2003","unstructured":"Liao JC, Boscolo R, Yang YL, Tran LM, Sabatti C, Roychowdhury VP: Network component analysis: reconstruction of regulatory signals in biological systems. Proc Natl Acad Sci USA 2003, 100(26):15522\u201315527. 10.1073\/pnas.2136632100","journal-title":"Proc Natl Acad Sci USA"},{"issue":"6","key":"2401_CR18","doi-asserted-by":"publisher","first-page":"3339","DOI":"10.1073\/pnas.0630591100","volume":"100","author":"EM Conlon","year":"2003","unstructured":"Conlon EM, Liu XS, Lieb JD, Liu JS: Integrating regulatory motif discovery and genome-wide expression analysis. Proc Natl Acad Sci USA 2003, 100(6):3339\u20133344. 10.1073\/pnas.0630591100","journal-title":"Proc Natl Acad Sci USA"},{"issue":"16","key":"2401_CR19","doi-asserted-by":"publisher","first-page":"2005","DOI":"10.1093\/bioinformatics\/btl343","volume":"22","author":"JG Joung","year":"2006","unstructured":"Joung JG, Shin D, Seong RH, Zhang BT: Identification of regulatory modules by co-clustering latent variable models: stem cell differentiation. Bioinformatics 2006, 22(16):2005\u20132011. 10.1093\/bioinformatics\/btl343","journal-title":"Bioinformatics"},{"key":"2401_CR20","volume-title":"Sixth International Conference on Bioinformatics: 2007; Hong Kong, China","author":"C Wang","year":"2007","unstructured":"Wang C, Chen L, Zhao P, Hoffman E, Wang Y, Clarke R, Xuan J: Motifdirected network component analysis for regulatory network inference. Sixth International Conference on Bioinformatics: 2007; Hong Kong, China 2007."},{"key":"2401_CR21","doi-asserted-by":"publisher","first-page":"1483","DOI":"10.1162\/neco.1997.9.7.1483","volume":"9","author":"A Hyvarinen","year":"1997","unstructured":"Hyvarinen A, E O: A fast fixed-point algorithm for independent component analysis. Neural Compuatation 1997, 9: 1483\u20131492. 10.1162\/neco.1997.9.7.1483","journal-title":"Neural Compuatation"},{"key":"2401_CR22","doi-asserted-by":"publisher","first-page":"290","DOI":"10.1186\/1471-2105-7-290","volume":"7","author":"A Frigyesi","year":"2006","unstructured":"Frigyesi A, Veerla S, Lindgren D, Hoglund M: Independent component analysis reveals new and biologically significant structures in micro array data. BMC Bioinformatics 2006, 7: 290. 10.1186\/1471-2105-7-290","journal-title":"BMC Bioinformatics"},{"key":"2401_CR23","doi-asserted-by":"crossref","unstructured":"Matys V, Kel-Margoulis OV, Fricke E, Liebich I, Land S, Barre-Dirrie A, Reuter I, Chekmenev D, Krull M, Hornischer K, et al.: TRANSFAC and its module TRANSCompel: transcriptional gene regulation in eukaryotes. Nucleic Acids Res 2006, (34 Database):D108\u2013110. 10.1093\/nar\/gkj143","DOI":"10.1093\/nar\/gkj143"},{"issue":"1","key":"2401_CR24","doi-asserted-by":"publisher","first-page":"51","DOI":"10.1093\/nar\/gkg129","volume":"31","author":"D Karolchik","year":"2003","unstructured":"Karolchik D, Baertsch R, Diekhans M, Furey TS, Hinrichs A, Lu YT, Roskin KM, Schwartz M, Sugnet CW, Thomas DJ, et al.: The UCSC Genome Browser Database. Nucleic Acids Res 2003, 31(1):51\u201354. 10.1093\/nar\/gkg129","journal-title":"Nucleic Acids Res"},{"issue":"13","key":"2401_CR25","doi-asserted-by":"publisher","first-page":"3576","DOI":"10.1093\/nar\/gkg585","volume":"31","author":"AE Kel","year":"2003","unstructured":"Kel AE, Gossling E, Reuter I, Cheremushkin E, Kel-Margoulis OV, Wingender E: MATCH: A tool for searching transcription factor binding sites in DNA sequences. Nucleic Acids Res 2003, 31(13):3576\u20133579. 10.1093\/nar\/gkg585","journal-title":"Nucleic Acids Res"},{"key":"2401_CR26","volume-title":"Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations","author":"I Witten","year":"2000","unstructured":"Witten I, Frank E: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann; 2000."},{"issue":"12","key":"2401_CR27","doi-asserted-by":"publisher","first-page":"3273","DOI":"10.1091\/mbc.9.12.3273","volume":"9","author":"PT Spellman","year":"1998","unstructured":"Spellman PT, Sherlock G, Zhang MQ, Iyer VR, Anders K, Eisen MB, Brown PO, Botstein D, Futcher B: Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. Mol Biol Cell 1998, 9(12):3273\u20133297.","journal-title":"Mol Biol Cell"},{"issue":"39","key":"2401_CR28","doi-asserted-by":"publisher","first-page":"14004","DOI":"10.1073\/pnas.0504195102","volume":"102","author":"M Shih Ie","year":"2005","unstructured":"Shih Ie M, Sheu JJ, Santillan A, Nakayama K, Yen MJ, Bristow RE, Vang R, Parmigiani G, Kurman RJ, Trope CG, et al.: Amplification of a chromatin remodeling gene, Rsf-1\/HBXAP, in ovarian carcinoma. Proc Natl Acad Sci USA 2005, 102(39):14004\u201314009. 10.1073\/pnas.0504195102","journal-title":"Proc Natl Acad Sci USA"},{"key":"2401_CR29","volume-title":"Guide to Probe Logarithmic Intensity Error (PLIER) Estimation","author":"Affymetrix","year":"2005","unstructured":"Affymetrix: Guide to Probe Logarithmic Intensity Error (PLIER) Estimation. Edited by: . Affymetrix I Santa Clara, CA; 2005."},{"issue":"1","key":"2401_CR30","doi-asserted-by":"publisher","first-page":"133","DOI":"10.1016\/j.yexcr.2004.04.019","volume":"298","author":"JY Huang","year":"2004","unstructured":"Huang JY, Shen BJ, Tsai WH, Lee SC: Functional interaction between nuclear matrix-associated HBXAP and NF-kappaB. Exp Cell Res 2004, 298(1):133\u2013143. 10.1016\/j.yexcr.2004.04.019","journal-title":"Exp Cell Res"},{"key":"2401_CR31","doi-asserted-by":"publisher","first-page":"2449","DOI":"10.1016\/j.ejca.2005.08.008","volume":"41","author":"M-L Karin","year":"2005","unstructured":"Karin M-L: The Fos family of transcription factors and their role in tumourigenesis, European journal of cancer. European journal of cancer 2005, 41: 2449\u20132461. 10.1016\/j.ejca.2005.08.008","journal-title":"European journal of cancer"},{"issue":"43","key":"2401_CR32","doi-asserted-by":"publisher","first-page":"33718","DOI":"10.1074\/jbc.M003555200","volume":"275","author":"SC Sharma","year":"2000","unstructured":"Sharma SC, Richards JS: Regulation of AP1 (Jun\/Fos) factor expression and activation in ovarian granulosa cells. Relation of JunD and Fra2 to terminal differentiation. J Biol Chem 2000, 275(43):33718\u201333728. 10.1074\/jbc.M003555200","journal-title":"J Biol Chem"},{"issue":"5","key":"2401_CR33","doi-asserted-by":"publisher","first-page":"2769","DOI":"10.4049\/jimmunol.164.5.2769","volume":"164","author":"LF Lee","year":"2000","unstructured":"Lee LF, Hellendall RP, Wang Y, Haskill JS, Mukaida N, Matsushima K, Ting JP: IL-8 reduced tumorigenicity of human ovarian cancer in vivo due to neutrophil infiltration. J Immunol 2000, 164(5):2769\u20132775.","journal-title":"J Immunol"},{"key":"2401_CR34","volume-title":"Ovarian cancer angiogenesis, biology and therapy","author":"L Xu","year":"2000","unstructured":"Xu L: Ovarian cancer angiogenesis, biology and therapy. University of Texas; 2000."},{"issue":"1","key":"2401_CR35","doi-asserted-by":"publisher","first-page":"107","DOI":"10.1210\/mend.12.1.0049","volume":"12","author":"P Topilko","year":"1998","unstructured":"Topilko P, Schneider-Maunoury S, Levi G, Trembleau A, Gourdji D, Driancourt MA, Rao CV, Charnay P: Multiple pituitary and ovarian defects in Krox-24 (NGFI-A, Egr-1)-targeted mice. Mol Endocrinol 1998, 12(1):107\u2013122. 10.1210\/me.12.1.107","journal-title":"Mol Endocrinol"},{"issue":"1","key":"2401_CR36","doi-asserted-by":"crossref","first-page":"6","DOI":"10.1158\/0008-5472.6.65.1","volume":"65","author":"R Hayami","year":"2005","unstructured":"Hayami R, Sato K, Wu W, Nishikawa T, Hiroi J, Ohtani-Kaneko R, Fukuda M, Ohta T: Down-regulation of BRCA1-BARD1 ubiquitin ligase by CDK2. Cancer Res 2005, 65(1):6\u201310.","journal-title":"Cancer Res"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-9-416.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,20]],"date-time":"2023-05-20T20:34:16Z","timestamp":1684614856000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-9-416"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,10,6]]},"references-count":36,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2008,12]]}},"alternative-id":["2401"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-9-416","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2008,10,6]]},"assertion":[{"value":"23 May 2008","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 October 2008","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 October 2008","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"416"}}