{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,13]],"date-time":"2026-01-13T09:32:33Z","timestamp":1768296753790,"version":"3.49.0"},"reference-count":23,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2009,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Joint analysis of transcriptomic and proteomic data taken from the same samples has the potential to elucidate complex biological mechanisms. Most current methods that integrate these datasets allow for the computation of the correlation between a gene and protein but only after a one-to-one matching of genes and proteins is done. However, genes and proteins are connected via biological pathways and their relationship is not necessarily one-to-one. In this paper, we investigate the use of Correlated Factor Analysis (CFA) for modeling the correlation of genome-scale gene and protein data. Unlike existing approaches, CFA considers all possible gene-protein pairs and utilizes all gene and protein information in its modeling framework. The Generalized Singular Value Decomposition (gSVD) is another method which takes into account all available transcriptomic and proteomic data. Comparison is made between CFA and gSVD.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>Our simulation study indicates that the CFA estimates can consistently capture the dominant patterns of correlation between two sets of measurements; in contrast, the gSVD estimates cannot do that. Applied to real cancer data, the list of co-regulated genes and proteins identified by CFA has biologically meaningful interpretation, where both the gene and protein expressions are pointing to the same processes. Among the GO terms for which the genes and proteins are most correlated, we observed blood vessel morphogenesis and development.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusion<\/jats:title>\n            <jats:p>We demonstrate that CFA is a useful tool for gene-protein data integration and modeling, where the main question is in finding which patterns of gene expression are most correlated with protein expression.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-10-272","type":"journal-article","created":{"date-parts":[[2009,9,1]],"date-time":"2009-09-01T18:14:08Z","timestamp":1251828848000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":14,"title":["Correlating gene and protein expression data using Correlated Factor Analysis"],"prefix":"10.1186","volume":"10","author":[{"given":"Chuen Seng","family":"Tan","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Agus","family":"Salim","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alexander","family":"Ploner","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Janne","family":"Lehti\u00f6","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kee Seng","family":"Chia","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yudi","family":"Pawitan","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2009,9,1]]},"reference":[{"issue":"2","key":"3002_CR1","doi-asserted-by":"publisher","first-page":"63","DOI":"10.1080\/07388550701334212","volume":"27","author":"L Nie","year":"2007","unstructured":"Nie L, Wu G, Culley DE, Scholten JCM, Zhang W: Integrative analysis of transcriptomic and proteomic data: challenges, solutions and applications. Crit Rev Biotechnol 2007, 27(2):63\u201375. 10.1080\/07388550701334212","journal-title":"Crit Rev Biotechnol"},{"issue":"4","key":"3002_CR2","doi-asserted-by":"publisher","first-page":"261","DOI":"10.1093\/bfgp\/ell019","volume":"5","author":"KM Waters","year":"2006","unstructured":"Waters KM, Pounds JG, Thrall BD: Data merging for integrated microarray and proteomic analysis. Brief Funct Genomic Proteomic 2006, 5(4):261\u2013272. 10.1093\/bfgp\/ell019","journal-title":"Brief Funct Genomic Proteomic"},{"issue":"3","key":"3002_CR3","doi-asserted-by":"publisher","first-page":"303","DOI":"10.1016\/j.ymeth.2004.08.021","volume":"35","author":"B Cox","year":"2005","unstructured":"Cox B, Kislinger T, Emili A: Integrating gene and protein expression data: pattern analysis and profile mining. Methods 2005, 35(3):303\u2013314. 10.1016\/j.ymeth.2004.08.021","journal-title":"Methods"},{"issue":"13","key":"3002_CR4","doi-asserted-by":"publisher","first-page":"1641","DOI":"10.1093\/bioinformatics\/btl134","volume":"22","author":"L Nie","year":"2006","unstructured":"Nie L, Wu G, Brockman FJ, Zhang W: Integrated analysis of transcriptomic and proteomic data of Desulfovibrio vulgaris: zero-inflated Poisson regression models to predict abundance of undetected proteins. Bioinformatics 2006, 22(13):1641\u20131647. 10.1093\/bioinformatics\/btl134","journal-title":"Bioinformatics"},{"issue":"6","key":"3002_CR5","doi-asserted-by":"publisher","first-page":"520","DOI":"10.1093\/bioinformatics\/17.6.520","volume":"17","author":"O Troyanskaya","year":"2001","unstructured":"Troyanskaya O, Cantor M, Sherlock G, Brown P, Hastie T, Tibshirani R, Botstein D, Altman RB: Missing value estimation methods for DNA microarrays. Bioinformatics 2001, 17(6):520\u2013525. 10.1093\/bioinformatics\/17.6.520","journal-title":"Bioinformatics"},{"issue":"3","key":"3002_CR6","doi-asserted-by":"publisher","first-page":"e34","DOI":"10.1093\/nar\/gnh026","volume":"32","author":"TH Bo","year":"2004","unstructured":"Bo TH, Dysvik B, Jonassen I: LSimpute: accurate estimation of missing values in microarray data with least squares methods. Nucleic Acids Res 2004, 32(3):e34. 10.1093\/nar\/gnh026","journal-title":"Nucleic Acids Res"},{"issue":"13","key":"3002_CR7","doi-asserted-by":"publisher","first-page":"2162","DOI":"10.1002\/pmic.200600898","volume":"7","author":"A Fagan","year":"2007","unstructured":"Fagan A, Culhane AC, Higgins DG: A multivariate analysis approach to the integration of proteomic and gene expression data. Proteomics 2007, 7(13):2162\u20132171. 10.1002\/pmic.200600898","journal-title":"Proteomics"},{"key":"3002_CR8","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1198\/108571107X177078","volume":"12","author":"A Salim","year":"2007","unstructured":"Salim A, Pawitan Y: Model-based maximum covariance analysis for irregularly observed climatological data. J Agric Biol Environ Stat 2007, 12: 1\u201324. 10.1198\/108571107X177078","journal-title":"J Agric Biol Environ Stat"},{"key":"3002_CR9","doi-asserted-by":"publisher","first-page":"2","DOI":"10.1109\/TCBB.2006.10","volume":"3","author":"JA Berger","year":"2006","unstructured":"Berger JA, Hautaniemi S, Mitra SK, Astola J: Jointly analyzing gene expression and copy number data in breast cancer using data reduction models. IEEE\/ACM Trans Comput Biol Bioinform 2006, 3: 2\u201316. 10.1109\/TCBB.2006.10","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"issue":"6","key":"3002_CR10","doi-asserted-by":"publisher","first-page":"3351","DOI":"10.1073\/pnas.0530258100","volume":"100","author":"O Alter","year":"2003","unstructured":"Alter O, Brown PO, Botstein D: Generalized singular value decomposition for comparative analysis of genome-scale expression data sets of two different organisms. Proc Natl Acad Sci USA 2003, 100(6):3351\u20133356. 10.1073\/pnas.0530258100","journal-title":"Proc Natl Acad Sci USA"},{"issue":"3","key":"3002_CR11","doi-asserted-by":"publisher","first-page":"820","DOI":"10.1158\/1535-7163.MCT-06-0650","volume":"6","author":"UT Shankavaram","year":"2007","unstructured":"Shankavaram UT, Reinhold WC, Nishizuka S, Major S, Morita D, Chary KK, Reimers MA, Scherf U, Kahn A, Dolginow D, Cossman J, Kaldjian EP, Scudiero DA, Petricoin E, Liotta L, Lee JK, Weinstein JN: Transcript and protein expression profiles of the NCI-60 cancer cell panel: an integromic microarray study. Mol Cancer Ther 2007, 6(3):820\u2013832. 10.1158\/1535-7163.MCT-06-0650","journal-title":"Mol Cancer Ther"},{"issue":"24","key":"3002_CR12","doi-asserted-by":"publisher","first-page":"14229","DOI":"10.1073\/pnas.2331323100","volume":"100","author":"S Nishizuka","year":"2003","unstructured":"Nishizuka S, Charboneau L, Young L, Major S, Reinhold WC, Waltham M, Kouros-Mehr H, Bussey KJ, Lee JK, Espina V, Munson PJ, Petricoin E, Liotta LA, Weinstein JN: Proteomic profiling of the NCI-60 cancer cell lines using new high-density reverse-phase lysate microarrays. Proc Natl Acad Sci USA 2003, 100(24):14229\u201314234. 10.1073\/pnas.2331323100","journal-title":"Proc Natl Acad Sci USA"},{"key":"3002_CR13","volume-title":"Statistical Analysis in Climate Research","author":"H von Storch","year":"1999","unstructured":"von Storch H, Zwiers FW: Statistical Analysis in Climate Research. Cambridge University Press; Cambridge; 1999."},{"key":"3002_CR14","volume-title":"R: A Language and Environment for Statistical Computing","author":"R Development Core Team","year":"2008","unstructured":"R Development Core Team:R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing; 2008. [http:\/\/www.r-project.org]"},{"key":"3002_CR15","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1038\/75556","volume":"25","author":"M Ashburner","year":"2000","unstructured":"Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 2000, 25: 25\u201329. 10.1038\/75556","journal-title":"Nat Genet"},{"key":"3002_CR16","doi-asserted-by":"publisher","first-page":"9440","DOI":"10.1073\/pnas.1530509100","volume":"100","author":"J Storey","year":"2003","unstructured":"Storey J, Tibshirani R: Statistical significance for genome-wide studies. PNAS 2003, 100: 9440\u20135. 10.1073\/pnas.1530509100","journal-title":"PNAS"},{"issue":"3","key":"3002_CR17","doi-asserted-by":"publisher","first-page":"398","DOI":"10.1137\/0718026","volume":"18","author":"C Paige","year":"1981","unstructured":"Paige C, Saunders M: Towards a Generalized Singular Value Decomposition. SIAM J Numer Anal 1981, 18(3):398\u2013405. 10.1137\/0718026","journal-title":"SIAM J Numer Anal"},{"issue":"1-2","key":"3002_CR18","doi-asserted-by":"publisher","first-page":"63","DOI":"10.1023\/A:1006414621286","volume":"50","author":"U Cavallaro","year":"2000","unstructured":"Cavallaro U, Christofori G: Molecular mechanisms of tumor angiogenesis and tumor progression. J Neurooncol 2000, 50(1\u20132):63\u201370. 10.1023\/A:1006414621286","journal-title":"J Neurooncol"},{"issue":"4","key":"3002_CR19","doi-asserted-by":"publisher","first-page":"449","DOI":"10.1023\/A:1006150007710","volume":"17","author":"J Banyard","year":"1998","unstructured":"Banyard J, Zetter BR: The role of cell motility in prostate cancer. Cancer Metastasis Rev 1998, 17(4):449\u2013458. 10.1023\/A:1006150007710","journal-title":"Cancer Metastasis Rev"},{"key":"3002_CR20","first-page":"39","volume-title":"Skin Cancer","author":"K Nouri","year":"2007","unstructured":"Nouri K, Patel SS, Singh A: Etiology of skin cancer. In Skin Cancer. 1st edition. Edited by: Nouri K. McGraw-Hill; 2007:39\u201345.","edition":"1"},{"key":"3002_CR21","doi-asserted-by":"publisher","first-page":"615","DOI":"10.1146\/annurev.cellbio.17.1.615","volume":"17","author":"R Katso","year":"2001","unstructured":"Katso R, Okkenhaug K, Ahmadi K, White S, Timms J, Waterfield MD: Cellular function of phosphoinositide 3-kinases: implications for development, homeostasis, and cancer. Annu Rev Cell Dev Biol 2001, 17: 615\u2013675. 10.1146\/annurev.cellbio.17.1.615","journal-title":"Annu Rev Cell Dev Biol"},{"issue":"18","key":"3002_CR22","doi-asserted-by":"publisher","first-page":"10101","DOI":"10.1073\/pnas.97.18.10101","volume":"97","author":"O Alter","year":"2000","unstructured":"Alter O, Brown PO, Botstein D: Singular value decomposition for genome-wide expression data processing and modeling. Proc Natl Acad Sci USA 2000, 97(18):10101\u201310106. 10.1073\/pnas.97.18.10101","journal-title":"Proc Natl Acad Sci USA"},{"key":"3002_CR23","doi-asserted-by":"publisher","first-page":"121","DOI":"10.1016\/0167-9473(94)90135-X","volume":"18","author":"B Escofier","year":"1994","unstructured":"Escofier B, Pages J: Multiple Factor Analysis (AFMULT package). Comput Stat Data Anal 1994, 18: 121\u2013140. 10.1016\/0167-9473(94)90135-X","journal-title":"Comput Stat Data Anal"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-10-272.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,8,31]],"date-time":"2021-08-31T21:40:44Z","timestamp":1630446044000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-10-272"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,9,1]]},"references-count":23,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2009,12]]}},"alternative-id":["3002"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-10-272","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2009,9,1]]},"assertion":[{"value":"4 November 2008","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"1 September 2009","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"1 September 2009","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"272"}}