{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,28]],"date-time":"2025-11-28T12:10:55Z","timestamp":1764331855806,"version":"3.38.0"},"reference-count":28,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2010,11,9]],"date-time":"2010-11-09T00:00:00Z","timestamp":1289260800000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/2.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2010,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Background<\/jats:title><jats:p>Nonparametric Bayesian techniques have been developed recently to extend the sophistication of factor models, allowing one to infer the number of appropriate factors from the observed data. We consider such techniques for sparse factor analysis, with application to gene-expression data from three virus challenge studies. Particular attention is placed on employing the Beta Process (BP), the Indian Buffet Process (IBP), and related sparseness-promoting techniques to infer a proper number of factors. The posterior density function on the model parameters is computed using Gibbs sampling and variational Bayesian (VB) analysis.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>Time-evolving gene-expression data are considered for respiratory syncytial virus (RSV), Rhino virus, and influenza, using blood samples from healthy human subjects. These data were acquired in three challenge studies, each executed after receiving institutional review board (IRB) approval from Duke University. Comparisons are made between several alternative means of per-forming nonparametric factor analysis on these data, with comparisons as well to sparse-PCA and Penalized Matrix Decomposition (PMD), closely related non-Bayesian approaches.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusions<\/jats:title><jats:p>Applying the Beta Process to the factor scores, or to the singular values of a pseudo-SVD construction, the proposed algorithms infer the number of factors in gene-expression data. For real data the \"true\" number of factors is unknown; in our simulations we consider a range of noise variances, and the proposed Bayesian models inferred the number of factors accurately relative to other methods in the literature, such as sparse-PCA and PMD. We have also identified a \"pan-viral\" factor of importance for each of the three viruses considered in this study. We have identified a set of genes associated with this pan-viral factor, of interest for early detection of such viruses based upon the host response, as quantified via gene-expression data.<\/jats:p><\/jats:sec>","DOI":"10.1186\/1471-2105-11-552","type":"journal-article","created":{"date-parts":[[2010,11,9]],"date-time":"2010-11-09T19:23:07Z","timestamp":1289330587000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":26,"title":["Bayesian inference of the number of factors in gene-expression analysis: application to human virus challenge studies"],"prefix":"10.1186","volume":"11","author":[{"given":"Bo","family":"Chen","sequence":"first","affiliation":[]},{"given":"Minhua","family":"Chen","sequence":"additional","affiliation":[]},{"given":"John","family":"Paisley","sequence":"additional","affiliation":[]},{"given":"Aimee","family":"Zaas","sequence":"additional","affiliation":[]},{"given":"Christopher","family":"Woods","sequence":"additional","affiliation":[]},{"given":"Geoffrey S","family":"Ginsburg","sequence":"additional","affiliation":[]},{"suffix":"III","given":"Alfred","family":"Hero","sequence":"additional","affiliation":[]},{"given":"Joseph","family":"Lucas","sequence":"additional","affiliation":[]},{"given":"David","family":"Dunson","sequence":"additional","affiliation":[]},{"given":"Lawrence","family":"Carin","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2010,11,9]]},"reference":[{"key":"4135_CR1","first-page":"723","volume-title":"Bayesian Statistics 7","author":"M West","year":"2003","unstructured":"West M: \"Bayesian factor regression models in the \"large p, small n\" paradigm,\". In Bayesian Statistics 7. Edited by: Bernardo JM, Bayarri M, Berger J, Dawid A, Heckerman D, Smith A, West M. Oxford University Press; 2003:723\u2013732."},{"key":"4135_CR2","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1111\/j.2517-6161.1996.tb02080.x","volume":"58","author":"R Tibshirani","year":"1996","unstructured":"Tibshirani R: \"Regression shrinkage and selection via the lasso,\". Journal of Royal Statistical Society Ser. B 1996, 58: 267\u2013288.","journal-title":"Journal of Royal Statistical Society Ser. B"},{"key":"4135_CR3","doi-asserted-by":"publisher","first-page":"301","DOI":"10.1111\/j.1467-9868.2005.00503.x","volume":"67","author":"H Zou","year":"2005","unstructured":"Zou H, Hastie T: \"Regularization and variable selection via the elastic net,\". Journal of Royal Statistical Society Ser. B 2005, 67: 301\u2013320. 10.1111\/j.1467-9868.2005.00503.x","journal-title":"Journal of Royal Statistical Society Ser. B"},{"key":"4135_CR4","doi-asserted-by":"publisher","first-page":"681","DOI":"10.1198\/016214508000000337","volume":"103","author":"T Park","year":"2008","unstructured":"Park T, Casella G: \"The Bayesian Lasso,\". Journal of the American Statistical Association 2008, 103: 681\u2013686,. 10.1198\/016214508000000337","journal-title":"Journal of the American Statistical Association"},{"key":"4135_CR5","volume-title":"An Introduction to Support Vector Machines","author":"N Cristianini","year":"2000","unstructured":"Cristianini N, Shawe-Taylor J: An Introduction to Support Vector Machines. Cambridge University Press; 2000."},{"key":"4135_CR6","doi-asserted-by":"publisher","first-page":"211","DOI":"10.1162\/15324430152748236","volume":"1","author":"M Tipping","year":"2001","unstructured":"Tipping M: \"Sparse Bayesian learning and the relevance vector machine,\". Journal of Machine Learning Research 2001, 1: 211\u2013244. 10.1162\/15324430152748236","journal-title":"Journal of Machine Learning Research"},{"key":"4135_CR7","volume-title":"IEEE Transactions on Signal Processing","author":"S Ji","year":"2008","unstructured":"Ji S, Xue Y, Carin L: \"Bayesian compressive sensing,\". IEEE Transactions on Signal Processing 2008., 56:"},{"key":"4135_CR8","volume-title":"Proc Conf Neural Information Proc Systems (NIPS), Vancouver, Canada","author":"P Rai","year":"2008","unstructured":"Rai P, Daum'e H III: \"The infinite hierarchical factor regression model,\". Proc Conf Neural Information Proc Systems (NIPS), Vancouver, Canada 2008."},{"key":"4135_CR9","volume-title":"7th International Conference on Independent Component Analysis and Signal Separation","author":"D Knowles","year":"2007","unstructured":"Knowles D, Ghahramani Z: \"Infinite sparse factor analysis and infinite independent components analysis,\". 7th International Conference on Independent Component Analysis and Signal Separation 2007."},{"key":"4135_CR10","first-page":"977","volume-title":"Advances in Neural Information Processing Systems","author":"E Meeds","year":"2007","unstructured":"Meeds E, Ghahramani Z, Neal R, Roweis S: \"Modeling dyadic data with binary latent factors,\". Advances in Neural Information Processing Systems 2007, 977\u2013984."},{"key":"4135_CR11","doi-asserted-by":"publisher","first-page":"1438","DOI":"10.1198\/016214508000000869","volume":"103","author":"C Carvalho","year":"2008","unstructured":"Carvalho C, Chang J, Lucas J, Nevins JR, Wang Q, West M: \"High-dimensional sparse factor modelling: Applications in gene expression genomics,\". Journal of the American Statistical Association 2008, 103: 1438\u20131456. 10.1198\/016214508000000869","journal-title":"Journal of the American Statistical Association"},{"key":"4135_CR12","doi-asserted-by":"publisher","first-page":"2004","DOI":"10.1198\/106186006X113430","volume":"15","author":"H Zou","year":"2006","unstructured":"Zou H, Hastie T, Tibshirani R: \"Sparse principal component analysis,\". Journal of Computational and Graphical Statistics 2006, 15: 2004. 10.1198\/106186006X113430","journal-title":"Journal of Computational and Graphical Statistics"},{"key":"4135_CR13","doi-asserted-by":"publisher","first-page":"515","DOI":"10.1093\/biostatistics\/kxp008","volume":"10","author":"D Witten","year":"2009","unstructured":"Witten D, Tibshirani R, Hastie T: \"A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis,\". Biostatistics 2009, 10: 515\u2013534. 10.1093\/biostatistics\/kxp008","journal-title":"Biostatistics"},{"key":"4135_CR14","first-page":"41","volume":"14","author":"H Lopes","year":"2004","unstructured":"Lopes H, West M: \"Bayesian model assessment in factor analysis,\". Statistica Sinica 2004, 14: 41\u201367.","journal-title":"Statistica Sinica"},{"key":"4135_CR15","volume-title":"Random Effect and Latent Variable Model Selection","author":"J Ghosh","year":"2008","unstructured":"Ghosh J, Dunson D: \"Bayesian model selection in factor analytic models,\". In Random Effect and Latent Variable Model Selection. Edited by: Dunson D. John Wiley & Sons; 2008."},{"key":"4135_CR16","doi-asserted-by":"publisher","first-page":"241258,","DOI":"10.1016\/S0378-3758(02)00336-1","volume":"112","author":"J Berger","year":"2003","unstructured":"Berger J, Ghosh J, Mukhopadhyay N: \"Approximation and consistency of Bayes factors as model dimension grows,\". J. Statist. Plann. Inference 2003, 112: 241258,. 10.1016\/S0378-3758(02)00336-1","journal-title":"J. Statist. Plann. Inference"},{"key":"4135_CR17","doi-asserted-by":"publisher","first-page":"1653","DOI":"10.1080\/03610929908832378","volume":"28","author":"S Press","year":"1999","unstructured":"Press S, Shigemasu K: \"A note on choosing the number of factors,\". Comm Statist Theory Methods 1999, 28: 1653\u20131670. 10.1080\/03610929908832378","journal-title":"Comm Statist Theory Methods"},{"key":"4135_CR18","doi-asserted-by":"publisher","first-page":"2339","DOI":"10.2333\/bhmk.29.23","volume":"29","author":"S Lee","year":"2002","unstructured":"Lee S, Song X: \"Bayesian selection on the number of factors in a factor analysis model,\". Behaviormetrika 2002, 29: 2339. 10.2333\/bhmk.29.23","journal-title":"Behaviormetrika"},{"key":"4135_CR19","first-page":"475","volume-title":"Advances in Neural Information Processing Systems","author":"T Griffiths","year":"2005","unstructured":"Griffiths T, Ghahramani Z: \"Infinite latent feature models and the indian buffet process,\". Advances in Neural Information Processing Systems 2005, 475\u2013482."},{"key":"4135_CR20","volume-title":"AISTATS","author":"F Doshi-Velez","year":"2009","unstructured":"Doshi-Velez F, Miller K, Gael JV, The Y: \"Variational inference for the indian buffet process,\". AISTATS 2009."},{"key":"4135_CR21","volume-title":"International Conference on Artificial Intelligence and Statistics","author":"R Thibaux","year":"2007","unstructured":"Thibaux R, Jordan M: \"Hierarchical beta processes and the Indian buffet process,\". International Conference on Artificial Intelligence and Statistics 2007."},{"key":"4135_CR22","volume-title":"Int Conf Machine Learning","author":"J Paisley","year":"2009","unstructured":"Paisley J, Carin L: \"Nonparametric factor analysis with beta process priors,\". Int Conf Machine Learning 2009."},{"key":"4135_CR23","volume-title":"\"Variational algorithms for approximate bayesian inference,\"","author":"M Beal","year":"2003","unstructured":"Beal M: \"Variational algorithms for approximate bayesian inference,\". Ph.D. dissertation, Gatsby Computational Neuroscience Unit, University College London; 2003."},{"key":"4135_CR24","volume-title":"Technical Report, Statistics Department, Stanford University","author":"H Zou","year":"2004","unstructured":"Zou H, Hastie T, Tibshirani R: \"Sparse principal component analysis,\". Technical Report, Statistics Department, Stanford University 2004."},{"key":"4135_CR25","doi-asserted-by":"publisher","first-page":"207","DOI":"10.1016\/j.chom.2009.07.006","volume":"6","author":"AK Zaas","year":"2009","unstructured":"Zaas AK, Chen M, Lucas J, Veldman T, Hero AO, Varkey J, Turner R, Oien C, Kingsmore S, Carin L, Woods CW, Ginsburg GS: \"Peripheral blood gene expression signatures characterize symptomatic respiratory viral infection,\". Cell Host & Microbe 2009, 6: 207\u2013217. 10.1016\/j.chom.2009.07.006","journal-title":"Cell Host & Microbe"},{"key":"4135_CR26","doi-asserted-by":"publisher","first-page":"555","DOI":"10.1198\/016214503000000387","volume":"98","author":"D Dunson","year":"2003","unstructured":"Dunson D: \"Dynamic latent trait models for multidimensional longitudinal data,\". J. Am. Statistical Ass 2003, 98: 555\u2013563. 10.1198\/016214503000000387","journal-title":"J. Am. Statistical Ass"},{"key":"4135_CR27","doi-asserted-by":"publisher","first-page":"2066","DOI":"10.1182\/blood-2006-02-002477","volume":"109","author":"O Ramilo","year":"2007","unstructured":"Ramilo O, Allman W, Chung W, Mejias A, Ardura M, Glaser C, Wittkowski KM, Piqueras B, Banchereau J, Palucka AK, Chaussabel D: \"Gene expression patterns in blood leukocytes discriminate patients with acute infections,\". Blood 2007, 109: 2066\u20132077. 10.1182\/blood-2006-02-002477","journal-title":"Blood"},{"issue":"3","key":"4135_CR28","doi-asserted-by":"publisher","first-page":"1259","DOI":"10.1214\/aos\/1176347749","volume":"18","author":"NL Hjort","year":"1990","unstructured":"Hjort NL: \"Nonparametric bayes estimators based on beta processes in models for life history data,\". Annals of Statistics 1990, 18(3):1259\u20131294. 10.1214\/aos\/1176347749","journal-title":"Annals of Statistics"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-11-552.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/1471-2105-11-552\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-11-552.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,2,27]],"date-time":"2025-02-27T18:25:42Z","timestamp":1740680742000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-11-552"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,11,9]]},"references-count":28,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2010,12]]}},"alternative-id":["4135"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-11-552","relation":{},"ISSN":["1471-2105"],"issn-type":[{"type":"electronic","value":"1471-2105"}],"subject":[],"published":{"date-parts":[[2010,11,9]]},"assertion":[{"value":"1 September 2009","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"9 November 2010","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"9 November 2010","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"552"}}