{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,7]],"date-time":"2026-04-07T11:46:03Z","timestamp":1775562363220,"version":"3.50.1"},"reference-count":65,"publisher":"Oxford University Press (OUP)","issue":"16","license":[{"start":{"date-parts":[[2022,6,25]],"date-time":"2022-06-25T00:00:00Z","timestamp":1656115200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"EPSRC\u2019s StatML CDT"},{"name":"Imperial\u2019s CRUK center and Imperial\u2019s Experimental Cancer Medicine center"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,8,10]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Motivation<\/jats:title><jats:p>Few Bayesian methods for analyzing high-dimensional sparse survival data provide scalable variable selection, effect estimation and uncertainty quantification. Such methods often either sacrifice uncertainty quantification by computing maximum a posteriori estimates, or quantify the uncertainty at high (unscalable) computational expense.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>We bridge this gap and develop an interpretable and scalable Bayesian proportional hazards model for prediction and variable selection, referred to as sparse variational Bayes. Our method, based on a mean-field variational approximation, overcomes the high computational cost of Markov chain Monte Carlo, whilst retaining useful features, providing a posterior distribution for the parameters and offering a natural mechanism for variable selection via posterior inclusion probabilities. The performance of our proposed method is assessed via extensive simulations and compared against other state-of-the-art Bayesian variable selection methods, demonstrating comparable or better performance. Finally, we demonstrate how the proposed method can be used for variable selection on two transcriptomic datasets with censored survival outcomes, and how the uncertainty quantification offered by our method can be used to provide an interpretable assessment of patient risk.<\/jats:p><\/jats:sec><jats:sec><jats:title>Availability and implementation<\/jats:title><jats:p>our method has been implemented as a freely available R package survival.svb (https:\/\/github.com\/mkomod\/survival.svb).<\/jats:p><\/jats:sec><jats:sec><jats:title>Supplementary information<\/jats:title><jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p><\/jats:sec>","DOI":"10.1093\/bioinformatics\/btac416","type":"journal-article","created":{"date-parts":[[2022,6,25]],"date-time":"2022-06-25T12:41:57Z","timestamp":1656160917000},"page":"3918-3926","source":"Crossref","is-referenced-by-count":7,"title":["Variational Bayes for high-dimensional proportional hazards models with applications within gene expression"],"prefix":"10.1093","volume":"38","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8662-0953","authenticated-orcid":false,"given":"Michael","family":"Komodromos","sequence":"first","affiliation":[{"name":"Department of Mathematics, Imperial College London , London SW7 2AZ, UK"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Eric O","family":"Aboagye","sequence":"additional","affiliation":[{"name":"Department of Surgery and Cancer, Imperial College London , London W12 0NN, UK"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Marina","family":"Evangelou","sequence":"additional","affiliation":[{"name":"Department of Mathematics, Imperial College London , London SW7 2AZ, UK"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sarah","family":"Filippi","sequence":"additional","affiliation":[{"name":"Department of Mathematics, Imperial College London , London SW7 2AZ, UK"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kolyan","family":"Ray","sequence":"additional","affiliation":[{"name":"Department of Mathematics, Imperial College London , London SW7 2AZ, UK"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2022,6,25]]},"reference":[{"key":"2023091911313387700_btac416-B1","doi-asserted-by":"crossref","first-page":"531","DOI":"10.1111\/j.1467-9469.2009.00685.x","article-title":"The dantzig selector in cox\u2019s proportional hazards model","volume":"37","author":"Antoniadis","year":"2010","journal-title":"Scand. J. Stat"},{"key":"2023091911313387700_btac416-B2","author":"Bai","year":"2021"},{"key":"2023091911313387700_btac416-B3","author":"Banerjee","year":"2021"},{"key":"2023091911313387700_btac416-B4","doi-asserted-by":"crossref","first-page":"405","DOI":"10.1214\/19-STS700","article-title":"Lasso meets horseshoe: a survey","volume":"34","author":"Bhadra","year":"2019","journal-title":"Stat. Sci"},{"key":"2023091911313387700_btac416-B5","first-page":"17","article-title":"A correlated topic model of science","volume":"1","author":"Blei","year":"2007","journal-title":"Ann. Appl. Stat"},{"key":"2023091911313387700_btac416-B6","doi-asserted-by":"crossref","first-page":"859","DOI":"10.1080\/01621459.2017.1285773","article-title":"Variational inference: a review for statisticians","volume":"112","author":"Blei","year":"2017","journal-title":"J. Am. Stat. Assoc"},{"key":"2023091911313387700_btac416-B7","doi-asserted-by":"crossref","first-page":"2080","DOI":"10.1093\/bioinformatics\/btm305","article-title":"Predicting survival from microarray data\u2014a comparative study","volume":"23","author":"B\u00f8velstad","year":"2007","journal-title":"Bioinformatics"},{"key":"2023091911313387700_btac416-B8","volume-title":"Algorithms for Minimization without Derivatives","author":"Brent","year":"1973"},{"key":"2023091911313387700_btac416-B9","doi-asserted-by":"crossref","first-page":"73","DOI":"10.1214\/12-BA703","article-title":"Scalable variational inference for Bayesian variable selection in regression, and its accuracy in genetic association studies","volume":"7","author":"Carbonetto","year":"2012","journal-title":"Bayesian Anal"},{"key":"2023091911313387700_btac416-B10","doi-asserted-by":"crossref","first-page":"465","DOI":"10.1093\/biomet\/asq017","article-title":"The horseshoe estimator for sparse signals","volume":"97","author":"Carvalho","year":"2010","journal-title":"Biometrika"},{"key":"2023091911313387700_btac416-B11","doi-asserted-by":"crossref","first-page":"2069","DOI":"10.1214\/12-AOS1029","article-title":"Needles and straw in a haystack: posterior concentration for possibly sparse sequences","volume":"40","author":"Castillo","year":"2012","journal-title":"Ann. Stat"},{"key":"2023091911313387700_btac416-B12","doi-asserted-by":"crossref","first-page":"232","DOI":"10.1038\/sj.bjc.6601118","article-title":"Survival analysis part I: basic concepts and first analyses","volume":"89","author":"Clark","year":"2003","journal-title":"Br. J. Cancer"},{"key":"2023091911313387700_btac416-B13","doi-asserted-by":"crossref","first-page":"161","DOI":"10.1593\/neo.91542","article-title":"Overexpression of elafin in ovarian carcinoma is driven by genomic gains and activation of the nuclear factor \u03baB pathway and is associated with poor overall survival","volume":"12","author":"Clauss","year":"2010","journal-title":"Neoplasia"},{"key":"2023091911313387700_btac416-B14","doi-asserted-by":"crossref","first-page":"187","DOI":"10.1111\/j.2517-6161.1972.tb00899.x","article-title":"Regression models and life-tables","volume":"34","author":"Cox","year":"1972","journal-title":"J. R. Stat. Soc. B"},{"key":"2023091911313387700_btac416-B15","doi-asserted-by":"crossref","first-page":"269","DOI":"10.1093\/biomet\/62.2.269","article-title":"Partial likelihood","volume":"62","author":"Cox","year":"1975","journal-title":"Biometrika"},{"key":"2023091911313387700_btac416-B16","doi-asserted-by":"crossref","first-page":"93","DOI":"10.1007\/s00180-015-0638-y","article-title":"A comparison of variational approximations for fast inference in mixed logit models","volume":"32","author":"Depraetere","year":"2017","journal-title":"Comput. Stat"},{"key":"2023091911313387700_btac416-B17","doi-asserted-by":"crossref","first-page":"1380","DOI":"10.1182\/blood-2012-02-404475","article-title":"NFAT control of innate immunity","volume":"120","author":"Fric","year":"2012","journal-title":"Blood"},{"key":"2023091911313387700_btac416-B18","doi-asserted-by":"crossref","first-page":"881","DOI":"10.1080\/01621459.1993.10476353","article-title":"Variable selection via gibbs sampling","volume":"88","author":"George","year":"1993","journal-title":"J. Am. Stat. Assoc"},{"key":"2023091911313387700_btac416-B19","doi-asserted-by":"crossref","first-page":"3001","DOI":"10.1093\/bioinformatics\/bti422","article-title":"Penalized cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data","volume":"21","author":"Gui","year":"2005","journal-title":"Bioinformatics"},{"key":"2023091911313387700_btac416-B20","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4757-3447-8","volume-title":"Bayesian Survival Analysis","author":"Ibrahim","year":"2001"},{"key":"2023091911313387700_btac416-B21","doi-asserted-by":"crossref","first-page":"1271","DOI":"10.1038\/s41591-020-0926-0","article-title":"A single-cell landscape of high-grade serous ovarian cancer","volume":"26","author":"Izar","year":"2020","journal-title":"Nat. Med"},{"key":"2023091911313387700_btac416-B22","first-page":"283","article-title":"A Variational Approach to Bayesian Logistic Regression Models and Their Extensions","author":"Jaakkola","year":"1997"},{"key":"2023091911313387700_btac416-B23","first-page":"1819","author":"Jerfel","year":"2021"},{"key":"2023091911313387700_btac416-B24","doi-asserted-by":"crossref","first-page":"214","DOI":"10.1111\/j.2517-6161.1978.tb01666.x","article-title":"Bayesian analysis of survival time data","volume":"40","author":"Kalbfleisch","year":"1978","journal-title":"J. R. Stat. Soc. B"},{"key":"2023091911313387700_btac416-B25","doi-asserted-by":"crossref","first-page":"2136","DOI":"10.1109\/TNNLS.2014.2376974","article-title":"Group factor analysis","volume":"26","author":"Klami","year":"2015","journal-title":"IEEE Trans. Neural Netw. Learn. Syst"},{"key":"2023091911313387700_btac416-B26","author":"Knowles","year":"2011"},{"key":"2023091911313387700_btac416-B27","doi-asserted-by":"crossref","first-page":"843","DOI":"10.1002\/9781119487845.ch30","volume-title":"Handb. Stat. Genomics","author":"Lewin","year":"2019"},{"key":"2023091911313387700_btac416-B28","doi-asserted-by":"crossref","first-page":"1202","DOI":"10.1198\/jasa.2010.tm08177","article-title":"Bayesian variable selection in structured high-dimensional covariate spaces with applications in genomics","volume":"105","author":"Li","year":"2010","journal-title":"J. Am. Stat. Assoc"},{"key":"2023091911313387700_btac416-B29","doi-asserted-by":"crossref","first-page":"1795","DOI":"10.1093\/bib\/bby051","article-title":"Review of applications of high-throughput sequencing in personalized medicine: barriers and facilitators of future progress in research and clinical application","volume":"20","author":"Lightbody","year":"2019","journal-title":"Brief. Bioinform"},{"key":"2023091911313387700_btac416-B30","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12885-015-1101-8","article-title":"Prediction of resistance to chemotherapy in ovarian cancer: a systematic review","volume":"15","author":"Lloyd","year":"2015","journal-title":"BMC Cancer"},{"key":"2023091911313387700_btac416-B31","doi-asserted-by":"crossref","first-page":"58","DOI":"10.1186\/1471-2105-11-58","article-title":"A variational Bayes algorithm for fast and accurate multiple locus genome-wide association analysis","volume":"11","author":"Logsdon","year":"2010","journal-title":"BMC Bioinformatics"},{"key":"2023091911313387700_btac416-B32","doi-asserted-by":"crossref","first-page":"1286","DOI":"10.1038\/s41416-020-01252-2","article-title":"Discovery of a biomarker candidate for surgical stratification in high-grade serous ovarian cancer","volume":"124","author":"Lu","year":"2021","journal-title":"Br. J. Cancer"},{"key":"2023091911313387700_btac416-B33","doi-asserted-by":"crossref","first-page":"316","DOI":"10.1111\/biom.13132","article-title":"Bayesian data integration and variable selection for pan-cancer survival prediction using protein expression data","volume":"76","author":"Maity","year":"2020","journal-title":"Biometrics"},{"key":"2023091911313387700_btac416-B34","doi-asserted-by":"crossref","first-page":"e1002207","DOI":"10.1371\/journal.pgen.1002207","article-title":"Variance of gene expression identifies altered network constraints in neurological disease","volume":"7","author":"Mar","year":"2011","journal-title":"PLoS Genet"},{"key":"2023091911313387700_btac416-B35","doi-asserted-by":"crossref","first-page":"1023","DOI":"10.1080\/01621459.1988.10478694","article-title":"Bayesian variable selection in linear regression","volume":"83","author":"Mitchell","year":"1988","journal-title":"J. Am. Stat. Assoc"},{"key":"2023091911313387700_btac416-B36","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/bcr3361","article-title":"ABCC5 supports osteoclast formation and promotes breast cancer metastasis to bone","volume":"14","author":"Mourskaia","year":"2012","journal-title":"Breast Cancer Res"},{"key":"2023091911313387700_btac416-B37","doi-asserted-by":"crossref","first-page":"449","DOI":"10.1080\/01621459.2000.10474219","article-title":"On profile likelihood","volume":"95","author":"Murphy","year":"2000","journal-title":"J. Am. Stat. Assoc"},{"key":"2023091911313387700_btac416-B38","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1093\/biostatistics\/5.2.155","article-title":"Detecting differential gene expression with a semiparametric hierarchical mixture method","volume":"5","author":"Newton","year":"2004","journal-title":"Biostatistics"},{"key":"2023091911313387700_btac416-B39","doi-asserted-by":"crossref","first-page":"809","DOI":"10.1214\/20-AOAS1325","article-title":"Bayesian variable selection for survival data using inverse moment priors","volume":"14","author":"Nikooienejad","year":"2020","journal-title":"Ann. Appl. Stat"},{"key":"2023091911313387700_btac416-B40","author":"Ning","year":"2021"},{"key":"2023091911313387700_btac416-B41","first-page":"85","article-title":"A review of Bayesian variable selection methods: what, how and which","volume":"4","author":"O\u2019Hara","year":"2009","journal-title":"Bayesian Anal"},{"key":"2023091911313387700_btac416-B42","first-page":"786","article-title":"The Variational Gaussian Approximaiton Revisited","volume-title":"Neural Comput","author":"Opper","year":"2009"},{"key":"2023091911313387700_btac416-B43","doi-asserted-by":"crossref","first-page":"3549","DOI":"10.1214\/17-EJS1332","article-title":"A variational Bayes approach to variable selection","volume":"11","author":"Ormerod","year":"2017","journal-title":"Electron. J. Stat"},{"key":"2023091911313387700_btac416-B44","doi-asserted-by":"crossref","first-page":"203","DOI":"10.1515\/sagmb-2013-0054","article-title":"Improved variational Bayes inference for transcript expression estimation","volume":"13","author":"Papastamoulis","year":"2014","journal-title":"Stat. Appl. Genet. Mol. Biol"},{"key":"2023091911313387700_btac416-B45","doi-asserted-by":"crossref","first-page":"988","DOI":"10.1038\/s41416-020-0945-0","article-title":"Gremlin-1 augments the oestrogen-related receptor \u03b1 signalling through EGFR activation: implications for the progression of breast cancer","volume":"123","author":"Park","year":"2020","journal-title":"Br. J. Cancer"},{"key":"2023091911313387700_btac416-B46","doi-asserted-by":"crossref","first-page":"681","DOI":"10.1198\/016214508000000337","article-title":"The Bayesian lasso","volume":"103","author":"Park","year":"2008","journal-title":"J. Am. Stat. Assoc"},{"key":"2023091911313387700_btac416-B47","doi-asserted-by":"crossref","first-page":"e18640","DOI":"10.1371\/journal.pone.0018640","article-title":"Identification of prognostic molecular features in the reactive stroma of human breast and prostate cancer","volume":"6","author":"Planche","year":"2011","journal-title":"PLoS One"},{"key":"2023091911313387700_btac416-B48","article-title":"Variational Bayes for high-dimensional linear regression with sparse priors","author":"Ray","year":"2021","journal-title":"J. Am. Stat. Assoc"},{"key":"2023091911313387700_btac416-B49","first-page":"14423","author":"Ray","year":"2020"},{"key":"2023091911313387700_btac416-B50","doi-asserted-by":"crossref","first-page":"9016","DOI":"10.1038\/s41598-021-88512-0","article-title":"The Nek2 centrosome-mitotic kinase contributes to the mesenchymal state, cell invasion, and migration of triple-negative breast cancer cells","volume":"11","author":"Rivera-Rivera","year":"2021","journal-title":"Sci. Rep"},{"key":"2023091911313387700_btac416-B51","doi-asserted-by":"crossref","first-page":"1","DOI":"10.18637\/jss.v039.i05","article-title":"Regularization paths for cox\u2019s proportional hazards model via coordinate descent","volume":"39","author":"Simon","year":"2011","journal-title":"J. Stat. Softw"},{"key":"2023091911313387700_btac416-B52","doi-asserted-by":"crossref","first-page":"3418","DOI":"10.1093\/bioinformatics\/btaa169","article-title":"Interpretable factor models of single-cell RNA-seq via variational autoencoders","volume":"36","author":"Svensson","year":"2020","journal-title":"Bioinformatics"},{"key":"2023091911313387700_btac416-B53","doi-asserted-by":"crossref","first-page":"2799","DOI":"10.1093\/bioinformatics\/btx300","article-title":"The spike-and-slab lasso cox model for survival prediction and associated genes detection","volume":"33","author":"Tang","year":"2017","journal-title":"Bioinformatics"},{"key":"2023091911313387700_btac416-B54","year":"2022"},{"key":"2023091911313387700_btac416-B55","doi-asserted-by":"crossref","first-page":"3025","DOI":"10.1093\/bioinformatics\/bti466","article-title":"A variational Bayesian mixture modelling framework for cluster analysis of gene-expression data","volume":"21","author":"Teschendorff","year":"2005","journal-title":"Bioinformatics"},{"key":"2023091911313387700_btac416-B56","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1111\/j.2517-6161.1996.tb02080.x","article-title":"Regression shrinkage and selection via the lasso","volume":"58","author":"Tibshirani","year":"1996","journal-title":"J. R. Stat. Soc. B"},{"key":"2023091911313387700_btac416-B57","doi-asserted-by":"crossref","first-page":"385","DOI":"10.1002\/(SICI)1097-0258(19970228)16:4<385::AID-SIM380>3.0.CO;2-3","article-title":"The lasso method for variable selection in the cox model","volume":"16","author":"Tibshirani","year":"1997","journal-title":"Stat. Med"},{"key":"2023091911313387700_btac416-B58","author":"Titsias","year":"2011"},{"key":"2023091911313387700_btac416-B59","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-642-45361-8","volume-title":"Molecular Biology","author":"Wid\u0142ak","year":"2013"},{"key":"2023091911313387700_btac416-B60","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1177\/0962280209105024","article-title":"Survival analysis with high-dimensional covariates","volume":"19","author":"Witten","year":"2010","journal-title":"Stat. Methods Med. Res"},{"key":"2023091911313387700_btac416-B61","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13048-019-0550-0","article-title":"Higher expression of calcineurin predicts poor prognosis in unique subtype of ovarian cancer","volume":"12","author":"Xin","year":"2019","journal-title":"J. Ovarian Res"},{"key":"2023091911313387700_btac416-B62","doi-asserted-by":"crossref","DOI":"10.1186\/bcr2753","article-title":"A multigene predictor of metastatic outcome in early stage hormone receptor-negative and triple-negative breast cancer","volume":"12","author":"Yau","year":"2010","journal-title":"Breast Cancer Res"},{"key":"2023091911313387700_btac416-B63","doi-asserted-by":"crossref","first-page":"2008","DOI":"10.1109\/TPAMI.2018.2889774","article-title":"Advances in variational inference","volume":"41","author":"Zhang","year":"2019","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell"},{"key":"2023091911313387700_btac416-B64","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1186\/s12859-016-1451-5","article-title":"Variational inference for rare variant detection in deep, heterogeneous next-generation sequencing data","volume":"18","author":"Zhang","year":"2017","journal-title":"BMC Bioinformatics"},{"key":"2023091911313387700_btac416-B65","doi-asserted-by":"crossref","first-page":"768","DOI":"10.1111\/j.1467-9868.2005.00527.x","article-title":"Regularization and variable selection via the elastic net","volume":"67","author":"Zou","year":"2005","journal-title":"J. R. Stat. Soc. B"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btac416\/44656205\/btac416.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/16\/3918\/51678605\/btac416.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/16\/3918\/51678605\/btac416.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,9,27]],"date-time":"2024-09-27T20:03:21Z","timestamp":1727467401000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/38\/16\/3918\/6617825"}},"subtitle":[],"editor":[{"given":"Janet","family":"Kelso","sequence":"additional","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2022,6,25]]},"references-count":65,"journal-issue":{"issue":"16","published-print":{"date-parts":[[2022,8,10]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btac416","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2022,8,15]]},"published":{"date-parts":[[2022,6,25]]}}}