{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,11]],"date-time":"2025-12-11T20:16:35Z","timestamp":1765484195990,"version":"3.37.3"},"reference-count":38,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2020,7,2]],"date-time":"2020-07-02T00:00:00Z","timestamp":1593648000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2020,7,2]],"date-time":"2020-07-02T00:00:00Z","timestamp":1593648000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2020,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Background<\/jats:title><jats:p>The standard lasso penalty and its extensions are commonly used to develop a regularized regression model while selecting candidate predictor variables on a time-to-event outcome in high-dimensional data. However, these selection methods focus on a homogeneous set of variables and do not take into account the case of predictors belonging to functional groups; typically, genomic data can be grouped according to biological pathways or to different types of collected data. Another challenge is that the standard lasso penalisation is known to have a high false discovery rate.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>We evaluated different penalizations in a Cox model to select grouped variables in order to further penalize variables that, in addition to having a low effect, belong to a group with a low overall effect; and to favor the selection of variables that, in addition to having a large effect, belong to a group with a large overall effect. We considered the case of prespecified and disjoint groups and proposed diverse weights for the adaptive lasso method. In particular we proposed the product Max Single Wald by Single Wald weighting (MSW*SW) which takes into account the information of the group to which it belongs and of this biomarker. Through simulations, we compared the selection and prediction ability of our approach with the standard lasso, the composite Minimax Concave Penalty (cMCP), the group exponential lasso (gel), the Integrative<jats:italic>L<\/jats:italic>1-Penalized Regression with Penalty Factors (IPF-Lasso), and the Sparse Group Lasso (SGL) methods. In addition, we illustrated the methods using gene expression data of 614 breast cancer patients.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusions<\/jats:title><jats:p>The adaptive lasso with the MSW*SW weighting method incorporates both the information in the grouping structure and the individual variable. It outperformed the competitors by reducing the false discovery rate without severely increasing the false negative rate.<\/jats:p><\/jats:sec>","DOI":"10.1186\/s12859-020-03618-y","type":"journal-article","created":{"date-parts":[[2020,7,2]],"date-time":"2020-07-02T08:31:03Z","timestamp":1593678663000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":18,"title":["Accounting for grouped predictor variables or pathways in high-dimensional penalized Cox regression models"],"prefix":"10.1186","volume":"21","author":[{"given":"Shaima","family":"Belhechmi","sequence":"first","affiliation":[]},{"given":"Riccardo De","family":"Bin","sequence":"additional","affiliation":[]},{"given":"Federico","family":"Rotolo","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6963-2968","authenticated-orcid":false,"given":"Stefan","family":"Michiels","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2020,7,2]]},"reference":[{"issue":"2","key":"3618_CR1","doi-asserted-by":"crossref","first-page":"187","DOI":"10.1111\/j.2517-6161.1972.tb00899.x","volume":"34","author":"DR Cox","year":"1972","unstructured":"Cox DR. Regression models and life-tables. J R Stat Soc Ser B Methodol. 1972; 34(2):187\u2013202.","journal-title":"J R Stat Soc Ser B Methodol"},{"key":"3618_CR2","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1111\/j.2517-6161.1996.tb02080.x","volume":"58","author":"R Tibshirani","year":"1996","unstructured":"Tibshirani R. Regression shrinkage and selection via the lasso. J R Stat Soc Ser B Methodol. 1996; 58:267\u201388.","journal-title":"J R Stat Soc Ser B Methodol"},{"issue":"4","key":"3618_CR3","doi-asserted-by":"publisher","first-page":"385","DOI":"10.1002\/(SICI)1097-0258(19970228)16:4<385::AID-SIM380>3.0.CO;2-3","volume":"16","author":"R Tibshirani","year":"1997","unstructured":"Tibshirani R. The Lasso Method for Variable Selection in the Cox Model. Stat Med. 1997; 16(4):385\u201395. https:\/\/doi.org\/10.1002\/(SICI)1097-0258(19970228)16:4<385::AID-SIM380>3.0.CO;2-3.","journal-title":"Stat Med"},{"issue":"3","key":"3618_CR4","doi-asserted-by":"publisher","first-page":"1436","DOI":"10.1214\/009053606000000281","volume":"34","author":"N Meinshausen","year":"2006","unstructured":"Meinshausen N, B\u00fchhlmann P. High-dimensional graphs and variable selection with the Lasso. Ann Stat. 2006; 34(3):1436\u201362.","journal-title":"Ann Stat"},{"key":"3618_CR5","first-page":"2541","volume":"7","author":"P Zhao","year":"2006","unstructured":"Zhao P, Yu B. On Model Selection Consistency of Lasso. J Mach Learn Res. 2006; 7:2541\u201363.","journal-title":"J Mach Learn Res"},{"issue":"15","key":"3618_CR6","doi-asserted-by":"publisher","first-page":"2561","DOI":"10.1002\/sim.6927","volume":"35","author":"N Tern\u00e8s","year":"2016","unstructured":"Tern\u00e8s N, Rotolo F, Michiels S. Empirical extensions of the lasso penalty to reduce the false discovery rate in high-dimensional cox regression models. Stat Med. 2016; 35(15):2561\u201373.","journal-title":"Stat Med"},{"issue":"476","key":"3618_CR7","doi-asserted-by":"publisher","first-page":"1418","DOI":"10.1198\/016214506000000735","volume":"101","author":"H Zou","year":"2006","unstructured":"Zou H. The adaptive lasso and its oracle properties. J Am Stat Assoc. 2006; 101(476):1418\u201329.","journal-title":"J Am Stat Assoc"},{"issue":"3","key":"3618_CR8","doi-asserted-by":"publisher","first-page":"691","DOI":"10.1093\/biomet\/asm037","volume":"94","author":"HH Zhang","year":"2007","unstructured":"Zhang HH, Lu W. Adaptive lasso for cox\u2019s proportional hazards model. Biometrika. 2007; 94(3):691\u2013703.","journal-title":"Biometrika"},{"issue":"3","key":"3618_CR9","doi-asserted-by":"publisher","first-page":"369","DOI":"10.4310\/SII.2009.v2.n3.a10","volume":"2","author":"P Breheny","year":"2009","unstructured":"Breheny P, Huang J. Penalized methods for bi-level variable selection. Stat Interface. 2009; 2(3):369\u201380.","journal-title":"Stat Interface"},{"issue":"3","key":"3618_CR10","doi-asserted-by":"publisher","first-page":"731","DOI":"10.1111\/biom.12300","volume":"71","author":"P Breheny","year":"2015","unstructured":"Breheny P. The group exponential lasso for bi-level variable selection. Biometrics. 2015; 71(3):731\u201340. https:\/\/doi.org\/10.1111\/biom.12300.","journal-title":"Biometrics"},{"issue":"2","key":"3618_CR11","doi-asserted-by":"publisher","first-page":"231","DOI":"10.1080\/10618600.2012.681250","volume":"22","author":"N Simon","year":"2013","unstructured":"Simon N, Friedman J, Hastie T, Tibshirani R. A sparse-group lasso. J Comput Graph Stat. 2013; 22(2):231\u201345.","journal-title":"J Comput Graph Stat"},{"issue":"1","key":"3618_CR12","doi-asserted-by":"publisher","first-page":"49","DOI":"10.1111\/j.1467-9868.2005.00532.x","volume":"68","author":"M Yuan","year":"2006","unstructured":"Yuan M, Lin Y. Model selection and estimation in regression with grouped variables. J R Stat Soc Ser B Stat Methodol. 2006; 68(1):49\u201367.","journal-title":"J R Stat Soc Ser B Stat Methodol"},{"key":"3618_CR13","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1155\/2017\/7691937","volume":"2017","author":"A-L Boulesteix","year":"2017","unstructured":"Boulesteix A-L, De Bin R, Jiang X, Fuchs M. IPF-lasso: Integrative-penalized regression with penalty factors for prediction based on multi-omics data. Comput Math Methods Med. 2017; 2017:1\u201314.","journal-title":"Comput Math Methods Med"},{"issue":"476","key":"3618_CR14","doi-asserted-by":"publisher","first-page":"1418","DOI":"10.1198\/016214506000000735","volume":"101","author":"H Zou","year":"2006","unstructured":"Zou H. The adaptive lasso and its oracle properties. J Am Stat Assoc. 2006; 101(476):1418\u201329.","journal-title":"J Am Stat Assoc"},{"key":"3618_CR15","volume-title":"Statistical Learning with Sparsity: the Lasso and Generalizations","author":"R Tibshirani","year":"2015","unstructured":"Tibshirani R, Wainwright M, Hastie T. Statistical Learning with Sparsity: the Lasso and Generalizations. Boca Raton: CRC Press; 2015."},{"issue":"24","key":"3618_CR16","doi-asserted-by":"publisher","first-page":"2305","DOI":"10.1002\/sim.4780122407","volume":"12","author":"PJM Verweij","year":"1993","unstructured":"Verweij PJM, Houwelingen HCV. Cross-validation in survival analysis. Stat Med. 1993; 12(24):2305\u201314. https:\/\/doi.org\/10.1002\/sim.4780122407.","journal-title":"Stat Med"},{"issue":"13","key":"3618_CR17","doi-asserted-by":"publisher","first-page":"1502","DOI":"10.1002\/sim.4022","volume":"30","author":"S Michiels","year":"2011","unstructured":"Michiels S, Potthoff RF, George SL. Multiple testing of treatment-effect-modifying biomarkers in a randomized clinical trial with a survival endpoint. Stat Med. 2011; 30(13):1502\u201318.","journal-title":"Stat Med"},{"issue":"2","key":"3618_CR18","doi-asserted-by":"publisher","first-page":"894","DOI":"10.1214\/09-AOS729","volume":"38","author":"C-H Zhang","year":"2010","unstructured":"Zhang C-H, et al. Nearly unbiased variable selection under minimax concave penalty. Ann Stat. 2010; 38(2):894\u2013942.","journal-title":"Ann Stat"},{"issue":"5","key":"3618_CR19","doi-asserted-by":"publisher","first-page":"1","DOI":"10.18637\/jss.v039.i05","volume":"39","author":"N Simon","year":"2011","unstructured":"Simon N, Friedman J, Hastie T, Tibshirani R. Regularization paths for cox\u2019s proportional hazards model via coordinate descent. J Stat Softw. 2011; 39(5):1.","journal-title":"J Stat Softw"},{"key":"3618_CR20","unstructured":"Friedman J, Hastie T, Simon N, Tibshirani R. Glmnet: Lasso and Elastic-Net Regularized Generalized Linear Models. R-package version 2.0-16. 2018. https:\/\/cran.r-project.org\/web\/packages\/glmnet."},{"key":"3618_CR21","unstructured":"Schafer J, Opgen-Rhein R, Zuber V, Ahdesmaki M, Silva APD, Strimmer. K. Corpcor: Efficient Estimation of Covariance and (Partial) Correlation. R package version 1.6.9. 2017. https:\/\/CRAN.R-project.org\/package=corpcor."},{"key":"3618_CR22","unstructured":"Breheny P, Breheny MP. Package \u2019grpreg\u2019. 2019."},{"key":"3618_CR23","unstructured":"Boulesteix A-L, Fuchs M. Ipflasso: Integrative Lasso with Penalty Factors. R package version 0.1. 2015. https:\/\/CRAN.R-project.org\/package=ipflasso."},{"key":"3618_CR24","unstructured":"Simon N, Friedman J, Hastie T, Tibshirani R. SGL: Fit a GLM (or Cox Model) with a Combination of Lasso and Group Lasso Regularization. R package version 1.1. 2013. https:\/\/CRAN.R-project.org\/package=SGL."},{"issue":"30","key":"3618_CR25","doi-asserted-by":"publisher","first-page":"5381","DOI":"10.1002\/sim.5958","volume":"32","author":"P Blanche","year":"2013","unstructured":"Blanche P, Dartigues J-F, Jacqmin-Gadda H. Estimating and comparing time-dependent areas under receiver operating characteristic curves for censored event times with competing risks. Stat Med. 2013; 32(30):5381\u201397.","journal-title":"Stat Med"},{"key":"3618_CR26","unstructured":"Blanche P, Blanche MP. Package \u2019timeROC\u2019. 2012."},{"issue":"1","key":"3618_CR27","doi-asserted-by":"publisher","first-page":"112","DOI":"10.1093\/bioinformatics\/btx560","volume":"34","author":"N Tern\u00e8s","year":"2018","unstructured":"Tern\u00e8s N, Rotolo F, Michiels S. biospear: an R package for biomarker selection in penalized Cox regression. Bioinformatics. 2018; 34(1):112\u20133. https:\/\/doi.org\/10.1093\/bioinformatics\/btx560.","journal-title":"Bioinformatics"},{"issue":"3","key":"3618_CR28","doi-asserted-by":"publisher","first-page":"499","DOI":"10.1111\/1467-9868.00347","volume":"64","author":"C Genovese","year":"2002","unstructured":"Genovese C, Wasserman L. Operating characteristics and extensions of the false discovery rate procedure. J R Stat Soc Ser B Stat Methodol. 2002; 64(3):499\u2013517.","journal-title":"J R Stat Soc Ser B Stat Methodol"},{"issue":"13","key":"3618_CR29","doi-asserted-by":"publisher","first-page":"3017","DOI":"10.1093\/bioinformatics\/bti448","volume":"21","author":"Y Pawitan","year":"2005","unstructured":"Pawitan Y, Michiels S, Koscielny S, Gusnanto A, Ploner A. False discovery rate, sensitivity and sample size for microarray studies. Bioinformatics. 2005; 21(13):3017\u201324.","journal-title":"Bioinformatics"},{"issue":"478","key":"3618_CR30","doi-asserted-by":"publisher","first-page":"527","DOI":"10.1198\/016214507000000149","volume":"102","author":"H Uno","year":"2007","unstructured":"Uno H, Cai T, Tian L, Wei L. Evaluating prediction rules for t-year survivors with censored regression models. J Am Stat Assoc. 2007; 102(478):527\u201337.","journal-title":"J Am Stat Assoc"},{"issue":"1","key":"3618_CR31","doi-asserted-by":"crossref","first-page":"8","DOI":"10.1002\/cjs.10046","volume":"38","author":"H Hung","year":"2010","unstructured":"Hung H, Chiang C-T. Estimation methods for time-dependent auc models with survival data. Can J Stat. 2010; 38(1):8\u201326.","journal-title":"Can J Stat"},{"issue":"16","key":"3618_CR32","doi-asserted-by":"publisher","first-page":"1996","DOI":"10.1200\/JCO.2011.39.5624","volume":"30","author":"M Ignatiadis","year":"2012","unstructured":"Ignatiadis M, Singhal SK, Desmedt C, Haibe-Kains B, Criscitiello C, Andre F, Loi S, Piccart M, Michiels S, Sotiriou C. Gene modules and response to neoadjuvant chemotherapy in breast cancer subtypes: a pooled analysis. J Clin Oncol Off JAm Soc Clin Oncol. 2012; 30(16):1996\u20132004. https:\/\/doi.org\/10.1200\/JCO.2011.39.5624.","journal-title":"J Clin Oncol Off JAm Soc Clin Oncol"},{"issue":"19","key":"3618_CR33","doi-asserted-by":"publisher","first-page":"2200","DOI":"10.1093\/bioinformatics\/btn374","volume":"24","author":"B Haibe-Kains","year":"2008","unstructured":"Haibe-Kains B, Desmedt C, Sotiriou C, Bontempi G. A comparative study of survival models for breast cancer prognostication based on microarray data: does a single gene beat them all?Bioinformatics. 2008; 24(19):2200\u20138. https:\/\/doi.org\/10.1093\/bioinformatics\/btn374.","journal-title":"Bioinformatics"},{"issue":"1","key":"3618_CR34","doi-asserted-by":"publisher","first-page":"94","DOI":"10.1186\/s12859-019-2656-1","volume":"20","author":"Z Tang","year":"2019","unstructured":"Tang Z, Lei S, Zhang X, Yi Z, Guo B, Chen JY, Shen Y, Yi N. Gsslasso Cox: a Bayesian hierarchical model for predicting survival and detecting associated genes by incorporating pathway information. BMC Bioinformatics. 2019; 20(1):94. https:\/\/doi.org\/10.1186\/s12859-019-2656-1.","journal-title":"BMC Bioinformatics"},{"issue":"1","key":"3618_CR35","doi-asserted-by":"publisher","first-page":"18","DOI":"10.1186\/1471-2105-10-18","volume":"10","author":"H Binder","year":"2009","unstructured":"Binder H, Schumacher M. Incorporating pathway information into boosting estimation of high-dimensional risk prediction models. BMC Bioinformatics. 2009; 10(1):18. https:\/\/doi.org\/10.1186\/1471-2105-10-18.","journal-title":"BMC Bioinformatics"},{"issue":"23","key":"3618_CR36","doi-asserted-by":"publisher","first-page":"3338","DOI":"10.1002\/sim.7821","volume":"37","author":"M Sutton","year":"2018","unstructured":"Sutton M, Thi\u00e9baut R, Liquet B. Sparse partial least squares with group and subgroup structure. Stat Med. 2018; 37(23):3338\u201356.","journal-title":"Stat Med"},{"key":"3618_CR37","doi-asserted-by":"publisher","first-page":"13787","DOI":"10.4137\/CIN.S13787","volume":"13","author":"L Zhang","year":"2014","unstructured":"Zhang L, Morris JS, Zhang J, Orlowski RZ, Baladandayuthapani V. Bayesian joint selection of genes and pathways: Applications in multiple myeloma genomics. Cancer Informat. 2014; 13:13787.","journal-title":"Cancer Informat"},{"key":"3618_CR38","unstructured":"Obozinski G, Jacob L, Vert J-P. Group Lasso with Overlaps: the Latent Group Lasso approach. arXiv:1110.0413 [cs, stat]. 2011. arXiv: 1110.0413."}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-020-03618-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-020-03618-y\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-020-03618-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,9]],"date-time":"2024-08-09T01:49:26Z","timestamp":1723168166000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-020-03618-y"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,7,2]]},"references-count":38,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2020,12]]}},"alternative-id":["3618"],"URL":"https:\/\/doi.org\/10.1186\/s12859-020-03618-y","relation":{},"ISSN":["1471-2105"],"issn-type":[{"type":"electronic","value":"1471-2105"}],"subject":[],"published":{"date-parts":[[2020,7,2]]},"assertion":[{"value":"3 January 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"19 June 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"2 July 2020","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Not applicable.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not Applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"277"}}