{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,15]],"date-time":"2026-06-15T13:45:46Z","timestamp":1781531146251,"version":"3.54.5"},"reference-count":42,"publisher":"Oxford University Press (OUP)","issue":"2","license":[{"start":{"date-parts":[[2021,9,29]],"date-time":"2021-09-29T00:00:00Z","timestamp":1632873600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61471078"],"award-info":[{"award-number":["61471078"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100017683","name":"Dalian Science and Technology Innovation Fund","doi-asserted-by":"publisher","award":["2020JJ27SN066"],"award-info":[{"award-number":["2020JJ27SN066"]}],"id":[{"id":"10.13039\/501100017683","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100012226","name":"Fundamental Research Funds for the Central Universities","doi-asserted-by":"publisher","award":["3132014306"],"award-info":[{"award-number":["3132014306"]}],"id":[{"id":"10.13039\/501100012226","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100012226","name":"Fundamental Research Funds for the Central Universities","doi-asserted-by":"publisher","award":["3132015213"],"award-info":[{"award-number":["3132015213"]}],"id":[{"id":"10.13039\/501100012226","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100012226","name":"Fundamental Research Funds for the Central Universities","doi-asserted-by":"publisher","award":["3132017075"],"award-info":[{"award-number":["3132017075"]}],"id":[{"id":"10.13039\/501100012226","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,1,3]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Motivation<\/jats:title><jats:p>Survival analysis using gene expression profiles plays a crucial role in the interpretation of clinical research and assessment of disease therapy programs. Several prediction models have been developed to explore the relationship between patients\u2019 covariates and survival. However, the high-dimensional genomic features limit the prediction performance of the survival model. Thus, an accurate and reliable prediction model is necessary for survival analysis using high-dimensional genomic data.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>In this study, we proposed an improved survival prediction model based on XGBoost framework called XGBLC, which used Lasso-Cox to enhance the ability to analyze high-dimensional genomic data. The novel first- and second-order gradient statistics of Lasso-Cox were defined to construct the loss function of XGBLC. We extensively tested our XGBLC algorithm on both simulated and real-world datasets, and estimated the performance of models with 5-fold cross-validation. Based on 20 cancer datasets from The Cancer Genome Atlas (TCGA), XGBLC outperforms five state-of-the-art survival methods in terms of C-index, Brier score and AUC. The results show that XGBLC still keeps good accuracy and robustness by comparing the performance on the simulated datasets with different scales. The developed prediction model would be beneficial for physicians to understand the effects of patient\u2019s genomic characteristics on survival and make personalized treatment decisions.<\/jats:p><\/jats:sec><jats:sec><jats:title>Availability and implementation<\/jats:title><jats:p>The implementation of XGBLC algorithm based on R language is available at: https:\/\/github.com\/lab319\/XGBLC<\/jats:p><\/jats:sec><jats:sec><jats:title>Supplementary information<\/jats:title><jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p><\/jats:sec>","DOI":"10.1093\/bioinformatics\/btab675","type":"journal-article","created":{"date-parts":[[2021,9,23]],"date-time":"2021-09-23T14:31:23Z","timestamp":1632407483000},"page":"410-418","source":"Crossref","is-referenced-by-count":53,"title":["XGBLC: an improved survival prediction model based on XGBoost"],"prefix":"10.1093","volume":"38","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1629-1968","authenticated-orcid":false,"given":"Baoshan","family":"Ma","sequence":"first","affiliation":[{"name":"School of Information Science and Technology, Dalian Maritime University , Dalian 116026, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Ge","family":"Yan","sequence":"additional","affiliation":[{"name":"School of Information Science and Technology, Dalian Maritime University , Dalian 116026, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Bingjie","family":"Chai","sequence":"additional","affiliation":[{"name":"School of Information Science and Technology, Dalian Maritime University , Dalian 116026, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xiaoyu","family":"Hou","sequence":"additional","affiliation":[{"name":"School of Information Science and Technology, Dalian Maritime University , Dalian 116026, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"286","published-online":{"date-parts":[[2021,9,29]]},"reference":[{"key":"2023020108470679100_btab675-B1","doi-asserted-by":"crossref","first-page":"E108","DOI":"10.1371\/journal.pbio.0020108","article-title":"Semi-supervised methods to predict patient survival from gene expression data","volume":"2","author":"Air","year":"2004","journal-title":"PLoS Biol"},{"key":"2023020108470679100_btab675-B2","doi-asserted-by":"crossref","first-page":"12","DOI":"10.1016\/j.jchf.2019.06.013","article-title":"Machine learning prediction of mortality and hospitalization in heart failure with preserved ejection fraction","volume":"8","author":"Angraal","year":"2020","journal-title":"JACC Heart Fail"},{"key":"2023020108470679100_btab675-B3","doi-asserted-by":"crossref","first-page":"3946","DOI":"10.1002\/sim.5452","article-title":"Generating survival times to simulate Cox proportional hazards models with time-varying covariates","volume":"31","author":"Austin","year":"2012","journal-title":"Stat. Med"},{"key":"2023020108470679100_btab675-B4","doi-asserted-by":"crossref","first-page":"816","DOI":"10.1038\/nm733","article-title":"Gene-expression profiles predict survival of patients with lung adenocarcinoma","volume":"8","author":"Beer","year":"2002","journal-title":"Nat. Med"},{"key":"2023020108470679100_btab675-B5","doi-asserted-by":"crossref","first-page":"1430","DOI":"10.1002\/bimj.201800376","article-title":"On the validity of time-dependent AUC estimation in the presence of cure fraction","volume":"61","author":"Beyene","year":"2019","journal-title":"Biometrical J"},{"key":"2023020108470679100_btab675-B6","first-page":"593","article-title":"Analysis of survival data","volume":"41","author":"Breslow","year":"1984","journal-title":"New York"},{"key":"2023020108470679100_btab675-B7","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1175\/1520-0493(1950)078<0001:VOFEIT>2.0.CO;2","article-title":"Verification of forecasts expressed in terms of probability","volume":"78","author":"Brier","year":"1950","journal-title":"Mon. Weather Rev"},{"key":"2023020108470679100_btab675-B8","doi-asserted-by":"crossref","first-page":"121","DOI":"10.1023\/A:1009715923555","article-title":"A tutorial on support vector machines for pattern recognition","volume":"2","author":"Burges","year":"1998","journal-title":"Data Min. Knowledge Discov"},{"key":"2023020108470679100_btab675-B9","doi-asserted-by":"crossref","first-page":"785","DOI":"10.1145\/2939672.2939785","volume-title":"The 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '16)","author":"Chen","year":"2016"},{"key":"2023020108470679100_btab675-B10","first-page":"22","article-title":"The accuracy of clinicians' predictions of survival in advanced cancer: a review","volume":"5","author":"Cheon","year":"2016","journal-title":"Ann. Palliat Med"},{"key":"2023020108470679100_btab675-B11","first-page":"187","article-title":"Regression models and life-tables","volume":"34","author":"Cox","year":"1972","journal-title":"J. R. Stat. Soc. Ser. B (Methodological)"},{"key":"2023020108470679100_btab675-B12","doi-asserted-by":"crossref","first-page":"269","DOI":"10.1093\/biomet\/62.2.269","article-title":"Partial likelihood","volume":"62","author":"Cox","year":"1975","journal-title":"Biometrika"},{"key":"2023020108470679100_btab675-B13","first-page":"157","article-title":"Random forests","volume":"45","author":"Cutler","year":"2004","journal-title":"Mach. Learn"},{"key":"2023020108470679100_btab675-B14","doi-asserted-by":"crossref","first-page":"5137","DOI":"10.1093\/bioinformatics\/btz446","article-title":"Path2Surv: pathway\/gene set-based survival analysis using multiple kernel learning","volume":"35","author":"Dereli","year":"2019","journal-title":"Bioinformatics"},{"key":"2023020108470679100_btab675-B15","first-page":"397","article-title":"Penalized regression: the bridge versus the lasso","volume":"7","author":"Fu","year":"1998","journal-title":"J. Comput. Graph. Stat"},{"key":"2023020108470679100_btab675-B16","doi-asserted-by":"crossref","first-page":"1455","DOI":"10.1162\/089976698300017269","article-title":"An equivalence between sparse approximation and support vector machines","volume":"10","author":"Girosi","year":"1998","journal-title":"Neural Comput"},{"key":"2023020108470679100_btab675-B17","doi-asserted-by":"crossref","first-page":"70","DOI":"10.1002\/bimj.200900028","article-title":"L1 penalized estimation in the Cox proportional hazards model","volume":"52","author":"Goeman","year":"2010","journal-title":"BIOM J"},{"key":"2023020108470679100_btab675-B18","doi-asserted-by":"crossref","first-page":"3001","DOI":"10.1093\/bioinformatics\/bti422","article-title":"Penalized Cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data","volume":"21","author":"Gui","year":"2005","journal-title":"Bioinformatics"},{"key":"2023020108470679100_btab675-B19","doi-asserted-by":"crossref","first-page":"361","DOI":"10.1002\/(SICI)1097-0258(19960229)15:4<361::AID-SIM168>3.0.CO;2-4","article-title":"Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors","volume":"15","author":"Harrell","year":"1996","journal-title":"Stat. Med"},{"key":"2023020108470679100_btab675-B20","doi-asserted-by":"crossref","first-page":"337","DOI":"10.1111\/j.0006-341X.2000.00337.x","article-title":"Time-dependent ROC curves for censored survival data and a diagnostic marker","volume":"56","author":"Heagerty","year":"2000","journal-title":"Biometrics"},{"key":"2023020108470679100_btab675-B21","doi-asserted-by":"crossref","first-page":"61","DOI":"10.1016\/j.canlet.2019.12.007","article-title":"Artificial intelligence in cancer diagnosis and prognosis: opportunities and challenges","volume":"471","author":"Huang","year":"2020","journal-title":"Cancer Lett"},{"key":"2023020108470679100_btab675-B22","doi-asserted-by":"crossref","first-page":"841","DOI":"10.1214\/08-AOAS169","article-title":"Random survival forests","volume":"2","author":"Ishwaran","year":"2008","journal-title":"Ann. Appl. Stat"},{"key":"2023020108470679100_btab675-B23","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1186\/s12911-016-0292-5","article-title":"Prognostic factor analysis for breast cancer using gene expression profiles","volume":"16","author":"Joe","year":"2016","journal-title":"BMC Med. Inf. Dec. Making"},{"key":"2023020108470679100_btab675-B24","doi-asserted-by":"crossref","first-page":"457","DOI":"10.1080\/01621459.1958.10501452","article-title":"Nonparametric estimation from incomplete observations","volume":"53","author":"Kaplan","year":"1958","journal-title":"J. Am. Stat. Assoc"},{"key":"2023020108470679100_btab675-B25","doi-asserted-by":"crossref","first-page":"24","DOI":"10.1186\/s12874-018-0482-1","article-title":"DeepSurv: personalized treatment recommender system using a Cox proportional hazards deep neural network","volume":"18","author":"Katzman","year":"2018","journal-title":"BMC Med. Res. Methodol"},{"key":"2023020108470679100_btab675-B26","doi-asserted-by":"crossref","first-page":"57","DOI":"10.2147\/IJN.S40733","article-title":"Feature selection and survival modeling in The Cancer Genome Atlas","volume":"8","author":"Kim","year":"2013","journal-title":"Int. J. Nanomed"},{"key":"2023020108470679100_btab675-B27","doi-asserted-by":"crossref","first-page":"148","DOI":"10.1109\/TBME.2020.2993278","article-title":"Optimizing survival analysis of XGBoost for ties to predict disease progression of breast cancer","volume":"68","author":"Liu","year":"2021","journal-title":"IEEE Trans. Biomed. Eng"},{"key":"2023020108470679100_btab675-B28","first-page":"1207","article-title":"A prognostic 4-lncRNA expression signature for lung squamous cell carcinoma","volume":"46","author":"Luo","year":"2018","journal-title":"Artif. Cells"},{"key":"2023020108470679100_btab675-B29","doi-asserted-by":"crossref","first-page":"1288","DOI":"10.7150\/jca.34585","article-title":"Identification of a sixteen-gene prognostic biomarker for lung adenocarcinoma using a machine learning method","volume":"11","author":"Ma","year":"2020","journal-title":"J. Cancer"},{"key":"2023020108470679100_btab675-B30","first-page":"1573","author":"Mitchel","year":"2019"},{"key":"2023020108470679100_btab675-B31","doi-asserted-by":"crossref","first-page":"12","DOI":"10.2202\/1557-4679.1049","article-title":"Multiple imputation and random forests (MIRF) for unobservable, high-dimensional data","volume":"3","author":"Nonyane","year":"2007","journal-title":"Int. J. Biostat"},{"key":"2023020108470679100_btab675-B32","doi-asserted-by":"crossref","first-page":"2209","DOI":"10.1056\/NEJMoa1516192","article-title":"Genomic classification and prognosis in acute myeloid leukemia","volume":"374","author":"Papaemmanuil","year":"2016","journal-title":"N. Engl. J. Med"},{"key":"2023020108470679100_btab675-B33","doi-asserted-by":"crossref","first-page":"1160","DOI":"10.1200\/JCO.2008.18.1370","article-title":"Supervised risk predictor of breast cancer based on intrinsic subtypes","volume":"27","author":"Parker","year":"2009","journal-title":"J. Clin. Oncol"},{"key":"2023020108470679100_btab675-B34","doi-asserted-by":"crossref","first-page":"397","DOI":"10.1504\/IJBRA.2015.071940","article-title":"A comprehensive evaluation of machine learning techniques for cancer class prediction based on microarray data","volume":"11","author":"Raza","year":"2015","journal-title":"Int. J. Bioinf. Res. Appl"},{"key":"2023020108470679100_btab675-B35","first-page":"655","author":"Shivaswamy","year":"2007"},{"key":"2023020108470679100_btab675-B36","doi-asserted-by":"crossref","first-page":"385","DOI":"10.1002\/(SICI)1097-0258(19970228)16:4<385::AID-SIM380>3.0.CO;2-3","article-title":"The Lasso method for variable selection in the cox model","volume":"16","author":"Tibshirani","year":"1997","journal-title":"Stat. Med"},{"key":"2023020108470679100_btab675-B37","doi-asserted-by":"crossref","first-page":"87","DOI":"10.1093\/bioinformatics\/btq617","article-title":"Improved performance on high-dimensional survival data by application of Survival-SVM","volume":"27","author":"Van Belle","year":"2011","journal-title":"Bioinformatics"},{"key":"2023020108470679100_btab675-B38","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1016\/j.artmed.2011.06.006","article-title":"Support vector methods for survival analysis: a comparison between ranking and regression approaches","volume":"53","author":"Van Belle","year":"2011","journal-title":"Artif. Intell. Med"},{"key":"2023020108470679100_btab675-B39","doi-asserted-by":"crossref","first-page":"1999","DOI":"10.1056\/NEJMoa021967","article-title":"A gene-expression signature as a predictor of survival in breast cancer","volume":"347","author":"Vijver","year":"2002","journal-title":"N. Engl. J. Med"},{"key":"2023020108470679100_btab675-B40","doi-asserted-by":"crossref","first-page":"691","DOI":"10.1093\/biomet\/asm037","article-title":"Adaptive Lasso for Cox's proportional hazards model","volume":"94","author":"Zhang","year":"2007","journal-title":"Biometrika"},{"key":"2023020108470679100_btab675-B41","first-page":"1176935118810215","article-title":"Machine learning with K-means dimensional reduction for predicting survival outcomes in patients with breast cancer","volume":"17","author":"Zhao","year":"2018","journal-title":"Cancer Inf"},{"key":"2023020108470679100_btab675-B42","doi-asserted-by":"crossref","first-page":"3330","DOI":"10.1093\/bioinformatics\/btv374","article-title":"NCC-AUC: an AUC optimization method to identify multi-biomarker panel for cancer prognosis from genomic and clinical data","volume":"31","author":"Zou","year":"2015","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btab675\/40533133\/btab675.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/2\/410\/49007723\/btab675.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/2\/410\/49007723\/btab675.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,9]],"date-time":"2023-11-09T08:08:09Z","timestamp":1699517289000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/38\/2\/410\/6377771"}},"subtitle":[],"editor":[{"given":"Valentina","family":"Boeva","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"editor"}]}],"short-title":[],"issued":{"date-parts":[[2021,9,29]]},"references-count":42,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2022,1,3]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btab675","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2022,1,15]]},"published":{"date-parts":[[2021,9,29]]}}}