{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,13]],"date-time":"2026-05-13T16:58:37Z","timestamp":1778691517754,"version":"3.51.4"},"reference-count":28,"publisher":"Oxford University Press (OUP)","issue":"6","license":[{"start":{"date-parts":[[2022,1,3]],"date-time":"2022-01-03T00:00:00Z","timestamp":1641168000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"name":"Institutional Research","award":["14-189-19"],"award-info":[{"award-number":["14-189-19"]}]},{"DOI":"10.13039\/100000048","name":"American Cancer Society","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100000048","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000002","name":"NIH","doi-asserted-by":"publisher","award":["R01-DE030493"],"award-info":[{"award-number":["R01-DE030493"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Biostatistics and Bioinformatics Shared Resource"},{"DOI":"10.13039\/100005567","name":"H. Lee Moffitt Cancer Center and Research Institute","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100005567","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000054","name":"NCI","doi-asserted-by":"publisher","award":["P30-CA076292"],"award-info":[{"award-number":["P30-CA076292"]}],"id":[{"id":"10.13039\/100000054","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,3,4]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>A gradient boosting decision tree (GBDT) is a powerful ensemble machine-learning method that has the potential to accelerate biomarker discovery from high-dimensional molecular data. Recent algorithmic advances, such as extreme gradient boosting (XGB) and light gradient boosting (LGB), have rendered the GBDT training more efficient, scalable and accurate. However, these modern techniques have not yet been widely adopted in discovering biomarkers for censored survival outcomes, which are key clinical outcomes or endpoints in cancer studies.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>In this paper, we present a new R package \u2018Xsurv\u2019 as an integrated solution that applies two modern GBDT training frameworks namely, XGB and LGB, for the modeling of right-censored survival outcomes. Based on our simulations, we benchmark the new approaches against traditional methods including the stepwise Cox regression model and the original gradient boosting function implemented in the package \u2018gbm\u2019. We also demonstrate the application of Xsurv in analyzing a melanoma methylation dataset. Together, these results suggest that Xsurv is a useful and computationally viable tool for screening a large number of prognostic candidate biomarkers, which may facilitate future translational and clinical research.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>\u2018Xsurv\u2019 is freely available as an R package at: https:\/\/github.com\/topycyao\/Xsurv.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Supplementary information<\/jats:title>\n                    <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btab869","type":"journal-article","created":{"date-parts":[[2021,12,29]],"date-time":"2021-12-29T08:08:55Z","timestamp":1640765335000},"page":"1631-1638","source":"Crossref","is-referenced-by-count":62,"title":["Efficient gradient boosting for prognostic biomarker discovery"],"prefix":"10.1093","volume":"38","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8069-5639","authenticated-orcid":false,"given":"Kaiqiao","family":"Li","sequence":"first","affiliation":[{"name":"Department of Applied Mathematics and Statistics, Stony Brook University , Stony Brook, NY 11794, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sijie","family":"Yao","sequence":"additional","affiliation":[{"name":"Department of Biostatistics and Bioinformatics, H. Lee Moffitt Cancer Center and Research Institute , Tampa, FL 33612, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhenyu","family":"Zhang","sequence":"additional","affiliation":[{"name":"Department of Applied Mathematics and Statistics, Stony Brook University , Stony Brook, NY 11794, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Biwei","family":"Cao","sequence":"additional","affiliation":[{"name":"Department of Biostatistics and Bioinformatics, H. Lee Moffitt Cancer Center and Research Institute , Tampa, FL 33612, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Christopher M","family":"Wilson","sequence":"additional","affiliation":[{"name":"Department of Biostatistics and Bioinformatics, H. Lee Moffitt Cancer Center and Research Institute , Tampa, FL 33612, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Denise","family":"Kalos","sequence":"additional","affiliation":[{"name":"Department of Biostatistics and Bioinformatics, H. Lee Moffitt Cancer Center and Research Institute , Tampa, FL 33612, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7861-916X","authenticated-orcid":false,"given":"Pei Fen","family":"Kuan","sequence":"additional","affiliation":[{"name":"Department of Applied Mathematics and Statistics, Stony Brook University , Stony Brook, NY 11794, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0753-5716","authenticated-orcid":false,"given":"Ruoqing","family":"Zhu","sequence":"additional","affiliation":[{"name":"Department of Statistics, University of Illinois Urbana-Champaign , Champaign, IL 61820, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5775-408X","authenticated-orcid":false,"given":"Xuefeng","family":"Wang","sequence":"additional","affiliation":[{"name":"Department of Biostatistics and Bioinformatics, H. Lee Moffitt Cancer Center and Research Institute , Tampa, FL 33612, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2022,1,3]]},"reference":[{"key":"2023033004314978100_","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1175\/1520-0493(1950)078<0001:VOFEIT>2.0.CO;2","article-title":"Verification of forecasts expressed in terms of probability","volume":"78","author":"Brier","year":"1950","journal-title":"Mon. Weather Rev"},{"key":"2023033004314978100_","doi-asserted-by":"crossref","first-page":"785","DOI":"10.1145\/2939672.2939785","volume-title":"Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining","author":"Chen","year":"2016"},{"key":"2023033004314978100_","author":"Chen","year":"2021"},{"key":"2023033004314978100_","doi-asserted-by":"crossref","first-page":"873595","DOI":"10.1155\/2013\/873595","article-title":"A gradient boosting algorithm for survival analysis via direct optimization of concordance index","volume":"2013","author":"Chen","year":"2013","journal-title":"Comput. Math. Methods Med"},{"key":"2023033004314978100_","doi-asserted-by":"crossref","first-page":"187","DOI":"10.1111\/j.2517-6161.1972.tb00899.x","article-title":"Regression models and life-tables","volume":"34","author":"Cox","year":"1972","journal-title":"J. R. Statist. Soc. Ser. B"},{"key":"2023033004314978100_","author":"Draper","year":"1981","edition":"2nd edn"},{"key":"2023033004314978100_","volume-title":"Mathematical Methods for Digital Computers","author":"Efroymson","year":"1960"},{"key":"2023033004314978100_","doi-asserted-by":"crossref","first-page":"572","DOI":"10.1016\/j.it.2020.04.007","article-title":"The speckled protein (SP) family: Immunity\u2019s chromatin readers","volume":"41","author":"Fraschilla","year":"2020","journal-title":"Trends Immunol"},{"key":"2023033004314978100_","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1006\/jcss.1997.1504","article-title":"A decision-theoretic generalization of on-line learning and an application to boosting","volume":"55","author":"Freund","year":"1997","journal-title":"J. Comput. Syst. Sci"},{"key":"2023033004314978100_","doi-asserted-by":"crossref","first-page":"1189","DOI":"10.1214\/aos\/1013203451","article-title":"Greedy function approximation: A gradient boosting machine","volume":"29","author":"Friedman","year":"2001","journal-title":"Ann. Statist"},{"key":"2023033004314978100_","author":"Greenwell","year":"2007"},{"key":"2023033004314978100_","doi-asserted-by":"crossref","first-page":"2543","DOI":"10.1001\/jama.1982.03320430047030","article-title":"Evaluating the yield of medical tests","volume":"247","author":"Harrell","year":"1982","journal-title":"JAMA"},{"key":"2023033004314978100_","author":"Hastie","year":"2009"},{"key":"2023033004314978100_","first-page":"278","article-title":"Random decision forests","author":"Ho","year":"1995","journal-title":"Proceedings of 3rd International Conference on Document Analysis and Recognition"},{"key":"2023033004314978100_","doi-asserted-by":"crossref","first-page":"1","DOI":"10.2307\/2529336","article-title":"The analysis and selection of variables in linear regression","volume":"32","author":"Hocking","year":"1976","journal-title":"Biometrics"},{"key":"2023033004314978100_","doi-asserted-by":"crossref","first-page":"890","DOI":"10.1002\/sim.8449","article-title":"Mendelian randomization using semiparametric linear transformation models","volume":"39","author":"Huang","year":"2020","journal-title":"Statist. Med"},{"key":"2023033004314978100_","doi-asserted-by":"crossref","first-page":"3090","DOI":"10.1172\/JCI91095","article-title":"DNA methylation-based immune response signature improves patient diagnosis in multiple cancers","volume":"127","author":"Jeschke","year":"2017","journal-title":"J. Clin. Invest"},{"key":"2023033004314978100_","doi-asserted-by":"crossref","first-page":"610","DOI":"10.2174\/092986652507180813110453","article-title":"Advances in usage of venom proteins as diagnostics and therapeutic mediators","volume":"25","author":"Khan","year":"2018","journal-title":"Prot. Pept. Lett"},{"key":"2023033004314978100_","author":"Kuhn","year":"2020"},{"key":"2023033004314978100_","doi-asserted-by":"crossref","first-page":"2403","DOI":"10.1093\/bioinformatics\/bti324","article-title":"Boosting proportional hazards models using smoothing splines, with applications to high-dimensional microarray data","volume":"21","author":"Li","year":"2005","journal-title":"Bioinformatics"},{"key":"2023033004314978100_","author":"Liu","year":"2015"},{"key":"2023033004314978100_","author":"Lundberg","year":"2017"},{"key":"2023033004314978100_","first-page":"512","author":"Mason","year":"1999"},{"key":"2023033004314978100_","doi-asserted-by":"crossref","first-page":"252","DOI":"10.1038\/nrc3239","article-title":"The blockade of immune checkpoints in cancer immunotherapy","volume":"12","author":"Pardoll","year":"2012","journal-title":"Nat. Rev. Cancer"},{"key":"2023033004314978100_","doi-asserted-by":"crossref","first-page":"231","DOI":"10.1080\/10618600.2012.681250","article-title":"A sparse-group lasso","volume":"22","author":"Simon","year":"2013","journal-title":"J. Comput. Graph. Statist"},{"key":"2023033004314978100_","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1111\/j.2517-6161.1996.tb02080.x","article-title":"Regression shrinkage and selection via the lasso","volume":"58","author":"Tibshirani","year":"1996","journal-title":"J. R. Statist. Soc. Ser. B"},{"key":"2023033004314978100_","doi-asserted-by":"crossref","first-page":"385","DOI":"10.1002\/(SICI)1097-0258(19970228)16:4<385::AID-SIM380>3.0.CO;2-3","article-title":"The Lasso method for variable selection in the Cox model","volume":"16","author":"Tibshirani","year":"1997","journal-title":"Statist. Med"},{"key":"2023033004314978100_","doi-asserted-by":"crossref","first-page":"49","DOI":"10.1111\/j.1467-9868.2005.00532.x","article-title":"Model selection and estimation in regression with grouped variables","volume":"68","author":"Yuan","year":"2006","journal-title":"J. R. Statist. Soc. Ser. B"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btab869\/42113890\/btab869.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/6\/1631\/49692935\/btab869.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/6\/1631\/49692935\/btab869.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,9,15]],"date-time":"2024-09-15T05:41:33Z","timestamp":1726378893000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/38\/6\/1631\/6493225"}},"subtitle":[],"editor":[{"given":"Zhiyong","family":"Lu","sequence":"additional","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2022,1,3]]},"references-count":28,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2022,3,4]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btab869","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2021.07.06.451263","asserted-by":"object"}]},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2022,3,15]]},"published":{"date-parts":[[2022,1,3]]}}}