{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T20:34:36Z","timestamp":1772138076391,"version":"3.50.1"},"reference-count":32,"publisher":"Oxford University Press (OUP)","issue":"14","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2007,7,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Motivation: In the process of developing risk prediction models, various steps of model building and model selection are involved. If this process is not adequately controlled, overfitting may result in serious overoptimism leading to potentially erroneous conclusions.<\/jats:p>\n                  <jats:p>Methods: For right censored time-to-event data, we estimate the prediction error for assessing the performance of a risk prediction model (Gerds and Schumacher, 2006; Graf et al., 1999). Furthermore, resampling methods are used to detect overfitting and resulting overoptimism and to adjust the estimates of prediction error (Gerds and Schumacher, 2007).<\/jats:p>\n                  <jats:p>Results: We show how and to what extent the methodology can be used in situations characterized by a large number of potential predictor variables where overfitting may be expected to be overwhelming. This is illustrated by estimating the prediction error of some recently proposed techniques for fitting a multivariate Cox regression model applied to the data of a prognostic study in patients with diffuse large-B-cell lymphoma (DLBCL).<\/jats:p>\n                  <jats:p>Availability: Resampling-based estimation of prediction error curves is implemented in an R package called pec available from the authors.<\/jats:p>\n                  <jats:p>Contact: \u00a0sec@imbi.uni-freiburg.de<\/jats:p>","DOI":"10.1093\/bioinformatics\/btm232","type":"journal-article","created":{"date-parts":[[2007,5,7]],"date-time":"2007-05-07T20:13:40Z","timestamp":1178568820000},"page":"1768-1774","source":"Crossref","is-referenced-by-count":87,"title":["Assessment of survival prediction models based on microarray data"],"prefix":"10.1093","volume":"23","author":[{"given":"Martin","family":"Schumacher","sequence":"first","affiliation":[{"name":"1 Department of Medical Biometry and Statistics, Institute of Medical Biometry and Medical Informatics, University Medical Center Freiburg and 2Freiburg Center of Data Analysis and Model Building, University Freiburg, Germany"}]},{"given":"Harald","family":"Binder","sequence":"additional","affiliation":[{"name":"1 Department of Medical Biometry and Statistics, Institute of Medical Biometry and Medical Informatics, University Medical Center Freiburg and 2Freiburg Center of Data Analysis and Model Building, University Freiburg, Germany"}]},{"given":"Thomas","family":"Gerds","sequence":"additional","affiliation":[{"name":"1 Department of Medical Biometry and Statistics, Institute of Medical Biometry and Medical Informatics, University Medical Center Freiburg and 2Freiburg Center of Data Analysis and Model Building, University Freiburg, Germany"}]}],"member":"286","published-online":{"date-parts":[[2007,5,7]]},"reference":[{"key":"2023041105234278100_","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4612-4348-9","volume-title":"Statistical Models Based on Counting Processes","author":"Andersen","year":"1993"},{"key":"2023041105234278100_","doi-asserted-by":"crossref","first-page":"374","DOI":"10.1093\/bioinformatics\/btg419","article-title":"Is cross-validation valid for small-sample microarray classification?","volume":"20","author":"Braga-Neto","year":"2004","journal-title":"Bioinformatics"},{"key":"2023041105234278100_","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1175\/1520-0493(1950)078<0001:VOFEIT>2.0.CO;2","article-title":"Verification of forecasts expressed in terms of probability","volume":"78","author":"Brier","year":"1950","journal-title":"Mon. Weather Rev."},{"key":"2023041105234278100_","doi-asserted-by":"crossref","first-page":"187","DOI":"10.1111\/j.2517-6161.1972.tb00899.x","article-title":"Regression models and life tables","volume":"34","author":"Cox","year":"1972","journal-title":"J. R. Stat. Soc. B."},{"key":"2023041105234278100_","doi-asserted-by":"crossref","first-page":"316","DOI":"10.1080\/01621459.1983.10477973","article-title":"Estimating the error rate of a prediction rule: improvement on cross-validation","volume":"78","author":"Efron","year":"1983","journal-title":"J. Am. Stat. Assoc."},{"key":"2023041105234278100_","first-page":"548","article-title":"Improvements on cross-validation: the 0.632+ bootstrap method","volume":"92","author":"Efron","year":"1997","journal-title":"J. Am. Stat. Assoc."},{"key":"2023041105234278100_","doi-asserted-by":"crossref","first-page":"407","DOI":"10.1214\/009053604000000067","article-title":"Least angle regression","volume":"32","author":"Efron","year":"2004","journal-title":"Ann. Stat."},{"key":"2023041105234278100_","doi-asserted-by":"crossref","first-page":"1979","DOI":"10.1093\/bioinformatics\/bti294","article-title":"Estimating misclassification error with small samples via bootstrap cross-validation","volume":"21","author":"Fu","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041105234278100_","doi-asserted-by":"crossref","first-page":"122","DOI":"10.1080\/01621459.1994.10476452","article-title":"An interpretation of partial least squares","volume":"89","author":"Garthwaite","year":"1994","journal-title":"J. Am. Stat. Assoc."},{"key":"2023041105234278100_","doi-asserted-by":"crossref","first-page":"572","DOI":"10.1093\/biomet\/88.2.572","article-title":"On functional misspecification of covariates in the cox regression model","volume":"88","author":"Gerds","year":"2001","journal-title":"Biometrika"},{"key":"2023041105234278100_","doi-asserted-by":"crossref","first-page":"1029","DOI":"10.1002\/bimj.200610301","article-title":"Consistent estimation of the expected brier score in general survival models with right-censored event times","volume":"48","author":"Gerds","year":"2006","journal-title":"Biom. J."},{"key":"2023041105234278100_","doi-asserted-by":"crossref","DOI":"10.1111\/j.1541-0420.2007.00832.x","article-title":"Efron-type measures of prediction error for survival analysis","author":"Gerds","year":"2007","journal-title":"Biometrics"},{"key":"2023041105234278100_","doi-asserted-by":"crossref","first-page":"2529","DOI":"10.1002\/(SICI)1097-0258(19990915\/30)18:17\/18<2529::AID-SIM274>3.0.CO;2-5","article-title":"Assessment and comparison of prognostic classification schemes for survival data","volume":"18","author":"Graf","year":"1999","journal-title":"Stat. Med."},{"key":"2023041105234278100_","doi-asserted-by":"crossref","first-page":"3001","DOI":"10.1093\/bioinformatics\/bti422","article-title":"Penalized Cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data","volume":"21","author":"Gui","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041105234278100_","doi-asserted-by":"crossref","first-page":"92","DOI":"10.1111\/j.0006-341X.2005.030814.x","article-title":"Survival model predictive accuracy and roc curves","volume":"61","author":"Heagerty","year":"2005","journal-title":"Biometrics"},{"key":"2023041105234278100_","doi-asserted-by":"crossref","first-page":"337","DOI":"10.1111\/j.0006-341X.2000.00337.x","article-title":"Time-dependent roc curves for censored survival data and a diagnostic marker","volume":"56","author":"Heagerty","year":"2000","journal-title":"Biometrics"},{"key":"2023041105234278100_","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1080\/00401706.1970.10488634","article-title":"Ridge regression: biased estimation for nonorthogonal problems","volume":"12","author":"Hoerl","year":"1970","journal-title":"Technometrics"},{"key":"2023041105234278100_","doi-asserted-by":"crossref","first-page":"885","DOI":"10.1200\/JCO.2002.20.4.885","article-title":"Statistical prediction models, artificial neural networks, and the sophism \u2018I am a patient, not a statistic\u2019","volume":"20","author":"Kattan","year":"2002","journal-title":"J. Clin. Oncol."},{"key":"2023041105234278100_","doi-asserted-by":"crossref","first-page":"201","DOI":"10.1080\/00031305.1991.10475802","article-title":"Explained residual variation, explained risk, and goodness of fit","volume":"45","author":"Korn","year":"1991","journal-title":"Am. Stat."},{"issue":"Suppl. 1","key":"2023041105234278100_","doi-asserted-by":"crossref","first-page":"I208","DOI":"10.1093\/bioinformatics\/bth900","article-title":"Partial Cox regression analysis for high-dimensional microarray gene expression data","volume":"20","author":"Li","year":"2004","journal-title":"Bioinformatics"},{"key":"2023041105234278100_","doi-asserted-by":"crossref","first-page":"3301","DOI":"10.1093\/bioinformatics\/bti499","article-title":"Prediction error estimation: a comparison of resampling methods","volume":"21","author":"Molinaro","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041105234278100_","article-title":"l \u00a01 regularization path algorithm for generalized linear models","author":"Park","year":"2006","journal-title":"Technical report"},{"key":"2023041105234278100_","doi-asserted-by":"crossref","first-page":"1937","DOI":"10.1056\/NEJMoa012914","article-title":"The use of molecular profiling to predict survival after chemotherapy for diffuse large-b-cell lymphoma","volume":"346","author":"Rosenwald","year":"2002","journal-title":"N. Engl. J. Med."},{"key":"2023041105234278100_","first-page":"289","article-title":"Prognostic factor studies","volume-title":"Handbook of Statistics in Clinical Oncology","author":"Schumacher","year":"2006","edition":"2nd"},{"key":"2023041105234278100_","doi-asserted-by":"crossref","first-page":"268","DOI":"10.1093\/biostatistics\/kxj006","article-title":"Microarray gene expression data with linked survival phenotypes: diffuse large-B-cell lymphoma revisited","volume":"7","author":"Segal","year":"2006","journal-title":"Biostatistics"},{"key":"2023041105234278100_","doi-asserted-by":"crossref","first-page":"7332","DOI":"10.1200\/JCO.2005.02.8712","article-title":"Roadmap for developing and validating therapeutically relevant genomic classifiers","volume":"23","author":"Simon","year":"2005","journal-title":"J. Clin. Oncol."},{"key":"2023041105234278100_","doi-asserted-by":"crossref","first-page":"14","DOI":"10.1093\/jnci\/95.1.14","article-title":"Pitfalls in the use of DNA microarray data for diagnostic and prognostic classification","volume":"95","author":"Simon","year":"2003","journal-title":"J. Natl Cancer Inst."},{"key":"2023041105234278100_","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1111\/j.2517-6161.1996.tb02080.x","article-title":"Regression shrinkage and selection via the lasso","volume":"58","author":"Tibshirani","year":"1996","journal-title":"J. R. Stat. Soc. Ser. B."},{"key":"2023041105234278100_","doi-asserted-by":"crossref","DOI":"10.1007\/978-0-387-21700-0","volume-title":"Unified Methods for Censored Longitudinal Data and Causality","author":"Van der Laan","year":"2003"},{"key":"2023041105234278100_","doi-asserted-by":"crossref","first-page":"3201","DOI":"10.1002\/sim.2353","article-title":"Cross-validated cox regression on microarray gene expression data","volume":"25","author":"Van Houwelingen","year":"2006","journal-title":"Stat. Med."},{"key":"2023041105234278100_","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1002\/bimj.200410011","article-title":"A comparison of nonparametric error rate estimation methods in classification problems","volume":"46","author":"Wehberg","year":"2004","journal-title":"Biome. J."},{"key":"2023041105234278100_","first-page":"391","article-title":"Estimation of principal components and related models by iterative least squares","volume-title":"Multivariate Analysis","author":"Wold","year":"1966"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/14\/1768\/49815068\/bioinformatics_23_14_1768.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/14\/1768\/49815068\/bioinformatics_23_14_1768.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,1,15]],"date-time":"2025-01-15T21:41:01Z","timestamp":1736977261000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/23\/14\/1768\/188061"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,5,7]]},"references-count":32,"journal-issue":{"issue":"14","published-print":{"date-parts":[[2007,7,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btm232","relation":{"has-review":[{"id-type":"doi","id":"10.3410\/f.1088585.542666","asserted-by":"object"}]},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2007,7,15]]},"published":{"date-parts":[[2007,5,7]]}}}