{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,3]],"date-time":"2026-06-03T14:24:37Z","timestamp":1780496677533,"version":"3.54.1"},"reference-count":53,"publisher":"Oxford University Press (OUP)","issue":"19","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":2999,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/2.0\/uk\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2008,10,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: Survival prediction of breast cancer (BC) patients independently of treatment, also known as prognostication, is a complex task since clinically similar breast tumors, in addition to be molecularly heterogeneous, may exhibit different clinical outcomes. In recent years, the analysis of gene expression profiles by means of sophisticated data mining tools emerged as a promising technology to bring additional insights into BC biology and to improve the quality of prognostication. The aim of this work is to assess quantitatively the accuracy of prediction obtained with state-of-the-art data analysis techniques for BC microarray data through an independent and thorough framework.<\/jats:p><jats:p>Results: Due to the large number of variables, the reduced amount of samples and the high degree of noise, complex prediction methods are highly exposed to performance degradation despite the use of cross-validation techniques. Our analysis shows that the most complex methods are not significantly better than the simplest one, a univariate model relying on a single proliferation gene. This result suggests that proliferation might be the most relevant biological process for BC prognostication and that the loss of interpretability deriving from the use of overcomplex methods may be not sufficiently counterbalanced by an improvement of the quality of prediction.<\/jats:p><jats:p>Availability: The comparison study is implemented in an R package called survcomp and is available from http:\/\/www.ulb.ac.be\/di\/map\/bhaibeka\/software\/survcomp\/.<\/jats:p><jats:p>Contact: \u00a0bhaibeka@ulb.ac.be<\/jats:p><jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btn374","type":"journal-article","created":{"date-parts":[[2008,7,18]],"date-time":"2008-07-18T00:25:07Z","timestamp":1216340707000},"page":"2200-2208","source":"Crossref","is-referenced-by-count":194,"title":["A comparative study of survival models for breast cancer prognostication based on microarray data: does a single gene beat them all?"],"prefix":"10.1093","volume":"24","author":[{"given":"B.","family":"Haibe-Kains","sequence":"first","affiliation":[{"name":"1 Machine Learning Group, Department of Computer Science and 2Functional Genomics Unit, Department of Medical Oncology, Institut Jules Bordet, Universit\u00e9 Libre de Bruxelles, Brussels, Belgium"},{"name":"1 Machine Learning Group, Department of Computer Science and 2Functional Genomics Unit, Department of Medical Oncology, Institut Jules Bordet, Universit\u00e9 Libre de Bruxelles, Brussels, Belgium"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"C.","family":"Desmedt","sequence":"additional","affiliation":[{"name":"1 Machine Learning Group, Department of Computer Science and 2Functional Genomics Unit, Department of Medical Oncology, Institut Jules Bordet, Universit\u00e9 Libre de Bruxelles, Brussels, Belgium"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"C.","family":"Sotiriou","sequence":"additional","affiliation":[{"name":"1 Machine Learning Group, Department of Computer Science and 2Functional Genomics Unit, Department of Medical Oncology, Institut Jules Bordet, Universit\u00e9 Libre de Bruxelles, Brussels, Belgium"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"G.","family":"Bontempi","sequence":"additional","affiliation":[{"name":"1 Machine Learning Group, Department of Computer Science and 2Functional Genomics Unit, Department of Medical Oncology, Institut Jules Bordet, Universit\u00e9 Libre de Bruxelles, Brussels, Belgium"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"286","published-online":{"date-parts":[[2008,7,17]]},"reference":[{"key":"2023020211115192200_B1","doi-asserted-by":"crossref","first-page":"1299","DOI":"10.1214\/aos\/1176325630","article-title":"Nearest neighbor estimation of a bivariate distribution under random censoring","volume":"22","author":"Akritas","year":"1994","journal-title":"Ann. Stat"},{"key":"2023020211115192200_B2","doi-asserted-by":"crossref","first-page":"D562","DOI":"10.1093\/nar\/gki022","article-title":"NCBI GEO: mining millions of expression profiles \u2013 database and tool","volume":"33","author":"Barrett","year":"2005","journal-title":"Nucleic Acids Res"},{"key":"2023020211115192200_B3","doi-asserted-by":"crossref","first-page":"293","DOI":"10.1109\/TCBB.2007.1014","article-title":"A blocking strategy to improve gene selection for classification of gene expression data","volume":"4","author":"Bontempi","year":"2007","journal-title":"IEEE\/ACM Trans. Comput. Biol. Bioinform"},{"key":"2023020211115192200_B4","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1175\/1520-0493(1950)078<0001:VOFEIT>2.0.CO;2","article-title":"Verification of forecasts expressed in terms of probabilities","volume":"78","author":"Brier","year":"1950","journal-title":"Mon. Weather Rev"},{"key":"2023020211115192200_B5","doi-asserted-by":"crossref","first-page":"1183","DOI":"10.1093\/jnci\/djj329","article-title":"Validation and clinical utility of a 70-gene prognostic signature for patients with node-negative breast cancer","volume":"98","author":"Buyse","year":"2006","journal-title":"J. Natl. Cancer Inst"},{"key":"2023020211115192200_B6","doi-asserted-by":"crossref","first-page":"187","DOI":"10.1111\/j.2517-6161.1972.tb00899.x","article-title":"Regression models and life tables","volume":"34","author":"Cox","year":"1972","journal-title":"J. R Stat. Soc. Ser B"},{"key":"2023020211115192200_B7","doi-asserted-by":"crossref","first-page":"3207","DOI":"10.1158\/1078-0432.CCR-06-2765","article-title":"Strong time-dependency of the 76-gene prognostic signature for node-negative breast cancer patients in the transbig multi-centre independent validation series","volume":"13","author":"Desmedt","year":"2007","journal-title":"Clin. Cancer Res"},{"key":"2023020211115192200_B8","doi-asserted-by":"crossref","DOI":"10.1158\/1078-0432.CCR-07-4756","article-title":"Biological processes associated with breast cancer clinical outcome depend on the molecular subtypes","volume-title":"Clin. Cancer Res.","author":"Desmedt","year":"2008"},{"key":"2023020211115192200_B9","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1198\/016214502753479248","article-title":"Comparison of discrimination methods for the classification of tumors using gene expression data","volume":"97","author":"Dudoit","year":"2002","journal-title":"J. Am. Stat. Assoc"},{"key":"2023020211115192200_B10","doi-asserted-by":"crossref","first-page":"21058","DOI":"10.1200\/jco.2007.25.18_suppl.21058","article-title":"Transforming genomic grade index (GGI) into a user-friendly qRT-PCR tool which will assist clinicians and patients in optimizing treatment of early breast cancer","volume":"25","author":"Durbecq","year":"2007","journal-title":"Journal of Clinical Oncology"},{"key":"2023020211115192200_B11","doi-asserted-by":"crossref","first-page":"979","DOI":"10.1093\/jnci\/93.13.979","article-title":"National institutes of health consensus development conference statement: adjuvant therapy for breast cancer","volume":"93","author":"Eifel","year":"2001","journal-title":"J. Natl. Cancer Inst"},{"key":"2023020211115192200_B12","doi-asserted-by":"crossref","first-page":"171","DOI":"10.1093\/bioinformatics\/bth469","article-title":"Outcome signature genes in breast cancer: is there a unique set","volume":"21","author":"Ein-Dor","year":"2005","journal-title":"Bioinformatics"},{"key":"2023020211115192200_B13","doi-asserted-by":"crossref","DOI":"10.1200\/JCO.2005.03.9115","article-title":"Multicenter validation of a gene expression\u2013based prognostic signature in lymph node\u2013negative primary breast cancer","volume":"24","author":"Foekens","year":"2006","journal-title":"J. Clin. Oncol"},{"key":"2023020211115192200_B14","doi-asserted-by":"crossref","first-page":"207","DOI":"10.1007\/BF01840834","article-title":"The nottingham prognostic index in primary breast cancer","volume":"22","author":"Galea","year":"1992","journal-title":"Breast Cancer Res. Treat"},{"key":"2023020211115192200_B15","doi-asserted-by":"crossref","DOI":"10.2202\/1544-6115.1034","article-title":"Reproducible research: a bioinformatics case study","volume":"4","author":"Gentleman","year":"2005","journal-title":"Stat. Appl. Genet. Mol. Biol"},{"key":"2023020211115192200_B16","doi-asserted-by":"crossref","first-page":"572","DOI":"10.1093\/biomet\/88.2.572","article-title":"On functional misspecification of covariates in the cox regression model","volume":"88","author":"Gerds","year":"2001","journal-title":"Biometrika"},{"key":"2023020211115192200_B17","doi-asserted-by":"crossref","first-page":"1029","DOI":"10.1002\/bimj.200610301","article-title":"Consistent estimation of the expected brier score in general survival models with right-censored event times","volume":"6","author":"Gerds","year":"2006","journal-title":"Biometrical J"},{"key":"2023020211115192200_B18","doi-asserted-by":"crossref","first-page":"3357","DOI":"10.1200\/JCO.2003.04.576","article-title":"Meeting highlights: updated international expert consensus on the primary therapy of early breast cancer","volume":"21","author":"Goldhirsh","year":"2003","journal-title":"J. Clin.Oncol"},{"key":"2023020211115192200_B19","doi-asserted-by":"crossref","first-page":"2529","DOI":"10.1002\/(SICI)1097-0258(19990915\/30)18:17\/18<2529::AID-SIM274>3.0.CO;2-5","article-title":"Assessment and comparison of prognostic classification schemes for survival data","volume":"18","author":"Graf","year":"1999","journal-title":"Stat. Med"},{"key":"2023020211115192200_B20","first-page":"237","article-title":"Computational intelligence in clinical oncology : lessons learned from an analysis of a clinical study","volume-title":"Applications of Computational Intelligence in Biomedicine and Bioinformatics: Current Trends and Open Problems of Studies in Computational Intelligence.","author":"Haibe-Kains","year":"2008"},{"key":"2023020211115192200_B21","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1016\/S0092-8674(00)81683-9","article-title":"The hallmarks of cancer","volume":"100","author":"Hanahan","year":"2000","journal-title":"Cell"},{"key":"2023020211115192200_B22","doi-asserted-by":"crossref","first-page":"361","DOI":"10.1002\/(SICI)1097-0258(19960229)15:4<361::AID-SIM168>3.0.CO;2-4","article-title":"Tutorial in biostatistics: multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors","volume":"15","author":"Harrell","year":"1996","journal-title":"Stat. Med"},{"key":"2023020211115192200_B23","doi-asserted-by":"crossref","first-page":"337","DOI":"10.1111\/j.0006-341X.2000.00337.x","article-title":"Time-dependent ROC curves for censored survival data and a diagnostic marker","volume":"56","author":"Heagerty","year":"2000","journal-title":"Biometrics"},{"key":"2023020211115192200_B24","doi-asserted-by":"crossref","first-page":"350","DOI":"10.2307\/2289186","article-title":"Statistical methods for meta-analysis","volume":"82","author":"Hedges","year":"1987","journal-title":"J. Am. Stat. Assoc"},{"key":"2023020211115192200_B25","doi-asserted-by":"crossref","first-page":"457","DOI":"10.1080\/01621459.1958.10501452","article-title":"Nonparametric estimation from incomplete observations","volume":"53","author":"Kaplan","year":"1958","journal-title":"J. Am. Stat. Assoc"},{"key":"2023020211115192200_B26","doi-asserted-by":"crossref","first-page":"226","DOI":"10.1109\/34.667881","article-title":"On combining classifiers","volume":"20","author":"Kittler","year":"1998","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell"},{"key":"2023020211115192200_B27","doi-asserted-by":"crossref","first-page":"1479","DOI":"10.1136\/bmj.322.7300.1479","article-title":"Forest plots: trying to see the wood and the trees","volume":"322","author":"Lewis","year":"2001","journal-title":"Brit. Med. J"},{"key":"2023020211115192200_B28","doi-asserted-by":"crossref","first-page":"1239","DOI":"10.1200\/JCO.2006.07.1522","article-title":"Definition of clinically distinct molecular subtypes in estrogen receptor positive breast carcinomas through use of genomic grade","volume":"25","author":"Loi","year":"2007","journal-title":"J. Clin. Oncol"},{"key":"2023020211115192200_B29","doi-asserted-by":"crossref","first-page":"488","DOI":"10.1016\/S0140-6736(05)17866-0","article-title":"Prediction of cancer outcome with microarrays: a multiple random validation strategy","volume":"365","author":"Michiels","year":"2005","journal-title":"Lancet"},{"key":"2023020211115192200_B30","doi-asserted-by":"crossref","first-page":"13550","DOI":"10.1073\/pnas.0506230102","article-title":"An expression signature for p53 status in human breast cancer predicts mutation status, transcriptional effects, and patient survival","volume":"102","author":"Miller","year":"2005","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023020211115192200_B31","doi-asserted-by":"crossref","first-page":"2716","DOI":"10.1200\/JCO.2005.06.178","article-title":"Population-based validation of the prognostic model adjuvant! for early breast cancer","volume":"23","author":"Olivotto","year":"2005","journal-title":"J. Clin. Oncol"},{"key":"2023020211115192200_B32","doi-asserted-by":"crossref","first-page":"659","DOI":"10.1111\/j.1467-9868.2007.00607.x","article-title":"L1 regularization path algorithm for generalized linear models","volume":"69","author":"Park","year":"2007","journal-title":"J. R. Stat. Soc"},{"key":"2023020211115192200_B33","doi-asserted-by":"crossref","first-page":"2109","DOI":"10.1002\/sim.1802","article-title":"Overall C as a measure of discrimination in survival analysis: model specic population value and condence interval estimation","volume":"23","author":"Pencina","year":"2004","journal-title":"Stat. Med"},{"key":"2023020211115192200_B34","doi-asserted-by":"crossref","first-page":"747","DOI":"10.1038\/35021093","article-title":"Molecular portraits of human breast tumours","volume":"406","author":"Perou","year":"2000","journal-title":"Nature"},{"key":"2023020211115192200_B35","volume-title":"R: A language and environment for statistical computing.","author":"R Development Core Team","year":"2007"},{"key":"2023020211115192200_B36","first-page":"13","article-title":"Histological typing of breast tumors","volume":"2","author":"Scarff","year":"1968","journal-title":"International histological classification of tumours"},{"key":"2023020211115192200_B37","doi-asserted-by":"crossref","first-page":"1768","DOI":"10.1093\/bioinformatics\/btm232","article-title":"Assessment of survival prediction models based on microarray data","volume":"23","author":"Schumacher","year":"2007","journal-title":"Bioinformatics"},{"key":"2023020211115192200_B38","doi-asserted-by":"crossref","first-page":"7332","DOI":"10.1200\/JCO.2005.02.8712","article-title":"Roadmap for developing and validating therapeutically relevant genomic classifiers","volume":"23","author":"Simon","year":"2005","journal-title":"J. Clin. Oncol"},{"key":"2023020211115192200_B39","doi-asserted-by":"crossref","first-page":"545","DOI":"10.1038\/nrc2173","article-title":"Taking gene-expression profiling to the clinic: when will molecular signatures become relevant to patient care","volume":"7","author":"Sotiriou","year":"2007","journal-title":"Nat. Cancer Rev"},{"key":"2023020211115192200_B40","doi-asserted-by":"crossref","first-page":"10393","DOI":"10.1073\/pnas.1732912100","article-title":"Breast cancer classification and prognosis based on gene expression profiles from a population-based study","volume":"100","author":"Sotiriou","year":"2003","journal-title":"Proc. Natl Acad. Sci"},{"key":"2023020211115192200_B41","first-page":"S86","article-title":"Comprehensive molecular analysis of several prognostic signatures using molecular indices related to hallmarks of breast cancer: proliferation index appears to be the most significant component of all signatures","volume-title":"Breast Cancer Research and Treatment.","author":"Sotiriou","year":"2006"},{"key":"2023020211115192200_B42","doi-asserted-by":"crossref","first-page":"262","DOI":"10.1093\/jnci\/djj052","article-title":"Gene expression profiling in breast cancer: understanding the molecular basis of histologic grade to improve prognosis","volume":"f98","author":"Sotiriou","year":"2006","journal-title":"J. Natl Cancer Inst"},{"key":"2023020211115192200_B43","doi-asserted-by":"crossref","first-page":"10581","DOI":"10.1200\/jco.2007.25.18_suppl.10581","article-title":"Biological mechanisms that trigger breast cancer (bc) tumor progression are molecular subtype dependent. ASCO Annual Meeting Proceedings","volume":"25","author":"Sotiriou","year":"2007","journal-title":"J. Clin. Oncol"},{"key":"2023020211115192200_B44","doi-asserted-by":"crossref","first-page":"1285","DOI":"10.1126\/science.3287615","article-title":"Measuring the accuracy of diagnostic systems","volume":"240","author":"Sweets","year":"1988","journal-title":"Science"},{"key":"2023020211115192200_B45","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4757-3294-8","article-title":"Modeling Survival Data: Extending the Cox Model","volume-title":"Statistics for Biology and Health Series.","author":"Therneau","year":"2000"},{"key":"2023020211115192200_B46","doi-asserted-by":"crossref","first-page":"5355","DOI":"10.1158\/1078-0432.CCR-07-0249","article-title":"Comparison of gene sets for expression profiling: prediction of metastasis from low-malignant breast cancer","volume":"13","author":"Thomassen","year":"2007","journal-title":"Clin. Cancer Res"},{"key":"2023020211115192200_B47","doi-asserted-by":"crossref","first-page":"1999","DOI":"10.1056\/NEJMoa021967","article-title":"A gene expression signature as a predictor of survival in breast cancer","volume":"347","author":"van de Vijver","year":"2002","journal-title":"N. Engl. J. Med"},{"key":"2023020211115192200_B48","doi-asserted-by":"crossref","first-page":"3201","DOI":"10.1002\/sim.2353","article-title":"Cross-validated cox regression on microarray gene expression data","volume":"25","author":"van Houwelingen","year":"2006","journal-title":"Stat. Med"},{"key":"2023020211115192200_B49","doi-asserted-by":"crossref","first-page":"530","DOI":"10.1038\/415530a","article-title":"Gene expression profiling predicts clinical outcome of breast cancer","volume":"415","author":"van't Veer","year":"2002","journal-title":"Nature"},{"key":"2023020211115192200_B50","doi-asserted-by":"crossref","first-page":"1471","DOI":"10.1186\/1471-2105-7-91","article-title":"Bias in error estimation when using cross-validation for model selection","volume":"7","author":"Varma","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023020211115192200_B51","doi-asserted-by":"crossref","first-page":"671","DOI":"10.1016\/S0140-6736(05)17947-1","article-title":"Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer","volume":"365","author":"Wang","year":"2005","journal-title":"Lancet"},{"key":"2023020211115192200_B52","doi-asserted-by":"crossref","first-page":"80","DOI":"10.2307\/3001968","article-title":"Individual comparisons by ranking methods","volume":"1","author":"Wilcoxon","year":"1945","journal-title":"Biometrics. Bull"},{"key":"2023020211115192200_B53","doi-asserted-by":"crossref","first-page":"182","DOI":"10.1186\/1471-2407-7-182","article-title":"Pathway analysis of gene signatures predicting metastasis of node-negative primary breast cancer","volume":"7","author":"Yu","year":"2007","journal-title":"BMC Cancer"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/19\/2200\/49051271\/bioinformatics_24_19_2200.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/19\/2200\/49051271\/bioinformatics_24_19_2200.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,1,31]],"date-time":"2025-01-31T02:16:02Z","timestamp":1738289762000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/24\/19\/2200\/245858"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,7,17]]},"references-count":53,"journal-issue":{"issue":"19","published-print":{"date-parts":[[2008,10,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btn374","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2008,10,1]]},"published":{"date-parts":[[2008,7,17]]}}}