{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,3]],"date-time":"2026-03-03T15:46:55Z","timestamp":1772552815409,"version":"3.50.1"},"reference-count":52,"publisher":"Oxford University Press (OUP)","issue":"3","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2011,2,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: We propose a Bayesian ensemble method for survival prediction in high-dimensional gene expression data. We specify a fully Bayesian hierarchical approach based on an ensemble \u2018sum-of-trees\u2019 model and illustrate our method using three popular survival models. Our non-parametric method incorporates both additive and interaction effects between genes, which results in high predictive accuracy compared with other methods. In addition, our method provides model-free variable selection of important prognostic markers based on controlling the false discovery rates; thus providing a unified procedure to select relevant genes and predict survivor functions.<\/jats:p><jats:p>Results: We assess the performance of our method several simulated and real microarray datasets. We show that our method selects genes potentially related to the development of the disease as well as yields predictive performance that is very competitive to many other existing methods.<\/jats:p><jats:p>Availability: \u00a0http:\/\/works.bepress.com\/veera\/1\/.<\/jats:p><jats:p>Contact: \u00a0veera@mdanderson.org<\/jats:p><jats:p>Supplementary Information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btq660","type":"journal-article","created":{"date-parts":[[2010,12,10]],"date-time":"2010-12-10T01:29:53Z","timestamp":1291944593000},"page":"359-367","source":"Crossref","is-referenced-by-count":61,"title":["Bayesian ensemble methods for survival prediction in gene expression data"],"prefix":"10.1093","volume":"27","author":[{"given":"Vinicius","family":"Bonato","sequence":"first","affiliation":[{"name":"1 Pfizer Inc., Groton, CT 06340, 2Department of Biostatistics, 3Department of Bioinformatics and Computational Biology, 4Department of Radiation Oncology and 5Department of Pathology, The University of Texas, M. D. Anderson Cancer Center, Houston, TX 77030, USA"}]},{"given":"Veerabhadran","family":"Baladandayuthapani","sequence":"additional","affiliation":[{"name":"1 Pfizer Inc., Groton, CT 06340, 2Department of Biostatistics, 3Department of Bioinformatics and Computational Biology, 4Department of Radiation Oncology and 5Department of Pathology, The University of Texas, M. D. Anderson Cancer Center, Houston, TX 77030, USA"}]},{"given":"Bradley M.","family":"Broom","sequence":"additional","affiliation":[{"name":"1 Pfizer Inc., Groton, CT 06340, 2Department of Biostatistics, 3Department of Bioinformatics and Computational Biology, 4Department of Radiation Oncology and 5Department of Pathology, The University of Texas, M. D. Anderson Cancer Center, Houston, TX 77030, USA"}]},{"given":"Erik P.","family":"Sulman","sequence":"additional","affiliation":[{"name":"1 Pfizer Inc., Groton, CT 06340, 2Department of Biostatistics, 3Department of Bioinformatics and Computational Biology, 4Department of Radiation Oncology and 5Department of Pathology, The University of Texas, M. D. Anderson Cancer Center, Houston, TX 77030, USA"}]},{"given":"Kenneth D.","family":"Aldape","sequence":"additional","affiliation":[{"name":"1 Pfizer Inc., Groton, CT 06340, 2Department of Biostatistics, 3Department of Bioinformatics and Computational Biology, 4Department of Radiation Oncology and 5Department of Pathology, The University of Texas, M. D. Anderson Cancer Center, Houston, TX 77030, USA"}]},{"given":"Kim-Anh","family":"Do","sequence":"additional","affiliation":[{"name":"1 Pfizer Inc., Groton, CT 06340, 2Department of Biostatistics, 3Department of Bioinformatics and Computational Biology, 4Department of Radiation Oncology and 5Department of Pathology, The University of Texas, M. D. Anderson Cancer Center, Houston, TX 77030, USA"}]}],"member":"286","published-online":{"date-parts":[[2010,12,8]]},"reference":[{"key":"2023012511553610100_B1","doi-asserted-by":"crossref","first-page":"709","DOI":"10.1007\/BF00209493","article-title":"Redefinition of the coding sequence of the MXI1 gene and identification of a polymorphic repeat in the 3-prime non-coding region that allows the detection of loss of heterozygosity of chromosome 10q25 in glioblastomas","volume":"95","author":"Albarosa","year":"1995","journal-title":"Hum. Genet."},{"key":"2023012511553610100_B2","doi-asserted-by":"crossref","first-page":"3686","DOI":"10.1158\/1078-0432.CCR-04-2398","article-title":"Patterns of gene expression that characterize long-term survival in advanced stage serous ovarian cancers","volume":"11","author":"Berchuck","year":"2005","journal-title":"Clin. Cancer Res."},{"key":"2023012511553610100_B3","doi-asserted-by":"crossref","first-page":"18","DOI":"10.1186\/1471-2105-10-18","article-title":"Incorporating pathway information into boosting estimation of high-dimensional risk prediction models","volume":"10","author":"Binder","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2023012511553610100_B4","doi-asserted-by":"crossref","first-page":"123","DOI":"10.1007\/BF00058655","article-title":"Bagging predictors","volume":"24","author":"Breiman","year":"1996","journal-title":"Mach. Learn."},{"key":"2023012511553610100_B5","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach. Learn."},{"key":"2023012511553610100_B6","doi-asserted-by":"crossref","first-page":"326","DOI":"10.1504\/IJBRA.2010.035997","article-title":"Bagged gene shaving for the robust clustering of high-throughput data","volume":"6","author":"Broom","year":"2010","journal-title":"Int. J. Bioinformatics Res. Appl."},{"key":"2023012511553610100_B7","doi-asserted-by":"crossref","first-page":"1061","DOI":"10.1038\/nature07385","article-title":"Comprehensive genomic characterization defines human glioblastoma genes and core pathways","volume":"455","author":"The Cancer Genome Atlas Research Network","year":"2008","journal-title":"Nature"},{"key":"2023012511553610100_B8","doi-asserted-by":"crossref","first-page":"5439","DOI":"10.1182\/blood-2006-11-058040","article-title":"Prognostic significance of host immune gene polymorphisms in follicular lymphoma survival","volume":"109","author":"Cerhan","year":"2007","journal-title":"Blood"},{"key":"2023012511553610100_B9","doi-asserted-by":"crossref","first-page":"935","DOI":"10.1080\/01621459.1998.10473750","article-title":"Bayesian CART model search (with discussion)","volume":"93","author":"Chipman","year":"1998","journal-title":"J. Am. Stat. Assoc."},{"key":"2023012511553610100_B10","doi-asserted-by":"crossref","first-page":"266","DOI":"10.1214\/09-AOAS285","article-title":"BART: Bayesian Additive Regression Trees","volume":"4","author":"Chipman","year":"2010","journal-title":"Ann. Appl. Stat."},{"key":"2023012511553610100_B11","doi-asserted-by":"crossref","first-page":"238","DOI":"10.1016\/j.stamet.2007.09.003","article-title":"Bayesian Weibull tree models for survival analysis of clinico-genomic data","volume":"5","author":"Clarke","year":"2008","journal-title":"Stat. Methodol."},{"key":"2023012511553610100_B12","doi-asserted-by":"crossref","first-page":"187","DOI":"10.1111\/j.2517-6161.1972.tb00899.x","article-title":"Regression models and life tables","volume":"34","author":"Cox","year":"1972","journal-title":"J. R. Stat. Soc. B"},{"key":"2023012511553610100_B13","doi-asserted-by":"crossref","first-page":"259","DOI":"10.1111\/j.1541-0420.2006.00660.x","article-title":"Predicting patient survival from microarray data by accelerated failure time modeling using partial least squares and LASSO","volume":"63","author":"Datta","year":"2007","journal-title":"Biometrics"},{"key":"2023012511553610100_B14","doi-asserted-by":"crossref","first-page":"363","DOI":"10.1093\/biomet\/85.2.363","article-title":"A Bayesian CART algorithm","volume":"85","author":"Denison","year":"1998","journal-title":"Biometrika"},{"key":"2023012511553610100_B15","doi-asserted-by":"crossref","first-page":"1499","DOI":"10.1038\/nbt1205-1499","article-title":"How does gene expression clustering work?","volume":"23","author":"D'haeseleer","year":"2005","journal-title":"Nat. Biotechnol."},{"key":"2023012511553610100_B16","doi-asserted-by":"crossref","first-page":"1189","DOI":"10.1214\/aos\/1013203451","article-title":"Greedy function approximation: a gradient boosting machine","volume":"29","author":"Friedman","year":"2001","journal-title":"Ann. Stat."},{"key":"2023012511553610100_B17","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1016\/j.molcel.2005.03.008","article-title":"PHLPP: a phosphatase that directly dephosphorylates Akt, promotes apoptosis, and suppresses tumor growth","volume":"18","author":"Gao","year":"2005","journal-title":"Mol. Cell"},{"key":"2023012511553610100_B18","volume-title":"Markov Chain Monte Carlo in Practice: Interdisciplinary Statistics","author":"Gilks","year":"1996"},{"key":"2023012511553610100_B19","doi-asserted-by":"crossref","first-page":"2529","DOI":"10.1002\/(SICI)1097-0258(19990915\/30)18:17\/18<2529::AID-SIM274>3.0.CO;2-5","article-title":"Assessment and comparison of prognostic classification schemes for survival data","volume":"18","author":"Graf","year":"1999","journal-title":"Stat. Med."},{"key":"2023012511553610100_B20","doi-asserted-by":"crossref","first-page":"3001","DOI":"10.1093\/bioinformatics\/bti422","article-title":"Penalized Cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data","volume":"21","author":"Gui","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012511553610100_B21","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4757-3462-1","volume-title":"Regression modeling strategies, with applications to linear models, survival analysis and logistic regression","author":"Harrell","year":"2001"},{"key":"2023012511553610100_B22","doi-asserted-by":"crossref","DOI":"10.1186\/gb-2000-1-2-research0003","article-title":"\u2018Gene shaving\u2019 as a method for identifying distinct sets of genes with similar expression patterns","volume":"1","author":"Hastie","year":"2000","journal-title":"Genome Biol."},{"key":"2023012511553610100_B23","doi-asserted-by":"crossref","first-page":"43152","DOI":"10.1074\/jbc.M204858200","article-title":"Cloning and characterization of GRIPE, a novel interacting partner of the transcription factor E12 in developing mouse forebrain","volume":"277","author":"Heng","year":"2002","journal-title":"J. Biol. Chem."},{"key":"2023012511553610100_B24","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1002\/sim.1593","article-title":"Bagging survival trees","volume":"23","author":"Hothorn","year":"2004","journal-title":"Stat. Med."},{"key":"2023012511553610100_B25","doi-asserted-by":"crossref","first-page":"355","DOI":"10.1093\/biostatistics\/kxj011","article-title":"Survival ensembles","volume":"7","author":"Hothorn","year":"2006","journal-title":"Biostatistics"},{"key":"2023012511553610100_B26","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4757-3447-8","volume-title":"Bayesian Survival Analysis.","author":"Ibrahim","year":"2001"},{"key":"2023012511553610100_B27","doi-asserted-by":"crossref","first-page":"591","DOI":"10.1198\/016214504000000638","article-title":"Relative risk forests for exercise heart rate recovery as a predictor of mortality","volume":"99","author":"Ishwaran","year":"2004","journal-title":"J. Am. Stat. Assoc."},{"key":"2023012511553610100_B28","doi-asserted-by":"crossref","first-page":"841","DOI":"10.1214\/08-AOAS169","article-title":"Random survival forests","volume":"2","author":"Ishwaran","year":"2008","journal-title":"Ann. Appl. Stat."},{"key":"2023012511553610100_B29","doi-asserted-by":"crossref","first-page":"214","DOI":"10.1111\/j.2517-6161.1978.tb01666.x","article-title":"Non-parametric Bayesian analysis of survival time data","volume":"40","author":"Kalbfleisch","year":"1978","journal-title":"J. R. Stat. Soc. B"},{"key":"2023012511553610100_B30","volume-title":"Survival Analysis - Techniques for Censored and Truncated Data.","author":"Klein","year":"1997"},{"key":"2023012511553610100_B31","first-page":"756","article-title":"Bayesian methods for variable selection in survival models with application to DNA microarray data","volume":"66","author":"Lee","year":"2004","journal-title":"Sankhya"},{"key":"2023012511553610100_B32","doi-asserted-by":"crossref","first-page":"869","DOI":"10.1016\/j.csda.2004.03.017","article-title":"An extensive comparison of recent classification tools applied to microarray data","volume":"48","author":"Lee","year":"2005","journal-title":"Comput. Stat. Data Anal."},{"key":"2023012511553610100_B33","doi-asserted-by":"crossref","first-page":"i208","DOI":"10.1093\/bioinformatics\/bth900","article-title":"Partial Cox regression analysis for high-dimensional microarray gene expression data","volume":"20","author":"Li","year":"2004","journal-title":"Bioinformatics"},{"key":"2023012511553610100_B34","doi-asserted-by":"crossref","first-page":"479","DOI":"10.1111\/j.1541-0420.2007.00895.x","article-title":"Bayesian analysis of mass spectrometry data using wavelet-based functional mixed models","volume":"64","author":"Morris","year":"2008","journal-title":"Biometrics"},{"key":"2023012511553610100_B35","doi-asserted-by":"crossref","first-page":"990","DOI":"10.1198\/016214504000001646","article-title":"Optimal sample size for multiple testing: the case of gene expression microarrays","volume":"99","author":"M\u00fceller","year":"2004","journal-title":"J. Am. Stat. Assoc."},{"key":"2023012511553610100_B36","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1093\/biostatistics\/5.2.155","article-title":"Detecting differential gene expression with a semiparametric hierarchical mixture method","volume":"5","author":"Newton","year":"2004","journal-title":"Biostatistics"},{"key":"2023012511553610100_B37","doi-asserted-by":"crossref","first-page":"1625","DOI":"10.1093\/bioinformatics\/18.12.1625","article-title":"Partial least squares proportional hazard regression for application to DNA microarray survival data","volume":"18","author":"Nguyen","year":"2002","journal-title":"Bioinformatics"},{"key":"2023012511553610100_B38","doi-asserted-by":"crossref","first-page":"S120","DOI":"10.1093\/bioinformatics\/18.suppl_1.S120","article-title":"Linking gene expression data with patient survival times using partial least squares","volume":"18","author":"Park","year":"2002","journal-title":"Bioinformatics"},{"key":"2023012511553610100_B39","doi-asserted-by":"crossref","first-page":"587","DOI":"10.1093\/biostatistics\/kxh011","article-title":"Bayesian analysis of binary prediction tree models for retrospectively sampled outcomes","volume":"5","author":"Pittman","year":"2004","journal-title":"Biostatistics"},{"key":"2023012511553610100_B40","doi-asserted-by":"crossref","first-page":"204","DOI":"10.1097\/PAP.0b013e3181a9d4bf","article-title":"Multigene classifiers, prognostic factors, and predictors of breast cancer clinical outcome","volume":"16","author":"Ross","year":"2009","journal-title":"Adv. Anat. Pathol."},{"key":"2023012511553610100_B41","doi-asserted-by":"crossref","first-page":"269","DOI":"10.1186\/1471-2105-9-269","article-title":"Flexible boosting of accelerated failure time models","volume":"9","author":"Schmid","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023012511553610100_B42","doi-asserted-by":"crossref","first-page":"1768","DOI":"10.1093\/bioinformatics\/btm232","article-title":"Assessment of survival prediction models based on microarray data","volume":"23","author":"Schumacher","year":"2007","journal-title":"Bioinformatics"},{"key":"2023012511553610100_B43","doi-asserted-by":"crossref","first-page":"2262","DOI":"10.1093\/bioinformatics\/btl362","article-title":"Bayesian variable selection for the analysis of microarray data with censored outcomes","volume":"22","author":"Sha","year":"2006","journal-title":"Bioinformatics"},{"key":"2023012511553610100_B44","doi-asserted-by":"crossref","first-page":"103","DOI":"10.1159\/000134693","article-title":"Cloning and characterization of BAI2 and BAI3, novel genes homologous to brain-specific angiogenesis inhibitor 1 (BAI1)","volume":"79","author":"Shiratsuchi","year":"1997","journal-title":"Cytogenet. Cell Genet."},{"key":"2023012511553610100_B45","doi-asserted-by":"crossref","first-page":"2013","DOI":"10.1214\/aos\/1074290335","article-title":"The positive false discovery rate: a Bayesian interpretation and the q-value","volume":"31","author":"Storey","year":"2003","journal-title":"Ann. Stat."},{"key":"2023012511553610100_B46","doi-asserted-by":"crossref","first-page":"528","DOI":"10.1080\/01621459.1987.10478458","article-title":"The calculation of posterior distributions by data augmentation","volume":"82","author":"Tanner","year":"1987","journal-title":"J. Am. Stat. Assoc."},{"key":"2023012511553610100_B47","first-page":"4811","article-title":"Multinuclearity and increased ploidy caused by overexpression of the aurora- and Ipl1-like midbody-associated protein mitotic kinase in human cancer cells","volume":"58","author":"Tatsuka","year":"1998","journal-title":"Cancer Res."},{"key":"2023012511553610100_B48","doi-asserted-by":"crossref","first-page":"385","DOI":"10.1002\/(SICI)1097-0258(19970228)16:4<385::AID-SIM380>3.0.CO;2-3","article-title":"The Lasso method for variable selection in the Cox model","volume":"16","author":"Tibshirani","year":"1997","journal-title":"Stat. Med."},{"key":"2023012511553610100_B49","doi-asserted-by":"crossref","first-page":"530","DOI":"10.1038\/415530a","article-title":"Gene expression profiling predicts clinical outcome of breast cancer","volume":"415","author":"Van't Veer","year":"2002","journal-title":"Nature"},{"key":"2023012511553610100_B50","doi-asserted-by":"crossref","first-page":"1590","DOI":"10.1016\/j.csda.2008.05.021","article-title":"Survival prediction using gene expression data: a review and comparison","volume":"53","author":"van Wieringen","year":"2009","journal-title":"Comput. Stat. Data Anal."},{"key":"2023012511553610100_B51","doi-asserted-by":"crossref","first-page":"53848","DOI":"10.1074\/jbc.M403114200","article-title":"Epidermal growth factor receptor-deficient mice have delayed primary endochondral ossification because of defective osteoclast recruitment","volume":"279","author":"Wang","year":"2004","journal-title":"J. Biol. Chem."},{"key":"2023012511553610100_B52","doi-asserted-by":"crossref","first-page":"733","DOI":"10.1093\/oso\/9780198526155.003.0053","article-title":"Bayesian factor regression models in the \u201clarge p, small n\u201d paradigm","volume-title":"Bayesian Statistics 7","author":"West","year":"2003"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/27\/3\/359\/48866751\/bioinformatics_27_3_359.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/27\/3\/359\/48866751\/bioinformatics_27_3_359.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,2,28]],"date-time":"2025-02-28T16:33:57Z","timestamp":1740760437000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/27\/3\/359\/320135"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,12,8]]},"references-count":52,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2011,2,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btq660","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2011,2,1]]},"published":{"date-parts":[[2010,12,8]]}}}