{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,24]],"date-time":"2026-02-24T20:22:23Z","timestamp":1771964543294,"version":"3.50.1"},"reference-count":28,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2020,6,21]],"date-time":"2020-06-21T00:00:00Z","timestamp":1592697600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2020,6,21]],"date-time":"2020-06-21T00:00:00Z","timestamp":1592697600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100009148","name":"Queen Mary University of London","doi-asserted-by":"crossref","id":[{"id":"10.13039\/100009148","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Comput Stat"],"published-print":{"date-parts":[[2021,3]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>In this paper we propose a Dirichlet process mixture model for censored survival data with covariates. This model is suitable in two scenarios. First, this method can be used to identify clusters determined by both the censored survival data and the predictors. Second, this method is suitable for highly correlated predictors, in cases when the usual survival models cannot be implemented because they would be unstable due to multicollinearity. The Dirichlet process mixture model links a response vector to covariate data through cluster membership and in this paper this model is extended for mixtures of Weibull distributions, which can be used to model survival times and also allow for censoring. We propose two variants of this model, one with a shape parameter common to all clusters (referred to as a global parameter) for the Weibull distributions and one with a cluster-specific shape parameter. The first satisfies the proportional hazard assumption, while the latter is very flexible, as it has the advantage of allowing estimation of the survival curve whether or not the proportional hazards assumption is satisfied. We present a simulation study and, to demonstrate the applicability of the method in practice, a real application to sleep surveys in older women from The Australian Longitudinal Study on Women\u2019s Health. The method developed in the paper is available in the R package PReMiuM.<\/jats:p>","DOI":"10.1007\/s00180-020-01000-3","type":"journal-article","created":{"date-parts":[[2020,6,21]],"date-time":"2020-06-21T05:02:37Z","timestamp":1592715757000},"page":"35-60","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":9,"title":["Clustering method for censored and collinear survival data"],"prefix":"10.1007","volume":"36","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-1870-3017","authenticated-orcid":false,"given":"Silvia","family":"Liverani","sequence":"first","affiliation":[]},{"given":"Lucy","family":"Leigh","sequence":"additional","affiliation":[]},{"given":"Irene L.","family":"Hudson","sequence":"additional","affiliation":[]},{"given":"Julie E.","family":"Byles","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2020,6,21]]},"reference":[{"issue":"485","key":"1000_CR1","doi-asserted-by":"publisher","first-page":"26","DOI":"10.1198\/jasa.2009.0001","volume":"104","author":"JL Bigelow","year":"2009","unstructured":"Bigelow JL, Dunson DB (2009) Bayesian semiparametric joint models for functional predictors. J Am Stat Assoc 104(485):26\u201336","journal-title":"J Am Stat Assoc"},{"key":"1000_CR2","first-page":"203","volume":"6","author":"GRM Borzadaran","year":"2011","unstructured":"Borzadaran GRM, Borzadaran HAM (2011) Log-concavity property for some well-known distributions. Surv Math Appl 6:203\u2013219","journal-title":"Surv Math Appl"},{"issue":"488","key":"1000_CR3","doi-asserted-by":"publisher","first-page":"1646","DOI":"10.1198\/jasa.2009.tm08302","volume":"104","author":"Y Chung","year":"2009","unstructured":"Chung Y, Dunson DB (2009) Nonparametric bayes conditional distribution modeling with variable selection. J Am Stat Assoc 104(488):1646\u20131660","journal-title":"J Am Stat Assoc"},{"key":"1000_CR4","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.envint.2016.02.011","volume":"91","author":"E Coker","year":"2016","unstructured":"Coker E, Liverani S, Ghosh JK, Jerrett M, Beckerman B, Li A, Ritz B, Molitor J (2016) Multi-pollutant exposure profiles associated with term low birth weight in Los Angeles County. Environ Int 91:1\u201313","journal-title":"Environ Int"},{"key":"1000_CR5","unstructured":"Department of Health and Aged Care (2001) Measuring remoteness: accessibility\/remoteness Index of Australia (ARIA) revised edition, Volume 14. Occasional papers: new series"},{"issue":"484","key":"1000_CR6","doi-asserted-by":"publisher","first-page":"1508","DOI":"10.1198\/016214508000001039","volume":"103","author":"DB Dunson","year":"2008","unstructured":"Dunson DB, Herring AB, Siega-Riz AM (2008) Bayesian inference on changes in response densities over predictor clusters. J Am Stat Assoc 103(484):1508\u20131517","journal-title":"J Am Stat Assoc"},{"key":"1000_CR7","doi-asserted-by":"publisher","first-page":"337","DOI":"10.2307\/2347565","volume":"41","author":"WR Gilks","year":"1992","unstructured":"Gilks WR, Wild P (1992) Adaptive rejection sampling for gibbs sampling. Appl Stat 41:337\u2013348","journal-title":"Appl Stat"},{"issue":"420","key":"1000_CR8","doi-asserted-by":"publisher","first-page":"942","DOI":"10.1080\/01621459.1992.10476248","volume":"87","author":"RJ Gray","year":"1992","unstructured":"Gray RJ (1992) Flexible methods for analyzing survival data using splines, with applications to breast cancer prognosis. J Am Stat Assoc 87(420):942\u2013951","journal-title":"J Am Stat Assoc"},{"issue":"1","key":"1000_CR9","doi-asserted-by":"publisher","first-page":"129","DOI":"10.1186\/1471-2288-13-129","volume":"13","author":"DI Hastie","year":"2013","unstructured":"Hastie DI, Liverani S, Azizi L, Richardson S, St\u00fccker I (2013) A semi-parametric approach to estimate risk functions associated with multi-dimensional exposure profiles: application to smoking and lung cancer. BMC Med Res Methodol 13(1):129","journal-title":"BMC Med Res Methodol"},{"issue":"5","key":"1000_CR10","doi-asserted-by":"publisher","first-page":"1023","DOI":"10.1007\/s11222-014-9471-3","volume":"25","author":"DI Hastie","year":"2015","unstructured":"Hastie DI, Liverani S, Richardson S (2015) Sampling from dirichlet process mixture models with unknown concentration parameter: mixing issues in large data implementations. Stat Comput 25(5):1023\u20131037","journal-title":"Stat Comput"},{"issue":"3","key":"1000_CR11","doi-asserted-by":"publisher","first-page":"221","DOI":"10.1016\/0271-7123(81)90005-5","volume":"15","author":"SM Hunt","year":"1981","unstructured":"Hunt SM, McKenna SP, McEwen J, Williams J, Papp E (1981) The nottingham health profile: subjective health status and medical consultations. Soc Sci Med Part A Med Psychol Med Sociol 15(3):221\u2013229","journal-title":"Soc Sci Med Part A Med Psychol Med Sociol"},{"issue":"3","key":"1000_CR12","doi-asserted-by":"publisher","first-page":"578","DOI":"10.1016\/j.jspi.2004.08.009","volume":"136","author":"A Kottas","year":"2006","unstructured":"Kottas A (2006) Nonparametric Bayesian survival analysis using mixtures of weibull distributions. J Stat Plan Inference 136(3):578\u2013596","journal-title":"J Stat Plan Inference"},{"issue":"6","key":"1000_CR13","doi-asserted-by":"publisher","first-page":"648","DOI":"10.1111\/jsr.12324","volume":"24","author":"L Leigh","year":"2015","unstructured":"Leigh L, Hudson IL, Byles JE (2015) Sleeping difficulty, disease and mortality in older women: a latent class analysis and distal survival analysis. J Sleep Res 24(6):648\u2013657","journal-title":"J Sleep Res"},{"issue":"2","key":"1000_CR14","doi-asserted-by":"publisher","first-page":"185","DOI":"10.18642\/jsata_7100121735","volume":"16","author":"L Leigh","year":"2016","unstructured":"Leigh L, Hudson IL, Byles JE (2016a) Joint modelling of the relationship between sleep, disease and mortality, exclusively in a cohort of older australian women (aged 70\u201375 years at baseline). J Stat Adv Theory Appl 16(2):185\u2013254","journal-title":"J Stat Adv Theory Appl"},{"issue":"6","key":"1000_CR15","doi-asserted-by":"publisher","first-page":"1090","DOI":"10.1177\/0898264315624907","volume":"28","author":"L Leigh","year":"2016","unstructured":"Leigh L, Hudson IL, Byles JE (2016b) Sleep difficulty and disease in a cohort of very old women. J Aging Health 28(6):1090\u20131104","journal-title":"J Aging Health"},{"issue":"7","key":"1000_CR16","doi-asserted-by":"publisher","first-page":"1","DOI":"10.18637\/jss.v064.i07","volume":"64","author":"S Liverani","year":"2015","unstructured":"Liverani S, Hastie DI, Azizi L, Papathomas M, Richardson S (2015) PReMiuM: an R package for profile regression mixture models using dirichlet processes. J Stat Softw 64(7):1\u201330","journal-title":"J Stat Softw"},{"key":"1000_CR17","doi-asserted-by":"publisher","first-page":"63","DOI":"10.1016\/j.sste.2016.04.003","volume":"18","author":"S Liverani","year":"2016","unstructured":"Liverani S, Lavigne A, Blangiardo M (2016) Modelling collinear and spatially correlated data. Spatial Spatio-temporal Epidemiol 18:63\u201373","journal-title":"Spatial Spatio-temporal Epidemiol"},{"issue":"6","key":"1000_CR18","doi-asserted-by":"publisher","first-page":"368","DOI":"10.1136\/oemed-2015-103177","volume":"73","author":"F Mattei","year":"2016","unstructured":"Mattei F, Liverani S, Guida F, Matrat M, Cen\u00e9e S, Azizi L, Menvielle G, Sanchez M, Pilorget C, Lap\u00f4tre-Ledoux B et al (2016) Multidimensional analysis of the effect of occupational exposure to organic solvents on lung cancer risk: the ICARE study. Occup Environ Med 73(6):368\u2013377","journal-title":"Occup Environ Med"},{"issue":"6","key":"1000_CR19","doi-asserted-by":"publisher","first-page":"1198","DOI":"10.1161\/HYPERTENSIONAHA.114.03799","volume":"64","author":"Molitor, J., I. J. Brown, Q. Chan, M. Papathomas, S. Liverani, N. Molitor, S. Richardson, L. Van Horn, M. L. Daviglus, A. Dyer, J. Stamler, P. Elliott, and I. R. Group","year":"2014","unstructured":"Molitor, J., I. J. Brown, Q. Chan, M. Papathomas, S. Liverani, N. Molitor, S. Richardson, L. Van Horn, M. L. Daviglus, A. Dyer, J. Stamler, P. Elliott, and I. R. Group (2014) Blood pressure differences associated with optimal macronutrient intake trial for heart health (OMNIHEART)-like diet compared with a typical American diet. Hypertension 64(6):1198\u20131204","journal-title":"Hypertension"},{"issue":"3","key":"1000_CR20","doi-asserted-by":"publisher","first-page":"484","DOI":"10.1093\/biostatistics\/kxq013","volume":"11","author":"J Molitor","year":"2010","unstructured":"Molitor J, Papathomas M, Jerrett M, Richardson S (2010) Bayesian profile regression with an application to the National Survey of Children\u2019s Health. Biostatistics 11(3):484\u2013498","journal-title":"Biostatistics"},{"issue":"18","key":"1000_CR21","doi-asserted-by":"publisher","first-page":"7754","DOI":"10.1021\/es104017x","volume":"45","author":"J Molitor","year":"2011","unstructured":"Molitor J, Su JG, Molitor N-T, Rubio VG, Richardson S, Hastie D, Morello-Frosch R, Jerrett M (2011) Identifying vulnerable populations through an examination of the association between multipollutant profiles and poverty. Environ Sci Technol 45(18):7754\u20137760","journal-title":"Environ Sci Technol"},{"issue":"6","key":"1000_CR22","doi-asserted-by":"publisher","first-page":"663","DOI":"10.1002\/gepi.21661","volume":"36","author":"M Papathomas","year":"2012","unstructured":"Papathomas M, Molitor J, Hoggart C, Hastie D, Richardson S (2012) Exploring data from genetic association studies using Bayesian variable selection and the dirichlet process: application to searching for gene$$\\times $$ gene patterns. Genet Epidemiol 36(6):663\u2013674","journal-title":"Genet Epidemiol"},{"issue":"1","key":"1000_CR23","doi-asserted-by":"publisher","first-page":"84","DOI":"10.1289\/ehp.1002118","volume":"119","author":"M Papathomas","year":"2011","unstructured":"Papathomas M, Molitor J, Richardson S, Riboli E, Vineis P (2011) Examining the joint effect of multiple risk factors using exposure risk profiles: lung cancer in nonsmokers. Environ Health Perspect 119(1):84","journal-title":"Environ Health Perspect"},{"key":"1000_CR24","doi-asserted-by":"publisher","first-page":"56","DOI":"10.1016\/j.envint.2015.02.010","volume":"79","author":"M Pirani","year":"2015","unstructured":"Pirani M, Best N, Blangiardo M, Liverani S, Atkinson RW, Fuller GW (2015) Analysing the health effects of simultaneous exposure to physical and chemical properties of airborne particles. Environ Int 79:56\u201364","journal-title":"Environ Int"},{"issue":"5","key":"1000_CR25","doi-asserted-by":"publisher","first-page":"526","DOI":"10.1111\/j.1467-842X.2000.tb00504.x","volume":"24","author":"J Powers","year":"2000","unstructured":"Powers J, Ball J, Adamson L, Dobson A (2000) Effectiveness of the national death index for establishing the vital status of older women in the Australian longitudinal study on women\u2019s health. Aust N Z J Public Health 24(5):526\u2013528","journal-title":"Aust N Z J Public Health"},{"key":"1000_CR26","doi-asserted-by":"crossref","unstructured":"Teh YW (2011) Dirichlet process. In: Encyclopedia of machine learning, pp 280\u2013287. Springer","DOI":"10.1007\/978-0-387-30164-8_219"},{"key":"1000_CR27","volume-title":"Physical and mental health summary scales\u2014a user\u2019s manual","author":"J Ware","year":"1994","unstructured":"Ware J, Kosinski M, Keller S (1994) Physical and mental health summary scales\u2014a user\u2019s manual. New England Medical Center, The Health Institute, Boston"},{"issue":"3","key":"1000_CR28","doi-asserted-by":"publisher","first-page":"333","DOI":"10.1007\/s10985-007-9045-1","volume":"13","author":"X Xue","year":"2007","unstructured":"Xue X, Kim MY, Shore RE (2007) Cox regression analysis in presence of collinearity: an application to assessment of health risks associated with occupational radiation exposure. Lifetime Data Anal 13(3):333\u2013350","journal-title":"Lifetime Data Anal"}],"container-title":["Computational Statistics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00180-020-01000-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s00180-020-01000-3\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00180-020-01000-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,6,20]],"date-time":"2021-06-20T23:04:12Z","timestamp":1624230252000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s00180-020-01000-3"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,6,21]]},"references-count":28,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,3]]}},"alternative-id":["1000"],"URL":"https:\/\/doi.org\/10.1007\/s00180-020-01000-3","relation":{},"ISSN":["0943-4062","1613-9658"],"issn-type":[{"value":"0943-4062","type":"print"},{"value":"1613-9658","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,6,21]]},"assertion":[{"value":"8 October 2019","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"10 June 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"21 June 2020","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}