{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,5]],"date-time":"2026-03-05T14:36:34Z","timestamp":1772721394421,"version":"3.50.1"},"reference-count":41,"publisher":"Springer Science and Business Media LLC","issue":"2","license":[{"start":{"date-parts":[[2026,1,27]],"date-time":"2026-01-27T00:00:00Z","timestamp":1769472000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2026,1,27]],"date-time":"2026-01-27T00:00:00Z","timestamp":1769472000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"Tokyo University of Science"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Stat Comput"],"published-print":{"date-parts":[[2026,4]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Robust Bayesian inference using density power divergence (DPD) has emerged as a promising approach for handling outliers in statistical estimation. Although the DPD-based posterior offers theoretical guarantees of robustness, its practical implementation faces significant computational challenges, particularly for general parametric models with intractable integral terms. These challenges are specifically pronounced in high-dimensional settings, where traditional numerical integration methods are inadequate and computationally expensive. Herein, we propose a novel approximate sampling methodology that addresses these limitations by integrating the loss-likelihood bootstrap with a stochastic gradient descent algorithm specifically designed for DPD-based estimation. Our approach enables efficient and scalable sampling from DPD-based posteriors for a broad class of parametric models, including those with intractable integrals. We further extend it to accommodate generalized linear models. Through comprehensive simulation studies, we demonstrate that our method efficiently samples from DPD-based posteriors, offering superior computational scalability compared to conventional methods, specifically in high-dimensional settings. The results also highlight its ability to handle complex parametric models with intractable integral terms. The Supplementary Materials for this article are available online.<\/jats:p>","DOI":"10.1007\/s11222-025-10807-3","type":"journal-article","created":{"date-parts":[[2026,1,27]],"date-time":"2026-01-27T05:37:08Z","timestamp":1769492228000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Sampling from density power divergence-based generalized posterior distribution via stochastic optimization"],"prefix":"10.1007","volume":"36","author":[{"given":"Naruki","family":"Sonobe","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tomotaka","family":"Momozaki","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tomoyuki","family":"Nakagawa","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2026,1,27]]},"reference":[{"key":"10807_CR1","doi-asserted-by":"publisher","unstructured":"Basak, S., Basu, A., Jones, M.C.: On the \u2018optimal\u2019 density power divergence tuning parameter. J. Appl. Stat. 48(3), 536\u2013556 (2021). https:\/\/doi.org\/10.1080\/02664763.2020.1736524, pMID: 35706540","DOI":"10.1080\/02664763.2020.1736524"},{"issue":"3","key":"10807_CR2","doi-asserted-by":"publisher","first-page":"549","DOI":"10.1093\/biomet\/85.3.549","volume":"85","author":"A Basu","year":"1998","unstructured":"Basu, A., Harris, I.R., Hjort, N.L., et al.: Robust and efficient estimation by minimising a density power divergence. Biometrika 85(3), 549\u2013559 (1998)","journal-title":"Biometrika"},{"issue":"5","key":"10807_CR3","doi-asserted-by":"publisher","first-page":"1103","DOI":"10.1111\/rssb.12158","volume":"78","author":"PG Bissiri","year":"2016","unstructured":"Bissiri, P.G., Holmes, C.C., Walker, S.G.: A general framework for updating belief distributions. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 78(5), 1103\u20131130 (2016)","journal-title":"Journal of the Royal Statistical Society: Series B (Statistical Methodology)"},{"key":"10807_CR4","doi-asserted-by":"publisher","unstructured":"Carpenter B, Gelman A, Hoffman MD, et\u00a0al (2017) Stan: A probabilistic programming language. Journal of Statistical Software 76(1):1\u201332. https:\/\/doi.org\/10.18637\/jss.v076.i01","DOI":"10.18637\/jss.v076.i01"},{"issue":"1","key":"10807_CR5","doi-asserted-by":"publisher","first-page":"123","DOI":"10.1111\/sjos.12168","volume":"43","author":"AP Dawid","year":"2016","unstructured":"Dawid, A.P., Musio, M., Ventura, L.: Minimum scoring rule inference. Scand. J. Stat. 43(1), 123\u2013138 (2016). https:\/\/doi.org\/10.1111\/sjos.12168","journal-title":"Scand. J. Stat."},{"key":"10807_CR6","unstructured":"Dellaporta C, Knoblauch J, Damoulas T, et\u00a0al (2022) Robust Bayesian inference for simulator-based models via the MMD posterior bootstrap. In: Camps-Valls G, Ruiz FJR, Valera I (eds) Proceedings of The 25th International Conference on Artificial Intelligence and Statistics, Proceedings of Machine Learning Research, vol 151. PMLR, pp 943\u2013970, https:\/\/proceedings.mlr.press\/v151\/dellaporta22a.html"},{"key":"10807_CR7","first-page":"762","volume":"802","author":"S Eguchi","year":"2001","unstructured":"Eguchi, S., Kano, Y.: Robustifing maximum likelihood estimation by psi-divergence. ISM Research Memorandum 802, 762\u2013763 (2001)","journal-title":"ISM Research Memorandum"},{"key":"10807_CR8","unstructured":"Fong E, Lyddon S, Holmes C (2019) Scalable nonparametric sampling from multimodal posteriors with the posterior bootstrap. In: Chaudhuri K, Salakhutdinov R (eds) Proceedings of the 36th International Conference on Machine Learning, Proceedings of Machine Learning Research, vol\u00a097. PMLR, pp 1952\u20131962, https:\/\/proceedings.mlr.press\/v97\/fong19a.html"},{"key":"10807_CR9","unstructured":"Frazier DT, Knoblauch J, Drovandi C (2024) The impact of loss estimation on Gibbs measures. arXiv:2404.15649"},{"issue":"9","key":"10807_CR10","doi-asserted-by":"publisher","first-page":"2053","DOI":"10.1016\/j.jmva.2008.02.004","volume":"99","author":"H Fujisawa","year":"2008","unstructured":"Fujisawa, H., Eguchi, S.: Robust parameter estimation with a small bias against heavy contamination. J. Multivar. Anal. 99(9), 2053\u20132081 (2008). https:\/\/doi.org\/10.1016\/j.jmva.2008.02.004","journal-title":"J. Multivar. Anal."},{"key":"10807_CR11","doi-asserted-by":"crossref","unstructured":"Ghadimi S, Lan G (2013) Stochastic first- and zeroth-order methods for nonconvex stochastic programming. SIAM Journal on Optimization 23(4):2341\u20132368. https:\/\/doi.org\/10.1137\/120880811","DOI":"10.1137\/120880811"},{"issue":"2","key":"10807_CR12","doi-asserted-by":"publisher","first-page":"413","DOI":"10.1007\/s10463-014-0499-0","volume":"68","author":"A Ghosh","year":"2016","unstructured":"Ghosh, A., Basu, A.: Robust bayes estimation using the density power divergence. Ann. Inst. Stat. Math. 68(2), 413\u2013437 (2016)","journal-title":"Ann. Inst. Stat. Math."},{"key":"10807_CR13","doi-asserted-by":"crossref","unstructured":"Ghosh A, Majumder T, Basu A (2022) General robust bayes pseudo-posteriors: Exponential convergence results with applications. Statistica Sinica 32(2):787\u2013823. https:\/\/www.jstor.org\/stable\/27118797","DOI":"10.5705\/ss.202019.0450"},{"issue":"3","key":"10807_CR14","doi-asserted-by":"publisher","first-page":"728","DOI":"10.1007\/s11749-018-0597-z","volume":"28","author":"F Giummol\u00e8","year":"2019","unstructured":"Giummol\u00e8, F., Mameli, V., Ruli, E., et al.: Objective Bayesian inference with proper scoring rules. TEST 28(3), 728\u2013755 (2019). https:\/\/doi.org\/10.1007\/s11749-018-0597-z","journal-title":"TEST"},{"issue":"4","key":"10807_CR15","doi-asserted-by":"publisher","first-page":"1069","DOI":"10.1214\/17-BA1085","volume":"12","author":"P Gr\u00fcnwald","year":"2017","unstructured":"Gr\u00fcnwald, P., van Ommen, T.: Inconsistency of Bayesian inference for misspecified linear models, and a proposal for repairing it. Bayesian Anal. 12(4), 1069\u20131103 (2017). https:\/\/doi.org\/10.1214\/17-BA1085","journal-title":"Bayesian Anal."},{"key":"10807_CR16","doi-asserted-by":"crossref","unstructured":"Hilbe JM (2025) COUNT: Functions, Data and Code for Count Data. https:\/\/CRAN.R-project.org\/package=COUNT, r package version 1.3.5","DOI":"10.1007\/978-3-662-69359-9_371"},{"issue":"2","key":"10807_CR17","doi-asserted-by":"publisher","first-page":"497","DOI":"10.1093\/biomet\/asx010","volume":"104","author":"CC Holmes","year":"2017","unstructured":"Holmes, C.C., Walker, S.G.: Assigning a value to a power likelihood in a general Bayesian model. Biometrika 104(2), 497\u2013503 (2017). https:\/\/doi.org\/10.1093\/biomet\/asx010","journal-title":"Biometrika"},{"issue":"3","key":"10807_CR18","doi-asserted-by":"publisher","first-page":"556","DOI":"10.1007\/s11749-014-0360-z","volume":"23","author":"G Hooker","year":"2014","unstructured":"Hooker, G., Vidyashankar, A.N.: Bayesian model robustness via disparities. TEST 23(3), 556\u2013584 (2014). https:\/\/doi.org\/10.1007\/s11749-014-0360-z","journal-title":"TEST"},{"issue":"6","key":"10807_CR19","doi-asserted-by":"publisher","first-page":"442","DOI":"10.3390\/e20060442","volume":"20","author":"J Jewson","year":"2018","unstructured":"Jewson, J., Smith, J.Q., Holmes, C.: Principles of Bayesian inference using general divergence criteria. Entropy 20(6), 442 (2018)","journal-title":"Entropy"},{"key":"10807_CR20","doi-asserted-by":"crossref","unstructured":"Jewson J, Ghalebikesabi S, Holmes CC (2023) Differentially private statistical inference through $$\\beta $$-divergence one posterior sampling. In: Oh A, Naumann T, Globerson A, et\u00a0al (eds) Advances in Neural Information Processing Systems, vol\u00a036. Curran Associates, Inc., pp 76974\u201377001, https:\/\/proceedings.neurips.cc\/paper_files\/paper\/2023\/file\/f3024ea88cec9f45a411cf4d51ab649c-Paper-Conference.pdf","DOI":"10.52202\/075280-3365"},{"key":"10807_CR21","doi-asserted-by":"publisher","unstructured":"Jewson J, Smith JQ, Holmes C (2024) On the stability of general Bayesian inference. Bayesian Analysis pp 1\u201331. https:\/\/doi.org\/10.1214\/24-BA1502","DOI":"10.1214\/24-BA1502"},{"issue":"2","key":"10807_CR22","doi-asserted-by":"publisher","first-page":"465","DOI":"10.1007\/s42081-019-00049-9","volume":"2","author":"T Kawashima","year":"2019","unstructured":"Kawashima, T., Fujisawa, H.: Robust and sparse regression in generalized linear model by stochastic optimization. Japanese Journal of Statistics and Data Science 2(2), 465\u2013489 (2019). https:\/\/doi.org\/10.1007\/s42081-019-00049-9","journal-title":"Japanese Journal of Statistics and Data Science"},{"key":"10807_CR23","unstructured":"Knoblauch J, Jewson JE, Damoulas T (2018) Doubly robust Bayesian inference for non-stationary streaming data with $$\\beta $$-divergences. In: Bengio S, Wallach H, Larochelle H, et\u00a0al (eds) Advances in Neural Information Processing Systems, vol\u00a031. Curran Associates, Inc., https:\/\/proceedings.neurips.cc\/paper_files\/paper\/2018\/file\/a3f390d88e4c41f2747bfa2f1b5f87db-Paper.pdf"},{"key":"10807_CR24","unstructured":"Lyddon S, Walker S, Holmes C (2018) Nonparametric learning from Bayesian models with randomized objective functions. In: Proceedings of the 32nd International Conference on Neural Information Processing Systems. Curran Associates Inc., Red Hook, NY, USA, NIPS\u201918, p 2075\u20132085"},{"issue":"2","key":"10807_CR25","doi-asserted-by":"publisher","first-page":"465","DOI":"10.1093\/biomet\/asz006","volume":"106","author":"SP Lyddon","year":"2019","unstructured":"Lyddon, S.P., Holmes, C., Walker, S.: General Bayesian updating and the loss-likelihood bootstrap. Biometrika 106(2), 465\u2013478 (2019)","journal-title":"Biometrika"},{"key":"10807_CR26","doi-asserted-by":"publisher","unstructured":"Martin AD, Quinn KM, Park JH (2011) MCMCpack: Markov chain Monte Carlo in R. Journal of Statistical Software 42(9):1\u201321. https:\/\/doi.org\/10.18637\/jss.v042.i09","DOI":"10.18637\/jss.v042.i09"},{"issue":"3","key":"10807_CR27","doi-asserted-by":"publisher","first-page":"997","DOI":"10.1111\/rssb.12500","volume":"84","author":"T Matsubara","year":"2022","unstructured":"Matsubara, T., Knoblauch, J., Briol, F.X., et al.: Robust generalised Bayesian inference for intractable likelihoods. J. R. Stat. Soc. Ser. B Stat Methodol. 84(3), 997\u20131022 (2022). https:\/\/doi.org\/10.1111\/rssb.12500","journal-title":"J. R. Stat. Soc. Ser. B Stat Methodol."},{"key":"10807_CR28","unstructured":"Miller JW (2021) Asymptotic normality, concentration, and coverage of generalized posteriors. Journal of Machine Learning Research 22(168):1\u201353. http:\/\/jmlr.org\/papers\/v22\/20-469.html"},{"issue":"8","key":"10807_CR29","doi-asserted-by":"publisher","first-page":"1859","DOI":"10.1162\/089976602760128045","volume":"14","author":"M Minami","year":"2002","unstructured":"Minami, M., Eguchi, S.: Robust blind source separation by beta divergence. Neural Comput. 14(8), 1859\u20131886 (2002)","journal-title":"Neural Comput."},{"issue":"2","key":"10807_CR30","doi-asserted-by":"publisher","first-page":"343","DOI":"10.1080\/03610926.2018.1543765","volume":"49","author":"T Nakagawa","year":"2020","unstructured":"Nakagawa, T., Hashimoto, S.: Robust Bayesian inference via $$\\gamma $$-divergence. Communications in Statistics-Theory and Methods 49(2), 343\u2013360 (2020)","journal-title":"Communications in Statistics-Theory and Methods"},{"issue":"1","key":"10807_CR31","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1111\/j.2517-6161.1994.tb01956.x","volume":"56","author":"MA Newton","year":"1994","unstructured":"Newton, M.A., Raftery, A.E.: Approximate Bayesian inference with the weighted likelihood bootstrap. J. R. Stat. Soc. Ser. B Stat Methodol. 56(1), 3\u201326 (1994)","journal-title":"J. R. Stat. Soc. Ser. B Stat Methodol."},{"issue":"2","key":"10807_CR32","doi-asserted-by":"publisher","first-page":"421","DOI":"10.1002\/cjs.11570","volume":"49","author":"MA Newton","year":"2021","unstructured":"Newton, M.A., Polson, N.G., Xu, J.: Weighted Bayesian bootstrap for scalable posterior distributions. Canadian Journal of Statistics 49(2), 421\u2013437 (2021)","journal-title":"Canadian Journal of Statistics"},{"key":"10807_CR33","doi-asserted-by":"crossref","unstructured":"Niederreiter, H.: Random Number Generation and Quasi-Monte Carlo Methods. Society for Industrial and Applied Mathematics, DOI 10(1137\/1), 9781611970081 (1992)","DOI":"10.1137\/1.9781611970081"},{"issue":"5","key":"10807_CR34","doi-asserted-by":"publisher","first-page":"851","DOI":"10.1007\/s10463-024-00906-9","volume":"76","author":"A Okuno","year":"2024","unstructured":"Okuno, A.: Minimizing robust density power-based divergences for general parametric density models. Ann. Inst. Stat. Math. 76(5), 851\u2013875 (2024). https:\/\/doi.org\/10.1007\/s10463-024-00906-9","journal-title":"Ann. Inst. Stat. Math."},{"key":"10807_CR35","doi-asserted-by":"publisher","unstructured":"Pacchiardi, L., Khoo, S., Dutta, R.: Generalized Bayesian likelihood-free inference. Electronic Journal of Statistics 18(2), 3628\u20133686 (2024). https:\/\/doi.org\/10.1214\/24-EJS2283","DOI":"10.1214\/24-EJS2283"},{"key":"10807_CR36","unstructured":"Rayana S (2016) ODDS library. https:\/\/shebuti.com\/outlier-detection-datasets-odds\/"},{"key":"10807_CR37","doi-asserted-by":"publisher","unstructured":"Syring, N., Martin, R.: Calibrating general posterior credible regions. Biometrika 106(2), 479\u2013486 (2019). https:\/\/doi.org\/10.1093\/biomet\/asy054","DOI":"10.1093\/biomet\/asy054"},{"key":"10807_CR38","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1016\/j.spl.2016.01.014","volume":"112","author":"SG Walker","year":"2016","unstructured":"Walker, S.G.: Bayesian information in an experiment and the Fisher information distance. Statistics & Probability Letters 112, 5\u20139 (2016). https:\/\/doi.org\/10.1016\/j.spl.2016.01.014","journal-title":"Statistics & Probability Letters"},{"issue":"7","key":"10807_CR39","doi-asserted-by":"publisher","first-page":"581","DOI":"10.1080\/00949650412331299120","volume":"75","author":"J Warwick","year":"2005","unstructured":"Warwick, J., Jones, M.C.: Choosing a robustness tuning parameter. J. Stat. Comput. Simul. 75(7), 581\u2013588 (2005). https:\/\/doi.org\/10.1080\/00949650412331299120","journal-title":"J. Stat. Comput. Simul."},{"issue":"1","key":"10807_CR40","doi-asserted-by":"publisher","first-page":"105","DOI":"10.1214\/21-BA1302","volume":"18","author":"PS Wu","year":"2023","unstructured":"Wu, P.S., Martin, R.: A comparison of learning rate selection methods in generalized Bayesian inference. Bayesian Anal. 18(1), 105\u2013132 (2023). https:\/\/doi.org\/10.1214\/21-BA1302","journal-title":"Bayesian Anal."},{"key":"10807_CR41","doi-asserted-by":"publisher","unstructured":"Yonekura S, Sugasawa S (2023) Adaptation of the tuning parameter in general Bayesian inference with robust divergence. Statistics and Computing 33(39). https:\/\/doi.org\/10.1007\/s11222-023-10205-7","DOI":"10.1007\/s11222-023-10205-7"}],"container-title":["Statistics and Computing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11222-025-10807-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11222-025-10807-3","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11222-025-10807-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,3,5]],"date-time":"2026-03-05T13:26:10Z","timestamp":1772717170000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11222-025-10807-3"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,1,27]]},"references-count":41,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2026,4]]}},"alternative-id":["10807"],"URL":"https:\/\/doi.org\/10.1007\/s11222-025-10807-3","relation":{},"ISSN":["0960-3174","1573-1375"],"issn-type":[{"value":"0960-3174","type":"print"},{"value":"1573-1375","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,1,27]]},"assertion":[{"value":"10 May 2025","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"15 December 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"27 January 2026","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest\/Competing interests"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"All authors have read and agreed to the published version of the manuscript.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"Not applicable.","order":5,"name":"Ethics","group":{"name":"EthicsHeading","label":"Materials availability"}},{"value":"The R code used to reproduce the experimental results is available in the following GitHub repository:\n                      \n                      .","order":6,"name":"Ethics","group":{"name":"EthicsHeading","label":"Code availability"}},{"value":"The authors declare no competing interests.","order":7,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"73"}}