{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,17]],"date-time":"2026-03-17T07:31:00Z","timestamp":1773732660838,"version":"3.50.1"},"reference-count":60,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2021,1,19]],"date-time":"2021-01-19T00:00:00Z","timestamp":1611014400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100014440","name":"Ministerio de Ciencia, Innovaci\u00f3n y Universidades","doi-asserted-by":"publisher","award":["PU16-05034"],"award-info":[{"award-number":["PU16-05034"]}],"id":[{"id":"10.13039\/100014440","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100010198","name":"Ministerio de Econom\u00eda, Industria y Competitividad, Gobierno de Espa\u00f1a","doi-asserted-by":"publisher","award":["MTM2017-86875-C3-1-R"],"award-info":[{"award-number":["MTM2017-86875-C3-1-R"]}],"id":[{"id":"10.13039\/501100010198","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100010198","name":"Ministerio de Econom\u00eda, Industria y Competitividad, Gobierno de Espa\u00f1a","doi-asserted-by":"publisher","award":["CEX2019-000904-S"],"award-info":[{"award-number":["CEX2019-000904-S"]}],"id":[{"id":"10.13039\/501100010198","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001961","name":"AXA Research Fund","doi-asserted-by":"publisher","award":["AXA-ICMAT  Chair  in  Adversarial  Risk  Analysis"],"award-info":[{"award-number":["AXA-ICMAT  Chair  in  Adversarial  Risk  Analysis"]}],"id":[{"id":"10.13039\/501100001961","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["MS-1638521"],"award-info":[{"award-number":["MS-1638521"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Entropy"],"abstract":"<jats:p>In this work, a framework to boost the efficiency of Bayesian inference in probabilistic models is introduced by embedding a Markov chain sampler within a variational posterior approximation. We call this framework \u201crefined variational approximation\u201d. Its strengths are its ease of implementation and the automatic tuning of sampler parameters, leading to a faster mixing time through automatic differentiation. Several strategies to approximate evidence lower bound (ELBO) computation are also introduced. Its efficient performance is showcased experimentally using state-space models for time-series data, a variational encoder for density estimation and a conditional variational autoencoder as a deep Bayes classifier.<\/jats:p>","DOI":"10.3390\/e23010123","type":"journal-article","created":{"date-parts":[[2021,1,19]],"date-time":"2021-01-19T04:55:31Z","timestamp":1611032131000},"page":"123","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["Variationally Inferred Sampling through a Refined Bound"],"prefix":"10.3390","volume":"23","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-0349-0714","authenticated-orcid":false,"given":"V\u00edctor","family":"Gallego","sequence":"first","affiliation":[{"name":"Institute of Mathematical Sciences (ICMAT), 28049 Madrid, Spain"},{"name":"Statistical and Applied Mathematical Sciences Institute, Durham, NC 7333, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"David","family":"R\u00edos Insua","sequence":"additional","affiliation":[{"name":"Institute of Mathematical Sciences (ICMAT), 28049 Madrid, Spain"},{"name":"School of Management, University of Shanghai for Science and Technology, Shanghai 201206, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2021,1,19]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"859","DOI":"10.1080\/01621459.2017.1285773","article-title":"Variational inference: A review for statisticians","volume":"112","author":"Blei","year":"2017","journal-title":"J. Am. Stat. Assoc."},{"key":"ref_2","unstructured":"Insua, D., Ruggeri, F., and Wiper, M. (2012). Bayesian Analysis of Stochastic Process Models, John Wiley & Sons."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Alquier, P. (2020). Approximate Bayesian Inference. Entropy, 22.","DOI":"10.3390\/e22111272"},{"key":"ref_4","first-page":"430","article-title":"Automatic differentiation variational inference","volume":"18","author":"Kucukelbir","year":"2017","journal-title":"J. Mach. Learn. Res."},{"key":"ref_5","unstructured":"Riquelme, C., Johnson, M., and Hoffman, M. (2018, January 15). Failure modes of variational inference for decision making. Proceedings of the Prediction and Generative Modeling in RL Workshop (AAMAS, ICML, IJCAI), Stockholm, Sweden."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"269","DOI":"10.1111\/j.1467-9868.2009.00736.x","article-title":"Particle Markov chain Monte Carlo methods","volume":"72","author":"Andrieu","year":"2010","journal-title":"J. R. Stat. Soc. Ser. B Stat. Methodol."},{"key":"ref_7","first-page":"2","article-title":"MCMC using Hamiltonian dynamics","volume":"Volume 2","author":"Neal","year":"2011","journal-title":"Handbook of Markov Chain Monte Carlo"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"143","DOI":"10.3758\/s13423-016-1015-8","article-title":"A simple introduction to Markov Chain Monte\u2013Carlo sampling","volume":"25","author":"Cassey","year":"2018","journal-title":"Psychon. Bull. Rev."},{"key":"ref_9","unstructured":"Nalisnick, E., Hertel, L., and Smyth, P. (2016, January 10). Approximate inference for deep latent gaussian mixtures. Proceedings of the NIPS Workshop on Bayesian Deep Learning, Barcelona, Spain."},{"key":"ref_10","unstructured":"Salimans, T., Kingma, D., and Welling, M. (2015, January 6\u201311). Markov chain Monte Carlo and variational inference: Bridging the gap. Proceedings of the International Conference on Machine Learning, Lille, France."},{"key":"ref_11","unstructured":"Tran, D., Ranganath, R., and Blei, D.M. (2016, January 2\u20134). The variational Gaussian process. Proceedings of the 4th International Conference on Learning Representations, San Juan, Puerto Rico."},{"key":"ref_12","unstructured":"Wood, F., Meent, J.W., and Mansinghka, V. (2014, January 22\u201325). A new approach to probabilistic programming inference. Proceedings of the Artificial Intelligence and Statistics, Reykjavik, Iceland."},{"key":"ref_13","unstructured":"Ge, H., Xu, K., and Ghahramani, Z. (2018, January 9\u201311). Turing: A language for flexible probabilistic inference. Proceedings of the International Conference on Artificial Intelligence and Statistics, Lanzarote, Spain."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1214\/088342307000000014","article-title":"A general framework for the parametrization of hierarchical models","volume":"22","author":"Papaspiliopoulos","year":"2007","journal-title":"Stat. Sci."},{"key":"ref_15","unstructured":"Hoffman, M., Sountsov, P., Dillon, J.V., Langmore, I., Tran, D., and Vasudevan, S. (2019). Neutra-lizing bad geometry in hamiltonian Monte Carlo using neural transport. arXiv."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"260601","DOI":"10.1103\/PhysRevLett.121.260601","article-title":"Neural Network Renormalization Group","volume":"121","author":"Li","year":"2018","journal-title":"Phys. Rev. Lett."},{"key":"ref_17","unstructured":"Parno, M., and Marzouk, Y. (2014). Transport map accelerated markov chain monte carlo. arXiv."},{"key":"ref_18","unstructured":"Rezende, D., and Mohamed, S. (2015, January 6\u201311). Variational Inference with Normalizing Flows. Proceedings of the International Conference on Machine Learning, Lille, France."},{"key":"ref_19","unstructured":"Chen, C., Li, C., Chen, L., Wang, W., Pu, Y., and Carin, L. (2018, January 25\u201331). Continuous-Time Flows for Efficient Inference and Density Estimation. Proceedings of the International Conference on Machine Learning, Vienna, Austria."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1016\/j.neunet.2018.10.002","article-title":"Variational inference with Gaussian mixture model and householder flow","volume":"109","author":"Liu","year":"2019","journal-title":"Neural Netw."},{"key":"ref_21","unstructured":"Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., and Bengio, Y. (2014, January 8\u201313). Generative adversarial nets. Proceedings of the Advances in Neural Information Processing Systems, Montreal, QC, USA."},{"key":"ref_22","first-page":"4873","article-title":"Stochastic Gradient Descent as Approximate Bayesian Inference","volume":"18","author":"Mandt","year":"2017","journal-title":"J. Mach. Learn. Res."},{"key":"ref_23","unstructured":"Husz\u00e1r, F. (2017). Variational inference using implicit distributions. arXiv."},{"key":"ref_24","unstructured":"Titsias, M.K., and Ruiz, F. (2019, January 16\u201318). Unbiased Implicit Variational Inference. Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, Naha, Japan."},{"key":"ref_25","unstructured":"Yin, M., and Zhou, M. (2018). Semi-Implicit Variational Inference. arXiv."},{"key":"ref_26","unstructured":"Hoffman, M.D. (2017, January 22\u201331). Learning deep latent Gaussian models with Markov chain Monte Carlo. Proceedings of the 34th International Conference on Machine Learning, Sydney, Australia."},{"key":"ref_27","unstructured":"Feng, Y., Wang, D., and Liu, Q. (2017). Learning to draw samples with amortized stein variational gradient descent. arXiv."},{"key":"ref_28","unstructured":"Cremer, C., Li, X., and Duvenaud, D. (2018). Inference suboptimality in variational autoencoders. arXiv."},{"key":"ref_29","unstructured":"Ruiz, F., and Titsias, M. (2019, January 10\u201315). A Contrastive Divergence for Combining Variational Inference and MCMC. Proceedings of the International Conference on Machine Learning, Long Beach, CA, USA."},{"key":"ref_30","unstructured":"Dai, B., Dai, H., He, N., Liu, W., Liu, Z., Chen, J., Xiao, L., and Song, L. (2018, January 3\u20138). Coupled variational bayes via optimization embedding. Proceedings of the Advances in Neural Information Processing Systems, Montreal, QC, USA."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Fang, L., Li, C., Gao, J., Dong, W., and Chen, C. (2019). Implicit Deep Latent Variable Models for Text Generation. arXiv.","DOI":"10.18653\/v1\/D19-1407"},{"key":"ref_32","unstructured":"Welling, M., and Teh, Y.W. (2014, January 11\u201313). Bayesian learning via stochastic gradient Langevin dynamics. Proceedings of the 28th International Conference on Machine Learning (ICML-11), Montreal, QC, USA."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Li, C., Chen, C., Carlson, D., and Carin, L. (2016, January 12\u201317). Preconditioned stochastic gradient Langevin dynamics for deep neural networks. Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, Phoenix, AZ, USA.","DOI":"10.1609\/aaai.v30i1.10200"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Li, C., Chen, C., Fan, K., and Carin, L. (2016, January 12\u201317). High-order stochastic gradient thermostats for Bayesian learning of deep models. Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, Phoenix, AZ, USA.","DOI":"10.1609\/aaai.v30i1.10199"},{"key":"ref_35","unstructured":"Abbati, G., Tosi, A., Osborne, M., and Flaxman, S. (2018, January 9\u201311). Adageo: Adaptive geometric learning for optimization and sampling. Proceedings of the International Conference on Artificial Intelligence and Statistics, Canary Islands, Spain."},{"key":"ref_36","unstructured":"Gallego, V., and Insua, D.R. (2018). Stochastic Gradient MCMC with Repulsive Forces. arXiv."},{"key":"ref_37","unstructured":"Ma, Y.A., Chen, T., and Fox, E. (2015, January 7\u201312). A complete recipe for stochastic gradient MCMC. Proceedings of the Advances in Neural Information Processing Systems, Montreal, QC, Canada."},{"key":"ref_38","first-page":"5595","article-title":"Automatic differentiation in machine learning: A survey","volume":"18","author":"Baydin","year":"2017","journal-title":"J. Mach. Learn. Res."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Pavliotis, G. (2014). Stochastic Processes and Applications: Diffusion Processes, the Fokker-Planck and Langevin Equations. Texts in Applied Mathematics, Springer.","DOI":"10.1007\/978-1-4939-1323-7"},{"key":"ref_40","unstructured":"Liu, Q., and Wang, D. (2016, January 5\u201310). Stein variational gradient descent: A general purpose Bayesian inference algorithm. Proceedings of the Advances In Neural Information Processing Systems, Barcelona, Spain."},{"key":"ref_41","unstructured":"Kingma, D.P., and Ba, J. (2014). Adam: A method for stochastic optimization. arXiv."},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Graves, T.L. (2011). Automatic step size selection in random walk Metropolis algorithms. arXiv.","DOI":"10.2172\/1057119"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Brooks, S., Gelman, A., Jones, G., and Meng, X.L. (2011). Handbook of Markov Chain Monte Carlo, CRC Press.","DOI":"10.1201\/b10905"},{"key":"ref_44","unstructured":"Murray, I., and Salakhutdinov, R. (2020, June 12). Notes on the KL-Divergence between a Markov Chain and Its Equilibrium Distribution; 2008. Available online: http:\/\/www.cs.toronto.edu\/~rsalakhu\/papers\/mckl.pdf."},{"key":"ref_45","unstructured":"Franceschi, L., Donini, M., Frasconi, P., and Pontil, M. (2017, January 22\u201331). Forward and reverse gradient-based hyperparameter optimization. Proceedings of the 34th International Conference on Machine Learning, Sydney, Australia."},{"key":"ref_46","unstructured":"Wallach, H., Larochelle, H., Beygelzimer, A., d\u2019Alch\u00e9-Buc, F., Fox, E., and Garnett, R. (2019). PyTorch: An Imperative Style, High-Performance Deep Learning Library. Advances in Neural Information Processing Systems 32, Curran Associates, Inc."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"257","DOI":"10.1109\/5.18626","article-title":"A tutorial on hidden Markov models and selected applications in speech recognition","volume":"77","author":"Rabiner","year":"1989","journal-title":"Proc. IEEE"},{"key":"ref_48","unstructured":"Zarchan, P., and Musoff, H. (2013). Fundamentals of Kalman filtering: A Practical Approach, American Institute of Aeronautics and Astronautics, Inc."},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"359","DOI":"10.1198\/016214506000001437","article-title":"Strictly proper scoring rules, prediction, and estimation","volume":"102","author":"Gneiting","year":"2007","journal-title":"J. Am. Stat. Assoc."},{"key":"ref_50","unstructured":"Keeling, C.D. (2005). Atmospheric Carbon Dioxide Record from Mauna Loa, Scripps Institution of Oceanography, The University of California."},{"key":"ref_51","unstructured":"Kingma, D.P., and Welling, M. (2013). Auto-encoding variational bayes. arXiv."},{"key":"ref_52","unstructured":"LeCun, Y., and Cortes, C. (2020, May 12). MNIST handwritten Digit Database. Available online: http:\/\/yann.lecun.com\/exdb\/mnist\/."},{"key":"ref_53","unstructured":"Xiao, H., Rasul, K., and Vollgraf, R. (2017). Fashion-MNIST: A Novel Image Dataset for Benchmarking Machine Learning Algorithms. arXiv."},{"key":"ref_54","unstructured":"Shi, J., Sun, S., and Zhu, J. (2018, January 25\u201331). A Spectral Approach to Gradient Estimation for Implicit Distributions. Proceedings of the International Conference on Machine Learning, Vienna, Austria."},{"key":"ref_55","unstructured":"Duvenaud, D., Maclaurin, D., and Adams, R. (2016, January 9\u201311). Early stopping as nonparametric variational inference. Proceedings of the Artificial Intelligence and Statistics, Cadiz, Spain."},{"key":"ref_56","doi-asserted-by":"crossref","first-page":"325","DOI":"10.1023\/A:1008929526011","article-title":"WinBUGS-a Bayesian modelling framework: Concepts, structure, and extensibility","volume":"10","author":"Lunn","year":"2000","journal-title":"Stat. Comput."},{"key":"ref_57","doi-asserted-by":"crossref","unstructured":"Carpenter, B., Gelman, A., Hoffman, M.D., Lee, D., Goodrich, B., Betancourt, M., Brubaker, M., Guo, J., Li, P., and Riddell, A. (2017). Stan: A probabilistic programming language. J. Stat. Softw., 76.","DOI":"10.18637\/jss.v076.i01"},{"key":"ref_58","unstructured":"Tran, D., Hoffman, M.W., Moore, D., Suter, C., Vasudevan, S., and Radul, A. (2018, January 3\u20138). Simple, distributed, and accelerated probabilistic programming. Proceedings of the Advances in Neural Information Processing Systems, Montreal, QC, Canada."},{"key":"ref_59","unstructured":"Bingham, E., Chen, J.P., Jankowiak, M., Obermeyer, F., Pradhan, N., Karaletsos, T., Singh, R., Szerlip, P., Horsfall, P., and Goodman, N.D. (2018). Pyro: Deep Universal Probabilistic Programming. arXiv."},{"key":"ref_60","unstructured":"West, M., and Harrison, J. (2006). Bayesian Forecasting and Dynamic Models, Springer."}],"container-title":["Entropy"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1099-4300\/23\/1\/123\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T05:12:38Z","timestamp":1760159558000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1099-4300\/23\/1\/123"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,1,19]]},"references-count":60,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2021,1]]}},"alternative-id":["e23010123"],"URL":"https:\/\/doi.org\/10.3390\/e23010123","relation":{},"ISSN":["1099-4300"],"issn-type":[{"value":"1099-4300","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,1,19]]}}}