{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,7]],"date-time":"2025-11-07T09:31:36Z","timestamp":1762507896465,"version":"build-2065373602"},"reference-count":49,"publisher":"MDPI AG","issue":"11","license":[{"start":{"date-parts":[[2019,11,10]],"date-time":"2019-11-10T00:00:00Z","timestamp":1573344000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100007219","name":"Natural Science Foundation of Shanghai","doi-asserted-by":"publisher","award":["19ZR1415800"],"award-info":[{"award-number":["19ZR1415800"]}],"id":[{"id":"10.13039\/100007219","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Entropy"],"abstract":"<jats:p>In Bayesian machine learning, sampling methods provide the asymptotically unbiased estimation for the inference of the complex probability distributions, where Markov chain Monte Carlo (MCMC) is one of the most popular sampling methods. However, MCMC can lead to high autocorrelation of samples or poor performances in some complex distributions. In this paper, we introduce Langevin diffusions to normalization flows to construct a brand-new dynamical sampling method. We propose the modified Kullback-Leibler divergence as the loss function to train the sampler, which ensures that the samples generated from the proposed method can converge to the target distribution. Since the gradient function of the target distribution is used during the process of calculating the modified Kullback-Leibler, which makes the integral of the modified Kullback-Leibler intractable. We utilize the Monte Carlo estimator to approximate this integral. We also discuss the situation when the target distribution is unnormalized. We illustrate the properties and performances of the proposed method on varieties of complex distributions and real datasets. The experiments indicate that the proposed method not only takes the advantage of the flexibility of neural networks but also utilizes the property of rapid convergence to the target distribution of the dynamics system and demonstrate superior performances competing with dynamics based MCMC samplers.<\/jats:p>","DOI":"10.3390\/e21111096","type":"journal-article","created":{"date-parts":[[2019,11,12]],"date-time":"2019-11-12T04:07:07Z","timestamp":1573531627000},"page":"1096","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Dynamical Sampling with Langevin Normalization Flows"],"prefix":"10.3390","volume":"21","author":[{"given":"Minghao","family":"Gu","sequence":"first","affiliation":[{"name":"School of Computer Science and Technology, East China Normal University, 3663 North Zhongshan Road, Shanghai 200241, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shiliang","family":"Sun","sequence":"additional","affiliation":[{"name":"School of Computer Science and Technology, East China Normal University, 3663 North Zhongshan Road, Shanghai 200241, China"},{"name":"Shanghai Institute of Intelligent Science and Technology, Tongji University, Shanghai 201804, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yan","family":"Liu","sequence":"additional","affiliation":[{"name":"School of Data Science and Engineering, East China Normal University, 3663 North Zhongshan Road, Shanghai 200241, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2019,11,10]]},"reference":[{"key":"ref_1","first-page":"1","article-title":"Inverse Bayesian inference in swarming behaviour of soldier crabs","volume":"376","author":"Gunji","year":"2018","journal-title":"Philos. Trans. R. Soc. Math. Phys. Eng. Sci."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Imani, M., Ghoreishi, S.F., Allaire, D., and Braga-Neto, U.M. (2019, January 27\u201328). MFBO-SSM: Multi-fidelity Bayesian optimization for fast inference in state-space models. Proceedings of the AAAI Conference on Artificial Intelligence, Honolulu, HI, USA.","DOI":"10.1609\/aaai.v33i01.33017858"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"3074","DOI":"10.3390\/e16063074","article-title":"Information-geometric Markov chain Monte Carlo methods using diffusions","volume":"16","author":"Livingstone","year":"2014","journal-title":"Entropy"},{"key":"ref_4","unstructured":"Robert, C.P., and Casella, G. (2013). Monte Carlo Statistical Methods, Springer."},{"key":"ref_5","unstructured":"Altieri, N., and Duvenaud, D. (2015, November 10). Variational Inference with Gradient Flows. Available online: http:\/\/approximateinference.org\/accepted\/AltieriDuvenaud2015.pdf."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"859","DOI":"10.1080\/01621459.2017.1285773","article-title":"Variational inference: a review for statisticians","volume":"112","author":"Blei","year":"2017","journal-title":"J. Am. Stat. Assoc."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Hock, K., and Earle, K. (2016). Markov chain Monte Carlo used in parameter inference of magnetic resonance spectra. Entropy, 18.","DOI":"10.3390\/e18020057"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"1441","DOI":"10.3390\/e17031441","article-title":"Approximated information analysis in Bayesian inference","volume":"17","author":"Seo","year":"2015","journal-title":"Entropy"},{"key":"ref_9","unstructured":"Imani, M., Ghoreishi, S.F., and Braga-Neto, U.M. (2018, January 3\u20138). Bayesian control of large MDPs with unknown dynamics in data-poor environments. Proceedings of the Advances in Neural Information Processing Systems (NIPS 2018), Montreal, QC, Canada."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"2039","DOI":"10.1007\/s00521-013-1445-4","article-title":"A review of deterministic approximate inference techniques for Bayesian machine learning","volume":"23","author":"Sun","year":"2013","journal-title":"Neural Comput. Appl."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"705","DOI":"10.1214\/aos\/1056562461","article-title":"Slice sampling","volume":"31","author":"Neal","year":"2003","journal-title":"Ann. Stat."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Li, Q., and Newton, K. (2019). Diffusion equation-assisted Markov chain Monte Carlo methods for the inverse radiative transfer equation. Entropy, 21.","DOI":"10.3390\/e21030291"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Skeel, R., and Fang, Y. (2017). Comparing Markov chain samplers for molecular simulation. Entropy, 19.","DOI":"10.20944\/preprints201709.0021.v1"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Brooks, S., Gelman, A., Jones, G., and Meng, X. (2011). Handbook of Markov chain Monte Carlo, CRC Press.","DOI":"10.1201\/b10905"},{"key":"ref_15","first-page":"1593","article-title":"The no-u-turn sampler: Adaptively setting path lengths in Hamiltonian Monte Carlo","volume":"15","author":"Hokman","year":"2014","journal-title":"J. Mach. Learn. Res."},{"key":"ref_16","unstructured":"Wang, Z., Mohamed, S., and Freitas, N. (2013, January 16\u201321). Adaptive Hamiltonian and Riemann manifold Monte Carlo. Proceedings of the International Conference on Machine Learning, Atlanta, GA, USA."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"216","DOI":"10.1016\/0370-2693(87)91197-X","article-title":"Hybrid Monte Carlo","volume":"195","author":"Duane","year":"1987","journal-title":"Phys. Lett. B"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"957","DOI":"10.1080\/01621459.2000.10474285","article-title":"Computational and inferential difficulties with mixture posterior distributions","volume":"95","author":"Celeux","year":"2000","journal-title":"J. Am. Stat. Assoc."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"125","DOI":"10.1023\/A:1008923215028","article-title":"Annealed importance sampling","volume":"11","author":"Neal","year":"2001","journal-title":"Stat. Comput."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Rudoy, D., and Wolfe, P.J. (2006, January 6\u20139). Monte Carlo methods for multi-modal distributions. Proceedings of the Fortieth Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, USA.","DOI":"10.1109\/ACSSC.2006.355120"},{"key":"ref_21","unstructured":"Sminchisescu, C., and Welling, M. (2007, January 21\u201324). Generalized darting Monte Carlo. Proceedings of the Artificial Intelligence and Statistics, San Juan, Puerto Rico."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"1454","DOI":"10.1198\/jasa.2009.tm08393","article-title":"Learn from thy neighbor: parallel-chain and regional adaptive MCMC","volume":"104","author":"Craiu","year":"2009","journal-title":"J. Am. Stat. Assoc."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"123","DOI":"10.1111\/j.1467-9868.2010.00765.x","article-title":"Riemann manifold Langevin and Hamiltonian Monte Carlo methods","volume":"73","author":"Girolami","year":"2011","journal-title":"J. R. Stat. Soc."},{"key":"ref_24","unstructured":"Tripuraneni, N., Rowland, M., Ghahramani, Z., and Turner, R. (2017, January 6\u201311). Magnetic Hamiltonian Monte Carlo. Proceedings of the International Conference on Machine Learning, Sydney, NSW, Australia."},{"key":"ref_25","unstructured":"Ahn, S., Chen, Y., and Welling, M. (2013, January 29\u201330). Distributed and adaptive darting Monte Carlo through regenerations. Proceedings of the Artificial Intelligence and Statistics, Scottsdale, AZ, USA."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Lan, S., Streets, J., and Shahbaba, B. (2014, January 27\u201331). Wormhole Hamiltonian Monte Carlo. Proceedings of the AAAI Conference on Artificial Intelligence, Quebec City, QC, Canada.","DOI":"10.1609\/aaai.v28i1.9006"},{"key":"ref_27","unstructured":"Rezende, D., and Mohamed, S. (2015, January 6\u201311). Variational inference with normalizing flows. Proceedings of the International Conference on Machine Learning, Lille, France."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1093\/biomet\/57.1.97","article-title":"Monte Carlo sampling methods using Markov chains and their applications","volume":"57","author":"Hastings","year":"1970","journal-title":"Biometrika"},{"key":"ref_29","unstructured":"Song, J., Zhao, S., and Ermon, S. (2017, January 4\u20139). A-nice-mc: Adversarial training for MCMC. Proceedings of the Advances in Neural Information Processing Systems, Long Beach, CA, USA."},{"key":"ref_30","unstructured":"Zhang, Y., Wang, X., Chen, C., Henao, R., Fan, K., and Carin, L. (2016, January 5\u201310). Towards unifying Hamiltonian Monte Carlo and slice sampling. Proceedings of the Advances in Neural Information Processing Systems, Barcelona, Spain."},{"key":"ref_31","unstructured":"Dinh, L., Sohl-Dickstein, J., and Bengio, S. (2016). Density estimation using real NVP. arXiv."},{"key":"ref_32","unstructured":"Paige, B., and Wood, F. (2016, January 19\u201324). Inference networks for sequential Monte Carlo in graphical models. Proceedings of the International Conference on Machine Learning, New York City, NY, USA."},{"key":"ref_33","unstructured":"Papamakarios, G., and Murray, I. (2016, January 5\u201310). Fast \u03b5-free inference of simulation models with Bayesian conditional density estimation. Proceedings of the Advances in Neural Information Processing Systems, Barcelona, Spain."},{"key":"ref_34","unstructured":"Ball\u00e9, J., Laparra, V., and Simoncelli, E.P. (2015). Density modeling of images using a generalized normalization transformation. arXiv."},{"key":"ref_35","unstructured":"Kingma, D.P., Salimans, T., Jozefowicz, R., Chen, X., Sutskever, I., and Welling, M. (2016, January 5\u201310). Improved variational inference with inverse autoregressive flow. Proceedings of the Advances in Neural Information Processing Systems, Barcelona, Spain."},{"key":"ref_36","unstructured":"Dinh, L., Krueger, D., and Bengio, Y. (2014). NICE: Non-linear independent components estimation. arXiv."},{"key":"ref_37","unstructured":"Oord, A.V.D., Dieleman, S., Zen, H., Simonyan, K., Vinyals, O., Graves, A., Kalchbrenner, N., Senior, A., and Kavukcuoglu, K. (2016). Wavenet: A generative model for raw audio. arXiv."},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Durrett, R. (2018). Stochastic Calculus: A Practical Introduction, CRC Press.","DOI":"10.1201\/9780203738283"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"\u00d8ksendal, B. (2003). Stochastic differential equations. Stochastic Differential Equations, Springer.","DOI":"10.1007\/978-3-642-14394-6"},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"337","DOI":"10.1023\/A:1023562417138","article-title":"Langevin diffusions and Metropolis-Hastings algorithms","volume":"4","author":"Roberts","year":"2002","journal-title":"Methodol. Comput. Appl. Probab."},{"key":"ref_41","unstructured":"Kloeden, P.E., and Platen, E. (2013). Numerical Solution of Stochastic Differential Equations, Springer."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"2797","DOI":"10.1007\/s00180-013-0429-2","article-title":"On the flexibility of the design of multiple try Metropolis schemes","volume":"28","author":"Martino","year":"2013","journal-title":"Comput. Stat."},{"key":"ref_43","first-page":"723","article-title":"A kernel two-sample test","volume":"13","author":"Roberts","year":"2012","journal-title":"J. Mach. Learn. Res."},{"key":"ref_44","unstructured":"Sohl-Dickstein, J., Mudigonda, M., and DeWeese, M.R. (2014, January 21\u201326). Hamiltonian Monte Carlo without detailed balance. Proceedings of the International Conference on Machine Learning, Beijing, China."},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Livingstone, S., Betancourt, M., Byrne, S., and Girolami, M. (2016). On the geometric ergodicity of Hamiltonian Monte Carlo. arXiv.","DOI":"10.3150\/16-BEJ810"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Freedman, D.A. (2009). Statistical Models: Theory and Practice, Cambridge University Press.","DOI":"10.1017\/CBO9780511815867"},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"720","DOI":"10.1162\/neco.1992.4.5.720","article-title":"The evidence framework applied to classification networks","volume":"4","author":"MacKay","year":"1992","journal-title":"Neural Comput."},{"key":"ref_48","unstructured":"Dua, D.M., and Graff, C. (2017, November 10). UCI Machine Learning Repository. Available online: https:\/\/archive.ics.uci.edu\/ml\/index.php."},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"839","DOI":"10.1148\/radiology.148.3.6878708","article-title":"A method of comparing the areas under receiver operating characteristic curves derived from the same cases","volume":"148","author":"Hanley","year":"1983","journal-title":"Radiology"}],"container-title":["Entropy"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1099-4300\/21\/11\/1096\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T13:33:19Z","timestamp":1760189599000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1099-4300\/21\/11\/1096"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,11,10]]},"references-count":49,"journal-issue":{"issue":"11","published-online":{"date-parts":[[2019,11]]}},"alternative-id":["e21111096"],"URL":"https:\/\/doi.org\/10.3390\/e21111096","relation":{},"ISSN":["1099-4300"],"issn-type":[{"type":"electronic","value":"1099-4300"}],"subject":[],"published":{"date-parts":[[2019,11,10]]}}}