{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,21]],"date-time":"2026-03-21T19:47:52Z","timestamp":1774122472556,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":27,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,10,26]],"date-time":"2022-10-26T00:00:00Z","timestamp":1666742400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,11,2]]},"DOI":"10.1145\/3533271.3561731","type":"proceedings-article","created":{"date-parts":[[2022,10,20]],"date-time":"2022-10-20T22:20:22Z","timestamp":1666304422000},"page":"361-368","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":19,"title":["Deep Hedging: Continuous Reinforcement Learning for Hedging of General Portfolios across Multiple Risk Aversions"],"prefix":"10.1145","author":[{"given":"Phillip","family":"Murray","sequence":"first","affiliation":[{"name":"JP Morgan, United Kingdom and Imperial College London, United Kingdom"}]},{"given":"Ben","family":"Wood","sequence":"additional","affiliation":[{"name":"JP Morgan, United Kingdom"}]},{"given":"Hans","family":"Buehler","sequence":"additional","affiliation":[{"name":"Technical University of Munich, Germany"}]},{"given":"Magnus","family":"Wiese","sequence":"additional","affiliation":[{"name":"University of Kaiserslautern, Germany"}]},{"given":"Mikko","family":"Pakkanen","sequence":"additional","affiliation":[{"name":"Imperial College London, United Kingdom"}]}],"member":"320","published-online":{"date-parts":[[2022,10,26]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Simulation of square-root processes. Encyclopedia of Quantitative Finance(2010), 1642\u20131649","author":"Andersen BG","unstructured":"Leif\u00a0 BG Andersen , Peter J\u00e4ckel , and Christian Kahl . 2010. Simulation of square-root processes. Encyclopedia of Quantitative Finance(2010), 1642\u20131649 . Leif\u00a0BG Andersen, Peter J\u00e4ckel, and Christian Kahl. 2010. Simulation of square-root processes. Encyclopedia of Quantitative Finance(2010), 1642\u20131649."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.3390\/jrfm13070158"},{"key":"e_1_3_2_1_3_1","unstructured":"Lorenzo Bisi. 2022. Algorithms for risk-averse reinforcement learning. (2022).  Lorenzo Bisi. 2022. Algorithms for risk-averse reinforcement learning. (2022)."},{"key":"e_1_3_2_1_4_1","volume-title":"Exact simulation of stochastic volatility and other affine jump diffusion processes. Operations research 54, 2","author":"Broadie Mark","year":"2006","unstructured":"Mark Broadie and \u00d6zg\u00fcr Kaya . 2006. Exact simulation of stochastic volatility and other affine jump diffusion processes. Operations research 54, 2 ( 2006 ), 217\u2013231. Mark Broadie and \u00d6zg\u00fcr Kaya. 2006. Exact simulation of stochastic volatility and other affine jump diffusion processes. Operations research 54, 2 (2006), 217\u2013231."},{"key":"#cr-split#-e_1_3_2_1_5_1.1","doi-asserted-by":"crossref","unstructured":"H. Buehler L. Gonon J. Teichmann and B. Wood. 2019. Deep hedging. Quantitative Finance(2019) 1-21. https:\/\/doi.org\/10.1080\/14697688.2019.1571683 arXiv:https:\/\/doi.org\/10.1080\/14697688.2019.1571683 10.1080\/14697688.2019.1571683","DOI":"10.1080\/14697688.2019.1571683"},{"key":"#cr-split#-e_1_3_2_1_5_1.2","doi-asserted-by":"crossref","unstructured":"H. Buehler L. Gonon J. Teichmann and B. Wood. 2019. Deep hedging. Quantitative Finance(2019) 1-21. https:\/\/doi.org\/10.1080\/14697688.2019.1571683 arXiv:https:\/\/doi.org\/10.1080\/14697688.2019.1571683","DOI":"10.1080\/14697688.2019.1571683"},{"key":"e_1_3_2_1_6_1","volume-title":"Deep hedging: Learning to remove the drift. Risk (March","author":"Buehler Hans","year":"2022","unstructured":"Hans Buehler , Phillip Murray , Mikko\u00a0 S. Pakkanen , and Ben Wood . March 2022. Deep hedging: Learning to remove the drift. Risk (March 2022 ). Hans Buehler, Phillip Murray, Mikko\u00a0S. Pakkanen, and Ben Wood. March 2022. Deep hedging: Learning to remove the drift. Risk (March 2022)."},{"key":"e_1_3_2_1_7_1","volume-title":"Algorithms for CVaR optimization in MDPs. Advances in neural information processing systems 27","author":"Chow Yinlam","year":"2014","unstructured":"Yinlam Chow and Mohammad Ghavamzadeh . 2014. Algorithms for CVaR optimization in MDPs. Advances in neural information processing systems 27 ( 2014 ). Yinlam Chow and Mohammad Ghavamzadeh. 2014. Algorithms for CVaR optimization in MDPs. Advances in neural information processing systems 27 (2014)."},{"key":"e_1_3_2_1_8_1","volume-title":"Risk-sensitive and robust decision-making: a cvar optimization approach. Advances in neural information processing systems 28","author":"Chow Yinlam","year":"2015","unstructured":"Yinlam Chow , Aviv Tamar , Shie Mannor , and Marco Pavone . 2015. Risk-sensitive and robust decision-making: a cvar optimization approach. Advances in neural information processing systems 28 ( 2015 ). Yinlam Chow, Aviv Tamar, Shie Mannor, and Marco Pavone. 2015. Risk-sensitive and robust decision-making: a cvar optimization approach. Advances in neural information processing systems 28 (2015)."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v32i1.11791"},{"key":"e_1_3_2_1_10_1","volume-title":"Multistage stochastic programs with the entropic risk measure. Preprint","author":"Dowson Oscar","year":"2020","unstructured":"Oscar Dowson , David\u00a0 P Morton , and Bernardo\u00a0 K Pagnoncelli . 2020. Multistage stochastic programs with the entropic risk measure. Preprint ( 2020 ). Oscar Dowson, David\u00a0P Morton, and Bernardo\u00a0K Pagnoncelli. 2020. Multistage stochastic programs with the entropic risk measure. Preprint (2020)."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.3905\/jfds.2020.1.045"},{"key":"e_1_3_2_1_12_1","volume-title":"Convex and coherent risk measures. Encyclopedia of Quantitative Finance(2010), 355\u2013363","author":"F\u00f6llmer Hans","unstructured":"Hans F\u00f6llmer and Alexander Schied . 2010. Convex and coherent risk measures. Encyclopedia of Quantitative Finance(2010), 355\u2013363 . Hans F\u00f6llmer and Alexander Schied. 2010. Convex and coherent risk measures. Encyclopedia of Quantitative Finance(2010), 355\u2013363."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.5555\/2789272.2886795"},{"key":"e_1_3_2_1_14_1","first-page":"17298","article-title":"Risk-Aware Transfer in Reinforcement Learning using Successor Features","volume":"34","author":"Gimelfarb Michael","year":"2021","unstructured":"Michael Gimelfarb , Andr\u00e9 Barreto , Scott Sanner , and Chi-Guhn Lee . 2021 . Risk-Aware Transfer in Reinforcement Learning using Successor Features . Advances in Neural Information Processing Systems 34 (2021), 17298 \u2013 17310 . Michael Gimelfarb, Andr\u00e9 Barreto, Scott Sanner, and Chi-Guhn Lee. 2021. Risk-Aware Transfer in Reinforcement Learning using Successor Features. Advances in Neural Information Processing Systems 34 (2021), 17298\u201317310.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"crossref","unstructured":"Ben Hambly Renyuan Xu and Huining Yang. 2021. Recent advances in reinforcement learning in finance. arXiv preprint arXiv:2112.04553(2021).  Ben Hambly Renyuan Xu and Huining Yang. 2021. Recent advances in reinforcement learning in finance. arXiv preprint arXiv:2112.04553(2021).","DOI":"10.2139\/ssrn.3971071"},{"key":"e_1_3_2_1_16_1","volume-title":"A closed-form solution for options with stochastic volatility with applications to bond and currency options. The review of financial studies 6, 2","author":"Heston L","year":"1993","unstructured":"Steven\u00a0 L Heston . 1993. A closed-form solution for options with stochastic volatility with applications to bond and currency options. The review of financial studies 6, 2 ( 1993 ), 327\u2013343. Steven\u00a0L Heston. 1993. A closed-form solution for options with stochastic volatility with applications to bond and currency options. The review of financial studies 6, 2 (1993), 327\u2013343."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.3390\/risks9070138"},{"key":"e_1_3_2_1_18_1","volume-title":"International conference on machine learning. PMLR, 448\u2013456","author":"Ioffe Sergey","year":"2015","unstructured":"Sergey Ioffe and Christian Szegedy . 2015 . Batch normalization: Accelerating deep network training by reducing internal covariate shift . In International conference on machine learning. PMLR, 448\u2013456 . Sergey Ioffe and Christian Szegedy. 2015. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In International conference on machine learning. PMLR, 448\u2013456."},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00780-021-00467-2"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.3905\/jfds.2019.1.1.159"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11579-009-0019-9"},{"key":"e_1_3_2_1_22_1","unstructured":"Timothy\u00a0P Lillicrap Jonathan\u00a0J Hunt Alexander Pritzel Nicolas Heess Tom Erez Yuval Tassa David Silver and Daan Wierstra. 2015. Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971(2015).  Timothy\u00a0P Lillicrap Jonathan\u00a0J Hunt Alexander Pritzel Nicolas Heess Tom Erez Yuval Tassa David Silver and Daan Wierstra. 2015. Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971(2015)."},{"key":"e_1_3_2_1_23_1","volume-title":"Reinforcement Learning: An Introduction(2 ed.)","author":"Sutton S.","year":"2018","unstructured":"Richard\u00a0 S. Sutton and Andrew\u00a0 G. Barto . 2018 . Reinforcement Learning: An Introduction(2 ed.) . MIT Press , Cambridge . Richard\u00a0S. Sutton and Andrew\u00a0G. Barto. 2018. Reinforcement Learning: An Introduction(2 ed.). MIT Press, Cambridge."},{"key":"e_1_3_2_1_24_1","volume-title":"Sequential decision making with coherent risk","author":"Tamar Aviv","year":"2016","unstructured":"Aviv Tamar , Yinlam Chow , Mohammad Ghavamzadeh , and Shie Mannor . 2016. Sequential decision making with coherent risk . IEEE transactions on automatic control 62, 7 ( 2016 ), 3323\u20133338. Aviv Tamar, Yinlam Chow, Mohammad Ghavamzadeh, and Shie Mannor. 2016. Sequential decision making with coherent risk. IEEE transactions on automatic control 62, 7 (2016), 3323\u20133338."},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/3383455.3422532"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"crossref","unstructured":"Magnus Wiese Ben Wood Alexandre Pachoud Ralf Korn Hans Buehler Phillip Murray and Lianjun Bai. 2021. Multi-asset spot and option market simulation. arXiv preprint arXiv:2112.06823(2021).  Magnus Wiese Ben Wood Alexandre Pachoud Ralf Korn Hans Buehler Phillip Murray and Lianjun Bai. 2021. Multi-asset spot and option market simulation. arXiv preprint arXiv:2112.06823(2021).","DOI":"10.2139\/ssrn.3980817"}],"event":{"name":"ICAIF '22: 3rd ACM International Conference on AI in Finance","location":"New York NY USA","acronym":"ICAIF '22","sponsor":["ACM Association for Computing Machinery"]},"container-title":["Proceedings of the Third ACM International Conference on AI in Finance"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3533271.3561731","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3533271.3561731","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:00:39Z","timestamp":1750186839000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3533271.3561731"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,10,26]]},"references-count":27,"alternative-id":["10.1145\/3533271.3561731","10.1145\/3533271"],"URL":"https:\/\/doi.org\/10.1145\/3533271.3561731","relation":{},"subject":[],"published":{"date-parts":[[2022,10,26]]},"assertion":[{"value":"2022-10-26","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}