{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:17:33Z","timestamp":1750220253881,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":17,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,2,18]],"date-time":"2022-02-18T00:00:00Z","timestamp":1645142400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,2,18]]},"DOI":"10.1145\/3523111.3523127","type":"proceedings-article","created":{"date-parts":[[2022,5,2]],"date-time":"2022-05-02T15:24:24Z","timestamp":1651505064000},"page":"104-109","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["Mean-variance Based Risk-sensitive Reinforcement Learning with Interpretable Attention"],"prefix":"10.1145","author":[{"given":"Woo Kyung","family":"Kim","sequence":"first","affiliation":[{"name":"Department of Computer Science and Engineering, Sungkyunkwan University, South Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Youngseok","family":"Lee","sequence":"additional","affiliation":[{"name":"Department of Electrical and Computer Engineering, Sungkyunkwan University, South Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Honguk","family":"Woo","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering, Sungkyunkwan University, South Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2022,5,2]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/IVS.2019.8813791"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA48506.2021.9560962"},{"key":"e_1_3_2_1_3_1","volume-title":"Proc. of the 35th International Conference on Machine Learning (ICML)","author":"Dabney Will","year":"2018","unstructured":"Will Dabney , Georg Ostrovski , David Silver , and Remi Munos . 2018 . Implicit Quantile Networks for Distributional Reinforcement Learning . In Proc. of the 35th International Conference on Machine Learning (ICML) . Stockholm, Sweden, 1096\u20131105. Will Dabney, Georg Ostrovski, David Silver, and Remi Munos. 2018. Implicit Quantile Networks for Distributional Reinforcement Learning. In Proc. of the 35th International Conference on Machine Learning (ICML). Stockholm, Sweden, 1096\u20131105."},{"key":"e_1_3_2_1_4_1","volume-title":"Proc. of the 35th International Conference on Machine Learning (ICML)","author":"Greydanus Samuel","year":"2018","unstructured":"Samuel Greydanus , Anurag Koul , Jonathan Dodge , and Alan Fern . 2018 . Visualizing and Understanding Atari Agents . In Proc. of the 35th International Conference on Machine Learning (ICML) . Stockholm, Sweden, 1792\u2013 1801. Samuel Greydanus, Anurag Koul, Jonathan Dodge, and Alan Fern. 2018. Visualizing and Understanding Atari Agents. In Proc. of the 35th International Conference on Machine Learning (ICML). Stockholm, Sweden, 1792\u20131801."},{"key":"e_1_3_2_1_5_1","volume-title":"Proc. of the 35th International Conference on Machine Learning (ICML)","author":"Haarnoja Tuomas","year":"2018","unstructured":"Tuomas Haarnoja , Aurick Zhou , Pieter Abbeel , and Sergey Levine . 2018 . Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor . In Proc. of the 35th International Conference on Machine Learning (ICML) . Stockholm, Sweden , 1861\u20131870. Tuomas Haarnoja, Aurick Zhou, Pieter Abbeel, and Sergey Levine. 2018. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor. In Proc. of the 35th International Conference on Machine Learning (ICML). Stockholm, Sweden, 1861\u20131870."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1016\/B978-1-55860-335-6.50021-0"},{"key":"e_1_3_2_1_7_1","volume-title":"Proc. of the IJCAI 2019 Workshop on Explainable Artificial Intelligence","author":"Juozapaitis Zoe","year":"2019","unstructured":"Zoe Juozapaitis , Anurag Koul , Alan Fern , Martin Erwig , and Finale Doshi-Velez . 2019 . Explainable reinforcement learning via reward decomposition . In Proc. of the IJCAI 2019 Workshop on Explainable Artificial Intelligence . Macau, China, 47\u201353. Zoe Juozapaitis, Anurag Koul, Alan Fern, Martin Erwig, and Finale Doshi-Velez. 2019. Explainable reinforcement learning via reward decomposition. In Proc. of the IJCAI 2019 Workshop on Explainable Artificial Intelligence. Macau, China, 47\u201353."},{"key":"e_1_3_2_1_8_1","unstructured":"Xiaoteng Ma Qiyuan Zhang Li Xia Zhengyuan Zhou Jun Yang and Qianchuan Zhao. 2020. Distributional Soft Actor Critic for Risk Sensitive Learning. CoRR abs\/2004.14547(2020). https:\/\/arxiv.org\/abs\/2004.14547  Xiaoteng Ma Qiyuan Zhang Li Xia Zhengyuan Zhou Jun Yang and Qianchuan Zhao. 2020. Distributional Soft Actor Critic for Risk Sensitive Learning. CoRR abs\/2004.14547(2020). https:\/\/arxiv.org\/abs\/2004.14547"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i03.5631"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.5555\/3454287.3455394"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCVW.2019.00522"},{"key":"e_1_3_2_1_12_1","first-page":"6","article-title":"Autonomous UAV Navigation Using Reinforcement Learning","volume":"9","author":"Pham X.","year":"2019","unstructured":"Huy\u00a0 X. Pham , Hung\u00a0 M. La , David Feil-Seifer , and Luan\u00a0Van Nguyen . 2019 . Autonomous UAV Navigation Using Reinforcement Learning . International Journal of Machine Learning and Computing (IJMLC) 9 , 6 (Dec 2019), 756\u2013761. Huy\u00a0X. Pham, Hung\u00a0M. La, David Feil-Seifer, and Luan\u00a0Van Nguyen. 2019. Autonomous UAV Navigation Using Reinforcement Learning. International Journal of Machine Learning and Computing (IJMLC) 9, 6 (Dec 2019), 756\u2013761.","journal-title":"International Journal of Machine Learning and Computing (IJMLC)"},{"key":"e_1_3_2_1_13_1","unstructured":"Ivan Sorokin Alexey Seleznev Mikhail Pavlov Aleksandr Fedorov and Anastasiia Ignateva. 2015. Deep Attention Recurrent Q-Network. CoRR abs\/1512.01693(2015). http:\/\/arxiv.org\/abs\/1512.01693  Ivan Sorokin Alexey Seleznev Mikhail Pavlov Aleksandr Fedorov and Anastasiia Ignateva. 2015. Deep Attention Recurrent Q-Network. CoRR abs\/1512.01693(2015). http:\/\/arxiv.org\/abs\/1512.01693"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/3383455.3422519"},{"key":"e_1_3_2_1_15_1","volume-title":"Proc. of the 35th International Conference on Machine Learning (ICML)","author":"Verma Abhinav","year":"2018","unstructured":"Abhinav Verma , Vijayaraghavan Murali , Rishabh Singh , Pushmeet Kohli , and Swarat Chaudhuri . 2018 . Programmatically Interpretable Reinforcement Learning . In Proc. of the 35th International Conference on Machine Learning (ICML) . Stockholm, Sweden, 5045\u20135054. Abhinav Verma, Vijayaraghavan Murali, Rishabh Singh, Pushmeet Kohli, and Swarat Chaudhuri. 2018. Programmatically Interpretable Reinforcement Learning. In Proc. of the 35th International Conference on Machine Learning (ICML). Stockholm, Sweden, 5045\u20135054."},{"key":"e_1_3_2_1_16_1","volume-title":"\u00a0S. Torr","author":"Yang Zhao","year":"2018","unstructured":"Zhao Yang , Song Bai , Li Zhang , and Philip H . \u00a0S. Torr . 2018 . Learn to Interpret Atari Agents. CoRR abs\/1812.11276(2018). http:\/\/arxiv.org\/abs\/1812.11276 Zhao Yang, Song Bai, Li Zhang, and Philip H.\u00a0S. Torr. 2018. Learn to Interpret Atari Agents. CoRR abs\/1812.11276(2018). http:\/\/arxiv.org\/abs\/1812.11276"},{"key":"e_1_3_2_1_17_1","volume-title":"Proc. of the 35th AAAI Conference on Artificial Intelligence. Virtual Only, 10905\u201310913","author":"Zhang Shangtong","year":"2020","unstructured":"Shangtong Zhang , Bo Liu , and Shimon Whiteson . 2020 . Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning . In Proc. of the 35th AAAI Conference on Artificial Intelligence. Virtual Only, 10905\u201310913 . Shangtong Zhang, Bo Liu, and Shimon Whiteson. 2020. Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning. In Proc. of the 35th AAAI Conference on Artificial Intelligence. Virtual Only, 10905\u201310913."}],"event":{"name":"ICMVA 2022: 2022 the 5th International Conference on Machine Vision and Applications","acronym":"ICMVA 2022","location":"Singapore Singapore"},"container-title":["2022 the 5th International Conference on Machine Vision and Applications (ICMVA)"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3523111.3523127","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3523111.3523127","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:30:43Z","timestamp":1750188643000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3523111.3523127"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,2,18]]},"references-count":17,"alternative-id":["10.1145\/3523111.3523127","10.1145\/3523111"],"URL":"https:\/\/doi.org\/10.1145\/3523111.3523127","relation":{},"subject":[],"published":{"date-parts":[[2022,2,18]]},"assertion":[{"value":"2022-05-02","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}