{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:13:45Z","timestamp":1750220025356,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":35,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,9,5]],"date-time":"2022-09-05T00:00:00Z","timestamp":1662336000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"EPSRC, AHRC, Innovate UK","award":["EP\/M023265\/1"],"award-info":[{"award-number":["EP\/M023265\/1"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,9,5]]},"DOI":"10.1145\/3555858.3555878","type":"proceedings-article","created":{"date-parts":[[2022,11,4]],"date-time":"2022-11-04T15:48:39Z","timestamp":1667576919000},"page":"1-11","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["Imitating Playstyle with Dynamic Time Warping Imitation"],"prefix":"10.1145","author":[{"given":"Mark","family":"Ferguson","sequence":"first","affiliation":[{"name":"University of York, United Kingdom"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sam","family":"Devlin","sequence":"additional","affiliation":[{"name":"Microsoft Research, United Kingdom"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Daniel","family":"Kudenko","sequence":"additional","affiliation":[{"name":"Leibniz University, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"James Alfred","family":"Walker","sequence":"additional","affiliation":[{"name":"University of York, United Kingdom"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2022,11,4]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/1015330.1015430"},{"key":"e_1_3_2_1_2_1","volume-title":"International Conference on Machine Learning. PMLR, 507\u2013517","author":"Badia Adri\u00e0\u00a0Puigdom\u00e8nech","year":"2020","unstructured":"Adri\u00e0\u00a0Puigdom\u00e8nech Badia , Bilal Piot , Steven Kapturowski , Pablo Sprechmann , Alex Vitvitskyi , Zhaohan\u00a0Daniel Guo , and Charles Blundell . 2020 . Agent57: Outperforming the atari human benchmark . In International Conference on Machine Learning. PMLR, 507\u2013517 . Adri\u00e0\u00a0Puigdom\u00e8nech Badia, Bilal Piot, Steven Kapturowski, Pablo Sprechmann, Alex Vitvitskyi, Zhaohan\u00a0Daniel Guo, and Charles Blundell. 2020. Agent57: Outperforming the atari human benchmark. In International Conference on Machine Learning. PMLR, 507\u2013517."},{"key":"e_1_3_2_1_3_1","unstructured":"Lukas Biewald. 2020. Experiment Tracking with Weights and Biases. https:\/\/www.wandb.com\/ Software available from wandb.com.  Lukas Biewald. 2020. Experiment Tracking with Weights and Biases. https:\/\/www.wandb.com\/ Software available from wandb.com."},{"key":"e_1_3_2_1_4_1","unstructured":"Mariusz Bojarski Davide Del\u00a0Testa Daniel Dworakowski Bernhard Firner Beat Flepp Prasoon Goyal Lawrence\u00a0D Jackel Mathew Monfort Urs Muller Jiakai Zhang 2016. End to end learning for self-driving cars. arXiv preprint arXiv:1604.07316(2016).  Mariusz Bojarski Davide Del\u00a0Testa Daniel Dworakowski Bernhard Firner Beat Flepp Prasoon Goyal Lawrence\u00a0D Jackel Mathew Monfort Urs Muller Jiakai Zhang 2016. End to end learning for self-driving cars. arXiv preprint arXiv:1604.07316(2016)."},{"key":"e_1_3_2_1_5_1","first-page":"5174","article-title":"On the utility of learning about humans for human-ai coordination","volume":"32","author":"Carroll Micah","year":"2019","unstructured":"Micah Carroll , Rohin Shah , Mark\u00a0 K Ho , Tom Griffiths , Sanjit Seshia , Pieter Abbeel , and Anca Dragan . 2019 . On the utility of learning about humans for human-ai coordination . Advances in Neural Information Processing Systems 32 (2019), 5174 \u2013 5185 . Micah Carroll, Rohin Shah, Mark\u00a0K Ho, Tom Griffiths, Sanjit Seshia, Pieter Abbeel, and Anca Dragan. 2019. On the utility of learning about humans for human-ai coordination. Advances in Neural Information Processing Systems 32 (2019), 5174\u20135185.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_1_6_1","volume-title":"Faulty reward functions in the wild. Internet: https:\/\/blog. openai. com\/faulty-reward-functions","author":"Clark Jack","year":"2016","unstructured":"Jack Clark and Dario Amodei . 2016. Faulty reward functions in the wild. Internet: https:\/\/blog. openai. com\/faulty-reward-functions ( 2016 ). Jack Clark and Dario Amodei. 2016. Faulty reward functions in the wild. Internet: https:\/\/blog. openai. com\/faulty-reward-functions (2016)."},{"key":"e_1_3_2_1_7_1","volume-title":"Primal Wasserstein Imitation Learning. In ICLR 2021-Ninth International Conference on Learning Representations.","author":"Dadashi Robert","year":"2021","unstructured":"Robert Dadashi , L\u00e9onard Hussenot , Matthieu Geist , and Olivier Pietquin . 2021 . Primal Wasserstein Imitation Learning. In ICLR 2021-Ninth International Conference on Learning Representations. Robert Dadashi, L\u00e9onard Hussenot, Matthieu Geist, and Olivier Pietquin. 2021. Primal Wasserstein Imitation Learning. In ICLR 2021-Ninth International Conference on Learning Representations."},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/CIG.2016.7860423"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/CIG.2012.6374152"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/3402942.3402960"},{"key":"e_1_3_2_1_11_1","volume-title":"State-only Imitation with Transition Dynamics Mismatch. In International Conference on Learning Representations.","author":"Gangwani Tanmay","year":"2019","unstructured":"Tanmay Gangwani and Jian Peng . 2019 . State-only Imitation with Transition Dynamics Mismatch. In International Conference on Learning Representations. Tanmay Gangwani and Jian Peng. 2019. State-only Imitation with Transition Dynamics Mismatch. In International Conference on Learning Representations."},{"key":"e_1_3_2_1_12_1","volume-title":"Generative adversarial nets. Advances in neural information processing systems 27","author":"Goodfellow Ian","year":"2014","unstructured":"Ian Goodfellow , Jean Pouget-Abadie , Mehdi Mirza , Bing Xu , David Warde-Farley , Sherjil Ozair , Aaron Courville , and Yoshua Bengio . 2014. Generative adversarial nets. Advances in neural information processing systems 27 ( 2014 ). Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative adversarial nets. Advances in neural information processing systems 27 (2014)."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/CoG52621.2021.9619048"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/CIG.2017.8080424"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/CIG.2018.8490417"},{"key":"e_1_3_2_1_16_1","volume-title":"Multi-modal imitation learning from unstructured demonstrations using generative adversarial nets. Advances in neural information processing systems 30","author":"Hausman Karol","year":"2017","unstructured":"Karol Hausman , Yevgen Chebotar , Stefan Schaal , Gaurav Sukhatme , and Joseph\u00a0 J Lim . 2017. Multi-modal imitation learning from unstructured demonstrations using generative adversarial nets. Advances in neural information processing systems 30 ( 2017 ). Karol Hausman, Yevgen Chebotar, Stefan Schaal, Gaurav Sukhatme, and Joseph\u00a0J Lim. 2017. Multi-modal imitation learning from unstructured demonstrations using generative adversarial nets. Advances in neural information processing systems 30 (2017)."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1007\/s40869-018-0051-1"},{"key":"e_1_3_2_1_18_1","volume-title":"Generative adversarial imitation learning. Advances in neural information processing systems 29","author":"Ho Jonathan","year":"2016","unstructured":"Jonathan Ho and Stefano Ermon . 2016. Generative adversarial imitation learning. Advances in neural information processing systems 29 ( 2016 ), 4565\u20134573. Jonathan Ho and Stefano Ermon. 2016. Generative adversarial imitation learning. Advances in neural information processing systems 29 (2016), 4565\u20134573."},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/TG.2018.2808198"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/CIG.2014.6932911"},{"key":"e_1_3_2_1_21_1","unstructured":"M Jaderberg WM Czarnecki I Dunning L Marris G Lever AG Castaneda 2018. Human-level performance in first-person multiplayer games with population-based deep reinforcement learning. arXiv. arXiv preprint arXiv:1807.01281(2018).  M Jaderberg WM Czarnecki I Dunning L Marris G Lever AG Castaneda 2018. Human-level performance in first-person multiplayer games with population-based deep reinforcement learning. arXiv. arXiv preprint arXiv:1807.01281(2018)."},{"key":"e_1_3_2_1_22_1","unstructured":"Max Jaderberg Volodymyr Mnih Wojciech\u00a0Marian Czarnecki Tom Schaul Joel\u00a0Z Leibo David Silver and Koray Kavukcuoglu. 2016. Reinforcement learning with unsupervised auxiliary tasks. arXiv preprint arXiv:1611.05397(2016).  Max Jaderberg Volodymyr Mnih Wojciech\u00a0Marian Czarnecki Tom Schaul Joel\u00a0Z Leibo David Silver and Koray Kavukcuoglu. 2016. Reinforcement learning with unsupervised auxiliary tasks. arXiv preprint arXiv:1611.05397(2016)."},{"key":"e_1_3_2_1_23_1","volume-title":"Infogail: Interpretable imitation learning from visual demonstrations. Advances in Neural Information Processing Systems 30","author":"Li Yunzhu","year":"2017","unstructured":"Yunzhu Li , Jiaming Song , and Stefano Ermon . 2017 . Infogail: Interpretable imitation learning from visual demonstrations. Advances in Neural Information Processing Systems 30 (2017). Yunzhu Li, Jiaming Song, and Stefano Ermon. 2017. Infogail: Interpretable imitation learning from visual demonstrations. Advances in Neural Information Processing Systems 30 (2017)."},{"key":"e_1_3_2_1_24_1","volume-title":"State Alignment-based Imitation Learning. In International Conference on Learning Representations.","author":"Liu Fangchen","year":"2019","unstructured":"Fangchen Liu , Zhan Ling , Tongzhou Mu , and Hao Su . 2019 . State Alignment-based Imitation Learning. In International Conference on Learning Representations. Fangchen Liu, Zhan Ling, Tongzhou Mu, and Hao Su. 2019. State Alignment-based Imitation Learning. In International Conference on Learning Representations."},{"key":"e_1_3_2_1_25_1","volume-title":"Human-level control through deep reinforcement learning. nature 518, 7540","author":"Mnih Volodymyr","year":"2015","unstructured":"Volodymyr Mnih , Koray Kavukcuoglu , David Silver , Andrei\u00a0 A Rusu , Joel Veness , Marc\u00a0 G Bellemare , Alex Graves , Martin Riedmiller , Andreas\u00a0 K Fidjeland , Georg Ostrovski , 2015. Human-level control through deep reinforcement learning. nature 518, 7540 ( 2015 ), 529\u2013533. Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei\u00a0A Rusu, Joel Veness, Marc\u00a0G Bellemare, Alex Graves, Martin Riedmiller, Andreas\u00a0K Fidjeland, Georg Ostrovski, 2015. Human-level control through deep reinforcement learning. nature 518, 7540 (2015), 529\u2013533."},{"key":"e_1_3_2_1_26_1","unstructured":"Andrew\u00a0Y Ng Stuart\u00a0J Russell 2000. Algorithms for inverse reinforcement learning.. In Icml Vol.\u00a01. 2.  Andrew\u00a0Y Ng Stuart\u00a0J Russell 2000. Algorithms for inverse reinforcement learning.. In Icml Vol.\u00a01. 2."},{"key":"e_1_3_2_1_27_1","unstructured":"Jette Randl\u00f8v and Preben Alstr\u00f8m. 1998. Learning to Drive a Bicycle Using Reinforcement Learning and Shaping.. In ICML Vol.\u00a098. Citeseer 463\u2013471.  Jette Randl\u00f8v and Preben Alstr\u00f8m. 1998. Learning to Drive a Bicycle Using Reinforcement Learning and Shaping.. In ICML Vol.\u00a098. Citeseer 463\u2013471."},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1016\/B978-1-55860-247-2.50055-3"},{"key":"e_1_3_2_1_29_1","unstructured":"Kun Shao Zhentao Tang Yuanheng Zhu Nannan Li and Dongbin Zhao. 2019. A survey of deep reinforcement learning in video games. arXiv preprint arXiv:1912.10944(2019).  Kun Shao Zhentao Tang Yuanheng Zhu Nannan Li and Dongbin Zhao. 2019. A survey of deep reinforcement learning in video games. arXiv preprint arXiv:1912.10944(2019)."},{"volume-title":"Reinforcement learning: An introduction","author":"Sutton S","key":"e_1_3_2_1_30_1","unstructured":"Richard\u00a0 S Sutton and Andrew\u00a0 G Barto . 2018. Reinforcement learning: An introduction . MIT press . Richard\u00a0S Sutton and Andrew\u00a0G Barto. 2018. Reinforcement learning: An introduction. MIT press."},{"key":"e_1_3_2_1_31_1","unstructured":"Fabien Tenc\u00e9 C\u00e9dric Buche Pierre De\u00a0Loor and Olivier Marc. 2010. The challenge of believability in video games: Definitions agents models and imitation learning. arXiv preprint arXiv:1009.0451(2010).  Fabien Tenc\u00e9 C\u00e9dric Buche Pierre De\u00a0Loor and Olivier Marc. 2010. The challenge of believability in video games: Definitions agents models and imitation learning. arXiv preprint arXiv:1009.0451(2010)."},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2018\/687"},{"key":"e_1_3_2_1_33_1","unstructured":"Faraz Torabi Garrett Warnell and Peter Stone. 2018. Generative adversarial imitation from observation. arXiv preprint arXiv:1807.06158(2018).  Faraz Torabi Garrett Warnell and Peter Stone. 2018. Generative adversarial imitation from observation. arXiv preprint arXiv:1807.06158(2018)."},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/1496984.1496997"},{"key":"e_1_3_2_1_35_1","first-page":"12402","article-title":"Off-policy imitation learning from observations","volume":"33","author":"Zhu Zhuangdi","year":"2020","unstructured":"Zhuangdi Zhu , Kaixiang Lin , Bo Dai , and Jiayu Zhou . 2020 . Off-policy imitation learning from observations . Advances in Neural Information Processing Systems 33 (2020), 12402 \u2013 12413 . Zhuangdi Zhu, Kaixiang Lin, Bo Dai, and Jiayu Zhou. 2020. Off-policy imitation learning from observations. Advances in Neural Information Processing Systems 33 (2020), 12402\u201312413.","journal-title":"Advances in Neural Information Processing Systems"}],"event":{"name":"FDG22: 17th International Conference on the Foundations of Digital Games","acronym":"FDG22","location":"Athens Greece"},"container-title":["Proceedings of the 17th International Conference on the Foundations of Digital Games"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3555858.3555878","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3555858.3555878","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T17:51:37Z","timestamp":1750182697000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3555858.3555878"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,9,5]]},"references-count":35,"alternative-id":["10.1145\/3555858.3555878","10.1145\/3555858"],"URL":"https:\/\/doi.org\/10.1145\/3555858.3555878","relation":{},"subject":[],"published":{"date-parts":[[2022,9,5]]},"assertion":[{"value":"2022-11-04","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}