{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:19:20Z","timestamp":1750220360418,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":33,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,8,3]],"date-time":"2021-08-03T00:00:00Z","timestamp":1627948800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,8,3]]},"DOI":"10.1145\/3472538.3472546","type":"proceedings-article","created":{"date-parts":[[2021,10,21]],"date-time":"2021-10-21T22:48:54Z","timestamp":1634856534000},"page":"1-7","source":"Crossref","is-referenced-by-count":0,"title":["Meta-Learning a Solution to the Hanabi Ad-Hoc Challenge"],"prefix":"10.1145","author":[{"given":"Aron","family":"Sarmasi","sequence":"first","affiliation":[{"name":"UC Davis, United States"}]},{"given":"Timothy","family":"Zhang","sequence":"additional","affiliation":[{"name":"University of California, Davis, United States"}]},{"given":"Chu-Hung","family":"Cheng","sequence":"additional","affiliation":[{"name":"UC Davis, United States"}]},{"given":"Huyen","family":"Pham","sequence":"additional","affiliation":[{"name":"UC Davis, United States"}]},{"given":"Xuanchen","family":"Zhou","sequence":"additional","affiliation":[{"name":"UC Davis, United States"}]},{"given":"Duong","family":"Nguyen","sequence":"additional","affiliation":[{"name":"University of California, Davis, United States"}]},{"given":"Soumil","family":"Shekdar","sequence":"additional","affiliation":[{"name":"UC Davis, United States"}]},{"given":"Joshua","family":"Joshua McCoy","sequence":"additional","affiliation":[{"name":"University of California, Davis, United States"}]}],"member":"320","published-online":{"date-parts":[[2021,10,21]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Albrecht and Peter Stone","author":"V.","year":"2018","unstructured":"Stefano\u00a0 V. Albrecht and Peter Stone . 2018 . Autonomous agents modelling other agents: A comprehensive survey and open problems. Artificial Intelligence 258 (may 2018), 66\u201395. https:\/\/doi.org\/10.1016\/j.artint.2018.01.002 arxiv:1709.08071 Stefano\u00a0V. Albrecht and Peter Stone. 2018. Autonomous agents modelling other agents: A comprehensive survey and open problems. Artificial Intelligence 258 (may 2018), 66\u201395. https:\/\/doi.org\/10.1016\/j.artint.2018.01.002 arxiv:1709.08071"},{"key":"e_1_3_2_1_2_1","volume-title":"International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=HJGven05Y7","author":"Antoniou Antreas","year":"2019","unstructured":"Antreas Antoniou , Harrison Edwards , and Amos Storkey . 2019 . How to train your MAML . In International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=HJGven05Y7 Antreas Antoniou, Harrison Edwards, and Amos Storkey. 2019. How to train your MAML. In International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=HJGven05Y7"},{"key":"e_1_3_2_1_5_1","unstructured":"Antoine Bauza. 2010. Hanabi. https:\/\/www.boardgamegeek.com\/boardgame\/98778\/hanabi  Antoine Bauza. 2010. Hanabi. https:\/\/www.boardgamegeek.com\/boardgame\/98778\/hanabi"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/CIG.2019.8847944"},{"key":"e_1_3_2_1_7_1","volume-title":"Dopamine: A Research Framework For Deep Reinforcement Learning. In unpublished. arxiv:1812.06110v1https:\/\/github.com\/google\/dopamine","author":"Castro Pablo\u00a0Samuel","year":"2018","unstructured":"Pablo\u00a0Samuel Castro , Subhodeep Moitra , Carles Gelada , Saurabh Kumar , Marc\u00a0 G Bellemare , and Google Brain . 2018 . Dopamine: A Research Framework For Deep Reinforcement Learning. In unpublished. arxiv:1812.06110v1https:\/\/github.com\/google\/dopamine Pablo\u00a0Samuel Castro, Subhodeep Moitra, Carles Gelada, Saurabh Kumar, Marc\u00a0G Bellemare, and Google Brain. 2018. Dopamine: A Research Framework For Deep Reinforcement Learning. In unpublished. arxiv:1812.06110v1https:\/\/github.com\/google\/dopamine"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.4169\/math.mag.88.5.323"},{"key":"e_1_3_2_1_9_1","unstructured":"Yan Duan Marcin Andrychowicz Bradly Stadie Jonathan Ho Jonas Schneider Ilya Sutskever Pieter Abbeel and Wojciech Zaremba. 2017. One-Shot Imitation Learning. In NIPS.  Yan Duan Marcin Andrychowicz Bradly Stadie Jonathan Ho Jonas Schneider Ilya Sutskever Pieter Abbeel and Wojciech Zaremba. 2017. One-Shot Imitation Learning. In NIPS."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"crossref","unstructured":"Markus Eger and Daniel Gruss. 2019. Wait a Second: Playing Hanabi without Giving Hints. (2019). https:\/\/doi.org\/10.1145\/3337722  Markus Eger and Daniel Gruss. 2019. Wait a Second: Playing Hanabi without Giving Hints. (2019). https:\/\/doi.org\/10.1145\/3337722","DOI":"10.1145\/3337722.3337744"},{"key":"e_1_3_2_1_11_1","unstructured":"Chelsea Finn Pieter Abbeel and Sergey Levine. 2017. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks. In ICML. arxiv:1703.03400v3  Chelsea Finn Pieter Abbeel and Sergey Levine. 2017. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks. In ICML. arxiv:1703.03400v3"},{"key":"e_1_3_2_1_12_1","unstructured":"Chelsea Finn Tianhe Yu Tianhao Zhang Pieter Abbeel and Sergey Levine. 2017. One-Shot Visual Imitation Learning via Meta-Learning. In CoRL. arxiv:1709.04905v1  Chelsea Finn Tianhe Yu Tianhao Zhang Pieter Abbeel and Sergey Levine. 2017. One-Shot Visual Imitation Learning via Meta-Learning. In CoRL. arxiv:1709.04905v1"},{"key":"e_1_3_2_1_13_1","unstructured":"Victor Garcia and Joan Bruna. 2018. Few-Shot Learning With Graph Neural Net-Works. In ICLR. arxiv:1711.04043v3  Victor Garcia and Joan Bruna. 2018. Few-Shot Learning With Graph Neural Net-Works. In ICLR. arxiv:1711.04043v3"},{"key":"e_1_3_2_1_14_1","unstructured":"Erin Grant Chelsea Finn Sergey Levine Trevor Darrell and Thomas Griffiths. 2018. Recasting Gradient-Based Meta-Learning as Hierarchical Bayes. In ICLR. arxiv:1801.08930v1  Erin Grant Chelsea Finn Sergey Levine Trevor Darrell and Thomas Griffiths. 2018. Recasting Gradient-Based Meta-Learning as Hierarchical Bayes. In ICLR. arxiv:1801.08930v1"},{"key":"e_1_3_2_1_15_1","unstructured":"He He Jordan Boyd-Graber Kevin Kwok and Hal Daum\u00e9. 2016. Opponent Modeling in Deep Reinforcement Learning. In ICML. arxiv:1609.05559  He He Jordan Boyd-Graber Kevin Kwok and Hal Daum\u00e9. 2016. Opponent Modeling in Deep Reinforcement Learning. In ICML. arxiv:1609.05559"},{"key":"e_1_3_2_1_16_1","unstructured":"Jonathan Ho and Stefano Ermon. 2016. Generative Adversarial Imitation Learning. In NIPS. arxiv:1606.03476  Jonathan Ho and Stefano Ermon. 2016. Generative Adversarial Imitation Learning. In NIPS. arxiv:1606.03476"},{"key":"e_1_3_2_1_17_1","volume-title":"Human-level concept learning through probabilistic program induction. Science 350, 6266","author":"Lake M","year":"2015","unstructured":"Brenden\u00a0 M Lake , Ruslan Salakhutdinov , and Joshua\u00a0 B Tenenbaum . 2015. Human-level concept learning through probabilistic program induction. Science 350, 6266 ( 2015 ), 1332\u20131338. Brenden\u00a0M Lake, Ruslan Salakhutdinov, and Joshua\u00a0B Tenenbaum. 2015. Human-level concept learning through probabilistic program induction. Science 350, 6266 (2015), 1332\u20131338."},{"key":"e_1_3_2_1_18_1","unstructured":"Hoang\u00a0M. Le Yisong Yue Peter Carr and Patrick Lucey. 2017. Coordinated Multi-Agent Imitation Learning. In ICML. arxiv:1703.03121  Hoang\u00a0M. Le Yisong Yue Peter Carr and Patrick Lucey. 2017. Coordinated Multi-Agent Imitation Learning. In ICML. arxiv:1703.03121"},{"key":"e_1_3_2_1_20_1","unstructured":"Chris Metzen and James Phinney. 1998. StarCraft: Remastered. https:\/\/starcraft.com\/en-us\/  Chris Metzen and James Phinney. 1998. StarCraft: Remastered. https:\/\/starcraft.com\/en-us\/"},{"key":"e_1_3_2_1_21_1","unstructured":"Nikhil Mishra Mostafa Rohaninejad Xi Chen and Pieter Abbeel. 2018. A Simple Neural Attentive Meta-Learner. In ICLR. arxiv:1707.03141  Nikhil Mishra Mostafa Rohaninejad Xi Chen and Pieter Abbeel. 2018. A Simple Neural Attentive Meta-Learner. In ICLR. arxiv:1707.03141"},{"key":"e_1_3_2_1_22_1","unstructured":"Volodymyr Mnih Adri\u00e0\u00a0Puigdom\u00e8nech Badia Mehdi Mirza Alex Graves Tim Harley Timothy\u00a0P Lillicrap David Silver and Koray Kavukcuoglu. 2016. Asynchronous Methods for Deep Reinforcement Learning. JMLR 48(2016). arxiv:1602.01783v2  Volodymyr Mnih Adri\u00e0\u00a0Puigdom\u00e8nech Badia Mehdi Mirza Alex Graves Tim Harley Timothy\u00a0P Lillicrap David Silver and Koray Kavukcuoglu. 2016. Asynchronous Methods for Deep Reinforcement Learning. JMLR 48(2016). arxiv:1602.01783v2"},{"key":"e_1_3_2_1_23_1","unstructured":"Jean-Baptiste Mouret and Jeff Clune. 2015. Illuminating search spaces by mapping elites. arXiv preprint arXiv:1504.04909(2015).  Jean-Baptiste Mouret and Jeff Clune. 2015. Illuminating search spaces by mapping elites. arXiv preprint arXiv:1504.04909(2015)."},{"key":"e_1_3_2_1_24_1","unstructured":"Arthur O\u2019Dwyer. 2018. Github - quuxplusone\/hanabi: Framework for writing bots that play Hanabi.https:\/\/github.com\/Quuxplusone\/Hanabi  Arthur O\u2019Dwyer. 2018. Github - quuxplusone\/hanabi: Framework for writing bots that play Hanabi.https:\/\/github.com\/Quuxplusone\/Hanabi"},{"key":"e_1_3_2_1_25_1","volume-title":"Towards Understanding the Effectiveness of MAML. In International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=rkgMkCEtPB","author":"Raghu Aniruddh","year":"2020","unstructured":"Aniruddh Raghu , Maithra Raghu , Samy Bengio , and Oriol Vinyals . 2020 . Rapid Learning or Feature Reuse? Towards Understanding the Effectiveness of MAML. In International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=rkgMkCEtPB Aniruddh Raghu, Maithra Raghu, Samy Bengio, and Oriol Vinyals. 2020. Rapid Learning or Feature Reuse? Towards Understanding the Effectiveness of MAML. In International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=rkgMkCEtPB"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10514-009-9121-3"},{"key":"e_1_3_2_1_27_1","first-page":"661","article-title":"Efficient Reductions for Imitation Learning","volume":"9","author":"Ross St\u00e9phane","year":"2010","unstructured":"St\u00e9phane Ross and J Andrew Bagnell . 2010 . Efficient Reductions for Imitation Learning . JMLR 9 (2010), 661 \u2013 668 . St\u00e9phane Ross and J Andrew Bagnell. 2010. Efficient Reductions for Imitation Learning. JMLR 9(2010), 661\u2013668.","journal-title":"JMLR"},{"key":"e_1_3_2_1_28_1","unstructured":"Andrei\u00a0A Rusu Dushyant Rao Jakub Sygnowski Oriol Vinyals Razvan Pascanu Simon Osindero and Raia Hadsell. 2019. Meta-Learning With Latent Embedding Optimization. In ICLR. arxiv:1807.05960v3  Andrei\u00a0A Rusu Dushyant Rao Jakub Sygnowski Oriol Vinyals Razvan Pascanu Simon Osindero and Raia Hadsell. 2019. Meta-Learning With Latent Embedding Optimization. In ICLR. arxiv:1807.05960v3"},{"key":"e_1_3_2_1_29_1","volume-title":"HOAD: The Hanabi Open Agent Dataset. In AAMAS. Montreal, Canada.","author":"Sarmasi Aron","year":"2021","unstructured":"Aron Sarmasi , Timothy Zhang , Chu-Hung Cheng , Huyen Pham , Xuanchen Zhou , Duong Nguyen , Soumil Shekdar , and Joshua McCoy . 2021 . HOAD: The Hanabi Open Agent Dataset. In AAMAS. Montreal, Canada. Aron Sarmasi, Timothy Zhang, Chu-Hung Cheng, Huyen Pham, Xuanchen Zhou, Duong Nguyen, Soumil Shekdar, and Joshua McCoy. 2021. HOAD: The Hanabi Open Agent Dataset. In AAMAS. Montreal, Canada."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"crossref","unstructured":"David Silver Aja Huang Chris\u00a0J Maddison Arthur Guez Laurent Sifre George Van Den Driessche Julian Schrittwieser Ioannis Antonoglou Veda Panneershelvam Marc Lanctot Sander Dieleman Dominik Grewe John Nham Nal Kalchbrenner Ilya Sutskever Timothy Lillicrap Madeleine Leach Koray Kavukcuoglu Thore Graepel and Demis Hassabis. 2016. Mastering the game of Go with deep neural networks and tree search. Nature 529(2016). https:\/\/doi.org\/10.1038\/nature16961  David Silver Aja Huang Chris\u00a0J Maddison Arthur Guez Laurent Sifre George Van Den Driessche Julian Schrittwieser Ioannis Antonoglou Veda Panneershelvam Marc Lanctot Sander Dieleman Dominik Grewe John Nham Nal Kalchbrenner Ilya Sutskever Timothy Lillicrap Madeleine Leach Koray Kavukcuoglu Thore Graepel and Demis Hassabis. 2016. Mastering the game of Go with deep neural networks and tree search. Nature 529(2016). https:\/\/doi.org\/10.1038\/nature16961","DOI":"10.1038\/nature16961"},{"key":"e_1_3_2_1_31_1","unstructured":"Jake Snell Kevin Swersky and Twitter\u00a0Richard Zemel. 2017. Prototypical Networks for Few-shot Learning. In NIPS.  Jake Snell Kevin Swersky and Twitter\u00a0Richard Zemel. 2017. Prototypical Networks for Few-shot Learning. In NIPS."},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"crossref","unstructured":"Flood Sung Yang Yongxin Li Zhang Tao Xiang Philip\u00a0HS Torr and Timothy\u00a0M Hospedales. 2018. Learning to Compare: Relation Network for Few-Shot Learning. In CVPR. 1199\u20131208.  Flood Sung Yang Yongxin Li Zhang Tao Xiang Philip\u00a0HS Torr and Timothy\u00a0M Hospedales. 2018. Learning to Compare: Relation Network for Few-Shot Learning. In CVPR. 1199\u20131208.","DOI":"10.1109\/CVPR.2018.00131"},{"key":"e_1_3_2_1_33_1","unstructured":"Oriol Vinyals Google Deepmind Charles Blundell Timothy Lillicrap Koray Kavukcuoglu and Daan Wierstra. 2016. Matching Networks for One Shot Learning. In NIPS.  Oriol Vinyals Google Deepmind Charles Blundell Timothy Lillicrap Koray Kavukcuoglu and Daan Wierstra. 2016. Matching Networks for One Shot Learning. In NIPS."},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"crossref","unstructured":"Joseph Walton-Rivers Piers\u00a0R. Williams Richard Bartle Diego Perez-Liebana and Simon\u00a0M. Lucas. 2017. Evaluating and Modelling Hanabi-Playing Agents. In CEC. 1382 \u2013 1389.  Joseph Walton-Rivers Piers\u00a0R. Williams Richard Bartle Diego Perez-Liebana and Simon\u00a0M. Lucas. 2017. Evaluating and Modelling Hanabi-Playing Agents. In CEC. 1382 \u2013 1389.","DOI":"10.1109\/CEC.2017.7969465"},{"key":"e_1_3_2_1_35_1","volume-title":"Robust Imitation of Diverse Behaviors. In Conference on Neural Information Processing Systems. arxiv:1707","author":"Wang Ziyu","year":"2017","unstructured":"Ziyu Wang , Josh Merel , Scott Reed , Greg Wayne , Nando de Freitas , and Nicolas Heess . 2017 . Robust Imitation of Diverse Behaviors. In Conference on Neural Information Processing Systems. arxiv:1707 .02747 Ziyu Wang, Josh Merel, Scott Reed, Greg Wayne, Nando de Freitas, and Nicolas Heess. 2017. Robust Imitation of Diverse Behaviors. In Conference on Neural Information Processing Systems. arxiv:1707.02747"},{"key":"e_1_3_2_1_37_1","unstructured":"James Zamiell. [n.d.]. GitHub - Zamiell\/hanabi-conventions: A list of Hanabi strategies. https:\/\/github.com\/Zamiell\/hanabi-conventions  James Zamiell. [n.d.]. GitHub - Zamiell\/hanabi-conventions: A list of Hanabi strategies. https:\/\/github.com\/Zamiell\/hanabi-conventions"}],"event":{"name":"FDG'21: The 16th International Conference on the Foundations of Digital Games 2021","acronym":"FDG'21","location":"Montreal QC Canada"},"container-title":["The 16th International Conference on the Foundations of Digital Games (FDG) 2021"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3472538.3472546","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3472538.3472546","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:17:23Z","timestamp":1750191443000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3472538.3472546"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,8,3]]},"references-count":33,"alternative-id":["10.1145\/3472538.3472546","10.1145\/3472538"],"URL":"https:\/\/doi.org\/10.1145\/3472538.3472546","relation":{},"subject":[],"published":{"date-parts":[[2021,8,3]]}}}