{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,10]],"date-time":"2026-03-10T11:20:34Z","timestamp":1773141634842,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":31,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,3,8]],"date-time":"2021-03-08T00:00:00Z","timestamp":1615161600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"NSF","award":["EFMA-1832795"],"award-info":[{"award-number":["EFMA-1832795"]}]},{"name":"NSF","award":["CNS-1305072"],"award-info":[{"award-number":["CNS-1305072"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,3,8]]},"DOI":"10.1145\/3434074.3447168","type":"proceedings-article","created":{"date-parts":[[2021,3,8]],"date-time":"2021-03-08T01:33:11Z","timestamp":1615167191000},"page":"242-246","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":11,"title":["Competitive Physical Human-Robot Game Play"],"prefix":"10.1145","author":[{"given":"Boling","family":"Yang","sequence":"first","affiliation":[{"name":"University of Washington, Seattle, WA, USA"}]},{"given":"Xiangyu","family":"Xie","sequence":"additional","affiliation":[{"name":"University of Washington, Seattle, WA, USA"}]},{"given":"Golnaz","family":"Habibi","sequence":"additional","affiliation":[{"name":"Massachusetts Institute of Technology, Boston, MA, USA"}]},{"given":"Joshua R.","family":"Smith","sequence":"additional","affiliation":[{"name":"University of Washington, Seattle, WA, USA"}]}],"member":"320","published-online":{"date-parts":[[2021,3,8]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Emergent complexity via multi-agent competition. arXiv preprint arXiv:1710.03748","author":"Bansal Trapit","year":"2017","unstructured":"Trapit Bansal , Jakub Pachocki , Szymon Sidor , Ilya Sutskever , and Igor Mordatch . 2017. Emergent complexity via multi-agent competition. arXiv preprint arXiv:1710.03748 ( 2017 ). Trapit Bansal, Jakub Pachocki, Szymon Sidor, Ilya Sutskever, and Igor Mordatch. 2017. Emergent complexity via multi-agent competition. arXiv preprint arXiv:1710.03748 (2017)."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/3319502.3374818"},{"key":"e_1_3_2_1_3_1","volume-title":"A Joseph Hoane Jr, and Feng-hsiung Hsu","author":"Campbell Murray","year":"2002","unstructured":"Murray Campbell , A Joseph Hoane Jr, and Feng-hsiung Hsu . 2002 . Deep blue. Artificial intelligence 134, 1--2 (2002), 57--83. Murray Campbell, A Joseph Hoane Jr, and Feng-hsiung Hsu. 2002. Deep blue. Artificial intelligence 134, 1--2 (2002), 57--83."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/3359616"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/ROMAN.2007.4415256"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/ROMAN.2010.5598658"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1089\/g4h.2013.0088"},{"key":"e_1_3_2_1_8_1","volume-title":"Learning with opponent-learning awareness. arXiv preprint arXiv:1709.04326","author":"Foerster Jakob N","year":"2017","unstructured":"Jakob N Foerster , Richard Y Chen , Maruan Al-Shedivat , Shimon Whiteson , Pieter Abbeel , and Igor Mordatch . 2017. Learning with opponent-learning awareness. arXiv preprint arXiv:1709.04326 ( 2017 ). Jakob N Foerster, Richard Y Chen, Maruan Al-Shedivat, Shimon Whiteson, Pieter Abbeel, and Igor Mordatch. 2017. Learning with opponent-learning awareness. arXiv preprint arXiv:1709.04326 (2017)."},{"key":"e_1_3_2_1_9_1","volume-title":"International conference on machine learning. 1804--1813","author":"He He","year":"2016","unstructured":"He He , Jordan Boyd-Graber , Kevin Kwok , and Hal Daum\u00e9 III. 2016 . Opponent modeling in deep reinforcement learning . In International conference on machine learning. 1804--1813 . He He, Jordan Boyd-Graber, Kevin Kwok, and Hal Daum\u00e9 III. 2016. Opponent modeling in deep reinforcement learning. In International conference on machine learning. 1804--1813."},{"key":"e_1_3_2_1_10_1","volume-title":"The four-phase model of interest development. Educational psychologist 41, 2","author":"Hidi Suzanne","year":"2006","unstructured":"Suzanne Hidi and K Ann Renninger . 2006. The four-phase model of interest development. Educational psychologist 41, 2 ( 2006 ), 111--127. Suzanne Hidi and K Ann Renninger. 2006. The four-phase model of interest development. Educational psychologist 41, 2 (2006), 111--127."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1016\/0167-2789(90)90076-2"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/HRI.2019.8673201"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/HRI.2019.8673116"},{"key":"e_1_3_2_1_14_1","volume-title":"Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971","author":"Lillicrap Timothy P","year":"2015","unstructured":"Timothy P Lillicrap , Jonathan J Hunt , Alexander Pritzel , Nicolas Heess , Tom Erez , Yuval Tassa , David Silver , and Daan Wierstra . 2015. Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971 ( 2015 ). Timothy P Lillicrap, Jonathan J Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, and Daan Wierstra. 2015. Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971 (2015)."},{"key":"e_1_3_2_1_15_1","volume-title":"OpenAI Pieter Abbeel, and Igor Mordatch","author":"Lowe Ryan","year":"2017","unstructured":"Ryan Lowe , Yi I Wu , Aviv Tamar , Jean Harb , OpenAI Pieter Abbeel, and Igor Mordatch . 2017 . Multi-agent actor-critic for mixed cooperative-competitive environments. In Advances in neural information processing systems. 6379--6390. Ryan Lowe, Yi I Wu, Aviv Tamar, Jean Harb, OpenAI Pieter Abbeel, and Igor Mordatch. 2017. Multi-agent actor-critic for mixed cooperative-competitive environments. In Advances in neural information processing systems. 6379--6390."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10514-017-9655-8"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1007\/s12008-015-0259-2"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/1121241.1121311"},{"key":"e_1_3_2_1_19_1","volume-title":"Robust adversarial reinforcement learning. arXiv preprint arXiv:1703.02702","author":"Pinto Lerrel","year":"2017","unstructured":"Lerrel Pinto , James Davidson , Rahul Sukthankar , and Abhinav Gupta . 2017. Robust adversarial reinforcement learning. arXiv preprint arXiv:1703.02702 ( 2017 ). Lerrel Pinto, James Davidson, Rahul Sukthankar, and Abhinav Gupta. 2017. Robust adversarial reinforcement learning. arXiv preprint arXiv:1703.02702 (2017)."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1037\/a0032688"},{"key":"e_1_3_2_1_21_1","volume-title":"Katie Salen Tekinba?, and Eric Zimmerman","author":"Salen Katie","year":"2004","unstructured":"Katie Salen , Katie Salen Tekinba?, and Eric Zimmerman . 2004 . Rules of play: Game design fundamentals. MIT press . Katie Salen, Katie Salen Tekinba?, and Eric Zimmerman. 2004. Rules of play: Game design fundamentals. MIT press."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2017.7989125"},{"key":"e_1_3_2_1_23_1","volume-title":"Checkers is solved. science 317, 5844","author":"Schaeffer Jonathan","year":"2007","unstructured":"Jonathan Schaeffer , Neil Burch , Yngvi Bj\u00f6rnsson , Akihiro Kishimoto , Martin M\u00fcller , Robert Lake , Paul Lu , and Steve Sutphen . 2007. Checkers is solved. science 317, 5844 ( 2007 ), 1518--1522. Jonathan Schaeffer, Neil Burch, Yngvi Bj\u00f6rnsson, Akihiro Kishimoto, Martin M\u00fcller, Robert Lake, Paul Lu, and Steve Sutphen. 2007. Checkers is solved. science 317, 5844 (2007), 1518--1522."},{"key":"e_1_3_2_1_24_1","volume-title":"Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347","author":"Schulman John","year":"2017","unstructured":"John Schulman , Filip Wolski , Prafulla Dhariwal , Alec Radford , and Oleg Klimov . 2017. Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347 ( 2017 ). John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, and Oleg Klimov. 2017. Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347 (2017)."},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/HRI.2010.5453193"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"crossref","unstructured":"David Silver Thomas Hubert Julian Schrittwieser Ioannis Antonoglou Matthew Lai Arthur Guez Marc Lanctot Laurent Sifre Dharshan Kumaran Thore Graepel etal 2018. A general reinforcement learning algorithm that masters chess shogi and Go through self-play. Science 362 6419 (2018) 1140--1144.  David Silver Thomas Hubert Julian Schrittwieser Ioannis Antonoglou Matthew Lai Arthur Guez Marc Lanctot Laurent Sifre Dharshan Kumaran Thore Graepel et al. 2018. A general reinforcement learning algorithm that masters chess shogi and Go through self-play. Science 362 6419 (2018) 1140--1144.","DOI":"10.1126\/science.aar6404"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"crossref","unstructured":"David Silver Julian Schrittwieser Karen Simonyan Ioannis Antonoglou Aja Huang Arthur Guez Thomas Hubert Lucas Baker Matthew Lai Adrian Bolton etal 2017. Mastering the game of go without human knowledge. nature 550 7676 (2017) 354--359.  David Silver Julian Schrittwieser Karen Simonyan Ioannis Antonoglou Aja Huang Arthur Guez Thomas Hubert Lucas Baker Matthew Lai Adrian Bolton et al. 2017. Mastering the game of go without human knowledge. nature 550 7676 (2017) 354--359.","DOI":"10.1038\/nature24270"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.5555\/1622467.1622471"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/ROMAN.2014.6926267"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0172395"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1556\/APhysiol.97.2010.1.3"}],"event":{"name":"HRI '21: ACM\/IEEE International Conference on Human-Robot Interaction","location":"Boulder CO USA","acronym":"HRI '21","sponsor":["SIGAI ACM Special Interest Group on Artificial Intelligence","SIGCHI ACM Special Interest Group on Computer-Human Interaction"]},"container-title":["Companion of the 2021 ACM\/IEEE International Conference on Human-Robot Interaction"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3434074.3447168","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3434074.3447168","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3434074.3447168","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:47:28Z","timestamp":1750193248000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3434074.3447168"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,3,8]]},"references-count":31,"alternative-id":["10.1145\/3434074.3447168","10.1145\/3434074"],"URL":"https:\/\/doi.org\/10.1145\/3434074.3447168","relation":{},"subject":[],"published":{"date-parts":[[2021,3,8]]},"assertion":[{"value":"2021-03-08","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}