{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:14:53Z","timestamp":1750220093818,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":26,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,7,8]],"date-time":"2022-07-08T00:00:00Z","timestamp":1657238400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["IIS-1815886"],"award-info":[{"award-number":["IIS-1815886"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,7,8]]},"DOI":"10.1145\/3512290.3528860","type":"proceedings-article","created":{"date-parts":[[2022,8,17]],"date-time":"2022-08-17T15:32:35Z","timestamp":1660750355000},"page":"350-358","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["Diversifying behaviors for learning in asymmetric multiagent systems"],"prefix":"10.1145","author":[{"given":"Gaurav","family":"Dixit","sequence":"first","affiliation":[{"name":"Oregon State University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Everardo","family":"Gonzalez","sequence":"additional","affiliation":[{"name":"Oregon State University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kagan","family":"Tumer","sequence":"additional","affiliation":[{"name":"Oregon State University"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2022,7,8]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/3071178.3071186"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/3377930.3389809"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/3377930.3389809"},{"key":"e_1_3_2_1_4_1","volume-title":"Difference Rewards Policy Gradients. arXiv preprint arXiv:2012.11258","author":"Castellini Jacopo","year":"2020","unstructured":"Jacopo Castellini , Sam Devlin , Frans A Oliehoek , and Rahul Savani . 2020. Difference Rewards Policy Gradients. arXiv preprint arXiv:2012.11258 ( 2020 ). Jacopo Castellini, Sam Devlin, Frans A Oliehoek, and Rahul Savani. 2020. Difference Rewards Policy Gradients. arXiv preprint arXiv:2012.11258 (2020)."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/3377930.3390217"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/3321707.3321804"},{"key":"e_1_3_2_1_7_1","unstructured":"Hoong Chuin Lau Due Thien Nguyen Akshat Kumar. 2018. Credit assignment for collective multiagent RL with global rewards. In Advances in Neural Informaiton Processing Systems (NIPS 2018): Montreal Canada December 2-8. 8102--8113.  Hoong Chuin Lau Due Thien Nguyen Akshat Kumar. 2018. Credit assignment for collective multiagent RL with global rewards. In Advances in Neural Informaiton Processing Systems (NIPS 2018): Montreal Canada December 2-8. 8102--8113."},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.amc.2018.11.052"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"crossref","unstructured":"Jared Hill James Archibald Wynn Stirling and Richard Frost. 2005. A multi-agent system architecture for distributed air traffic control. In AIAA guidance navigation and control conference and exhibit. 6049.  Jared Hill James Archibald Wynn Stirling and Richard Frost. 2005. A multi-agent system architecture for distributed air traffic control. In AIAA guidance navigation and control conference and exhibit. 6049.","DOI":"10.2514\/6.2005-6049"},{"key":"e_1_3_2_1_10_1","volume-title":"Evolutionary reinforcement learning for sample-efficient multiagent coordination. arXiv preprint arXiv:1906.07315","author":"Khadka Shauharda","year":"2019","unstructured":"Shauharda Khadka , Somdeb Majumdar , Santiago Miret , Stephen McAleer , and Kagan Tumer . 2019. Evolutionary reinforcement learning for sample-efficient multiagent coordination. arXiv preprint arXiv:1906.07315 ( 2019 ). Shauharda Khadka, Somdeb Majumdar, Santiago Miret, Stephen McAleer, and Kagan Tumer. 2019. Evolutionary reinforcement learning for sample-efficient multiagent coordination. arXiv preprint arXiv:1906.07315 (2019)."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1177\/0278364913495721"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.5555\/3306127.3331809"},{"key":"e_1_3_2_1_13_1","volume-title":"Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971","author":"Lillicrap Timothy P","year":"2015","unstructured":"Timothy P Lillicrap , Jonathan J Hunt , Alexander Pritzel , Nicolas Heess , Tom Erez , Yuval Tassa , David Silver , and Daan Wierstra . 2015. Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971 ( 2015 ). Timothy P Lillicrap, Jonathan J Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, and Daan Wierstra. 2015. Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971 (2015)."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1017\/S0269888918000292"},{"key":"e_1_3_2_1_15_1","volume-title":"Illuminating search spaces by mapping elites. arXiv preprint arXiv:1504.04909","author":"Mouret Jean-Baptiste","year":"2015","unstructured":"Jean-Baptiste Mouret and Jeff Clune . 2015. Illuminating search spaces by mapping elites. arXiv preprint arXiv:1504.04909 ( 2015 ). Jean-Baptiste Mouret and Jeff Clune. 2015. Illuminating search spaces by mapping elites. arXiv preprint arXiv:1504.04909 (2015)."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-77538-8_48"},{"key":"e_1_3_2_1_17_1","unstructured":"OpenAI. 2018. OpenAI Five. https:\/\/blog.openai.com\/openai-five\/.  OpenAI. 2018. OpenAI Five. https:\/\/blog.openai.com\/openai-five\/."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW.2017.70"},{"key":"e_1_3_2_1_19_1","volume-title":"An empirical analysis of collaboration methods in cooperative coevolutionary algorithms. Journal Spector","author":"Paul WR","year":"2002","unstructured":"WR Paul , CL William , and KA De Jong . 2002. An empirical analysis of collaboration methods in cooperative coevolutionary algorithms. Journal Spector ( 2002 ), 15. WR Paul, CL William, and KA De Jong. 2002. An empirical analysis of collaboration methods in cooperative coevolutionary algorithms. Journal Spector (2002), 15."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2016.7759651"},{"key":"e_1_3_2_1_21_1","volume-title":"Fitness Critics for Multiagent Learning. In 2019 International Symposium on Multi-Robot and Multi-Agent Systems (MRS). IEEE, 222--224","author":"Rockefeller Golden","year":"2019","unstructured":"Golden Rockefeller , Patrick Mannion , and Kagan Tumer . 2019 . Fitness Critics for Multiagent Learning. In 2019 International Symposium on Multi-Robot and Multi-Agent Systems (MRS). IEEE, 222--224 . Golden Rockefeller, Patrick Mannion, and Kagan Tumer. 2019. Fitness Critics for Multiagent Learning. In 2019 International Symposium on Multi-Robot and Multi-Agent Systems (MRS). IEEE, 222--224."},{"key":"e_1_3_2_1_22_1","volume-title":"Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347","author":"Schulman John","year":"2017","unstructured":"John Schulman , Filip Wolski , Prafulla Dhariwal , Alec Radford , and Oleg Klimov . 2017. Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347 ( 2017 ). John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, and Oleg Klimov. 2017. Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347 (2017)."},{"key":"e_1_3_2_1_23_1","unstructured":"Hongyao Tang Jianye Hao Tangjie Lv Yingfeng Chen Zongzhang Zhang Hangtian Jia Chunxu Ren Yan Zheng Zhaopeng Meng Changjie Fan and Li Wang. 2019. Hierarchical Deep Multiagent Reinforcement Learning with Temporal Abstraction. arXiv:1809.09332 [cs.LG]  Hongyao Tang Jianye Hao Tangjie Lv Yingfeng Chen Zongzhang Zhang Hangtian Jia Chunxu Ren Yan Zheng Zhaopeng Meng Changjie Fan and Li Wang. 2019. Hierarchical Deep Multiagent Reinforcement Learning with Temporal Abstraction. arXiv:1809.09332 [cs.LG]"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/9.664154"},{"key":"e_1_3_2_1_25_1","unstructured":"Oriol Vinyals Igor Babuschkin Junyoung Chung Michael Mathieu Max Jaderberg Wojtek Czarnecki Andrew Dudzik Aja Huang Petko Georgiev Richard Powell Timo Ewalds Dan Horgan Manuel Kroiss Ivo Danihelka John Agapiou Junhyuk Oh Valentin Dalibard David Choi Laurent Sifre Yury Sulsky Sasha Vezhnevets James Molloy Trevor Cai David Budden Tom Paine Caglar Gulcehre Ziyu Wang Tobias Pfaff Toby Pohlen Dani Yogatama Julia Cohen Katrina McKinney Oliver Smith Tom Schaul Timothy Lillicrap Chris Apps Koray Kavukcuoglu Demis Hassabis and David Silver. 2019. AlphaStar: Mastering the Real-Time Strategy Game StarCraft II. https:\/\/deepmind.com\/blog\/alphastar-mastering-real-time-strategy-game-starcraft-ii\/.  Oriol Vinyals Igor Babuschkin Junyoung Chung Michael Mathieu Max Jaderberg Wojtek Czarnecki Andrew Dudzik Aja Huang Petko Georgiev Richard Powell Timo Ewalds Dan Horgan Manuel Kroiss Ivo Danihelka John Agapiou Junhyuk Oh Valentin Dalibard David Choi Laurent Sifre Yury Sulsky Sasha Vezhnevets James Molloy Trevor Cai David Budden Tom Paine Caglar Gulcehre Ziyu Wang Tobias Pfaff Toby Pohlen Dani Yogatama Julia Cohen Katrina McKinney Oliver Smith Tom Schaul Timothy Lillicrap Chris Apps Koray Kavukcuoglu Demis Hassabis and David Silver. 2019. AlphaStar: Mastering the Real-Time Strategy Game StarCraft II. https:\/\/deepmind.com\/blog\/alphastar-mastering-real-time-strategy-game-starcraft-ii\/."},{"key":"e_1_3_2_1_26_1","volume-title":"Paired open-ended trailblazer (poet): Endlessly generating increasingly complex and diverse learning environments and their solutions. arXiv preprint arXiv:1901.01753","author":"Wang Rui","year":"2019","unstructured":"Rui Wang , Joel Lehman , Jeff Clune , and Kenneth O Stanley . 2019. Paired open-ended trailblazer (poet): Endlessly generating increasingly complex and diverse learning environments and their solutions. arXiv preprint arXiv:1901.01753 ( 2019 ). Rui Wang, Joel Lehman, Jeff Clune, and Kenneth O Stanley. 2019. Paired open-ended trailblazer (poet): Endlessly generating increasingly complex and diverse learning environments and their solutions. arXiv preprint arXiv:1901.01753 (2019)."}],"event":{"name":"GECCO '22: Genetic and Evolutionary Computation Conference","sponsor":["SIGEVO ACM Special Interest Group on Genetic and Evolutionary Computation"],"location":"Boston Massachusetts","acronym":"GECCO '22"},"container-title":["Proceedings of the Genetic and Evolutionary Computation Conference"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3512290.3528860","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3512290.3528860","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3512290.3528860","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T18:09:57Z","timestamp":1750183797000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3512290.3528860"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,7,8]]},"references-count":26,"alternative-id":["10.1145\/3512290.3528860","10.1145\/3512290"],"URL":"https:\/\/doi.org\/10.1145\/3512290.3528860","relation":{},"subject":[],"published":{"date-parts":[[2022,7,8]]},"assertion":[{"value":"2022-07-08","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}