{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,27]],"date-time":"2026-05-27T16:24:23Z","timestamp":1779899063599,"version":"3.53.1"},"reference-count":159,"publisher":"Association for Computing Machinery (ACM)","issue":"5","license":[{"start":{"date-parts":[[2025,1,24]],"date-time":"2025-01-24T00:00:00Z","timestamp":1737676800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Comput. Surv."],"published-print":{"date-parts":[[2025,5,31]]},"abstract":"<jats:p>Multi-Agent Reinforcement Learning (MARL) is susceptible to Adversarial Machine Learning (AML) attacks. Execution-time AML attacks against MARL are complex due to effects that propagate across time and between agents. To understand the interaction between AML and MARL, this survey covers attacks and defences for MARL, Multi-Agent Learning (MAL), and Deep Reinforcement Learning (DRL). This survey proposes a novel perspective on AML attacks based on attack vectors. This survey also proposes a framework that addresses gaps in current modelling frameworks and enables the comparison of different attacks against MARL. Lastly, the survey identifies knowledge gaps and future avenues of research.<\/jats:p>","DOI":"10.1145\/3708320","type":"journal-article","created":{"date-parts":[[2024,12,18]],"date-time":"2024-12-18T10:51:46Z","timestamp":1734519106000},"page":"1-35","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":32,"title":["Adversarial Machine Learning Attacks and Defences in Multi-Agent Reinforcement Learning"],"prefix":"10.1145","volume":"57","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-2229-4532","authenticated-orcid":false,"given":"Maxwell","family":"Standen","sequence":"first","affiliation":[{"name":"School of Computer and Mathematical Sciences, The University of Adelaide, Adelaide, Australia and Defence Science and Technology Group, Edinburgh, Australia"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2288-8699","authenticated-orcid":false,"given":"Junae","family":"Kim","sequence":"additional","affiliation":[{"name":"Defence Science and Technology Group, Edinburgh Australia"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2501-1155","authenticated-orcid":false,"given":"Claudia","family":"Szabo","sequence":"additional","affiliation":[{"name":"School of Computer and Mathematical Sciences, The University of Adelaide, Adelaide, Australia"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2025,1,24]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.1109\/MSP.2017.2743240"},{"key":"e_1_3_2_3_2","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2021\/591"},{"key":"e_1_3_2_4_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v32i1.11672"},{"key":"e_1_3_2_5_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-62416-7_19"},{"key":"e_1_3_2_6_2","unstructured":"Vahid Behzadan and Arslan Munir. 2017. Whatever Does Not Kill Deep Reinforcement Learning Makes It Stronger. arXiv: https:\/\/arxiv.org\/abs\/1712.09344"},{"key":"e_1_3_2_7_2","first-page":"2633","volume-title":"Proceedings of the International Conference on Autonomous Agents and Multiagent Systems","author":"Belaire Roman","year":"2024","unstructured":"Roman Belaire, Pradeep Varakantham, Thanh Nguyen, and David Lo. 2024. Regret-based defense in adversarial reinforcement learning. In Proceedings of the International Conference on Autonomous Agents and Multiagent Systems. 2633\u20132640."},{"key":"e_1_3_2_8_2","unstructured":"Arjun Nitin Bhagoji Warren He Bo Li and Dawn Song. 2017. Exploring the Space of Black-box Attacks on Deep Neural Networks. arXiv: https:\/\/arxiv.org\/abs\/1712.09491"},{"key":"e_1_3_2_9_2","first-page":"1394","volume-title":"Proceedings of the Conference on Robot Learning","author":"Blumenkamp Jan","year":"2021","unstructured":"Jan Blumenkamp and Amanda Prorok. 2021. The emergence of adversarial communication in multi-agent reinforcement learning. In Proceedings of the Conference on Robot Learning. 1394\u20131414."},{"key":"e_1_3_2_10_2","unstructured":"Tom B. Brown Dandelion Man\u00e9 Aurko Roy Mart\u00edn Abadi and Justin Gilmer. 2017. Adversarial Patch. https:\/\/machine-learning-and-security.github.io\/"},{"key":"e_1_3_2_11_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA46639.2022.9811574"},{"key":"e_1_3_2_12_2","first-page":"13","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","volume":"36","author":"Bukharin A.","year":"2023","unstructured":"A. Bukharin, Y. Li, Y. Yu, Q. Zhang, Z. Chen, S. Zuo, C. Zhang, S. Zhang, and T. Zhao. 2023. Robust multi-agent reinforcement learning via adversarial regularization: Theoretical foundation and stable algorithms. In Proceedings of the Advances in Neural Information Processing Systems, Vol. 36. 13 pages."},{"key":"e_1_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.1109\/TSMCC.2007.913919"},{"key":"e_1_3_2_14_2","doi-asserted-by":"publisher","DOI":"10.1145\/3128572.3140444"},{"key":"e_1_3_2_15_2","doi-asserted-by":"publisher","DOI":"10.1109\/SP.2017.49"},{"key":"e_1_3_2_16_2","unstructured":"Stephen Casper Taylor Killian Gabriel Kreiman and Dylan Hadfield-Menell. 2022. White-Box Adversarial Policies in Deep Reinforcement Learning. arXiv: https:\/\/arxiv.org\/abs\/2209.02167"},{"key":"e_1_3_2_17_2","doi-asserted-by":"publisher","DOI":"10.5555\/1756006.1859921"},{"key":"e_1_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.1145\/3320269.3384715"},{"key":"e_1_3_2_19_2","doi-asserted-by":"publisher","DOI":"10.1186\/s42400-019-0027-x"},{"key":"e_1_3_2_20_2","unstructured":"Yize Chen Daniel Arnold Yuanyuan Shi and Sean Peisert. 2021. Understanding the Safety Requirements for Learning-based Power Systems Operations. arXiv: https:\/\/arxiv.org\/abs\/2110.04983"},{"key":"e_1_3_2_21_2","first-page":"11","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","volume":"31","author":"Cheng Ricson","year":"2018","unstructured":"Ricson Cheng, Ziyan Wang, and Katerina Fragkiadaki. 2018. Geometry-aware recurrent neural networks for active visual recognition. In Proceedings of the Advances in Neural Information Processing Systems, Vol. 31. 11 pages."},{"key":"e_1_3_2_22_2","first-page":"31","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","volume":"36","author":"Cheng Z.","year":"2023","unstructured":"Z. Cheng, X. Wu, J. Yu, W. Sun, W. Guo, and X. Xing. 2023. StateMask: Explaining deep reinforcement learning through state mask. In Proceedings of the Advances in Neural Information Processing Systems, Vol. 36. 31 pages."},{"key":"e_1_3_2_23_2","first-page":"1538","volume-title":"Proceedings of the International Conference on Machine Learning","volume":"2019","author":"Das Abhishek","year":"2019","unstructured":"Abhishek Das, Th\u00e9ophile Gervet, Joshua Romoff, Dhruv Batra, Devi Parikh, Mike Rabbat, and Joelle Pineau. 2019. TarMAC: Targeted multi-agent communication. In Proceedings of the International Conference on Machine Learning, Vol. 2019-June. 1538\u20131546."},{"key":"e_1_3_2_24_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v38i16.29682"},{"key":"e_1_3_2_25_2","doi-asserted-by":"publisher","DOI":"10.1109\/CSR51186.2021.9527986"},{"key":"e_1_3_2_26_2","doi-asserted-by":"publisher","DOI":"10.1109\/TNNLS.2021.3056046"},{"key":"e_1_3_2_27_2","doi-asserted-by":"publisher","DOI":"10.1109\/ITSC.2018.8569635"},{"key":"e_1_3_2_28_2","first-page":"2145","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","author":"Foerster Jakob","year":"2016","unstructured":"Jakob Foerster, Ioannis Alexandros Assael, Nando De Freitas, and Shimon Whiteson. 2016. Learning to communicate with deep multi-agent reinforcement learning. In Proceedings of the Advances in Neural Information Processing Systems. 2145\u20132153."},{"key":"e_1_3_2_29_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v32i1.11794"},{"key":"e_1_3_2_30_2","doi-asserted-by":"publisher","DOI":"10.1109\/WACV45572.2020.9093310"},{"key":"e_1_3_2_31_2","unstructured":"Ted Fujimoto and Arthur Paul Pedersen. 2022. Adversarial Attacks in Cooperative AI. arXiv: https:\/\/arxiv.org\/abs\/2111.14833"},{"key":"e_1_3_2_32_2","doi-asserted-by":"publisher","unstructured":"Javier Garc\u00eda Rub\u00e9n Majadas and Fernando Fern\u00e1ndez. 2020. Learning adversarial attack policies through multi-objective reinforcement learning. Engineering Applications of Artificial Intelligence 96 (2020) 104021. DOI:10.1016\/j.engappai.2020.104021","DOI":"10.1016\/j.engappai.2020.104021"},{"key":"e_1_3_2_33_2","doi-asserted-by":"publisher","unstructured":"Javier Garc\u00eda and Ismael Sagredo. 2022. Instance-based defense against adversarial attacks in deep reinforcement learning. Engineering Applications of Artificial Intelligence 107 (2022) 104514. DOI:10.1016\/j.engappai.2021.104514","DOI":"10.1016\/j.engappai.2021.104514"},{"key":"e_1_3_2_34_2","first-page":"27","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","volume":"36","author":"Ghai U.","year":"2023","unstructured":"U. Ghai, A. Gupta, W. Xia, K. Singh, and E. Hazan. 2023. Online nonstochastic model-free reinforcement learning. In Proceedings of the Advances in Neural Information Processing Systems, Vol. 36. 27 pages."},{"key":"e_1_3_2_35_2","first-page":"16","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Gleave Adam","year":"2020","unstructured":"Adam Gleave, Michael Dennis, Cody Wild, Neel Kant, Sergey Levine, and Stuart Russell. 2020. Adversarial policies: Attacking deep reinforcement learning. In Proceedings of the International Conference on Learning Representations. 16 pages."},{"key":"e_1_3_2_36_2","first-page":"11","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Goodfellow Ian J","year":"2015","unstructured":"Ian J Goodfellow, Jonathon Shlens, and Christian Szegedy. 2015. Explaining and harnessing adversarial examples. In Proceedings of the International Conference on Learning Representations. 11 pages."},{"key":"e_1_3_2_37_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10994-020-05929-w"},{"key":"e_1_3_2_38_2","first-page":"12","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Guo Chuan","year":"2018","unstructured":"Chuan Guo, Mayank Rana, Moustapha Cisse, and Laurens Van Der Maaten. 2018. Countering adversarial images using input transformations. In Proceedings of the International Conference on Learning Representations. 12 pages."},{"key":"e_1_3_2_39_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW56347.2022.00022"},{"key":"e_1_3_2_40_2","first-page":"3910","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Guo Wenbo","year":"2021","unstructured":"Wenbo Guo, Xian Wu, Sui Huang, and Xinyu Xing. 2021. Adversarial policy learning in two-player competitive games. In Proceedings of the International Conference on Machine Learning. 3910\u20133919."},{"key":"e_1_3_2_41_2","first-page":"12222","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","volume":"34","author":"Guo Wenbo","year":"2021","unstructured":"Wenbo Guo, Xian Wu, Usmann Khan, and Xinyu Xing. 2021. EDGE: Explaining deep reinforcement learning policies. In Proceedings of the Advances in Neural Information Processing Systems, Vol. 34. 12222\u201312236."},{"key":"e_1_3_2_42_2","first-page":"3943","volume-title":"Proceedings of the USENIX Security Symposium","volume":"6","author":"Guo W.","year":"2023","unstructured":"W. Guo, X. Wu, L. Wang, X. Xing, and D. Song. 2023. PATROL: Provable defense against adversarial policy in two-player games. In Proceedings of the USENIX Security Symposium, Vol. 6. 3943\u20133960."},{"key":"e_1_3_2_43_2","doi-asserted-by":"crossref","unstructured":"Jayesh K. Gupta Maxim Egorov and Mykel Kochenderfer. 2017. Cooperative multi-agent control using deep reinforcement learning. In Autonomous Agents and Multiagent Systems Springer International Publishing Cham 66\u201383.","DOI":"10.1007\/978-3-319-71682-4_5"},{"key":"e_1_3_2_44_2","first-page":"37","article-title":"What is the solution for state-adversarial multi-agent reinforcement learning?","volume":"2024","author":"Han Songyang","year":"2024","unstructured":"Songyang Han, Sanbao Su, Sihong He, Shuo Han, Haizhao Yang, and Fei Miao. 2024. What is the solution for state-adversarial multi-agent reinforcement learning? Transactions on Machine Learning Research 2024 (2024), 37 pages.","journal-title":"Transactions on Machine Learning Research"},{"key":"e_1_3_2_45_2","doi-asserted-by":"publisher","DOI":"10.1109\/IJCNN48605.2020.9206634"},{"key":"e_1_3_2_46_2","doi-asserted-by":"publisher","DOI":"10.5555\/1597148.1597262"},{"key":"e_1_3_2_47_2","first-page":"29","volume-title":"Proceedings of the AAAI Fall Symposium Series","author":"Hausknecht Matthew","year":"2015","unstructured":"Matthew Hausknecht and Peter Stone. 2015. Deep recurrent q-learning for partially observable MDPs. In Proceedings of the AAAI Fall Symposium Series. 29\u201337."},{"key":"e_1_3_2_48_2","volume-title":"Cooperation and Communication in Multiagent Deep Reinforcement Learning","author":"Hausknecht Matthew John","year":"2016","unstructured":"Matthew John Hausknecht. 2016. Cooperation and Communication in Multiagent Deep Reinforcement Learning. Ph. D. Dissertation. The University of Texas."},{"key":"e_1_3_2_49_2","doi-asserted-by":"publisher","DOI":"10.1145\/3394486.3403089"},{"key":"e_1_3_2_50_2","first-page":"7","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Huang Sandy","year":"2017","unstructured":"Sandy Huang, Nicolas Papernot, Ian Goodfellow, Yan Duan, and Pieter Abbeel. 2017. Adversarial attacks on neural network policies. In Proceedings of the International Conference on Learning Representations. 7 pages."},{"key":"e_1_3_2_51_2","first-page":"548","volume-title":"Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems","volume":"2020","author":"Hussenot L\u00e9onard","year":"2020","unstructured":"L\u00e9onard Hussenot, Matthieu Geist, and Olivier Pietquin. 2020. CopyCAT: Taking control of neural policies with constant attacks. In Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, Vol. 2020-May. 548\u2013556."},{"key":"e_1_3_2_52_2","doi-asserted-by":"publisher","DOI":"10.1109\/TAI.2021.3111139"},{"key":"e_1_3_2_53_2","first-page":"557","volume-title":"Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems","volume":"2020","author":"Inkawhich Matthew","year":"2020","unstructured":"Matthew Inkawhich, Yiran Chen, and Hai Li. 2020. Snooping attacks on deep reinforcement learning. In Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, Vol. 2020-May. 557\u2013565."},{"key":"e_1_3_2_54_2","first-page":"5372","volume-title":"Proceedings of the International Conference on Machine Learning","volume":"2019","author":"Jaques Natasha","year":"2019","unstructured":"Natasha Jaques, Angeliki Lazaridou, Edward Hughes, Caglar Gulcehre, Pedro A. Ortega, D. J. Strouse, Joel Z. Leibo, and Nando de Freitas. 2019. Social influence as intrinsic motivation for multi-agent deep reinforcement learning. In Proceedings of the International Conference on Machine Learning, Vol. 2019-June. 5372\u20135381."},{"key":"e_1_3_2_55_2","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2023\/19"},{"key":"e_1_3_2_56_2","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2022.3141829"},{"key":"e_1_3_2_57_2","doi-asserted-by":"publisher","DOI":"10.1109\/DAC18072.2020.9218663"},{"key":"e_1_3_2_58_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.infsof.2008.09.009"},{"key":"e_1_3_2_59_2","unstructured":"Ezgi Korkmaz. 2020. Daylight: Assessing Generalization Skills of Deep Reinforcement Learning Agents. https:\/\/openreview.net\/forum?id=Z3XVHSbSawb"},{"key":"e_1_3_2_60_2","first-page":"6","volume-title":"Proceedings of the NeurIPS Workshop on Distribution Shifts: Connecting Methods and Applications","author":"Korkmaz Ezgi","year":"2021","unstructured":"Ezgi Korkmaz. 2021. Adversarial training blocks generalization in neural policies. In Proceedings of the NeurIPS Workshop on Distribution Shifts: Connecting Methods and Applications. 6 pages."},{"key":"e_1_3_2_61_2","unstructured":"Ezgi Korkmaz. 2021. Assessing Deep Reinforcement Learning Policies via Natural Corruptions at the Edge of Imperceptibility. https:\/\/openreview.net\/forum?id=kTcRljax0x9"},{"key":"e_1_3_2_62_2","first-page":"1661","volume-title":"Proceedings of the Conference on Uncertainty in Artificial Intelligence","author":"Korkmaz Ezgi","year":"2021","unstructured":"Ezgi Korkmaz. 2021. Investigating vulnerabilities of deep neural policies. In Proceedings of the Conference on Uncertainty in Artificial Intelligence. 1661\u20131670."},{"key":"e_1_3_2_63_2","first-page":"6","volume-title":"Proceedings of the Workshop on Adversarial Machine Learning","author":"Korkmaz Ezgi","year":"2021","unstructured":"Ezgi Korkmaz. 2021. Non-robust feature mapping in deep reinforcement learning. In Proceedings of the Workshop on Adversarial Machine Learning. 6 pages."},{"key":"e_1_3_2_64_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v37i7.26009"},{"key":"e_1_3_2_65_2","first-page":"17534","volume-title":"Proceedings of the International Conference on Machine Learning","volume":"202","author":"Korkmaz E.","year":"2023","unstructured":"E. Korkmaz and J. Brown-Cohen. 2023. Detecting adversarial directions in deep reinforcement learning to make robust decisions. In Proceedings of the International Conference on Machine Learning, Vol. 202. 17534\u201317543."},{"key":"e_1_3_2_66_2","first-page":"6","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Kos Jernej","year":"2017","unstructured":"Jernej Kos and Dawn Song. 2017. Delving into adversarial attacks on deep policies. In Proceedings of the International Conference on Learning Representations. 6 pages."},{"key":"e_1_3_2_67_2","first-page":"29","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Kumar Aounon","year":"2022","unstructured":"Aounon Kumar, Alexander Levine, and Soheil Feizi. 2022. Policy smoothing for provably robust reinforcement learning. In Proceedings of the International Conference on Learning Representations. 29 pages."},{"key":"e_1_3_2_68_2","first-page":"1761","volume-title":"Proceedings of the IEEE International Conference on Machine Learning and Applications","author":"Kumar R Praveen","year":"2021","unstructured":"R Praveen Kumar, I. Niranjan Kumar, Sujith Sivasankaran, A. Mohan Vamsi, and Vineeth Vijayaraghavan. 2021. Critical state detection for adversarial attacks in deep reinforcement learning. In Proceedings of the IEEE International Conference on Machine Learning and Applications. 1761\u20131766."},{"key":"e_1_3_2_69_2","first-page":"17","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Kurakin Alexey","year":"2017","unstructured":"Alexey Kurakin, Ian Goodfellow, and Samy Bengio. 2017. Adversarial machine learning at scale. In Proceedings of the International Conference on Learning Representations. 17 pages."},{"key":"e_1_3_2_70_2","first-page":"11","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","author":"Laidlaw Cassidy","year":"2019","unstructured":"Cassidy Laidlaw and Soheil Feizi. 2019. Functional adversarial attacks. In Proceedings of the Advances in Neural Information Processing Systems. 11 pages."},{"key":"e_1_3_2_71_2","first-page":"10","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","volume":"32","author":"Lecarpentier Erwan","year":"2019","unstructured":"Erwan Lecarpentier and Emmanuel Rachelson. 2019. Non-stationary markov decision processes a worst-case approach using model-based reinforcement learning. In Proceedings of the Advances in Neural Information Processing Systems, Vol. 32. 10 pages."},{"key":"e_1_3_2_72_2","doi-asserted-by":"publisher","DOI":"10.1145\/3450267.3450537"},{"key":"e_1_3_2_73_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i04.5887"},{"key":"e_1_3_2_74_2","first-page":"275","volume-title":"Proceedings of the European Conference on Cyber Warfare and Security","author":"Lemay Antoine","year":"2019","unstructured":"Antoine Lemay and Sylvain Leblanc. 2019. Operational tempo in cyber operations. In Proceedings of the European Conference on Cyber Warfare and Security. ACAD CONFERENCES LTD Location NR READING, 275\u2013281."},{"key":"e_1_3_2_75_2","doi-asserted-by":"publisher","DOI":"10.5555\/3545946.3598671"},{"key":"e_1_3_2_76_2","first-page":"22547","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","volume":"35","author":"Liang Yongyuan","year":"2022","unstructured":"Yongyuan Liang, Yanchao Sun, Ruijie Zheng, and Furong Huang. 2022. Efficient adversarial training without attacking: Worst-case-aware robust reinforcement learning. In Proceedings of the Advances in Neural Information Processing Systems, Vol. 35. 22547\u201322561."},{"key":"e_1_3_2_77_2","doi-asserted-by":"publisher","DOI":"10.1109\/SPW50608.2020.00027"},{"key":"e_1_3_2_78_2","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2017\/525"},{"key":"e_1_3_2_79_2","first-page":"22249","volume-title":"Proceedings of the International Conference on Machine Learning","volume":"202","author":"Liu Z.","year":"2023","unstructured":"Z. Liu, Z. Guo, Z. Cen, H. Zhang, Y. Yao, H. Hu, and D. Zhao. 2023. Towards robust and safe reinforcement learning with benign off-policy data. In Proceedings of the International Conference on Machine Learning, Vol. 202. 22249\u201322265."},{"key":"e_1_3_2_80_2","first-page":"693","volume-title":"Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems","volume":"2","author":"Lowe Ryan","year":"2019","unstructured":"Ryan Lowe, Jakob Foerster, Y.-Lan Boureau, Joelle Pineau, and Yann Dauphin. 2019. On the pitfalls of measuring emergent communication. In Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, Vol. 2. 693\u2013701."},{"key":"e_1_3_2_81_2","first-page":"22917","volume-title":"Proceedings of the International Conference on Machine Learning","volume":"202","author":"Lu C.","year":"2023","unstructured":"C. Lu, T. Willi, A. Letcher, and J. Foerster. 2023. Adversarial cheap talk. In Proceedings of the International Conference on Machine Learning, Vol. 202. 22917\u201322941."},{"key":"e_1_3_2_82_2","first-page":"23","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Madry Aleksander","year":"2018","unstructured":"Aleksander Madry, Aleksandar Makelov, Ludwig Schmidt, Dimitris Tsipras, and Adrian Vladu. 2018. Towards deep learning models resistant to adversarial attacks. In Proceedings of the International Conference on Learning Representations. 23 pages."},{"key":"e_1_3_2_83_2","unstructured":"Antoine Marot Isabelle Guyon Benjamin Donnot Gabriel Dulac-Arnold Patrick Panciatici Mariette Awad Aidan O\u2019Sullivan Adrian Kelly and Zigfried Hampel-Arias. 2020. L2RPN: Learning to Run a Power Network in a Sustainable World NeurIPS2020 challenge design. 26 pages. https:\/\/www.semanticscholar.org\/paper\/L2RPN%3A-Learning-to-Run-a-Power-Network-in-a-World-Marot-Guyon\/1b389944395c210e92dea97a882b5c309cd622e4"},{"key":"e_1_3_2_84_2","unstructured":"Rupert Mitchell Jan Blumenkamp and Amanda Prorok. 2020. Gaussian process based message filtering for robust multi-agent cooperation in the presence of adversarial communication. arXiv: https:\/\/arxiv.org\/abs\/2012.00508"},{"key":"e_1_3_2_85_2","first-page":"9","volume-title":"Proceedings of the NIPS Deep Learning Workshop","author":"Mnih Volodymyr","year":"2013","unstructured":"Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin Riedmiller. 2013. Playing atari with deep reinforcement learning. In Proceedings of the NIPS Deep Learning Workshop. 9 pages."},{"key":"e_1_3_2_86_2","doi-asserted-by":"publisher","DOI":"10.1109\/CSR54599.2022.9850345"},{"key":"e_1_3_2_87_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.17"},{"key":"e_1_3_2_88_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v38i19.30139"},{"key":"e_1_3_2_89_2","doi-asserted-by":"publisher","DOI":"10.51593\/2022CA003"},{"key":"e_1_3_2_90_2","doi-asserted-by":"publisher","unstructured":"Kohei Ohashi Kosuke Nakanishi Wataru Sasaki Yuji Yasui and Shin Ishii. 2021. Deep adversarial reinforcement learning with noise compensation by autoencoder. IEEE Access 9 (2021) 143901\u2013143912. DOI:10.1109\/ACCESS.2021.3121751","DOI":"10.1109\/ACCESS.2021.3121751"},{"key":"e_1_3_2_91_2","first-page":"26156","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","volume":"34","author":"Oikarinen Tuomas","year":"2021","unstructured":"Tuomas Oikarinen, Wang Zhang, Alexandre Megretski, Tsui-Wei Weng, and Luca Daniel. 2021. Robust deep reinforcement learning through adversarial loss. In Proceedings of the Advances in Neural Information Processing Systems, Vol. 34. 26156\u201326167."},{"key":"e_1_3_2_92_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-28929-8"},{"key":"e_1_3_2_93_2","unstructured":"OpenAI Christopher Berner Greg Brockman Brooke Chan Vicki Cheung Przemys\u0142aw D\u0119biak Christy Dennison David Farhi Quirin Fischer Shariq Hashme Chris Hesse Rafal J\u00f3zefowicz Scott Gray Catherine Olsson Jakub Pachocki Michael Petrov Henrique P. d. O. Pinto Jonathan Raiman Tim Salimans Jeremy Schlatter Jonas Schneider Szymon Sidor Ilya Sutskever Jie Tang Filip Wolski and Susan Zhang. 2019. Dota 2 with Large Scale Deep Reinforcement Learning. arXiv: https:\/\/arxiv.org\/abs\/1912.06680"},{"key":"e_1_3_2_94_2","unstructured":"Alexander Pan Yongkyun Lee Huan Zhang Yize Chen and Yuanyuan Shi. 2021. Improving Robustness of Reinforcement Learning for Power System Control with Adversarial Training. arXiv: https:\/\/arxiv.org\/abs\/2110.08956"},{"key":"e_1_3_2_95_2","first-page":"1010","volume-title":"Proceedings of the International Conference on Autonomous Agents and Multiagent Systems","author":"Pan Xinlei","year":"2022","unstructured":"Xinlei Pan, Chaowei Xiao, Warren He, Shuang Yang, Jian Peng, Mingjie Sun, Jinfeng Yi, Zijiang Yang, Mingyan Liu, and Bo Li. 2022. Characterizing attacks on deep reinforcement learning. In Proceedings of the International Conference on Autonomous Agents and Multiagent Systems. 1010\u20131018."},{"key":"e_1_3_2_96_2","unstructured":"Nicolas Papernot Patrick McDaniel and Ian Goodfellow. 2016. Transferability in Machine Learning: from Phenomena to Black-Box Attacks using Adversarial Samples. arXiv: https:\/\/arxiv.org\/abs\/1605.07277"},{"key":"e_1_3_2_97_2","doi-asserted-by":"publisher","DOI":"10.1109\/EuroSP.2016.36"},{"key":"e_1_3_2_98_2","first-page":"2040","volume-title":"Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems","volume":"3","author":"Pattanaik Anay","year":"2018","unstructured":"Anay Pattanaik, Zhenyi Tang, Shuijing Liu, Gautham Bommannan, and Girish Chowdhary. 2018. Robust deep reinforcement learning with adversarial attacks. In Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, Vol. 3. 2040\u20132042."},{"key":"e_1_3_2_99_2","first-page":"1055","volume-title":"Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems","volume":"2020","author":"Phan Thomy","year":"2020","unstructured":"Thomy Phan, Thomas Gabor, Andreas Sedlmeier, Fabian Ritz, Bernhard Kempter, Cornel Klein, Horst Sauer, Reiner Schmid, Jan Wieghardt, Marc Zeller, and Claudia Linnhoff-Popien. 2020. Learning and testing resilience in cooperative multi-agent systems. In Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, Vol. 2020-May. 1055\u20131063."},{"key":"e_1_3_2_100_2","first-page":"4310","volume-title":"Proceedings of the International Conference on Machine Learning","volume":"6","author":"Pinto Lerrel","year":"2017","unstructured":"Lerrel Pinto, James Davidson, Rahul Sukthankar, and Abhinav Gupta. 2017. Robust adversarial reinforcement learning. In Proceedings of the International Conference on Machine Learning, Vol. 6. 4310\u20134319."},{"key":"e_1_3_2_101_2","first-page":"4365","volume-title":"Proceedings of the USENIX Security Symposium","author":"Poddebniak Damian","year":"2021","unstructured":"Damian Poddebniak, Fabian Ising, Hanno B\u00f6ck, and Sebastian Schinzel. 2021. Why TLS is better without STARTTLS: A security analysis of STARTTLS in the email context. In Proceedings of the USENIX Security Symposium. 4365\u20134382."},{"key":"e_1_3_2_102_2","unstructured":"Amanda Prorok Matthew Malencia Luca Carlone Gaurav S. Sukhatme Brian M. Sadler and Vijay Kumar. 2021. Beyond Robustness: A Taxonomy of Approaches towards Resilient Multi-Robot Systems."},{"key":"e_1_3_2_103_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-021-11437-3"},{"key":"e_1_3_2_104_2","first-page":"15","article-title":"Understanding adversarial attacks on observations in deep reinforcement learning","author":"Qiaoben You","year":"2024","unstructured":"You Qiaoben, Chen Ying, Xinning Zhou, Hang Su, Jun Zhu, and Bo Zhang. 2024. Understanding adversarial attacks on observations in deep reinforcement learning. Science China Information Sciences 67, 5 (2024), 15 pages.","journal-title":"Science China Information Sciences"},{"key":"e_1_3_2_105_2","first-page":"8","volume-title":"Proceedings of the Workshop on Adversarial Machine Learning","author":"Qiaoben You","year":"2021","unstructured":"You Qiaoben, Xinning Zhou, Chen Ying, and Jun Zhu. 2021. Strategically-timed state-observation attacks on deep reinforcement learning agents. In Proceedings of the Workshop on Adversarial Machine Learning. 8 pages."},{"key":"e_1_3_2_106_2","doi-asserted-by":"publisher","DOI":"10.1145\/3625236"},{"key":"e_1_3_2_107_2","doi-asserted-by":"publisher","DOI":"10.1109\/TNNLS.2021.3133537"},{"key":"e_1_3_2_108_2","doi-asserted-by":"publisher","DOI":"10.1109\/HST56032.2022.10025434"},{"key":"e_1_3_2_109_2","doi-asserted-by":"publisher","DOI":"10.1145\/3605764.3623913"},{"key":"e_1_3_2_110_2","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2023\/54"},{"key":"e_1_3_2_111_2","first-page":"1","volume-title":"Proceedings of the International Conference on Machine Learning","volume":"21","author":"Rashid Tabish","year":"2020","unstructured":"Tabish Rashid, Mikayel Samvelyan, Christian Schroeder De Witt, Gregory Farquhar, Jakob Foerster, and Shimon Whiteson. 2020. Monotonic value function factorisation for deep multi-agent reinforcement learning. In Proceedings of the International Conference on Machine Learning, Vol. 21. 1\u201351."},{"key":"e_1_3_2_112_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.eng.2019.12.012"},{"key":"e_1_3_2_113_2","doi-asserted-by":"publisher","DOI":"10.23919\/ACC50511.2021.9483025"},{"key":"e_1_3_2_114_2","first-page":"1630","volume-title":"Proceedings of the International Conference on Autonomous Agents and Multiagent Systems","author":"Samvelyan Mikayel","year":"2024","unstructured":"Mikayel Samvelyan, Davide Paglieri, Minqi Jiang, Jack Parker-Holder, and Tim Rockt\u00e4schel. 2024. Multi-agent diagnostics for robustness via illuminated diversity. In Proceedings of the International Conference on Autonomous Agents and Multiagent Systems. 1630\u20131644."},{"key":"e_1_3_2_115_2","first-page":"2186","volume-title":"Proceedings of the International Conference on Autonomous Agents and MultiAgent Systems","volume":"4","author":"Samvelyan Mikayel","year":"2019","unstructured":"Mikayel Samvelyan, Tabish Rashid, Christian Schroeder De Witt, Gregory Farquhar, Nantas Nardelli, Tim GJ Rudner, Chia-Man Hung, Philip H. S. Torr, Jakob Foerster, and Shimon Whiteson. 2019. The starcraft multi-agent challenge. In Proceedings of the International Conference on Autonomous Agents and MultiAgent Systems, Vol. 4. 2186\u20132188."},{"key":"e_1_3_2_116_2","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btx196"},{"key":"e_1_3_2_117_2","doi-asserted-by":"publisher","DOI":"10.1038\/nature16961"},{"key":"e_1_3_2_118_2","first-page":"7","volume-title":"Proceedings of the International Workshop on Adaptive Cyber Defense","author":"Standen Maxwell","year":"2021","unstructured":"Maxwell Standen, Martin Lucas, David Bowman, Toby J. Richer, and Junae Kim. 2021. CybORG: A gym for the development of autonomous cyber agents. In Proceedings of the International Workshop on Adaptive Cyber Defense. 7 pages."},{"key":"e_1_3_2_119_2","doi-asserted-by":"publisher","DOI":"10.1109\/TEVC.2019.2890858"},{"key":"e_1_3_2_120_2","first-page":"2252","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","author":"Sukhbaatar Sainbayar","year":"2016","unstructured":"Sainbayar Sukhbaatar, Arthur Szlam, and Rob Fergus. 2016. Learning multiagent communication with backpropagation. In Proceedings of the Advances in Neural Information Processing Systems. 2252\u20132260."},{"key":"e_1_3_2_121_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i04.6047"},{"key":"e_1_3_2_122_2","first-page":"30","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Sun Yanchao","year":"2023","unstructured":"Yanchao Sun, Ruijie Zheng, Parisa Hassanzadeh, Yongyuan Liang, Soheil Feizi, Sumitra Ganesh, and Furong Huang. 2023. Certifiably robust policy learning against adversarial communication in multi-agent systems. In Proceedings of the International Conference on Learning Representations. 30 pages."},{"key":"e_1_3_2_123_2","first-page":"40","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Sun Yanchao","year":"2021","unstructured":"Yanchao Sun, Ruijie Zheng, Yongyuan Liang, and Furong Huang. 2021. Who is the strongest enemy? Towards optimal and efficient evasion attacks in deep RL. In Proceedings of the International Conference on Learning Representations. 40 pages."},{"key":"e_1_3_2_124_2","first-page":"10","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Szegedy Christian","year":"2014","unstructured":"Christian Szegedy, Wojciech Zaremba, Ilya Sutskever, Joan Bruna, Dumitru Erhan, Ian Goodfellow, and Rob Fergus. 2014. Intriguing properties of neural networks. In Proceedings of the International Conference on Learning Representations. 10 pages."},{"key":"e_1_3_2_125_2","first-page":"35","volume-title":"A Taxonomy and Terminology of Adversarial Machine Learning","author":"Tabassi Elham","year":"2019","unstructured":"Elham Tabassi, Kevin J. Burns, Michael Hadjimichael, Andres D. Molina-Markham, and Julian T. Sexton. 2019. A Taxonomy and Terminology of Adversarial Machine Learning. preprint 8269. National Institute of Standards and Technology. 35 pages."},{"key":"e_1_3_2_126_2","doi-asserted-by":"publisher","DOI":"10.1145\/3292500.3330724"},{"key":"e_1_3_2_127_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-17143-7_19"},{"key":"e_1_3_2_128_2","first-page":"6215","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Tessler Chen","year":"2019","unstructured":"Chen Tessler, Yonathan Efroni, and Shie Mannor. 2019. Action robust reinforcement learning and applications in continuous control. In Proceedings of the International Conference on Machine Learning. 6215\u20136224."},{"key":"e_1_3_2_129_2","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2022\/484"},{"key":"e_1_3_2_130_2","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2012.6386109"},{"key":"e_1_3_2_131_2","first-page":"9","volume-title":"Proceedings of the ACM Computer Science in Cars Symposium","author":"Tretschk Edgar","year":"2018","unstructured":"Edgar Tretschk, Seong Joon Oh, and Mario Fritz. 2018. Sequential attacks on agents for long-term adversarial goals. In Proceedings of the ACM Computer Science in Cars Symposium. 9 pages."},{"key":"e_1_3_2_132_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.00767"},{"key":"e_1_3_2_133_2","first-page":"7995","volume-title":"Proceedings of the International Conference on Machine Learning","volume":"11","author":"Uesato Jonathan","year":"2018","unstructured":"Jonathan Uesato, Brendan O\u2019donoghue, Pushmeet Kohli, and Aaron Oord. 2018. Adversarial risk and the dangers of evaluating against weak attacks. In Proceedings of the International Conference on Machine Learning, Vol. 11. 7995\u20138007."},{"key":"e_1_3_2_134_2","doi-asserted-by":"publisher","DOI":"10.1038\/s41586-019-1724-z"},{"key":"e_1_3_2_135_2","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2022\/549"},{"key":"e_1_3_2_136_2","unstructured":"Pengyue Wang Yan Li Shashi Shekhar and William F. Northrop. 2020. Adversarial Attacks on Reinforcement Learning based Energy Management Systems of Extended Range Electric Delivery Vehicles. arXiv: https:\/\/arxiv.org\/abs\/2006.00817"},{"key":"e_1_3_2_137_2","first-page":"15","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Wang Tonghan","year":"2020","unstructured":"Tonghan Wang, Jianhao Wang, Chongyi Zheng, and Chongjie Zhang. 2020. Learning nearly decomposable value functions via communication minimization. In Proceedings of the International Conference on Learning Representations. 15 pages."},{"key":"e_1_3_2_138_2","unstructured":"Yulong Wang Tong Sun Shenghong Li Xin Yuan Wei Ni Ekram Hossain and H. Vincent Poor. 2023. Adversarial Attacks and Defenses in Machine Learning-Powered Networks: A Contemporary Survey. arXiv: https:\/\/arxiv.org\/abs\/2303.06302"},{"key":"e_1_3_2_139_2","first-page":"13","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Weng Tsui-Wei","year":"2020","unstructured":"Tsui-Wei Weng, Krishnamurthy (Dj) Dvijotham, Jonathan Uesato, Kai Xiao, Sven Gowal, Robert Stanforth, and Pushmeet Kohli. 2020. Toward evaluating robustness of deep reinforcement learning with continuous control. In Proceedings of the International Conference on Learning Representations. 13 pages."},{"key":"e_1_3_2_140_2","doi-asserted-by":"publisher","DOI":"10.1145\/2601248.2601268"},{"key":"e_1_3_2_141_2","first-page":"34","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Wu Fan","year":"2022","unstructured":"Fan Wu, Linyi Li, Zijian Huang, Yevgeniy Vorobeychik, Ding Zhao, and Bo Li. 2022. CROP: Certifying robust policies for reinforcement learning through functional smoothing. In Proceedings of the International Conference on Learning Representations. 34 pages."},{"key":"e_1_3_2_142_2","first-page":"24177","volume-title":"Proceedings of the International Conference on Machine Learning","volume":"162","author":"Wu J.","year":"2022","unstructured":"J. Wu and Y. Vorobeychik. 2022. Robust deep reinforcement learning through bootstrapped opportunistic curriculum. In Proceedings of the International Conference on Machine Learning, Vol. 162. 24177\u201324211."},{"key":"e_1_3_2_143_2","first-page":"1883","volume-title":"Proceedings of the USENIX Security Symposium","author":"Wu Xian","year":"2021","unstructured":"Xian Wu, Wenbo Guo, Hua Wei, and Xinyu Xing. 2021. Adversarial policy training against deep reinforcement learning. In Proceedings of the USENIX Security Symposium. 1883\u20131900."},{"key":"e_1_3_2_144_2","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2018\/543"},{"key":"e_1_3_2_145_2","first-page":"1418","volume-title":"Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems","volume":"3","author":"Xue Wanqi","year":"2022","unstructured":"Wanqi Xue, Wei Qiu, Bo An, Zinovi Rabinovich, Svetlana Obraztsova, and Chai Kiat Yeo. 2022. Mis-spoke or mis-lead: Achieving robustness in multi-agent communicative reinforcement learning. In Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, Vol. 3. 1418\u20131426."},{"key":"e_1_3_2_146_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v36i8.20862"},{"key":"e_1_3_2_147_2","first-page":"16","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","volume":"35","author":"Yang R.","year":"2022","unstructured":"R. Yang, C. Bai, X. Ma, Z. Wang, C. Zhang, and L. Han. 2022. RORL: Robust offline reinforcement learning via conservative smoothing. In Proceedings of the Advances in Neural Information Processing Systems, Vol. 35. 16 pages."},{"key":"e_1_3_2_148_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2016.04.001"},{"key":"e_1_3_2_149_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v38i19.30176"},{"key":"e_1_3_2_150_2","first-page":"3673","volume-title":"Proceedings of the Workshop on Adversarial Machine Learning","author":"Ying Chengyang","year":"2021","unstructured":"Chengyang Ying, Xinning Zhou, Dong Yan, and Jun Zhu. 2021. Towards safe reinforcement learning via constraining conditional value at risk. In Proceedings of the Workshop on Adversarial Machine Learning. 3673\u20133680."},{"key":"e_1_3_2_151_2","first-page":"24611","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","volume":"35","author":"Yu Chao","year":"2022","unstructured":"Chao Yu, Akash Velu, Eugene Vinitsky, Jiaxuan Gao, Yu Wang, Alexandre Bayen, and Yi Wu. 2022. The surprising effectiveness of ppo in cooperative multi-agent games. In Proceedings of the Advances in Neural Information Processing Systems, Vol. 35. 24611\u201324624."},{"key":"e_1_3_2_152_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v38i16.29708"},{"key":"e_1_3_2_153_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v36i8.20876"},{"key":"e_1_3_2_154_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v37i10.26388"},{"key":"e_1_3_2_155_2","first-page":"16","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Zhang Huan","year":"2021","unstructured":"Huan Zhang, Hongge Chen, Duane Boning, and Cho-Jui Hsieh. 2021. Robust reinforcement learning on state observations with learned optimal adversary. In Proceedings of the International Conference on Learning Representations. 16 pages."},{"key":"e_1_3_2_156_2","first-page":"21024","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","volume":"33","author":"Zhang Huan","year":"2020","unstructured":"Huan Zhang, Hongge Chen, Chaowei Xiao, Bo Li, Mingyan Liu, Duane Boning, and Cho-Jui Hsieh. 2020. Robust deep reinforcement learning against adversarial perturbations on state observations. In Proceedings of the Advances in Neural Information Processing Systems, Vol. 33. 21024\u201321037."},{"key":"e_1_3_2_157_2","first-page":"2075","volume-title":"Proceedings of the International Conference on Autonomous Agents and Multiagent Systems","author":"Zhang Mingyue","year":"2024","unstructured":"Mingyue Zhang, Nianyu Li, Jialong Li, Jiachun Liao, and Jiamou Liu. 2024. Memory-based resilient control against non-cooperation in multi-agent flocking. In Proceedings of the International Conference on Autonomous Agents and Multiagent Systems. 2075\u20132084."},{"key":"e_1_3_2_158_2","doi-asserted-by":"publisher","DOI":"10.1109\/TSG.2021.3062700"},{"key":"e_1_3_2_159_2","unstructured":"Changxi Zhu Mehdi Dastani and Shihan Wang. 2022. A Survey of Multi-Agent Reinforcement Learning with Communication. arXiv: https:\/\/arxiv.org\/abs\/2203.08975"},{"key":"e_1_3_2_160_2","first-page":"8","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","volume":"20","author":"Zinkevich Martin","year":"2007","unstructured":"Martin Zinkevich, Michael Johanson, Michael Bowling, and Carmelo Piccione. 2007. Regret Minimization in Games with Incomplete Information. In Proceedings of the Advances in Neural Information Processing Systems, Vol. 20. 8 pages."}],"container-title":["ACM Computing Surveys"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3708320","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3708320","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T01:09:45Z","timestamp":1750295385000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3708320"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,1,24]]},"references-count":159,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2025,5,31]]}},"alternative-id":["10.1145\/3708320"],"URL":"https:\/\/doi.org\/10.1145\/3708320","relation":{},"ISSN":["0360-0300","1557-7341"],"issn-type":[{"value":"0360-0300","type":"print"},{"value":"1557-7341","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,1,24]]},"assertion":[{"value":"2023-12-05","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-11-22","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-01-24","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}