{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:09:45Z","timestamp":1750219785931,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":14,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,7,15]],"date-time":"2023-07-15T00:00:00Z","timestamp":1689379200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,7,15]]},"DOI":"10.1145\/3583133.3590741","type":"proceedings-article","created":{"date-parts":[[2023,7,24]],"date-time":"2023-07-24T23:30:33Z","timestamp":1690241433000},"page":"279-282","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Overcoming Deceptive Rewards with Quality-Diversity"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0009-4233-247X","authenticated-orcid":false,"given":"Arno","family":"Feiden","sequence":"first","affiliation":[{"name":"Fraunhofer SCAI, Sankt Augustin, Germany"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8334-3695","authenticated-orcid":false,"given":"Jochen","family":"Garcke","sequence":"additional","affiliation":[{"name":"Institut f\u00fcr Numerische Simulation \/ Universit\u00e4t Bonn, Bonn, Germany"},{"name":"Fraunhofer SCAI, Sankt Augustin, Germany"}]}],"member":"320","published-online":{"date-parts":[[2023,7,24]]},"reference":[{"volume-title":"Quality-Diversity Optimization: A Novel Branch of Stochastic Optimization","author":"Chatzilygeroudis Konstantinos","key":"e_1_3_2_1_1_1","unstructured":"Konstantinos Chatzilygeroudis , Antoine Cully , Vassilis Vassiliades , and Jean-Baptiste Mouret . 2021. Quality-Diversity Optimization: A Novel Branch of Stochastic Optimization . Springer International Publishing , Cham , 109--135. Konstantinos Chatzilygeroudis, Antoine Cully, Vassilis Vassiliades, and Jean-Baptiste Mouret. 2021. Quality-Diversity Optimization: A Novel Branch of Stochastic Optimization. Springer International Publishing, Cham, 109--135."},{"key":"e_1_3_2_1_2_1","volume-title":"Joel Lehman, Kenneth O. Stanley, and Jeff Clune.","author":"Conti Edoardo","year":"2018","unstructured":"Edoardo Conti , Vashisht Madhavan , Felipe Petroski Such , Joel Lehman, Kenneth O. Stanley, and Jeff Clune. 2018 . Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents. In NeurIPS '18. 5032--5043. Edoardo Conti, Vashisht Madhavan, Felipe Petroski Such, Joel Lehman, Kenneth O. Stanley, and Jeff Clune. 2018. Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents. In NeurIPS'18. 5032--5043."},{"key":"e_1_3_2_1_3_1","volume-title":"Robots that can adapt like animals. Nature 521 (05","author":"Cully Antoine","year":"2015","unstructured":"Antoine Cully , Jeff Clune , Danesh Tarapore , and Jean-Baptiste Mouret . 2015. Robots that can adapt like animals. Nature 521 (05 2015 ), 503--507. Antoine Cully, Jeff Clune, Danesh Tarapore, and Jean-Baptiste Mouret. 2015. Robots that can adapt like animals. Nature 521 (05 2015), 503--507."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/TEVC.2017.2704781"},{"key":"e_1_3_2_1_5_1","volume-title":"First return, then explore. Nature 590 (02","author":"Ecoffet Adrien","year":"2021","unstructured":"Adrien Ecoffet , Joost Huizinga , Joel Lehman , Kenneth Stanley , and Jeff Clune . 2021. First return, then explore. Nature 590 (02 2021 ), 580--586. Adrien Ecoffet, Joost Huizinga, Joel Lehman, Kenneth Stanley, and Jeff Clune. 2021. First return, then explore. Nature 590 (02 2021), 580--586."},{"key":"e_1_3_2_1_6_1","volume-title":"ICML'18 (PMLR","volume":"1870","author":"Haarnoja Tuomas","year":"2018","unstructured":"Tuomas Haarnoja , Aurick Zhou , Pieter Abbeel , and Sergey Levine . 2018 . Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor . In ICML'18 (PMLR , Vol. 80). 1861-- 1870 . Tuomas Haarnoja, Aurick Zhou, Pieter Abbeel, and Sergey Levine. 2018. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor. In ICML'18 (PMLR, Vol. 80). 1861--1870."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-63710-1_4"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1162\/EVCO_a_00025"},{"key":"e_1_3_2_1_9_1","unstructured":"Jean-Baptiste Mouret and Jeff Clune. 2015. Illuminating search spaces by mapping elites. arXiv:1504.04909  Jean-Baptiste Mouret and Jeff Clune. 2015. Illuminating search spaces by mapping elites. arXiv:1504.04909"},{"key":"e_1_3_2_1_10_1","volume-title":"Diversity Policy Gradient for Sample Efficient Quality-Diversity Optimization (GECCO '22)","author":"Pierrot Thomas","year":"2022","unstructured":"Thomas Pierrot , Valentin Mac\u00e9 , Felix Chalumeau , Arthur Flajolet , Geoffrey Cideron , Karim Beguir , Antoine Cully , Olivier Sigaud , and Nicolas Perrin-Gilbert . 2022 . Diversity Policy Gradient for Sample Efficient Quality-Diversity Optimization (GECCO '22) . 1075--1083. Thomas Pierrot, Valentin Mac\u00e9, Felix Chalumeau, Arthur Flajolet, Geoffrey Cideron, Karim Beguir, Antoine Cully, Olivier Sigaud, and Nicolas Perrin-Gilbert. 2022. Diversity Policy Gradient for Sample Efficient Quality-Diversity Optimization (GECCO '22). 1075--1083."},{"key":"e_1_3_2_1_11_1","volume-title":"Stanley","author":"Pugh Justin K.","year":"2015","unstructured":"Justin K. Pugh , L. B. Soros , Paul A. Szerlip , and Kenneth O . Stanley . 2015 . Confronting the Challenge of Quality Diversity (GECCO '15). 967--974. Justin K. Pugh, L. B. Soros, Paul A. Szerlip, and Kenneth O. Stanley. 2015. Confronting the Challenge of Quality Diversity (GECCO '15). 967--974."},{"key":"e_1_3_2_1_12_1","first-page":"1","article-title":"Stable-Baselines3: Reliable Reinforcement Learning Implementations","volume":"22","author":"Raffin Antonin","year":"2021","unstructured":"Antonin Raffin , Ashley Hill , Adam Gleave , Anssi Kanervisto , Maximilian Ernestus , and Noah Dormann . 2021 . Stable-Baselines3: Reliable Reinforcement Learning Implementations . JMLR 22 , 268 (2021), 1 -- 8 . Antonin Raffin, Ashley Hill, Adam Gleave, Anssi Kanervisto, Maximilian Ernestus, and Noah Dormann. 2021. Stable-Baselines3: Reliable Reinforcement Learning Implementations. JMLR 22, 268 (2021), 1--8.","journal-title":"JMLR"},{"key":"e_1_3_2_1_13_1","volume-title":"Sam Sommerer, Nathan Dennler, and Stefanos Nikolaidis.","author":"Tjanaka Bryon","year":"2021","unstructured":"Bryon Tjanaka , Matthew C. Fontaine , David H. Lee , Yulun Zhang , Trung Tran Minh Vu , Sam Sommerer, Nathan Dennler, and Stefanos Nikolaidis. 2021 . pyribs: A bare-bones Python library for quality diversity optimization. https:\/\/github.com\/icaros-usc\/pyribs. Bryon Tjanaka, Matthew C. Fontaine, David H. Lee, Yulun Zhang, Trung Tran Minh Vu, Sam Sommerer, Nathan Dennler, and Stefanos Nikolaidis. 2021. pyribs: A bare-bones Python library for quality diversity optimization. https:\/\/github.com\/icaros-usc\/pyribs."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/TEVC.2017.2735550"}],"event":{"name":"GECCO '23 Companion: Companion Conference on Genetic and Evolutionary Computation","sponsor":["SIGEVO ACM Special Interest Group on Genetic and Evolutionary Computation"],"location":"Lisbon Portugal","acronym":"GECCO '23 Companion"},"container-title":["Proceedings of the Companion Conference on Genetic and Evolutionary Computation"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3583133.3590741","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3583133.3590741","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:37:50Z","timestamp":1750178270000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3583133.3590741"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,7,15]]},"references-count":14,"alternative-id":["10.1145\/3583133.3590741","10.1145\/3583133"],"URL":"https:\/\/doi.org\/10.1145\/3583133.3590741","relation":{},"subject":[],"published":{"date-parts":[[2023,7,15]]},"assertion":[{"value":"2023-07-24","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}