{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,3]],"date-time":"2025-11-03T13:42:06Z","timestamp":1762177326398,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":34,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,6,25]],"date-time":"2020-06-25T00:00:00Z","timestamp":1593043200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000181","name":"Air Force Office of Scientific Research","doi-asserted-by":"publisher","award":["FA9550-19-1-0195"],"award-info":[{"award-number":["FA9550-19-1-0195"]}],"id":[{"id":"10.13039\/100000181","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000104","name":"National Aeronautics and Space Administration","doi-asserted-by":"publisher","award":["80NSSC18K0941"],"award-info":[{"award-number":["80NSSC18K0941"]}],"id":[{"id":"10.13039\/100000104","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,6,25]]},"DOI":"10.1145\/3377930.3390220","type":"proceedings-article","created":{"date-parts":[[2020,6,29]],"date-time":"2020-06-29T19:29:12Z","timestamp":1593458952000},"page":"453-461","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":10,"title":["Multi-fitness learning for behavior-driven cooperation"],"prefix":"10.1145","author":[{"given":"Connor","family":"Yates","sequence":"first","affiliation":[{"name":"Oregon State University"}]},{"given":"Reid","family":"Christopher","sequence":"additional","affiliation":[{"name":"Oregon State University"}]},{"given":"Kagan","family":"Tumer","sequence":"additional","affiliation":[{"name":"Oregon State University"}]}],"member":"320","published-online":{"date-parts":[[2020,6,26]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/2330163.2330306"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10458-008-9046-9"},{"key":"e_1_3_2_1_3_1","volume-title":"European Workshop on Reinforcement Learning. Springer, 249--260","author":"Boutsioukis Georgios","year":"2011","unstructured":"Georgios Boutsioukis , Ioannis Partalas , and Ioannis Vlahavas . 2011 . Transfer learning in multi-agent reinforcement learning domains . In European Workshop on Reinforcement Learning. Springer, 249--260 . Georgios Boutsioukis, Ioannis Partalas, and Ioannis Vlahavas. 2011. Transfer learning in multi-agent reinforcement learning domains. In European Workshop on Reinforcement Learning. Springer, 249--260."},{"key":"e_1_3_2_1_4_1","volume-title":"Proceedings of the 6th European Congress and Exhibition on Intelligent Transport Systems and Services. 203--225","author":"Branke J\u00fcrgen","year":"2007","unstructured":"J\u00fcrgen Branke , Peter Goldate , and Holger Prothmann . 2007 . Actuated traffic signal optimization using evolutionary algorithms . In Proceedings of the 6th European Congress and Exhibition on Intelligent Transport Systems and Services. 203--225 . J\u00fcrgen Branke, Peter Goldate, and Holger Prothmann. 2007. Actuated traffic signal optimization using evolutionary algorithms. In Proceedings of the 6th European Congress and Exhibition on Intelligent Transport Systems and Services. 203--225."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/IJCNN.2014.6889732"},{"volume-title":"Twenty-Eighth AAAI Conference on Artificial Intelligence.","author":"Brys Tim","key":"e_1_3_2_1_6_1","unstructured":"Tim Brys , Ann Now\u00e9 , Daniel Kudenko , and Matthew E. Taylor . 2014. Combining Multiple Correlated Reward and Shaping Signals by Measuring Confidence . In Twenty-Eighth AAAI Conference on Artificial Intelligence. Tim Brys, Ann Now\u00e9, Daniel Kudenko, and Matthew E. Taylor. 2014. Combining Multiple Correlated Reward and Shaping Signals by Measuring Confidence. In Twenty-Eighth AAAI Conference on Artificial Intelligence."},{"key":"e_1_3_2_1_7_1","volume-title":"Autonomous multiagent space exploration with high-level human feedback. Journal of Aerospace Information Systems","author":"Colby Mitchell","year":"2016","unstructured":"Mitchell Colby , Logan Yliniemi , and Kagan Tumer . 2016. Autonomous multiagent space exploration with high-level human feedback. Journal of Aerospace Information Systems ( 2016 ), 301--315. Mitchell Colby, Logan Yliniemi, and Kagan Tumer. 2016. Autonomous multiagent space exploration with high-level human feedback. Journal of Aerospace Information Systems (2016), 301--315."},{"key":"e_1_3_2_1_8_1","volume-title":"A fast and elitist multiobjective genetic algorithm: NSGA-II","author":"Deb Kalyanmoy","year":"2002","unstructured":"Kalyanmoy Deb , Amrit Pratap , Sameer Agarwal , and TAMT Meyarivan . 2002. A fast and elitist multiobjective genetic algorithm: NSGA-II . IEEE transactions on evolutionary computation 6, 2 ( 2002 ), 182--197. Kalyanmoy Deb, Amrit Pratap, Sameer Agarwal, and TAMT Meyarivan. 2002. A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE transactions on evolutionary computation 6, 2 (2002), 182--197."},{"key":"e_1_3_2_1_9_1","volume-title":"Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems. IFAAMAS, 433--440","author":"Devlin Sam Michael","year":"2012","unstructured":"Sam Michael Devlin and Daniel Kudenko . 2012 . Dynamic potential-based reward shaping . In Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems. IFAAMAS, 433--440 . Sam Michael Devlin and Daniel Kudenko. 2012. Dynamic potential-based reward shaping. In Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems. IFAAMAS, 433--440."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/TEVC.2008.920677"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/2463372.2463525"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2014.6942864"},{"key":"e_1_3_2_1_13_1","volume-title":"Learning tensegrity locomotion using open-loop control signals and coevolutionary algorithms. Artificial life 21, 2","author":"Iscen Atil","year":"2015","unstructured":"Atil Iscen , Ken Caluwaerts , Jonathan Bruce , Adrian Agogino , Vytas SunSpiral , and Kagan Tumer . 2015. Learning tensegrity locomotion using open-loop control signals and coevolutionary algorithms. Artificial life 21, 2 ( 2015 ), 119--140. Atil Iscen, Ken Caluwaerts, Jonathan Bruce, Adrian Agogino, Vytas SunSpiral, and Kagan Tumer. 2015. Learning tensegrity locomotion using open-loop control signals and coevolutionary algorithms. Artificial life 21, 2 (2015), 119--140."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ress.2005.11.018"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P17-1001"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/1273496.1273572"},{"key":"e_1_3_2_1_17_1","volume-title":"Forming neural networks through efficient and adaptive coevolution. Evolutionary computation 5, 4","author":"Moriarty David E","year":"1997","unstructured":"David E Moriarty and Risto Miikkulainen . 1997. Forming neural networks through efficient and adaptive coevolution. Evolutionary computation 5, 4 ( 1997 ), 373--399. David E Moriarty and Risto Miikkulainen. 1997. Forming neural networks through efficient and adaptive coevolution. Evolutionary computation 5, 4 (1997), 373--399."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/CIG.2006.311682"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMC.2012.67"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.5555\/3305890.3305958"},{"key":"e_1_3_2_1_21_1","volume-title":"Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems -","volume":"3","author":"Proper Scott","year":"2012","unstructured":"Scott Proper and Kagan Tumer . 2012 . Modeling Difference Rewards for Multiagent Learning . In Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 3 (AAMAS '12). International Foundation for Autonomous Agents and Multiagent Systems, Richland, SC, 1397--1398. event-place: Valencia, Spain. Scott Proper and Kagan Tumer. 2012. Modeling Difference Rewards for Multiagent Learning. In Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 3 (AAMAS '12). International Foundation for Autonomous Agents and Multiagent Systems, Richland, SC, 1397--1398. event-place: Valencia, Spain."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2016.7759651"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1609\/aimag.v36i4.2577"},{"key":"e_1_3_2_1_24_1","volume-title":"Proceedings of the 20th International Conference on Machine Learning (ICML-03)","author":"Russell Stuart J","year":"2003","unstructured":"Stuart J Russell and Andrew Zimdars . 2003 . Q-decomposition for reinforcement learning agents . In Proceedings of the 20th International Conference on Machine Learning (ICML-03) . 656--663. Stuart J Russell and Andrew Zimdars. 2003. Q-decomposition for reinforcement learning agents. In Proceedings of the 20th International Conference on Machine Learning (ICML-03). 656--663."},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2013.6639012"},{"key":"e_1_3_2_1_26_1","unstructured":"Ozan Sener and Vladlen Koltun. 2018. Multi-task learning as multi-objective optimization. In Advances in Neural Information Processing Systems. 527--538.  Ozan Sener and Vladlen Koltun. 2018. Multi-task learning as multi-objective optimization. In Advances in Neural Information Processing Systems. 527--538."},{"key":"e_1_3_2_1_27_1","volume-title":"Proceedings of the 12th annual conference on Genetic and evolutionary computation. 1131--1138","author":"Kagan Tumer Jack F","year":"2010","unstructured":"Jack F Shepherd III and Kagan Tumer . 2010 . Robust neuro-control for a micro quadrotor . In Proceedings of the 12th annual conference on Genetic and evolutionary computation. 1131--1138 . Jack F Shepherd III and Kagan Tumer. 2010. Robust neuro-control for a micro quadrotor. In Proceedings of the 12th annual conference on Genetic and evolutionary computation. 1131--1138."},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/CEC.2008.4631247"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.5555\/3237383.3238080"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/MIS.2009.10"},{"key":"e_1_3_2_1_31_1","unstructured":"Harm Van Seijen Mehdi Fatemi Joshua Romoff Romain Laroche Tavian Barnes and Jeffrey Tsang. 2017. Hybrid reward architecture for reinforcement learning. In Advances in Neural Information Processing Systems. 5392--5402.  Harm Van Seijen Mehdi Fatemi Joshua Romoff Romain Laroche Tavian Barnes and Jeffrey Tsang. 2017. Hybrid reward architecture for reinforcement learning. In Advances in Neural Information Processing Systems. 5392--5402."},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1177\/0954410011410274"},{"key":"e_1_3_2_1_33_1","volume-title":"Proc. AAAI-08 Workshop on Transfer Learning for Complex Tasks.","author":"Wilson Aaron","year":"2008","unstructured":"Aaron Wilson , Alan Fern , Soumya Ray , and Prasad Tadepalli . 2008 . Learning and transferring roles in multi-agent reinforcement . In Proc. AAAI-08 Workshop on Transfer Learning for Complex Tasks. Aaron Wilson, Alan Fern, Soumya Ray, and Prasad Tadepalli. 2008. Learning and transferring roles in multi-agent reinforcement. In Proc. AAAI-08 Workshop on Transfer Learning for Complex Tasks."},{"key":"e_1_3_2_1_34_1","volume-title":"TIK report 103. Computer Engineering and Networks Laboratory (TIK), ETH Zurich","author":"Zitzler Eckart","year":"2001","unstructured":"Eckart Zitzler , Marco Laumanns , and Lothar Thiele . 2001. SPEA 2: improving the strength Pareto evolutionary algorithm , TIK report 103. Computer Engineering and Networks Laboratory (TIK), ETH Zurich , Zurich, Switzerland 545 ( 2001 ). Eckart Zitzler, Marco Laumanns, and Lothar Thiele. 2001. SPEA 2: improving the strength Pareto evolutionary algorithm, TIK report 103. Computer Engineering and Networks Laboratory (TIK), ETH Zurich, Zurich, Switzerland 545 (2001)."}],"event":{"name":"GECCO '20: Genetic and Evolutionary Computation Conference","sponsor":["SIGEVO ACM Special Interest Group on Genetic and Evolutionary Computation"],"location":"Canc\u00fan Mexico","acronym":"GECCO '20"},"container-title":["Proceedings of the 2020 Genetic and Evolutionary Computation Conference"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3377930.3390220","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3377930.3390220","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3377930.3390220","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:40:59Z","timestamp":1750200059000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3377930.3390220"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,6,25]]},"references-count":34,"alternative-id":["10.1145\/3377930.3390220","10.1145\/3377930"],"URL":"https:\/\/doi.org\/10.1145\/3377930.3390220","relation":{},"subject":[],"published":{"date-parts":[[2020,6,25]]},"assertion":[{"value":"2020-06-26","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}