{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,19]],"date-time":"2026-02-19T05:22:38Z","timestamp":1771478558349,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":56,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,6,25]],"date-time":"2020-06-25T00:00:00Z","timestamp":1593043200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,6,25]]},"DOI":"10.1145\/3377930.3389842","type":"proceedings-article","created":{"date-parts":[[2020,6,29]],"date-time":"2020-06-29T19:29:12Z","timestamp":1593458952000},"page":"814-822","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":18,"title":["Effective reinforcement learning through evolutionary surrogate-assisted prescription"],"prefix":"10.1145","author":[{"given":"Olivier","family":"Francon","sequence":"first","affiliation":[{"name":"The University of Texas at Austin"}]},{"given":"Santiago","family":"Gonzalez","sequence":"additional","affiliation":[{"name":"The University of Texas at Austin"}]},{"given":"Babak","family":"Hodjat","sequence":"additional","affiliation":[{"name":"The University of Texas at Austin"}]},{"given":"Elliot","family":"Meyerson","sequence":"additional","affiliation":[{"name":"The University of Texas at Austin"}]},{"given":"Risto","family":"Miikkulainen","sequence":"additional","affiliation":[{"name":"The University of Texas at Austin"}]},{"given":"Xin","family":"Qiu","sequence":"additional","affiliation":[{"name":"The University of Texas at Austin"}]},{"given":"Hormoz","family":"Shahrzad","sequence":"additional","affiliation":[{"name":"The University of Texas at Austin"}]}],"member":"320","published-online":{"date-parts":[[2020,6,26]]},"reference":[{"key":"e_1_3_2_1_1_1","first-page":"487","article-title":"Interactions between learning and evolution","author":"Ackley D.","year":"1991","unstructured":"Ackley , D. and Littman , M. 1991 . Interactions between learning and evolution . Artificial Life II 10 (1991), 487 -- 509 . Ackley, D. and Littman, M. 1991. Interactions between learning and evolution. Artificial Life II 10 (1991), 487--509.","journal-title":"Artificial Life"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSMC.1983.6313077"},{"key":"e_1_3_2_1_3_1","unstructured":"Brockman G. Cheung V. Pettersson L. Schneider J. Schulman J. Tang J. and Zaremba W. 2016. OpenAI Gym. CoRR abs\/1606.01540 (2016).  Brockman G. Cheung V. Pettersson L. Schneider J. Schulman J. Tang J. and Zaremba W. 2016. OpenAI Gym. CoRR abs\/1606.01540 (2016)."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF03325101"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF00889887"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ejor.2017.02.015"},{"key":"e_1_3_2_1_7_1","volume-title":"Proceedings of the 28th International Conference on Machine Learning (ICML) (ICML\u00e2\u0102&Zacute;11)","author":"Deisenroth M.","unstructured":"Deisenroth , M. and Rasmussen , C. E . 2011. PILCO: A model-based and data-efficient approach to policy search . In Proceedings of the 28th International Conference on Machine Learning (ICML) (ICML\u00e2\u0102&Zacute;11) . 465--472. Deisenroth, M. and Rasmussen, C. E. 2011. PILCO: A model-based and data-efficient approach to policy search. In Proceedings of the 28th International Conference on Machine Learning (ICML) (ICML\u00e2\u0102&Zacute;11). 465--472."},{"key":"e_1_3_2_1_8_1","unstructured":"Dhariwal P. Hesse C. Klimov O. Nichol A. Plappert M. Radford A. Schulman J. Sidor S. Wu Y. and Zhokhov P. 2017. OpenAI Baselines. https:\/\/github.com\/openai\/baselines. (2017).  Dhariwal P. Hesse C. Klimov O. Nichol A. Plappert M. Radford A. Schulman J. Sidor S. Wu Y. and Zhokhov P. 2017. OpenAI Baselines. https:\/\/github.com\/openai\/baselines. (2017)."},{"key":"e_1_3_2_1_9_1","volume-title":"Behavioral Medicine: Nutrition, Medication Management, and Exercise. In Practical Psychology in Medical Rehabilitation","author":"Dreer L. E.","year":"2017","unstructured":"Dreer , L. E. and Linley , A . 2017 . Behavioral Medicine: Nutrition, Medication Management, and Exercise. In Practical Psychology in Medical Rehabilitation , M. Budd, S. Hough, S. Wegener, and W. Stiers (Eds.). Springer . Dreer, L. E. and Linley, A. 2017. Behavioral Medicine: Nutrition, Medication Management, and Exercise. In Practical Psychology in Medical Rehabilitation, M. Budd, S. Hough, S. Wegener, and W. Stiers (Eds.). Springer."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"crossref","unstructured":"Emmerich M. and Deutz A. 2018. A tutorial on multiobjective optimization: Fundamentals and evolutionary methods. Natural Computation 17 (2018) 585\u00e2\u0102\u015e--609.  Emmerich M. and Deutz A. 2018. A tutorial on multiobjective optimization: Fundamentals and evolutionary methods. Natural Computation 17 (2018) 585\u00e2\u0102\u015e--609.","DOI":"10.1007\/s11047-018-9685-y"},{"key":"e_1_3_2_1_11_1","volume-title":"Proceedings of the European Conference on Machine Learning. Springer, 654--662","author":"Gomez F.","unstructured":"Gomez , F. , Schmidhuber , J. , and Miikkulainen , R . 2006. Efficient non-linear control through neuroevolution . In Proceedings of the European Conference on Machine Learning. Springer, 654--662 . Gomez, F., Schmidhuber, J., and Miikkulainen, R. 2006. Efficient non-linear control through neuroevolution. In Proceedings of the European Conference on Machine Learning. Springer, 654--662."},{"key":"e_1_3_2_1_12_1","volume-title":"Proceedings of the International Conference on Genetic Algorithms and their Applications. 112--120","author":"Grefenstette J. J.","unstructured":"Grefenstette , J. J. and Fitzpatrick , J. M . 1985. Genetic search with approximate function evaluations . In Proceedings of the International Conference on Genetic Algorithms and their Applications. 112--120 . Grefenstette, J. J. and Fitzpatrick, J. M. 1985. Genetic search with approximate function evaluations. In Proceedings of the International Conference on Genetic Algorithms and their Applications. 112--120."},{"key":"e_1_3_2_1_13_1","unstructured":"Ha D. and Schmidhuber J. 2018. Recurrent World Models Facilitate Policy Evolution. In Advances in Neural Information Processing Systems 32 (NIPS\u00e2\u0102&Zacute;18). Curran Associates Inc. Red Hook NY USA 2455\u00e2\u0102\u015e2467.  Ha D. and Schmidhuber J. 2018. Recurrent World Models Facilitate Policy Evolution. In Advances in Neural Information Processing Systems 32 (NIPS\u00e2\u0102&Zacute;18). Curran Associates Inc. Red Hook NY USA 2455\u00e2\u0102\u015e2467."},{"key":"e_1_3_2_1_14_1","volume-title":"Advances in Neural Information Processing Systems 23","author":"Hasselt H. V.","unstructured":"Hasselt , H. V. 2010. Double Q-learning . In Advances in Neural Information Processing Systems 23 , J. D. Lafferty, C. K. I. Williams, J. Shawe-Taylor, R. S. Zemel, and A. Culotta (Eds.). Curran Associates, Inc. , 2613--2621. Hasselt, H. V. 2010. Double Q-learning. In Advances in Neural Information Processing Systems 23, J. D. Lafferty, C. K. I. Williams, J. Shawe-Taylor, R. S. Zemel, and A. Culotta (Eds.). Curran Associates, Inc., 2613--2621."},{"key":"e_1_3_2_1_15_1","volume-title":"Emergence of locomotion behaviours in rich environments. arXiv:1707.02286","author":"Heess N., TB, D.","year":"2017","unstructured":"Heess , N., TB, D. , Sriram , S. , Lemmon , J. , Merel , J. , Wayne , G. , Tassa , Y. , Erez , T. , Wang , Z. , Eslami , S. , and others. 2017. Emergence of locomotion behaviours in rich environments. arXiv:1707.02286 ( 2017 ). Heess, N., TB, D., Sriram, S., Lemmon, J., Merel, J., Wayne, G., Tassa, Y., Erez, T., Wang, Z., Eslami, S., and others. 2017. Emergence of locomotion behaviours in rich environments. arXiv:1707.02286 (2017)."},{"key":"e_1_3_2_1_16_1","volume-title":"PRETSL: Distributed Probabilistic Rule Evolution for Time-Series Classification","author":"Hodjat B.","year":"2018","unstructured":"Hodjat , B. , Shahrzad , H. , Miikkulainen , R. , Murray , L. , and Holmes , C . 2018 . PRETSL: Distributed Probabilistic Rule Evolution for Time-Series Classification . In Genetic Programming Theory and Practice XIV. Springer , 139--148. Hodjat, B., Shahrzad, H., Miikkulainen, R., Murray, L., and Holmes, C. 2018. PRETSL: Distributed Probabilistic Rule Evolution for Time-Series Classification. In Genetic Programming Theory and Practice XIV. Springer, 139--148."},{"key":"e_1_3_2_1_17_1","unstructured":"Houthooft R. Chen Y. Isola P. Stadie B. Wolski F. Ho O. J. and Abbeel P. 2018. Evolved policy gradients. In Advances in Neural Information Processing Systems 31. Curran Associates Inc. 5400--5409.  Houthooft R. Chen Y. Isola P. Stadie B. Wolski F. Ho O. J. and Abbeel P. 2018. Evolved policy gradients. In Advances in Neural Information Processing Systems 31. Curran Associates Inc. 5400--5409."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.swevo.2011.05.001"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/TEVC.2005.846356"},{"key":"e_1_3_2_1_20_1","volume-title":"Proceedings of the Bird of a Feather Workshop, Genetic and Evolutionary Computation Conference (GECCO). 170--173","author":"Jin Y.","unstructured":"Jin , Y. , Husken , M. , and Sendhoff , B . 2003. Quality measures for approximate models in evolutionary computation . In Proceedings of the Bird of a Feather Workshop, Genetic and Evolutionary Computation Conference (GECCO). 170--173 . Jin, Y., Husken, M., and Sendhoff, B. 2003. Quality measures for approximate models in evolutionary computation. In Proceedings of the Bird of a Feather Workshop, Genetic and Evolutionary Computation Conference (GECCO). 170--173."},{"key":"e_1_3_2_1_21_1","unstructured":"Jin Y. Olhofer M. and Sendhoff B. 2000. On Evolutionary Optimization with Approximate Fitness Functions. 786--793.  Jin Y. Olhofer M. and Sendhoff B. 2000. On Evolutionary Optimization with Approximate Fitness Functions. 786--793."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/TEVC.2018.2869001"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0213918"},{"key":"e_1_3_2_1_24_1","volume-title":"Adam: A method for stochastic optimization. arXiv:1412.6980","author":"Kingma D. P.","year":"2014","unstructured":"Kingma , D. P. and Ba , J . 2014 . Adam: A method for stochastic optimization. arXiv:1412.6980 (2014). Kingma, D. P. and Ba, J. 2014. Adam: A method for stochastic optimization. arXiv:1412.6980 (2014)."},{"key":"e_1_3_2_1_25_1","unstructured":"Lehman J. Clune J. Misevic D. Adami C. Beaulieu J. Bentley P. J. Bernard S. Beslon G. Bryson D. M. Chrabaszcz P. Cheney N. Cully A. Doncieux S. Dyer F. C. Ellefsen K. O. Feldt R. Fischer S. Forrest S. Fr\u00e9noy A. Gagn\u00e9 C. Goff L. K. L. Grabowski L. M. Hodjat B. Hutter F. Keller L. Knibbe C. Krcah P. Lenski R. E. Lipson H. MacCurdy R. Maestre C. Miikkulainen R. Mitri S. Moriarty D. E. Mouret J. Nguyen A. Ofria C. Parizeau M. Parsons D. P. Pennock R. T. Punch W. F. Ray T. S. Schoenauer M. Shulte E. Sims K. Stanley K. O. Taddei F. Tarapore D. Thibault S. Weimer W. Watson R. and Yosinksi J. 2018. The Surprising Creativity of Digital Evolution: A Collection of Anecdotes from the Evolutionary Computation and Artificial Life Research Communities. arXiv:1803.03453 (2018).  Lehman J. Clune J. Misevic D. Adami C. Beaulieu J. Bentley P. J. Bernard S. Beslon G. Bryson D. M. Chrabaszcz P. Cheney N. Cully A. Doncieux S. Dyer F. C. Ellefsen K. O. Feldt R. Fischer S. Forrest S. Fr\u00e9noy A. Gagn\u00e9 C. Goff L. K. L. Grabowski L. M. Hodjat B. Hutter F. Keller L. Knibbe C. Krcah P. Lenski R. E. Lipson H. MacCurdy R. Maestre C. Miikkulainen R. Mitri S. Moriarty D. E. Mouret J. Nguyen A. Ofria C. Parizeau M. Parsons D. P. Pennock R. T. Punch W. F. Ray T. S. Schoenauer M. Shulte E. Sims K. Stanley K. O. Taddei F. Tarapore D. Thibault S. Weimer W. Watson R. and Yosinksi J. 2018. The Surprising Creativity of Digital Evolution: A Collection of Anecdotes from the Evolutionary Computation and Artificial Life Research Communities. arXiv:1803.03453 (2018)."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSMC.2014.2358639"},{"key":"e_1_3_2_1_27_1","volume-title":"Evolution in Action: Past","author":"Miikkulainen R.","unstructured":"Miikkulainen , R. 2019. Creative AI Through Evolutionary Computation . In Evolution in Action: Past , Present and Future, Banzhaf et al. (Ed.). Springer , New York . Miikkulainen, R. 2019. Creative AI Through Evolutionary Computation. In Evolution in Action: Past, Present and Future, Banzhaf et al. (Ed.). Springer, New York."},{"key":"e_1_3_2_1_28_1","volume-title":"Press. Ascend by Evolv: AI-Based Massively Multivariate Conversion Rate Optimization","author":"Miikkulainen R.","unstructured":"Miikkulainen , R. , Brundage , M. , Epstein , J. , Foster , T. , Hodjat , B. , Iscoe , N. , Jiang , J. , Legrand , D. , Nazari , S. , Qiu , X. , Scharff , M. , Schoolland , C. , Severn , R. , and Shagrin , A . In Press. Ascend by Evolv: AI-Based Massively Multivariate Conversion Rate Optimization . AI Magazine (In Press) . Miikkulainen, R., Brundage, M., Epstein, J., Foster, T., Hodjat, B., Iscoe, N., Jiang, J., Legrand, D., Nazari, S., Qiu, X., Scharff, M., Schoolland, C., Severn, R., and Shagrin, A. In Press. Ascend by Evolv: AI-Based Massively Multivariate Conversion Rate Optimization. AI Magazine (In Press)."},{"key":"e_1_3_2_1_29_1","volume-title":"Proceedings of the 33rd International Conference on International Conference on Machine Learning (ICML) (ICML\u00e2\u0102&Zacute;16). 1928","author":"Mnih V.","year":"1937","unstructured":"Mnih , V. , Badia , A. P. , Mirza , M. , Graves , A. , Lillicrap , T. , Harley , T. , Silver , D. , and Kavukcuoglu , K . 2016. Asynchronous methods for deep reinforcement learning . In Proceedings of the 33rd International Conference on International Conference on Machine Learning (ICML) (ICML\u00e2\u0102&Zacute;16). 1928 -- 1937 . Mnih, V., Badia, A. P., Mirza, M., Graves, A., Lillicrap, T., Harley, T., Silver, D., and Kavukcuoglu, K. 2016. Asynchronous methods for deep reinforcement learning. In Proceedings of the 33rd International Conference on International Conference on Machine Learning (ICML) (ICML\u00e2\u0102&Zacute;16). 1928--1937."},{"key":"e_1_3_2_1_30_1","volume-title":"Human-level control through deep reinforcement learning. Nature 518, 7540","author":"Mnih V.","year":"2015","unstructured":"Mnih , V. , Kavukcuoglu , K. , Silver , D. , Rusu , A. A. , Veness , J. , Bellemare , M. G. , Graves , A. , Riedmiller , M. , Fidjeland , A. K. , Ostrovski , G. , and others. 2015. Human-level control through deep reinforcement learning. Nature 518, 7540 ( 2015 ), 529--533. Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A. A., Veness, J., Bellemare, M. G., Graves, A., Riedmiller, M., Fidjeland, A. K., Ostrovski, G., and others. 2015. Human-level control through deep reinforcement learning. Nature 518, 7540 (2015), 529--533."},{"key":"e_1_3_2_1_31_1","unstructured":"Mossalam H. Assael Y. M. Roijers D. M. and Whiteson S. 2016. Multi-Objective Deep Reinforcement Learning. arXiv:1610.02707 (2016).  Mossalam H. Assael Y. M. Roijers D. M. and Whiteson S. 2016. Multi-Objective Deep Reinforcement Learning. arXiv:1610.02707 (2016)."},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.5555\/2806734.2806739"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/TAMD.2010.2051436"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.2514\/2.1999"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1115\/1.2841318"},{"key":"e_1_3_2_1_36_1","volume-title":"Proceedings of the Eighth International Conference on Learning Representations (ICLR).","author":"Qiu X.","unstructured":"Qiu , X. , Meyerson , E. , and Miikkulainen , R . 2020. Quantifying Point-Prediction Uncertainty in Neural Networks via Residual Estimation with an I\/O Kernel . In Proceedings of the Eighth International Conference on Learning Representations (ICLR). Qiu, X., Meyerson, E., and Miikkulainen, R. 2020. Quantifying Point-Prediction Uncertainty in Neural Networks via Residual Estimation with an I\/O Kernel. In Proceedings of the Eighth International Conference on Learning Representations (ICLR)."},{"key":"e_1_3_2_1_37_1","unstructured":"Ray A. Achiam J. and Amodei D. 2019. Benchmarking Safe Exploration in Deep Reinforcement Learning. (2019). https:\/\/cdn.openai.com\/safexp-short.pdf  Ray A. Achiam J. and Amodei D. 2019. Benchmarking Safe Exploration in Deep Reinforcement Learning. (2019). https:\/\/cdn.openai.com\/safexp-short.pdf"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1007\/11564096_32"},{"key":"e_1_3_2_1_39_1","unstructured":"Salimans T. Ho J. Chen X. Sidor S. and Sutskever I. 2017. Evolution Strategies as a Scalable Alternative to Reinforcement Learning. arXiv:1703.03864 (2017).  Salimans T. Ho J. Chen X. Sidor S. and Sutskever I. 2017. Evolution Strategies as a Scalable Alternative to Reinforcement Learning. arXiv:1703.03864 (2017)."},{"key":"e_1_3_2_1_40_1","volume-title":"Proceedings of the Second International Conference on Learning Representations (ICLR). Citeseer.","author":"Saxe A. M.","unstructured":"Saxe , A. M. , Mcclelland , J. L. , and Ganguli , S . 2014. Exact solutions to the nonlinear dynamics of learning in deep linear neural network . In Proceedings of the Second International Conference on Learning Representations (ICLR). Citeseer. Saxe, A. M., Mcclelland, J. L., and Ganguli, S. 2014. Exact solutions to the nonlinear dynamics of learning in deep linear neural network. In Proceedings of the Second International Conference on Learning Representations (ICLR). Citeseer."},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1142\/S012906579100011X"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/10.6.635"},{"key":"e_1_3_2_1_43_1","volume-title":"Proceedings of the Fourth International Conference on Learning Representations (ICLR).","author":"Schulman J.","unstructured":"Schulman , J. , Moritz , P. , Levine , S. , Jordan , M. , and Abbeel , P . 2016. High-dimensional continuous control using generalized advantage estimation . In Proceedings of the Fourth International Conference on Learning Representations (ICLR). Schulman, J., Moritz, P., Levine, S., Jordan, M., and Abbeel, P. 2016. High-dimensional continuous control using generalized advantage estimation. In Proceedings of the Fourth International Conference on Learning Representations (ICLR)."},{"key":"e_1_3_2_1_44_1","unstructured":"Schulman J. Wolski F. Dhariwal P. Radford A. and Klimov O. 2017. Proximal Policy Optimization Algorithms. CoRR abs\/1707.06347 (2017).  Schulman J. Wolski F. Dhariwal P. Radford A. and Klimov O. 2017. Proximal Policy Optimization Algorithms. CoRR abs\/1707.06347 (2017)."},{"key":"e_1_3_2_1_45_1","volume-title":"G., Schrittwieser, J., Antonoglou, I., Panneershelvam, V., Lanctot, M., and others.","author":"Silver D.","year":"2016","unstructured":"Silver , D. , Huang , A. , Maddison , C. J. , Guez , A. , Sifre , L. , Van Den Driessche , G., Schrittwieser, J., Antonoglou, I., Panneershelvam, V., Lanctot, M., and others. 2016 . Mastering the game of Go with deep neural networks and tree search. Nature 529, 7587 (2016), 484. Silver, D., Huang, A., Maddison, C. J., Guez, A., Sifre, L., Van Den Driessche, G., Schrittwieser, J., Antonoglou, I., Panneershelvam, V., Lanctot, M., and others. 2016. Mastering the game of Go with deep neural networks and tree search. Nature 529, 7587 (2016), 484."},{"key":"e_1_3_2_1_46_1","unstructured":"Snoek J. Larochelle H. and Adams R. P. 2012. Practical Bayesian Optimization of Machine Learning Algorithms. In Advances in Neural Information Processing Systems 25 F. Pereira C. J. C. Burges L. Bottou and K. Q. Weinberger (Eds.). Curran Associates Inc. 2951--2959.  Snoek J. Larochelle H. and Adams R. P. 2012. Practical Bayesian Optimization of Machine Learning Algorithms. In Advances in Neural Information Processing Systems 25 F. Pereira C. J. C. Burges L. Bottou and K. Q. Weinberger (Eds.). Curran Associates Inc. 2951--2959."},{"key":"e_1_3_2_1_47_1","volume-title":"Proceedings of the Genetic and Evolutionary Computation Conference (GECCO) (05","author":"Spector L.","year":"2001","unstructured":"Spector , L. , Goodman , E. , Wu , A., B. Langdon , W. , Voigt , m. H. , Gen , M. , Sen , S. , Dorigo , M. , Pezeshk , S. , Garzon , M. , Burke , E. , and Kaufmann Publishers , M. 2001 . Autoconstructive Evolution: Push, PushGP, and Pushpop . Proceedings of the Genetic and Evolutionary Computation Conference (GECCO) (05 2001). Spector, L., Goodman, E., Wu, A., B. Langdon, W., Voigt, m. H., Gen, M., Sen, S., Dorigo, M., Pezeshk, S., Garzon, M., Burke, E., and Kaufmann Publishers, M. 2001. Autoconstructive Evolution: Push, PushGP, and Pushpop. Proceedings of the Genetic and Evolutionary Computation Conference (GECCO) (05 2001)."},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"crossref","unstructured":"Stanley K. O. Clune J. Lehman J. and Miikkulainen R. 2019. Designing Neural Networks through Evolutionary Algorithms. Nature Machine Intelligence 1 1 (2019) 24\u00e2\u0102\u015e35.  Stanley K. O. Clune J. Lehman J. and Miikkulainen R. 2019. Designing Neural Networks through Evolutionary Algorithms. Nature Machine Intelligence 1 1 (2019) 24\u00e2\u0102\u015e35.","DOI":"10.1038\/s42256-018-0006-z"},{"key":"e_1_3_2_1_49_1","unstructured":"Such F. P. Madhavan V. Conti E. Lehman J. Stanley K. O. and Clune J. 2017. Deep neuroevolution: Genetic algorithms are a competitive alternative for training deep neural networks for reinforcement learning. arXiv:1712.06567 (2017).  Such F. P. Madhavan V. Conti E. Lehman J. Stanley K. O. and Clune J. 2017. Deep neuroevolution: Genetic algorithms are a competitive alternative for training deep neural networks for reinforcement learning. arXiv:1712.06567 (2017)."},{"key":"e_1_3_2_1_50_1","unstructured":"Tasfi N. 2016. PyGame Learning Environment. https:\/\/github.com\/ntasfi\/PyGame-Learning-Environment. (2016).  Tasfi N. 2016. PyGame Learning Environment. https:\/\/github.com\/ntasfi\/PyGame-Learning-Environment. (2016)."},{"key":"e_1_3_2_1_51_1","unstructured":"Wahlstr\u00f6m N. Sch\u00f6n T. B. and Deisenroth M. P. 2015. From pixels to torques: Policy learning with deep dynamical models. arXiv:1502.02251 (2015).  Wahlstr\u00f6m N. Sch\u00f6n T. B. and Deisenroth M. P. 2015. From pixels to torques: Policy learning with deep dynamical models. arXiv:1502.02251 (2015)."},{"key":"e_1_3_2_1_52_1","volume-title":"Proceedings of the 33rd International Conference on International Conference on Machine Learning (ICML) (ICML\u00e2\u0102&Zacute;16)","volume":"48","author":"Wang Z.","year":"2016","unstructured":"Wang , Z. , Schaul , T. , Hessel , M. , Van Hasselt , H. , Lanctot , M. , and De Freitas , N. 2016 . Dueling Network Architectures for Deep Reinforcement Learning . In Proceedings of the 33rd International Conference on International Conference on Machine Learning (ICML) (ICML\u00e2\u0102&Zacute;16) , Vol. 48 . JMLR.org, 1995\u00e2\u0102\u015e2003. Wang, Z., Schaul, T., Hessel, M., Van Hasselt, H., Lanctot, M., and De Freitas, N. 2016. Dueling Network Architectures for Deep Reinforcement Learning. In Proceedings of the 33rd International Conference on International Conference on Machine Learning (ICML) (ICML\u00e2\u0102&Zacute;16), Vol. 48. JMLR.org, 1995\u00e2\u0102\u015e2003."},{"key":"e_1_3_2_1_54_1","volume-title":"Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, NY.","author":"Werbos P. J.","year":"1987","unstructured":"Werbos , P. J. 1987 . Learning how the world works: Specifications for predictive networks in robots and brains . In Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, NY. Werbos, P. J. 1987. Learning how the world works: Specifications for predictive networks in robots and brains. In Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, NY."},{"key":"e_1_3_2_1_55_1","volume-title":"Reinforcement Learning","author":"Whiteson S.","unstructured":"Whiteson , S. 2012. Evolutionary computation for reinforcement learning . In Reinforcement Learning . Springer , 325--355. Whiteson, S. 2012. Evolutionary computation for reinforcement learning. In Reinforcement Learning. Springer, 325--355."},{"key":"e_1_3_2_1_56_1","unstructured":"Yang R. Sun X. and Narasimhan K. 2019. A Generalized Algorithm for Multi-Objective Reinforcement Learning and Policy Adaptation. In Advances in Neural Information Processing Systems 32 H. Wallach H. Larochelle A. Beygelzimer F. d'Alch\u00e9-Buc E. Fox and R. Garnett (Eds.). Curran Associates Inc. 14610--14621.  Yang R. Sun X. and Narasimhan K. 2019. A Generalized Algorithm for Multi-Objective Reinforcement Learning and Policy Adaptation. In Advances in Neural Information Processing Systems 32 H. Wallach H. Larochelle A. Beygelzimer F. d'Alch\u00e9-Buc E. Fox and R. Garnett (Eds.). Curran Associates Inc. 14610--14621."},{"key":"e_1_3_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1109\/CEC.2010.5586024"}],"event":{"name":"GECCO '20: Genetic and Evolutionary Computation Conference","location":"Canc\u00fan Mexico","acronym":"GECCO '20","sponsor":["SIGEVO ACM Special Interest Group on Genetic and Evolutionary Computation"]},"container-title":["Proceedings of the 2020 Genetic and Evolutionary Computation Conference"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3377930.3389842","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3377930.3389842","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:41:07Z","timestamp":1750200067000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3377930.3389842"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,6,25]]},"references-count":56,"alternative-id":["10.1145\/3377930.3389842","10.1145\/3377930"],"URL":"https:\/\/doi.org\/10.1145\/3377930.3389842","relation":{},"subject":[],"published":{"date-parts":[[2020,6,25]]},"assertion":[{"value":"2020-06-26","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}