{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,21]],"date-time":"2026-02-21T12:31:53Z","timestamp":1771677113259,"version":"3.50.1"},"reference-count":19,"publisher":"Association for Computing Machinery (ACM)","issue":"3","license":[{"start":{"date-parts":[[2015,3,10]],"date-time":"2015-03-10T00:00:00Z","timestamp":1425945600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["AI Matters"],"published-print":{"date-parts":[[2015,3,10]]},"abstract":"<jats:p>Many robotic motion tasks, such as UAV control, have non-linear and high-dimensional dynamics. Difficult for both human demonstration and explicit solutions, these tasks can be described with opposing preferences. This thesis develops PEARL, a real-time solution for such tasks on acceleration-controlled systems with unknown dynamics, and finds PEARL's safety conditions.<\/jats:p>","DOI":"10.1145\/2735392.2735396","type":"journal-article","created":{"date-parts":[[2015,3,12]],"date-time":"2015-03-12T12:18:05Z","timestamp":1426162685000},"page":"8-12","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Reinforcement learning and planning for preference balancing tasks"],"prefix":"10.1145","volume":"1","author":[{"given":"Aleksandra","family":"Faust","sequence":"first","affiliation":[{"name":"Sandia National Laboratories"}]}],"member":"320","published-online":{"date-parts":[[2015,3,10]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"Apprenticeship learning and reinforcement learning with application to robotic control (Unpublished doctoral dissertation)","author":"Abbeel P.","year":"2008"},{"key":"e_1_2_1_2_1","volume-title":"Reinforcement learning and dynamic programming using function approximators","author":"Bu\u015foniu L.","year":"2010","edition":"1"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.2202\/1553-779X.1066"},{"key":"e_1_2_1_4_1","volume-title":"Reinforcement learning and planning for preference balancing tasks (Unpublished doctoral dissertation)","author":"Faust A.","year":"2014"},{"key":"e_1_2_1_5_1","unstructured":"Faust A. Aimone J. James C. and Tapia L. (n.d.). Resilient array sorting agent with reinforcement learning. Under submission.  Faust A. Aimone J. James C. and Tapia L. (n.d.). Resilient array sorting agent with reinforcement learning. Under submission."},{"key":"e_1_2_1_6_1","unstructured":"Faust A. Chiang H.-T. and Tapia L. (n.d.). PEARL: PrEference Appraisal Reinforcement Learning for motion planning. Under submission.  Faust A. Chiang H.-T. and Tapia L. (n.d.). PEARL: PrEference Appraisal Reinforcement Learning for motion planning. Under submission."},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2015.7139692"},{"key":"e_1_2_1_8_1","volume-title":"Machine Learning in Planning and Control of Robot Motion Workshop at IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS).","author":"Faust A.","year":"2014"},{"key":"e_1_2_1_9_1","volume-title":"Proc. IEEE International Conference on Robotics and Automation (ICRA), 4887--4894","author":"Faust A."},{"key":"e_1_2_1_10_1","volume-title":"Autonomous Learning Workshop at IEEE International Conference on Robotics and Automation (ICRA).","author":"Faust A."},{"key":"e_1_2_1_11_1","volume-title":"Automated aerial suspended cargo delivery through reinforcement learning. Artificial Intelligence. doi: http:\/\/dx.doi.org\/10.1016\/j.artint.2014.11.009","author":"Faust A.","year":"2014"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/JAS.2014.7004690"},{"key":"e_1_2_1_13_1","volume-title":"Proceedings of the 11th World Congress on Intelligent Control and Automation.","author":"Figueroa R."},{"key":"e_1_2_1_14_1","doi-asserted-by":"crossref","DOI":"10.1515\/9781400841042","volume-title":"Non-linear dynamical systems and control","author":"Haddad W. M.","year":"2008"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/70.508439"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1177\/0278364913495721"},{"key":"e_1_2_1_17_1","volume-title":"Analysis and control of nonlinear systems: A flatness-based approach","author":"Levine J.","year":"2010"},{"key":"e_1_2_1_18_1","volume-title":"C. Pradalier, R. Siegwart, & G","author":"Peters J.","year":"2011"},{"key":"e_1_2_1_19_1","volume-title":"A reinforcement learning: an introduction","author":"Sutton R.","year":"1998"}],"container-title":["AI Matters"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2735392.2735396","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2735392.2735396","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T06:17:02Z","timestamp":1750227422000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2735392.2735396"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,3,10]]},"references-count":19,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2015,3,10]]}},"alternative-id":["10.1145\/2735392.2735396"],"URL":"https:\/\/doi.org\/10.1145\/2735392.2735396","relation":{},"ISSN":["2372-3483"],"issn-type":[{"value":"2372-3483","type":"electronic"}],"subject":[],"published":{"date-parts":[[2015,3,10]]},"assertion":[{"value":"2015-03-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}