{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:29:58Z","timestamp":1750220998131,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":13,"publisher":"ACM","license":[{"start":{"date-parts":[[2019,4,16]],"date-time":"2019-04-16T00:00:00Z","timestamp":1555372800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Toyota Motors North America R&D"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2019,4,16]]},"DOI":"10.1145\/3302504.3313355","type":"proceedings-article","created":{"date-parts":[[2019,4,8]],"date-time":"2019-04-08T13:37:58Z","timestamp":1554730678000},"page":"270-271","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":4,"title":["Structured reward functions using STL"],"prefix":"10.1145","author":[{"given":"Anand","family":"Balakrishnan","sequence":"first","affiliation":[{"name":"University of Southern California"}]},{"given":"Jyotirmoy V.","family":"Deshmukh","sequence":"additional","affiliation":[{"name":"University of Southern California"}]}],"member":"320","published-online":{"date-parts":[[2019,4,16]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Zhaodan Kong, Mac Schwager, and Calin Belta.","author":"Aksaray Derya","year":"2016","unstructured":"Derya Aksaray , Austin Jones , Zhaodan Kong, Mac Schwager, and Calin Belta. 2016 . Q-Learning for Robust Satisfaction of Signal Temporal Logic Specifications . (Sept. 2016). arXiv:cs\/1609.07409 Derya Aksaray, Austin Jones, Zhaodan Kong, Mac Schwager, and Calin Belta. 2016. Q-Learning for Robust Satisfaction of Signal Temporal Logic Specifications. (Sept. 2016). arXiv:cs\/1609.07409"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10703-017-0286-7"},{"volume-title":"Computer Aided Verification (Lecture Notes in Computer Science)","author":"Donz\u00e9 Alexandre","key":"e_1_3_2_1_3_1","unstructured":"Alexandre Donz\u00e9 , Thomas Ferr\u00e8 , and Oded Maler . 2013. Efficient Robust Monitoring for STL . In Computer Aided Verification (Lecture Notes in Computer Science) , Natasha Sharygina and Helmut Veith (Eds.). Springer Berlin Heidelberg , 264--279. Alexandre Donz\u00e9, Thomas Ferr\u00e8, and Oded Maler. 2013. Efficient Robust Monitoring for STL. In Computer Aided Verification (Lecture Notes in Computer Science), Natasha Sharygina and Helmut Veith (Eds.). Springer Berlin Heidelberg, 264--279."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.5555\/1885174.1885183"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.5555\/3091125.3091208"},{"key":"e_1_3_2_1_6_1","volume-title":"Continuous Deep Q-Learning with Model-Based Acceleration. (March","author":"Gu Shixiang","year":"2016","unstructured":"Shixiang Gu , Timothy Lillicrap , Ilya Sutskever , and Sergey Levine . 2016. Continuous Deep Q-Learning with Model-Based Acceleration. (March 2016 ). arXiv:cs\/1603.00748 Shixiang Gu, Timothy Lillicrap, Ilya Sutskever, and Sergey Levine. 2016. Continuous Deep Q-Learning with Model-Based Acceleration. (March 2016). arXiv:cs\/1603.00748"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10703-018-0319-x"},{"volume-title":"Reinforcement Learning with Temporal Logic Rewards. In 2017 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS). 3834--3839","author":"Li X.","key":"e_1_3_2_1_8_1","unstructured":"X. Li , C. Vasile , and C. Belta . 2017 . Reinforcement Learning with Temporal Logic Rewards. In 2017 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS). 3834--3839 . X. Li, C. Vasile, and C. Belta. 2017. Reinforcement Learning with Temporal Logic Rewards. In 2017 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS). 3834--3839."},{"key":"e_1_3_2_1_9_1","volume-title":"Mehdi Mirza, Alex Graves, Timothy P. Lillicrap, Tim Harley, David Silver, and Koray Kavukcuoglu.","author":"Mnih Volodymyr","year":"2016","unstructured":"Volodymyr Mnih , Adri\u00e0 Puigdom\u00e8nech Badia , Mehdi Mirza, Alex Graves, Timothy P. Lillicrap, Tim Harley, David Silver, and Koray Kavukcuoglu. 2016 . Asynchronous Methods for Deep Reinforcement Learning . (Feb. 2016). arXiv:cs\/1602.01783 Volodymyr Mnih, Adri\u00e0 Puigdom\u00e8nech Badia, Mehdi Mirza, Alex Graves, Timothy P. Lillicrap, Tim Harley, David Silver, and Koray Kavukcuoglu. 2016. Asynchronous Methods for Deep Reinforcement Learning. (Feb. 2016). arXiv:cs\/1602.01783"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1038\/nature14236"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/2883817.2883839"},{"key":"e_1_3_2_1_12_1","volume-title":"Proximal Policy Optimization Algorithms. (July","author":"Schulman John","year":"2017","unstructured":"John Schulman , Filip Wolski , Prafulla Dhariwal , Alec Radford , and Oleg Klimov . 2017. Proximal Policy Optimization Algorithms. (July 2017 ). arXiv:cs\/1707.06347 John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, and Oleg Klimov. 2017. Proximal Policy Optimization Algorithms. (July 2017). arXiv:cs\/1707.06347"},{"key":"e_1_3_2_1_13_1","volume-title":"Barto","author":"Sutton Richard S.","year":"2018","unstructured":"Richard S. Sutton and Andrew G . Barto . 2018 . Reinforcement Learning : An Introduction (second edition ed.). The MIT Press , Cambridge, MA. Richard S. Sutton and Andrew G. Barto. 2018. Reinforcement Learning: An Introduction (second edition ed.). The MIT Press, Cambridge, MA."}],"event":{"name":"HSCC '19: 22nd ACM International Conference on Hybrid Systems: Computation and Control","sponsor":["SIGBED ACM Special Interest Group on Embedded Systems"],"location":"Montreal Quebec Canada","acronym":"HSCC '19"},"container-title":["Proceedings of the 22nd ACM International Conference on Hybrid Systems: Computation and Control"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3302504.3313355","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3302504.3313355","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T00:25:37Z","timestamp":1750206337000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3302504.3313355"}},"subtitle":["poster abstract"],"short-title":[],"issued":{"date-parts":[[2019,4,16]]},"references-count":13,"alternative-id":["10.1145\/3302504.3313355","10.1145\/3302504"],"URL":"https:\/\/doi.org\/10.1145\/3302504.3313355","relation":{},"subject":[],"published":{"date-parts":[[2019,4,16]]},"assertion":[{"value":"2019-04-16","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}