{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:25:32Z","timestamp":1750220732895,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":22,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,9,7]],"date-time":"2020-09-07T00:00:00Z","timestamp":1599436800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100010198","name":"Army Research Laboratory","doi-asserted-by":"publisher","award":["W911NF-10-2-0022"],"award-info":[{"award-number":["W911NF-10-2-0022"]}],"id":[{"id":"10.13039\/100010198","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,9,7]]},"DOI":"10.1145\/3386263.3407652","type":"proceedings-article","created":{"date-parts":[[2020,9,4]],"date-time":"2020-09-04T21:34:20Z","timestamp":1599255260000},"page":"131-136","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":11,"title":["Energy-Efficient Hardware for Language Guided Reinforcement Learning"],"prefix":"10.1145","author":[{"given":"Aidin","family":"Shiri","sequence":"first","affiliation":[{"name":"University of Maryland, Baltimore County, Baltimore, MD, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Arnab Neelim","family":"Mazumder","sequence":"additional","affiliation":[{"name":"University of Maryland, Baltimore County, Baltimore, MD, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Bharat","family":"Prakash","sequence":"additional","affiliation":[{"name":"University of Maryland, Baltimore County, Baltimore, MD, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Nitheesh Kumar","family":"Manjunath","sequence":"additional","affiliation":[{"name":"University of Maryland, Baltimore County, Baltimore, MD, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Houman","family":"Homayoun","sequence":"additional","affiliation":[{"name":"University of California, Davis, Davis, CA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Avesta","family":"Sasan","sequence":"additional","affiliation":[{"name":"George Mason University, Fairfax, VA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Nicholas R.","family":"Waytowich","sequence":"additional","affiliation":[{"name":"US Army Research Laboratory, New york, NY, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tinoosh","family":"Mohsenin","sequence":"additional","affiliation":[{"name":"University of Maryland, Baltimore County, Baltimore, MD, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2020,9,7]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Introduction to reinforcement learning","author":"Sutton Richard S","year":"1998","unstructured":"Richard S Sutton , Andrew G Barto , Introduction to reinforcement learning , volume 2 . MIT press Cambridge , 1998 . Richard S Sutton, Andrew G Barto, et al. Introduction to reinforcement learning, volume 2. MIT press Cambridge, 1998."},{"key":"e_1_3_2_1_2_1","first-page":"4299","volume-title":"Advances in Neural Information Processing Systems","author":"Christiano Paul F","year":"2017","unstructured":"Paul F Christiano , Jan Leike , Tom Brown , Miljan Martic , Shane Legg , and Dario Amodei . Deep reinforcement learning from human preferences . In Advances in Neural Information Processing Systems , pages 4299 -- 4307 , 2017 . Paul F Christiano, Jan Leike, Tom Brown, Miljan Martic, Shane Legg, and Dario Amodei. Deep reinforcement learning from human preferences. In Advances in Neural Information Processing Systems, pages 4299--4307, 2017."},{"key":"e_1_3_2_1_3_1","volume-title":"Learning behaviors from a single video demonstration using human feedback","author":"Gandhi Sunil","year":"2019","unstructured":"Sunil Gandhi , Tim Oates , Tinoosh Mohsenin , and Nicholas R Waytowich . Learning behaviors from a single video demonstration using human feedback . 2019 . Sunil Gandhi, Tim Oates, Tinoosh Mohsenin, and Nicholas R Waytowich. Learning behaviors from a single video demonstration using human feedback. 2019."},{"key":"e_1_3_2_1_4_1","volume-title":"Learning to understand goal specifications by modelling reward. arXiv preprint arXiv:1806.01946","author":"Bahdanau Dzmitry","year":"2018","unstructured":"Dzmitry Bahdanau , Felix Hill , Jan Leike , Edward Hughes , Arian Hosseini , Pushmeet Kohli , and Edward Grefenstette . Learning to understand goal specifications by modelling reward. arXiv preprint arXiv:1806.01946 , 2018 . Dzmitry Bahdanau, Felix Hill, Jan Leike, Edward Hughes, Arian Hosseini, Pushmeet Kohli, and Edward Grefenstette. Learning to understand goal specifications by modelling reward. arXiv preprint arXiv:1806.01946, 2018."},{"key":"e_1_3_2_1_5_1","volume-title":"Using natural language for reward shaping in reinforcement learning. arXiv preprint arXiv:1903.02020","author":"Goyal Prasoon","year":"2019","unstructured":"Prasoon Goyal , Scott Niekum , and Raymond J Mooney . Using natural language for reward shaping in reinforcement learning. arXiv preprint arXiv:1903.02020 , 2019 . Prasoon Goyal, Scott Niekum, and Raymond J Mooney. Using natural language for reward shaping in reinforcement learning. arXiv preprint arXiv:1903.02020, 2019."},{"key":"e_1_3_2_1_6_1","volume-title":"Max Jaderberg, Denis Teplyashin, et al. Grounded language learning in a simulated 3d world. arXiv preprint arXiv:1706.06551","author":"Hermann Karl Moritz","year":"2017","unstructured":"Karl Moritz Hermann , Felix Hill , Simon Green , Fumin Wang , Ryan Faulkner , Hubert Soyer , David Szepesvari , Wojciech Marian Czarnecki , Max Jaderberg, Denis Teplyashin, et al. Grounded language learning in a simulated 3d world. arXiv preprint arXiv:1706.06551 , 2017 . Karl Moritz Hermann, Felix Hill, Simon Green, Fumin Wang, Ryan Faulkner, Hubert Soyer, David Szepesvari, Wojciech Marian Czarnecki, Max Jaderberg, Denis Teplyashin, et al. Grounded language learning in a simulated 3d world. arXiv preprint arXiv:1706.06551, 2017."},{"issue":"6","key":"e_1_3_2_1_7_1","first-page":"4","article-title":"Walk the talk: Connecting language, knowledge, and action in route instructions","volume":"2","author":"MacMahon Matt","year":"2006","unstructured":"Matt MacMahon , Brian Stankiewicz , and Benjamin Kuipers . Walk the talk: Connecting language, knowledge, and action in route instructions . Def , 2 ( 6 ): 4 , 2006 . Matt MacMahon, Brian Stankiewicz, and Benjamin Kuipers. Walk the talk: Connecting language, knowledge, and action in route instructions. Def, 2(6):4, 2006.","journal-title":"Def"},{"unstructured":"Bharat Prakash Nicholas Waytowich Ashwinkumar Ganesan Tim Oates and Tinoosh Mohsenin. Guiding safe reinforcement learning policies using structured language constraints.  Bharat Prakash Nicholas Waytowich Ashwinkumar Ganesan Tim Oates and Tinoosh Mohsenin. Guiding safe reinforcement learning policies using structured language constraints.","key":"e_1_3_2_1_8_1"},{"key":"e_1_3_2_1_9_1","article-title":"A scalable and low-power deep convolutional neural network for multimodal data classification","author":"Jafari Ali","year":"2018","unstructured":"Ali Jafari , Ashwinkumar Ganesan , Chetan Sai Kumar Thalisetty , Varun Sivasubramanian , Tim Oates , and Tinoosh Mohsenin . Sensornet : A scalable and low-power deep convolutional neural network for multimodal data classification . IEEE Transactions on Circuits and Systems I: Regular Papers, (99):1--14 , 2018 . Ali Jafari, Ashwinkumar Ganesan, Chetan Sai Kumar Thalisetty, Varun Sivasubramanian, Tim Oates, and Tinoosh Mohsenin. Sensornet: A scalable and low-power deep convolutional neural network for multimodal data classification. IEEE Transactions on Circuits and Systems I: Regular Papers, (99):1--14, 2018.","journal-title":"IEEE Transactions on Circuits and Systems I: Regular Papers, (99):1--14"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_10_1","DOI":"10.1109\/RECONFIG.2018.8641702"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_11_1","DOI":"10.1109\/ISQED.2019.8697574"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_12_1","DOI":"10.1109\/ISCAS.2018.8351525"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_13_1","DOI":"10.1109\/ISCAS.2016.7527445"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_14_1","DOI":"10.1145\/3299874.3319493"},{"key":"e_1_3_2_1_15_1","volume-title":"Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347","author":"Schulman John","year":"2017","unstructured":"John Schulman , Filip Wolski , Prafulla Dhariwal , Alec Radford , and Oleg Klimov . Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347 , 2017 . John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, and Oleg Klimov. Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347, 2017."},{"key":"e_1_3_2_1_16_1","volume-title":"gym-miniworld environment for open ai gym. https:\/\/github.com\/maximecb\/gym-miniworld","author":"Chevalier-Boisvert Maxime","year":"2018","unstructured":"Maxime Chevalier-Boisvert . gym-miniworld environment for open ai gym. https:\/\/github.com\/maximecb\/gym-miniworld , 2018 . Maxime Chevalier-Boisvert. gym-miniworld environment for open ai gym. https:\/\/github.com\/maximecb\/gym-miniworld, 2018."},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_17_1","DOI":"10.1145\/3316781.3317873"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_18_1","DOI":"10.1145\/2897937.2898003"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_19_1","DOI":"10.1145\/3301278"},{"key":"e_1_3_2_1_20_1","first-page":"1","volume-title":"2016 International Conference on Compliers, Architectures, and Sythesis of Embedded Systems (CASES)","author":"Hegde Gopalakrishna","year":"2016","unstructured":"Gopalakrishna Hegde , Nachiappan Ramasamy , Nachiket Kapre , : an optimized library for deep learning on embedded accelerator-based platforms . In 2016 International Conference on Compliers, Architectures, and Sythesis of Embedded Systems (CASES) , pages 1 -- 10 . IEEE, 2016 . Gopalakrishna Hegde, Nachiappan Ramasamy, Nachiket Kapre, et al. Caffepresso: an optimized library for deep learning on embedded accelerator-based platforms. In 2016 International Conference on Compliers, Architectures, and Sythesis of Embedded Systems (CASES), pages 1--10. IEEE, 2016."},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_21_1","DOI":"10.1109\/FCCM.2018.00055"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_22_1","DOI":"10.1145\/3229631.3229639"}],"event":{"acronym":"GLSVLSI '20","name":"GLSVLSI '20: Great Lakes Symposium on VLSI 2020","location":"Virtual Event China"},"container-title":["Proceedings of the 2020 on Great Lakes Symposium on VLSI"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3386263.3407652","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/abs\/10.1145\/3386263.3407652","content-type":"text\/html","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3386263.3407652","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3386263.3407652","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:38:25Z","timestamp":1750199905000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3386263.3407652"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,9,7]]},"references-count":22,"alternative-id":["10.1145\/3386263.3407652","10.1145\/3386263"],"URL":"https:\/\/doi.org\/10.1145\/3386263.3407652","relation":{},"subject":[],"published":{"date-parts":[[2020,9,7]]},"assertion":[{"value":"2020-09-07","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}