{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,13]],"date-time":"2026-06-13T12:47:36Z","timestamp":1781354856504,"version":"3.54.1"},"publisher-location":"New York, NY, USA","reference-count":13,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,7,29]],"date-time":"2022-07-29T00:00:00Z","timestamp":1659052800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,7,29]]},"DOI":"10.1145\/3549179.3549181","type":"proceedings-article","created":{"date-parts":[[2022,8,20]],"date-time":"2022-08-20T22:07:10Z","timestamp":1661033230000},"page":"8-13","source":"Crossref","is-referenced-by-count":1,"title":["FPGA hardware implementation of Q-learning algorithm with low resource consumption"],"prefix":"10.1145","author":[{"given":"XIAOJUAN","family":"LIU","sequence":"first","affiliation":[{"name":"College of Electronic Science and Engineering, National University of Defense Technology, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"JIETAO","family":"DIAO","sequence":"additional","affiliation":[{"name":"College of Electronic Science and Engineering, National University of Defense Technology, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"NAN","family":"LI","sequence":"additional","affiliation":[{"name":"College of Electronic Science and Engineering, National University of Defense Technology, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2022,8,20]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Luca Di Nunzio, Rocco Fazzolari, Daniele Giardino, Marco Matta, Alberto Nannarelli, and Marco Re.","author":"Spano Sergio","year":"2019","unstructured":"Sergio Spano , Gian Carlo Cardarilli , Luca Di Nunzio, Rocco Fazzolari, Daniele Giardino, Marco Matta, Alberto Nannarelli, and Marco Re. 2019 . An Efficient Hardware Implementation of Reinforcement Learning: The Q-Learning Algorithm. IEEE Access ( 2019), 186340-186351. DOI 10.1109\/ACCESS.2019.2961174 Sergio Spano, Gian Carlo Cardarilli, Luca Di Nunzio, Rocco Fazzolari, Daniele Giardino, Marco Matta, Alberto Nannarelli, and Marco Re. 2019. An Efficient Hardware Implementation of Reinforcement Learning: The Q-Learning Algorithm. IEEE Access (2019), 186340-186351. DOI 10.1109\/ACCESS.2019.2961174"},{"key":"e_1_3_2_1_2_1","volume-title":"Barto Et Richard S Sutton","author":"Andrew","year":"1998","unstructured":"Andrew G. Barto Et Richard S Sutton . 1998 . Reinforcement learning: An introduction. MIT Press . Andrew G. Barto Et Richard S Sutton. 1998. Reinforcement learning: An introduction. MIT Press."},{"key":"e_1_3_2_1_3_1","volume-title":"Fazzolari Rocco, Daniele Giardino, Marco Matta, Marco Re, and Span\u00f2 Sergio.","author":"Cardarilli Gian Carlo","year":"2021","unstructured":"Gian Carlo Cardarilli , Luca Di Nunzio , Fazzolari Rocco, Daniele Giardino, Marco Matta, Marco Re, and Span\u00f2 Sergio. 2021 . An Action-Selection Policy Generator for Reinforcement Learning Hardware Accelerators(Conference Paper). Lecture Notes in Electrical Engineering ( 2021), 267-272. Gian Carlo Cardarilli, Luca Di Nunzio, Fazzolari Rocco, Daniele Giardino, Marco Matta, Marco Re, and Span\u00f2 Sergio. 2021. An Action-Selection Policy Generator for Reinforcement Learning Hardware Accelerators(Conference Paper). Lecture Notes in Electrical Engineering (2021), 267-272."},{"key":"e_1_3_2_1_4_1","volume-title":"Control of HVAC-Systems Using Reinforcement Learning With Hysteresis and Tolerance Control","author":"Blad C., S.","year":"2020","unstructured":"Blad C., S. Kalles\u00f8e C., and B\u00f8gh S . Control of HVAC-Systems Using Reinforcement Learning With Hysteresis and Tolerance Control ., 2020 . Blad C., S. Kalles\u00f8e C., and B\u00f8gh S. Control of HVAC-Systems Using Reinforcement Learning With Hysteresis and Tolerance Control., 2020."},{"key":"e_1_3_2_1_5_1","volume-title":"Online Vehicle Routing With Neural Combinatorial Optimization and Deep Reinforcement Learning","author":"Yu James J. Q.","year":"2019","unstructured":"James J. Q. Yu , Wen Yu , and Jiatao Gu. 2019. Online Vehicle Routing With Neural Combinatorial Optimization and Deep Reinforcement Learning . IEEE T. Intell. Transp . ( 2019 ), 3806-3817. James J. Q. Yu, Wen Yu, and Jiatao Gu. 2019. Online Vehicle Routing With Neural Combinatorial Optimization and Deep Reinforcement Learning. IEEE T. Intell. Transp. (2019), 3806-3817."},{"key":"e_1_3_2_1_6_1","volume-title":"Towards Hardware Accelerated Reinforcement Learning for Application-Specific Robotic Control","author":"Michal Mysior Shengjia Shao Jason Tsai","year":"2018","unstructured":"Jason Tsai Michal Mysior Shengjia Shao . 2018. Towards Hardware Accelerated Reinforcement Learning for Application-Specific Robotic Control . IEEE ( 2018 ). Jason Tsai Michal Mysior Shengjia Shao. 2018. Towards Hardware Accelerated Reinforcement Learning for Application-Specific Robotic Control. IEEE (2018)."},{"key":"e_1_3_2_1_7_1","volume-title":"A New Deep-Q-Learning-Based Transmission Scheduling Mechanism for the Cognitive Internet of Things","author":"Zhu Jiang","year":"2018","unstructured":"Jiang Zhu , Yonghui Song , and Dingde Jiang . 2018. A New Deep-Q-Learning-Based Transmission Scheduling Mechanism for the Cognitive Internet of Things . IEEE Internet of Things Journal ( 2018 ). Jiang Zhu, Yonghui Song, and Dingde Jiang. 2018. A New Deep-Q-Learning-Based Transmission Scheduling Mechanism for the Cognitive Internet of Things. IEEE Internet of Things Journal (2018)."},{"key":"e_1_3_2_1_8_1","volume-title":"The Safety Management System Using Q-Learning Algorithm in IoT Environment","author":"A. Dolas S., A.","year":"2021","unstructured":"A. Dolas S., A. Jain S., and N. Bhute A. The Safety Management System Using Q-Learning Algorithm in IoT Environment ., 2021 . A. Dolas S., A. Jain S., and N. Bhute A. The Safety Management System Using Q-Learning Algorithm in IoT Environment., 2021."},{"key":"e_1_3_2_1_9_1","volume-title":"Watkins and Peter Dayan","author":"Christopher J. C.","year":"1992","unstructured":"Christopher J. C. H. Watkins and Peter Dayan . 1992 . Technical Note : Q-Learning. Mach. Learn . (1992), 279-292. Christopher J. C. H. Watkins and Peter Dayan. 1992. Technical Note: Q-Learning. Mach. Learn. (1992), 279-292."},{"key":"e_1_3_2_1_10_1","volume-title":"The Experience-Memory Q-Learning Algorithm for Robot Path Planning in Unknown Environment","author":"Zhao Meng","year":"2020","unstructured":"Meng Zhao , Hui Lu , Siyi Yang , and Fengjuan Guo . 2020. The Experience-Memory Q-Learning Algorithm for Robot Path Planning in Unknown Environment . IEEE Access ( 2020 ), 47824-47844. Meng Zhao, Hui Lu, Siyi Yang, and Fengjuan Guo. 2020. The Experience-Memory Q-Learning Algorithm for Robot Path Planning in Unknown Environment. IEEE Access (2020), 47824-47844."},{"key":"e_1_3_2_1_11_1","volume-title":"SJ Singh SapamJitu Singh, LC Jain LakhmiC. Jain, and AK Nagar AtulyaK. Nagar.","author":"Konar A. Konar Amit","year":"2013","unstructured":"A. Konar Amit Konar , IG Chakraborty IndraniGoswami Chakraborty , SJ Singh SapamJitu Singh, LC Jain LakhmiC. Jain, and AK Nagar AtulyaK. Nagar. 2013 . A Deterministic Improved Q-Learning for Path Planning of a Mobile Robot. IEEE T. Syst. Man Cy .-S. (2013), 1141-1153. A. Konar Amit Konar, IG Chakraborty IndraniGoswami Chakraborty, SJ Singh SapamJitu Singh, LC Jain LakhmiC. Jain, and AK Nagar AtulyaK. Nagar. 2013. A Deterministic Improved Q-Learning for Path Planning of a Mobile Robot. IEEE T. Syst. Man Cy.-S. (2013), 1141-1153."},{"key":"e_1_3_2_1_12_1","volume-title":"Pauline AUTHOR Ongp Uthm. Ong, and Kah Chun AUTHOR Cheah","author":"Low Ee Soong AUTHOR","year":"2019","unstructured":"Ee Soong AUTHOR Low , Pauline AUTHOR Ongp Uthm. Ong, and Kah Chun AUTHOR Cheah . 2019 . Solving the optimal path planning of a mobile robot using improved Q-learning. Robotics & Autonomous Systems ( 2019), 143-161. Ee Soong AUTHOR Low, Pauline AUTHOR Ongp Uthm. Ong, and Kah Chun AUTHOR Cheah. 2019. Solving the optimal path planning of a mobile robot using improved Q-learning. Robotics & Autonomous Systems (2019), 143-161."},{"key":"e_1_3_2_1_13_1","volume-title":"Fernandes","author":"Da Silva Lucileide M. D.","year":"2019","unstructured":"Lucileide M. D. Da Silva , Matheus F. Torquato , and Marcelo A. C . Fernandes . 2019 . Parallel Implementation of Reinforcement Learning Q-Learning Technique for FPGA. IEEE Access ( 2019), 2782-2798. DOI 10.1109\/ACCESS.2018.2885950 Lucileide M. D. Da Silva, Matheus F. Torquato, and Marcelo A. C. Fernandes. 2019. Parallel Implementation of Reinforcement Learning Q-Learning Technique for FPGA. IEEE Access (2019), 2782-2798. DOI 10.1109\/ACCESS.2018.2885950"}],"event":{"name":"PRIS 2022: 2022 4th International Conference on Pattern Recognition and Intelligent Systems","location":"Wuhan China","acronym":"PRIS 2022"},"container-title":["2022 4th International Conference on Pattern Recognition and Intelligent Systems"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3549179.3549181","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3549179.3549181","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:00:22Z","timestamp":1750186822000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3549179.3549181"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,7,29]]},"references-count":13,"alternative-id":["10.1145\/3549179.3549181","10.1145\/3549179"],"URL":"https:\/\/doi.org\/10.1145\/3549179.3549181","relation":{},"subject":[],"published":{"date-parts":[[2022,7,29]]}}}