{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,10]],"date-time":"2026-01-10T19:31:08Z","timestamp":1768073468258,"version":"3.49.0"},"reference-count":24,"publisher":"MDPI AG","issue":"5","license":[{"start":{"date-parts":[[2025,5,16]],"date-time":"2025-05-16T00:00:00Z","timestamp":1747353600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Natural Science Foundation of China","award":["62173134"],"award-info":[{"award-number":["62173134"]}]},{"name":"National Natural Science Foundation of China","award":["62473140"],"award-info":[{"award-number":["62473140"]}]},{"name":"National Natural Science Foundation of China","award":["62473145"],"award-info":[{"award-number":["62473145"]}]},{"name":"National Natural Science Foundation of China","award":["2024JK2029"],"award-info":[{"award-number":["2024JK2029"]}]},{"name":"National Natural Science Foundation of China","award":["2023JJ60151"],"award-info":[{"award-number":["2023JJ60151"]}]},{"name":"National Natural Science Foundation of China","award":["2023JJ50029"],"award-info":[{"award-number":["2023JJ50029"]}]},{"name":"National Natural Science Foundation of China","award":["22B0735"],"award-info":[{"award-number":["22B0735"]}]},{"name":"Key R&amp;D Program Project of Hunan Province","award":["62173134"],"award-info":[{"award-number":["62173134"]}]},{"name":"Key R&amp;D Program Project of Hunan Province","award":["62473140"],"award-info":[{"award-number":["62473140"]}]},{"name":"Key R&amp;D Program Project of Hunan Province","award":["62473145"],"award-info":[{"award-number":["62473145"]}]},{"name":"Key R&amp;D Program Project of Hunan Province","award":["2024JK2029"],"award-info":[{"award-number":["2024JK2029"]}]},{"name":"Key R&amp;D Program Project of Hunan Province","award":["2023JJ60151"],"award-info":[{"award-number":["2023JJ60151"]}]},{"name":"Key R&amp;D Program Project of Hunan Province","award":["2023JJ50029"],"award-info":[{"award-number":["2023JJ50029"]}]},{"name":"Key R&amp;D Program Project of Hunan Province","award":["22B0735"],"award-info":[{"award-number":["22B0735"]}]},{"name":"Natural Science Foundation of Hunan Province","award":["62173134"],"award-info":[{"award-number":["62173134"]}]},{"name":"Natural Science Foundation of Hunan Province","award":["62473140"],"award-info":[{"award-number":["62473140"]}]},{"name":"Natural Science Foundation of Hunan Province","award":["62473145"],"award-info":[{"award-number":["62473145"]}]},{"name":"Natural Science Foundation of Hunan Province","award":["2024JK2029"],"award-info":[{"award-number":["2024JK2029"]}]},{"name":"Natural Science Foundation of Hunan Province","award":["2023JJ60151"],"award-info":[{"award-number":["2023JJ60151"]}]},{"name":"Natural Science Foundation of Hunan Province","award":["2023JJ50029"],"award-info":[{"award-number":["2023JJ50029"]}]},{"name":"Natural Science Foundation of Hunan Province","award":["22B0735"],"award-info":[{"award-number":["22B0735"]}]},{"name":"Excellent Youth Project of Hunan Provincial Department of Education","award":["62173134"],"award-info":[{"award-number":["62173134"]}]},{"name":"Excellent Youth Project of Hunan Provincial Department of Education","award":["62473140"],"award-info":[{"award-number":["62473140"]}]},{"name":"Excellent Youth Project of Hunan Provincial Department of Education","award":["62473145"],"award-info":[{"award-number":["62473145"]}]},{"name":"Excellent Youth Project of Hunan Provincial Department of Education","award":["2024JK2029"],"award-info":[{"award-number":["2024JK2029"]}]},{"name":"Excellent Youth Project of Hunan Provincial Department of Education","award":["2023JJ60151"],"award-info":[{"award-number":["2023JJ60151"]}]},{"name":"Excellent Youth Project of Hunan Provincial Department of Education","award":["2023JJ50029"],"award-info":[{"award-number":["2023JJ50029"]}]},{"name":"Excellent Youth Project of Hunan Provincial Department of Education","award":["22B0735"],"award-info":[{"award-number":["22B0735"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Systems"],"abstract":"<jats:p>For mobile agent path planning, traditional path planning algorithms frequently induce abrupt variations in path curvature and steering angles, increasing the risk of lateral tire slippage and undermining operational safety. Concurrently, conventional reinforcement learning methods struggle to converge rapidly, leading to an insufficient efficiency in planning to meet the demand for energy economy. This study proposes LSTM B\u00e9zier\u2013Double Deep Q-Network (LB-DDQN), an advanced path-planning framework for mobile agents based on deep reinforcement learning. The architecture first enables mapless navigation through a DDQN foundation, subsequently integrates long short-term memory (LSTM) networks for the fusion of environmental features and preservation of training information, and ultimately enhances the path\u2019s quality through redundant node elimination via an obstacle\u2013path relationship analysis, combined with B\u00e9zier curve-based trajectory smoothing. A sensor-driven three-dimensional simulation environment featuring static obstacles was constructed using the ROS and Gazebo platforms, where LiDAR-equipped mobile agent models were trained for real-time environmental perception and strategy optimization prior to deployment on experimental vehicles. The simulation and physical implementation results reveal that LB-DDQN achieves effective collision avoidance, while demonstrating marked enhancements in critical metrics: the path\u2019s smoothness, energy efficiency, and motion stability exhibit average improvements exceeding 50%. The framework further maintains superior safety standards and operational efficiency across diverse scenarios.<\/jats:p>","DOI":"10.3390\/systems13050385","type":"journal-article","created":{"date-parts":[[2025,5,19]],"date-time":"2025-05-19T06:31:29Z","timestamp":1747636289000},"page":"385","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Research on Mobile Agent Path Planning Based on Deep Reinforcement Learning"],"prefix":"10.3390","volume":"13","author":[{"given":"Shengwei","family":"Jin","sequence":"first","affiliation":[{"name":"College of Information Science and Engineering, Hunan Institute of Engineering, Xiangtan 411104, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2261-5973","authenticated-orcid":false,"given":"Xizheng","family":"Zhang","sequence":"additional","affiliation":[{"name":"College of Information Science and Engineering, Hunan Institute of Engineering, Xiangtan 411104, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ying","family":"Hu","sequence":"additional","affiliation":[{"name":"College of Information Science and Engineering, Hunan Institute of Engineering, Xiangtan 411104, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ruoyuan","family":"Liu","sequence":"additional","affiliation":[{"name":"College of Information Science and Engineering, Hunan Institute of Engineering, Xiangtan 411104, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Qing","family":"Wang","sequence":"additional","affiliation":[{"name":"College of Information Science and Engineering, Hunan Institute of Engineering, Xiangtan 411104, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Haihua","family":"He","sequence":"additional","affiliation":[{"name":"College of Information Science and Engineering, Hunan Institute of Engineering, Xiangtan 411104, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Junyu","family":"Liao","sequence":"additional","affiliation":[{"name":"College of Information Science and Engineering, Hunan Institute of Engineering, Xiangtan 411104, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Lijing","family":"Zeng","sequence":"additional","affiliation":[{"name":"College of Information Science and Engineering, Hunan Institute of Engineering, Xiangtan 411104, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2025,5,16]]},"reference":[{"key":"ref_1","first-page":"9","article-title":"A review of path planning algorithms for mobile robots","volume":"30","author":"Li","year":"2022","journal-title":"Comput. Meas. Control"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Zhu, Z., Lyu, H., Zhang, J., and Yin, Y. (2021). An efficient ship automatic collision avoidance method based on modified artificial potential field. J. Mar. Sci. Eng., 10.","DOI":"10.3390\/jmse10010003"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"122228","DOI":"10.1016\/j.apenergy.2023.122228","article-title":"Learning-based model predictive energy management for fuel cell hybrid electric bus with health-aware control","volume":"355","author":"Jia","year":"2024","journal-title":"Appl. Energy"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"119032","DOI":"10.1016\/j.enconman.2024.119032","article-title":"Deep reinforcement learning-based energy management strategy for fuel cell buses integrating future road information and cabin comfort control","volume":"321","author":"Jia","year":"2024","journal-title":"Energy Convers. Manag."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"821","DOI":"10.1016\/j.ijhydene.2023.05.311","article-title":"Energy sources durability energy management for fuel cell hybrid electric bus based on deep reinforcement learning considering future terrain information","volume":"52","author":"Li","year":"2024","journal-title":"Int. J. Hydrogen Energy"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"59196","DOI":"10.1109\/ACCESS.2021.3070054","article-title":"Geometric A-star algorithm: An improved A-star algorithm for AGV path planning in a port environment","volume":"9","author":"Tang","year":"2021","journal-title":"IEEE Access"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"107230","DOI":"10.1016\/j.cie.2021.107230","article-title":"Path planning optimization of indoor mobile robot based on adaptive ant colony algorithm","volume":"156","author":"Miao","year":"2021","journal-title":"Comput. Ind. Eng."},{"key":"ref_8","unstructured":"Chen, S., Zhao, C., Wang, C., and Zhengbing, Y. (2021, January 30). Reinforcement learning based multi-intelligence path planning algorithm. Proceedings of the 32nd Chinese Conference on Program Control, Taiyuan, China."},{"key":"ref_9","first-page":"1406","article-title":"Survey of deep reinforcement learning based on value function and policy gradient","volume":"42","author":"Liu","year":"2019","journal-title":"J. Comput."},{"key":"ref_10","first-page":"1","article-title":"A review of deep reinforcement learning","volume":"41","author":"Liu","year":"2018","journal-title":"Chin. J. Comput."},{"key":"ref_11","unstructured":"Mnih, V., Kavukcuoglu, K., Silver, D., Graves, A., Antonoglou, I., Wierstra, D., and Riedmiller, M. (2012, January 3\u20136). Playing Atari with Deep Reinforcement Learning. Proceedings of the Workshops at the 26th Neural Information Processing Systems, Lake Tahoe, NV, USA."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"279","DOI":"10.1007\/BF00992698","article-title":"Q-learning","volume":"8","author":"Watkins","year":"2012","journal-title":"Mach. Learn."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Van Hasselt, H., Guez, A., and Silver, D. (2016, January 12\u201317). Deep reinforcement learning with double Q-learning. Proceedings of the AAAI Conference on Artificial Intelligence, Philadelphia, PA, USA.","DOI":"10.1609\/aaai.v30i1.10295"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Xin, J., Zhao, H., Liu, D., and Li, M. (2017, January 20\u201322). Application of deep reinforcement learning in mobile robot path planning. Proceedings of the Chinese Automation Congress, Jinan, China.","DOI":"10.1109\/CAC.2017.8244061"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Zhang, X., Shi, X., Zhang, Z., Wang, Z., and Zhang, L. (2022). A DDQN path planning algorithm based on experience classification and multi steps for mobile robots. Electronics, 11.","DOI":"10.3390\/electronics11142120"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Zhu, Z., Hu, C., Zhu, C., Zhu, Y., and Sheng, Y. (2021). An improved dueling deep double-Q network based on prioritized experience replay for path planning of unmanned surface vehicles. J. Mar. Sci. Eng., 9.","DOI":"10.3390\/jmse9111267"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"228","DOI":"10.1016\/j.ijhydene.2025.02.302","article-title":"Remaining useful life prediction of PEMFC based on matrix long short-term memory","volume":"111","author":"Yi","year":"2025","journal-title":"Int. J. Hydrogen Energy"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"4378","DOI":"10.1016\/j.jfranklin.2023.01.033","article-title":"Dynamic path planning of mobile robot based on improved simulated annealing algorithm","volume":"360","author":"Shi","year":"2023","journal-title":"J. Frankl. Inst."},{"key":"ref_19","first-page":"205","article-title":"Research on control strategy of agricultural robot","volume":"47","author":"Wang","year":"2025","journal-title":"J. Agric. Mech. Res."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1109\/TIV.2016.2578706","article-title":"A Survey of Motion Planning and Control Techniques for Self-Driving Urban Vehicles","volume":"1","author":"Pade","year":"2016","journal-title":"IEEE Trans. Intell. Veh."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Rajamani, R. (2011). Vehicle Dynamics and Control, Springer.","DOI":"10.1007\/978-1-4614-1433-9"},{"key":"ref_22","unstructured":"NHTSA (2016). Federal Automated Vehicles Policy: Accelerating the Next Revolution in Roadway Safety, Springer."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Levinson, J., Askeland, J., Becker, J., Dolson, J., Held, D., Kammel, S., Kolter, J.Z., Langer, D., Pink, O., and Pratt, V. (2011, January 5\u20139). Towards Fully Autonomous Driving: Systems and Algorithms. Proceedings of the 2011 IEEE Intelligent Vehicles Symposium (IV), Baden-Baden, Germany.","DOI":"10.1109\/IVS.2011.5940562"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"485","DOI":"10.1177\/0278364909359210","article-title":"Path Planning for Autonomous Vehicles in Unknown Semi-structured Environments","volume":"29","author":"Dolgov","year":"2010","journal-title":"Int. J. Robot. Res."}],"container-title":["Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2079-8954\/13\/5\/385\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T17:34:13Z","timestamp":1760031253000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2079-8954\/13\/5\/385"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,5,16]]},"references-count":24,"journal-issue":{"issue":"5","published-online":{"date-parts":[[2025,5]]}},"alternative-id":["systems13050385"],"URL":"https:\/\/doi.org\/10.3390\/systems13050385","relation":{},"ISSN":["2079-8954"],"issn-type":[{"value":"2079-8954","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,5,16]]}}}