{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,25]],"date-time":"2026-06-25T16:42:11Z","timestamp":1782405731894,"version":"3.54.5"},"reference-count":25,"publisher":"World Scientific Pub Co Pte Ltd","issue":"06","funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["92267201"],"award-info":[{"award-number":["92267201"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["92367301"],"award-info":[{"award-number":["92367301"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100011259","name":"State Key Laboratory of Robotics","doi-asserted-by":"publisher","award":["2024-Z19"],"award-info":[{"award-number":["2024-Z19"]}],"id":[{"id":"10.13039\/501100011259","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100011259","name":"State Key Laboratory of Robotics","doi-asserted-by":"publisher","award":["2024-Z22"],"award-info":[{"award-number":["2024-Z22"]}],"id":[{"id":"10.13039\/501100011259","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Un. Sys."],"published-print":{"date-parts":[[2025,11]]},"abstract":"<jats:p>The automated guided vehicle (AGV) has been widely used in the realm of intelligent logistics, and path planning has become a key challenge in AGV research. In large and complex dynamic environments, multi-AGV unmanned systems have the problems of low search efficiency, slow convergence speed, and even impossible convergence. To accelerate the convergence of AGVs during the learning process, a new deep reinforcement learning method heuristic soft action-multi-agent twin delayed deep deterministic policy gradient (HA-MATD3) algorithm is proposed in this paper. Specifically, a dynamic reward function utilizing an artificial potential field method is introduced to score the actions of the AGVs, and the heuristic soft action and reward network are introduced to optimize the multi-agent twin delayed deep deterministic policy gradient (MATD3) algorithm. First, the AGV generates the ideal heuristic soft action through its state and target information, and the AGV can effectively solve the problem of low search efficiency through heuristic soft action learning. Furthermore, the reward network is used to judge the reward value of the action taken by the AGV, ensuring that the generated path is efficient, collision-free and safer. These improvements enrich the decision-making process and improve the adaptability and responsiveness of AGVs to various environmental conditions. Finally, experimental results demonstrate that the proposed HA-MATD3 algorithm is effective in solving the multi-AGV path planning problem in complex environments. This research contributes to the development of unmanned systems, especially in the multi-AGV path planning problem.<\/jats:p>","DOI":"10.1142\/s2301385025410067","type":"journal-article","created":{"date-parts":[[2025,3,13]],"date-time":"2025-03-13T06:01:47Z","timestamp":1741845707000},"page":"1531-1544","source":"Crossref","is-referenced-by-count":2,"title":["A Deep Reinforcement Learning Method for Multiple AGV Path Planning Based on MATD3 Algorithm"],"prefix":"10.1142","volume":"13","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1094-9335","authenticated-orcid":false,"given":"Yukai","family":"Fu","sequence":"first","affiliation":[{"name":"State Key Laboratory of Robotics, Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016, P. R. China"},{"name":"Key Laboratory of Networked Control Systems, Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016, P. R. China"},{"name":"Institutes for Robotics and Intelligent Manufacturing, Chinese Academy of Sciences, Shenyang 110169, P. R. China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0009-1718-6298","authenticated-orcid":false,"given":"Ao","family":"Xu","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Robotics, Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016, P. R. China"},{"name":"Key Laboratory of Networked Control Systems, Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016, P. R. China"},{"name":"Institutes for Robotics and Intelligent Manufacturing, Chinese Academy of Sciences, Shenyang 110169, P. R. China"},{"name":"School of Automation and Electrical Engineering, Shenyang Ligong University, Shenyang 110159, P. R. China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0000-4380-6379","authenticated-orcid":false,"given":"Yiyang","family":"Liu","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Robotics, Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016, P. R. China"},{"name":"Key Laboratory of Networked Control Systems, Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016, P. R. China"},{"name":"Institutes for Robotics and Intelligent Manufacturing, Chinese Academy of Sciences, Shenyang 110169, P. R. China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Hongfei","family":"Bai","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Robotics, Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016, P. R. China"},{"name":"Key Laboratory of Networked Control Systems, Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016, P. R. China"},{"name":"Institutes for Robotics and Intelligent Manufacturing, Chinese Academy of Sciences, Shenyang 110169, P. R. China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6148-1034","authenticated-orcid":false,"given":"Chao","family":"Deng","sequence":"additional","affiliation":[{"name":"Institute of Advanced Technology, Nanjing University of Posts and Telecommunications, Nanjing 210023, P. R. China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"219","published-online":{"date-parts":[[2025,6,19]]},"reference":[{"key":"S2301385025410067BIB001","doi-asserted-by":"publisher","DOI":"10.1016\/j.jclepro.2016.10.057"},{"key":"S2301385025410067BIB002","doi-asserted-by":"publisher","DOI":"10.1007\/s43154-020-00007-4"},{"key":"S2301385025410067BIB003","doi-asserted-by":"publisher","DOI":"10.1088\/1742-6596\/1746\/1\/012052"},{"key":"S2301385025410067BIB004","doi-asserted-by":"publisher","DOI":"10.1109\/IMCEC46724.2019.8983841"},{"issue":"3","key":"S2301385025410067BIB005","first-page":"346","volume":"42","author":"Wang H.","year":"2020","journal-title":"Robot"},{"key":"S2301385025410067BIB006","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-41968-3_55"},{"key":"S2301385025410067BIB007","doi-asserted-by":"publisher","DOI":"10.1142\/S2301385017500042"},{"key":"S2301385025410067BIB008","doi-asserted-by":"publisher","DOI":"10.1142\/S2301385024500225"},{"issue":"2","key":"S2301385025410067BIB009","first-page":"132","volume":"27","author":"Qingbao Z.","year":"2005","journal-title":"Robot"},{"issue":"1","key":"S2301385025410067BIB010","first-page":"120","volume":"42","author":"Zhang Y.","year":"2020","journal-title":"Robot"},{"key":"S2301385025410067BIB011","doi-asserted-by":"publisher","DOI":"10.1142\/S230138502441005X"},{"key":"S2301385025410067BIB012","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2018.01.113"},{"key":"S2301385025410067BIB013","doi-asserted-by":"publisher","DOI":"10.1016\/j.artint.2014.11.006"},{"key":"S2301385025410067BIB014","doi-asserted-by":"publisher","DOI":"10.1016\/j.oceaneng.2021.108709"},{"key":"S2301385025410067BIB015","doi-asserted-by":"publisher","DOI":"10.1142\/S0218001422520140"},{"key":"S2301385025410067BIB016","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2018.8461113"},{"key":"S2301385025410067BIB017","doi-asserted-by":"publisher","DOI":"10.1142\/S2301385025500669"},{"key":"S2301385025410067BIB018","doi-asserted-by":"publisher","DOI":"10.1142\/S2301385024420044"},{"key":"S2301385025410067BIB019","doi-asserted-by":"publisher","DOI":"10.1109\/NetCIT54147.2021.00090"},{"key":"S2301385025410067BIB020","doi-asserted-by":"publisher","DOI":"10.1109\/IROS45743.2020.9340876"},{"key":"S2301385025410067BIB021","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2019.2903261"},{"key":"S2301385025410067BIB022","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2021.3062803"},{"key":"S2301385025410067BIB023","doi-asserted-by":"publisher","DOI":"10.1007\/s10957-024-02453-y"},{"key":"S2301385025410067BIB026","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2022.117191"},{"key":"S2301385025410067BIB029","doi-asserted-by":"publisher","DOI":"10.3390\/drones6110365"}],"container-title":["Unmanned Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S2301385025410067","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,29]],"date-time":"2025-10-29T06:57:06Z","timestamp":1761721026000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S2301385025410067"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,6,19]]},"references-count":25,"journal-issue":{"issue":"06","published-print":{"date-parts":[[2025,11]]}},"alternative-id":["10.1142\/S2301385025410067"],"URL":"https:\/\/doi.org\/10.1142\/s2301385025410067","relation":{},"ISSN":["2301-3850","2301-3869"],"issn-type":[{"value":"2301-3850","type":"print"},{"value":"2301-3869","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,6,19]]}}}