{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,15]],"date-time":"2026-01-15T12:51:15Z","timestamp":1768481475417,"version":"3.49.0"},"reference-count":46,"publisher":"World Scientific Pub Co Pte Ltd","issue":"07","funder":[{"name":"young and middle-aged teachers in Fujian Province","award":["JAT210674"],"award-info":[{"award-number":["JAT210674"]}]},{"name":"Department of Education of Fujian Province","award":["JAT220194"],"award-info":[{"award-number":["JAT220194"]}]},{"name":"Department of Education of Fujian Province","award":["JAT210232"],"award-info":[{"award-number":["JAT210232"]}]},{"name":"National Natural Science Foundation Cultivation Program of Jimei University","award":["ZP2020036"],"award-info":[{"award-number":["ZP2020036"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Artif. Intell. Tools"],"published-print":{"date-parts":[[2024,11]]},"abstract":"<jats:p> The path planning of mobile robots helps robots to perceive environment using the information obtained from sensors and plan a route to reach the target. With the increasing difficulty of task, the environment the mobile robots face becomes more and more complex. Traditional path planning methods can no longer meet the requirements of mobile robot navigation in complex environment. Deep reinforcement learning (DRL) is introduced into robot navigation However, it may be time-consuming to train DRL model when the environment is very complex and the existing environment may differ from the unknown environment. In order to handle the robot navigation in heterogeneous environment, this paper utilizes deep transfer reinforcement learning (DTRL) for mobile robot path planning. Compared with DRL, DTRL does not require the distribution of the existing environment is the same as that of the unknown environment. Additionally, DTRL can transfer the knowledge of existing model to new scenario to reduce the training time. The simulations show that DTRL can reach higher success rate than DRL for heterogeneous environment robot navigation. By using local policy, it costs less time to train DTRL than DRL for a complex environment and DTRL can consume less navigation time. <\/jats:p>","DOI":"10.1142\/s0218213024400050","type":"journal-article","created":{"date-parts":[[2024,7,12]],"date-time":"2024-07-12T14:58:40Z","timestamp":1720796320000},"source":"Crossref","is-referenced-by-count":3,"title":["Path Planning for Mobile Robots Using Transfer Reinforcement Learning"],"prefix":"10.1142","volume":"33","author":[{"ORCID":"https:\/\/orcid.org\/0009-0007-3541-2418","authenticated-orcid":false,"given":"Xinwang","family":"Zheng","sequence":"first","affiliation":[{"name":"Chengyi College, Jimei University, Xiamen, 361021 Fujian, P. R. China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2749-419X","authenticated-orcid":false,"given":"Wenjie","family":"Zheng","sequence":"additional","affiliation":[{"name":"School of Ocean Information Engineering, Jimei University, Xiamen, 361021 Fujian, P. R. China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2242-9476","authenticated-orcid":false,"given":"Yong","family":"Du","sequence":"additional","affiliation":[{"name":"School of Ocean Information Engineering, Jimei University, Xiamen, 361021 Fujian, P. R. China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5400-4395","authenticated-orcid":false,"given":"Tiejun","family":"Li","sequence":"additional","affiliation":[{"name":"School of Ocean Information Engineering, Jimei University, Xiamen, 361021 Fujian, P. R. China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0006-0755-3304","authenticated-orcid":false,"given":"Zhansheng","family":"Yuan","sequence":"additional","affiliation":[{"name":"School of Ocean Information Engineering, Jimei University, Xiamen, 361021 Fujian, P. R. China"}]}],"member":"219","published-online":{"date-parts":[[2024,9,14]]},"reference":[{"key":"S0218213024400050BIB001","doi-asserted-by":"publisher","DOI":"10.3390\/app12010135"},{"key":"S0218213024400050BIB002","doi-asserted-by":"publisher","DOI":"10.1109\/MRA.2010.937861"},{"key":"S0218213024400050BIB003","doi-asserted-by":"publisher","DOI":"10.1016\/j.jmsy.2023.05.026"},{"key":"S0218213024400050BIB004","doi-asserted-by":"publisher","DOI":"10.1016\/j.measurement.2023.112821"},{"key":"S0218213024400050BIB005","doi-asserted-by":"publisher","DOI":"10.1109\/HPCC\/SmartCity\/DSS.2018.00243"},{"key":"S0218213024400050BIB006","doi-asserted-by":"publisher","DOI":"10.1016\/j.engappai.2023.107798"},{"key":"S0218213024400050BIB007","doi-asserted-by":"publisher","DOI":"10.1016\/j.dt.2019.04.011"},{"key":"S0218213024400050BIB008","doi-asserted-by":"publisher","DOI":"10.5897\/IJPS11.1745"},{"key":"S0218213024400050BIB009","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2023.120254"},{"issue":"2","key":"S0218213024400050BIB010","first-page":"151","volume":"5","author":"Buniyamin N.","year":"2011","journal-title":"Int. J. Syst. Appl. Eng. Dev."},{"key":"S0218213024400050BIB011","doi-asserted-by":"publisher","DOI":"10.1007\/978-981-15-4095-0_2"},{"key":"S0218213024400050BIB012","doi-asserted-by":"publisher","DOI":"10.1007\/s11432-011-4332-6"},{"key":"S0218213024400050BIB013","volume-title":"Adaptive Control Processes: A Guided Tour","author":"Hammer P. C.","year":"1962"},{"key":"S0218213024400050BIB014","doi-asserted-by":"publisher","DOI":"10.26599\/TST.2021.9010012"},{"key":"S0218213024400050BIB015","doi-asserted-by":"publisher","DOI":"10.1080\/0305215X.2022.2104840"},{"key":"S0218213024400050BIB016","doi-asserted-by":"publisher","DOI":"10.1109\/TSSC.1968.300136"},{"key":"S0218213024400050BIB017","doi-asserted-by":"publisher","DOI":"10.1109\/ITAIC58329.2023.10408799"},{"key":"S0218213024400050BIB018","doi-asserted-by":"publisher","DOI":"10.1145\/3544585.3544600"},{"key":"S0218213024400050BIB019","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2023.3302698"},{"key":"S0218213024400050BIB020","doi-asserted-by":"publisher","DOI":"10.1007\/BF01840369"},{"key":"S0218213024400050BIB021","doi-asserted-by":"publisher","DOI":"10.1109\/ICIEA.2018.8397948"},{"key":"S0218213024400050BIB022","doi-asserted-by":"publisher","DOI":"10.1177\/027836498600500106"},{"key":"S0218213024400050BIB023","doi-asserted-by":"publisher","DOI":"10.1017\/S0263574723001546"},{"key":"S0218213024400050BIB024","doi-asserted-by":"publisher","DOI":"10.1177\/09544062241230171"},{"key":"S0218213024400050BIB025","doi-asserted-by":"publisher","DOI":"10.1109\/ICIINFS.2010.5578632"},{"key":"S0218213024400050BIB026","doi-asserted-by":"publisher","DOI":"10.1109\/CEC.2016.7744104"},{"key":"S0218213024400050BIB027","doi-asserted-by":"publisher","DOI":"10.1109\/JIOT.2024.3379361"},{"key":"S0218213024400050BIB028","doi-asserted-by":"publisher","DOI":"10.1016\/j.artint.2015.04.001"},{"key":"S0218213024400050BIB029","doi-asserted-by":"publisher","DOI":"10.1016\/j.cirp.2017.04.095"},{"key":"S0218213024400050BIB030","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2016.08.108"},{"key":"S0218213024400050BIB031","doi-asserted-by":"publisher","DOI":"10.1109\/IAEAC.2018.8577599"},{"key":"S0218213024400050BIB032","doi-asserted-by":"publisher","DOI":"10.1177\/0278364915619772"},{"key":"S0218213024400050BIB035","doi-asserted-by":"publisher","DOI":"10.1038\/nature14236"},{"key":"S0218213024400050BIB036","doi-asserted-by":"publisher","DOI":"10.1109\/CAC.2017.8244061"},{"key":"S0218213024400050BIB037","volume-title":"Advances in Neural Information Processing Systems","volume":"18","author":"Muller U.","year":"2005"},{"key":"S0218213024400050BIB038","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2017.7989182"},{"key":"S0218213024400050BIB039","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2017.8202134"},{"key":"S0218213024400050BIB040","first-page":"42","volume-title":"2018 Int. Conf. Big Data and Artificial Intelligence (BDAI)","author":"Yan T.","year":"2018"},{"key":"S0218213024400050BIB041","doi-asserted-by":"publisher","DOI":"10.1007\/s10462-023-10547-8"},{"key":"S0218213024400050BIB042","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2024.123673"},{"key":"S0218213024400050BIB043","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01424-7_27"},{"key":"S0218213024400050BIB044","doi-asserted-by":"publisher","DOI":"10.1109\/TITS.2022.3161939"},{"key":"S0218213024400050BIB045","doi-asserted-by":"publisher","DOI":"10.1016\/j.conbuildmat.2017.09.110"},{"key":"S0218213024400050BIB046","doi-asserted-by":"publisher","DOI":"10.1088\/1361-6501\/ac57ef"},{"key":"S0218213024400050BIB047","doi-asserted-by":"publisher","DOI":"10.1007\/s10489-022-04030-0"},{"key":"S0218213024400050BIB048","doi-asserted-by":"publisher","DOI":"10.1109\/ITOEC53115.2022.9734653"}],"container-title":["International Journal on Artificial Intelligence Tools"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0218213024400050","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,12,18]],"date-time":"2024-12-18T01:15:31Z","timestamp":1734484531000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S0218213024400050"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,9,14]]},"references-count":46,"journal-issue":{"issue":"07","published-print":{"date-parts":[[2024,11]]}},"alternative-id":["10.1142\/S0218213024400050"],"URL":"https:\/\/doi.org\/10.1142\/s0218213024400050","relation":{},"ISSN":["0218-2130","1793-6349"],"issn-type":[{"value":"0218-2130","type":"print"},{"value":"1793-6349","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,9,14]]},"article-number":"2440005"}}