{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,19]],"date-time":"2026-05-19T12:42:40Z","timestamp":1779194560738,"version":"3.51.4"},"reference-count":132,"publisher":"Wiley","issue":"1","license":[{"start":{"date-parts":[[2024,12,24]],"date-time":"2024-12-24T00:00:00Z","timestamp":1734998400000},"content-version":"vor","delay-in-days":358,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["International Journal of Intelligent Systems"],"published-print":{"date-parts":[[2024,1]]},"abstract":"<jats:p>Effective motion planning is an indispensable prerequisite for the optimal performance of robotic manipulators in any task. In this regard, the research and application of reinforcement learning in robotic manipulators for motion planning have gained great relevance in recent years. The ability of reinforcement learning agents to adapt to variable environments, especially those featuring dynamic obstacles, has propelled their increasing application in this domain. Notwithstanding, a clear need remains for a resource that critically examines the progress, challenges, and future directions of this machine learning control technique in motion planning. This article undertakes a comprehensive review of the landscape of reinforcement learning, offering a retrospective analysis of its application in motion planning from 2018 to the present. The exploration extends to the trends associated with reinforcement learning in the context of serial manipulators and motion planning, as well as the various technological challenges currently presented by this machine learning control technique. The overarching objective of this review is to serve as a valuable resource for the robotics community, facilitating the ongoing development of systems controlled by reinforcement learning. By delving into the primary challenges intrinsic to this technology, the review seeks to enhance the understanding of reinforcement learning\u2019s role in motion planning and provides insights that may suggest future research directions in this domain.<\/jats:p>","DOI":"10.1155\/int\/1636497","type":"journal-article","created":{"date-parts":[[2024,12,24]],"date-time":"2024-12-24T14:17:06Z","timestamp":1735049826000},"source":"Crossref","is-referenced-by-count":10,"title":["A Review on Reinforcement Learning for Motion Planning of Robotic Manipulators"],"prefix":"10.1155","volume":"2024","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6511-8542","authenticated-orcid":false,"given":"\u00cd\u00f1igo","family":"Elguea-Aguinaco","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9668-1346","authenticated-orcid":false,"given":"Ibai","family":"Inziarte-Hidalgo","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5960-4365","authenticated-orcid":false,"given":"Simon","family":"B\u00f8gh","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3305-8108","authenticated-orcid":false,"given":"Nestor","family":"Arana-Arexolaleiba","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"311","published-online":{"date-parts":[[2024,12,24]]},"reference":[{"key":"e_1_2_12_1_2","doi-asserted-by":"publisher","DOI":"10.1049\/csy2.12020"},{"key":"e_1_2_12_2_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10845-021-01867-z"},{"key":"e_1_2_12_3_2","doi-asserted-by":"publisher","DOI":"10.23919\/jsee.2023.000051"},{"key":"e_1_2_12_4_2","doi-asserted-by":"publisher","DOI":"10.1109\/CCTA.2017.8062637"},{"key":"e_1_2_12_5_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2017.7989463"},{"key":"e_1_2_12_6_2","doi-asserted-by":"crossref","unstructured":"WangY. YeX. YangY. andZhangW. Collision-free Trajectory Planning in Human-Robot Interaction through Hand Movement Prediction from Vision 2017 IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids) 2017 305\u2013310 https:\/\/doi.org\/10.1109\/HUMANOIDS.2017.8246890 2-s2.0-85044448781.","DOI":"10.1109\/HUMANOIDS.2017.8246890"},{"key":"e_1_2_12_7_2","doi-asserted-by":"publisher","DOI":"10.3390\/robotics10010022"},{"key":"e_1_2_12_8_2","doi-asserted-by":"publisher","DOI":"10.1177\/0278364913495721"},{"key":"e_1_2_12_9_2","doi-asserted-by":"crossref","unstructured":"JiangH. WangH. YauW.-Y. andWanK.-W. A Brief Survey: Deep Reinforcement Learning in Mobile Robot Navigation 2020 15th IEEE Conference on Industrial Electronics and Applications (ICIEA) 2020 https:\/\/doi.org\/10.1109\/ICIEA48937.2020.9248288.","DOI":"10.1109\/ICIEA48937.2020.9248288"},{"key":"e_1_2_12_10_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2021.3076530"},{"key":"e_1_2_12_11_2","doi-asserted-by":"publisher","DOI":"10.26599\/TST.2021.9010012"},{"key":"e_1_2_12_12_2","doi-asserted-by":"publisher","DOI":"10.3390\/app13148174"},{"key":"e_1_2_12_13_2","doi-asserted-by":"publisher","DOI":"10.3390\/electronics10090999"},{"key":"e_1_2_12_14_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.engappai.2022.105321"},{"key":"e_1_2_12_15_2","doi-asserted-by":"publisher","DOI":"10.1007\/s41315-023-00274-2"},{"key":"e_1_2_12_16_2","article-title":"Reinforcement Learning. An Introduction","author":"Sutton R. S.","year":"2018","journal-title":"Second Edi"},{"key":"e_1_2_12_17_2","doi-asserted-by":"publisher","DOI":"10.15302\/J-ENG-2015009"},{"key":"e_1_2_12_18_2","article-title":"Path Planning and Trajectory Planning Algorithms: A General Overview","volume":"29","author":"Gasparetto A.","year":"2015","journal-title":"Springer"},{"key":"e_1_2_12_19_2","first-page":"1071","article-title":"Learning Neural Network Policies with Guided Policy Search under Unknown Dynamics","volume":"2","author":"Levine S.","year":"2014","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_2_12_20_2","doi-asserted-by":"crossref","DOI":"10.1109\/ICRA.2019.8793572","volume-title":"State Estimation in Contact-Rich Manipulation","author":"Wirnshofer F.","year":"2019"},{"key":"e_1_2_12_21_2","article-title":"Probabilistic Model Learning and Long-Term Prediction Forcontact-Rich Manipulation Tasks","volume":"1","author":"Khader P.","year":"2019","journal-title":"CoRR"},{"key":"e_1_2_12_22_2","doi-asserted-by":"crossref","unstructured":"WeberJ.andSchmidtM. An Improved Approach for Inverse Kinematics and Motion Planning of an Industrial Robot Manipulator with Reinforcement Learning 2021 Fifth IEEE International Conference on Robotic Computing (IRC) 2021 10\u201317 https:\/\/doi.org\/10.1109\/IRC52146.2021.00009.","DOI":"10.1109\/IRC52146.2021.00009"},{"key":"e_1_2_12_23_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-981-13-2375-1_44"},{"key":"e_1_2_12_24_2","doi-asserted-by":"publisher","DOI":"10.1109\/IJCNN.2018.8489273"},{"key":"e_1_2_12_25_2","doi-asserted-by":"publisher","DOI":"10.23919\/CCC52363.2021.9550010"},{"key":"e_1_2_12_26_2","doi-asserted-by":"crossref","unstructured":"XinlanG. TaoL. andJianZ. Trajectory Planning and Obstacle Avoidance Behavior of Manipulator Based on Q-Learning Algorithm 2023 IEEE International Conference on Control Electronics and Computer Technology (ICCECT) 2023 1235\u20131241 https:\/\/doi.org\/10.1109\/ICCECT57938.2023.10140814.","DOI":"10.1109\/ICCECT57938.2023.10140814"},{"key":"e_1_2_12_27_2","doi-asserted-by":"crossref","unstructured":"RibeiroF. M.andPintoV. H. Reinforcement Learning Techniques Applied to the Motion Planning of a Robotic Manipulator 2022 IEEE International Conference on Autonomous Robot Systems and Competitions (ICARSC) 2022 173\u2013178 https:\/\/doi.org\/10.1109\/ICARSC55462.2022.9784814.","DOI":"10.1109\/ICARSC55462.2022.9784814"},{"key":"e_1_2_12_28_2","doi-asserted-by":"publisher","DOI":"10.1109\/CCDC55256.2022.10033563"},{"key":"e_1_2_12_29_2","doi-asserted-by":"publisher","DOI":"10.1007\/s12369-008-0001-3"},{"key":"e_1_2_12_30_2","doi-asserted-by":"publisher","DOI":"10.3390\/app10020575"},{"key":"e_1_2_12_31_2","doi-asserted-by":"publisher","DOI":"10.3390\/s23135974"},{"key":"e_1_2_12_32_2","first-page":"5049","article-title":"Hindsight Experience Replay","volume":"2017","author":"Andrychowicz M.","year":"2017","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_2_12_33_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.mechatronics.2021.102630"},{"key":"e_1_2_12_34_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA40945.2020.9197102"},{"key":"e_1_2_12_35_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRAS.2019.8809005"},{"key":"e_1_2_12_36_2","doi-asserted-by":"crossref","unstructured":"HuangZ. ChenG. ShenY. LiuY. YouH. andLiT. LNOA: A Real-Time Obstacle Avoidance Motion Planning Method for Redundant Manipulator Based on Reinforcement Learning 2022 International Conference on Service Robotics (ICoSR) 2022 1\u20136 https:\/\/doi.org\/10.1109\/ICoSR57188.2022.00019.","DOI":"10.1109\/ICoSR57188.2022.00019"},{"key":"e_1_2_12_37_2","first-page":"5020","article-title":"Sub-goal Trees \u2013 A Framework for Goal-Based Reinforcement Learning","author":"Jurgenson T.","year":"2020","journal-title":"Proc. 37th Int. Conf. Mach. Learn."},{"key":"e_1_2_12_38_2","doi-asserted-by":"publisher","DOI":"10.3390\/electronics11213636"},{"key":"e_1_2_12_39_2","doi-asserted-by":"publisher","DOI":"10.1109\/Humanoids53995.2022.10000077"},{"key":"e_1_2_12_40_2","doi-asserted-by":"crossref","unstructured":"ChengX.andLiuS. Dynamic Obstacle Avoidance Algorithm for Robot Arm Based on Deep Reinforcement Learning 2022 IEEE 11th Data Driven Control and Learning Systems Conference (DDCLS) 2022 1136\u20131141 https:\/\/doi.org\/10.1109\/DDCLS55054.2022.9858561.","DOI":"10.1109\/DDCLS55054.2022.9858561"},{"key":"e_1_2_12_41_2","doi-asserted-by":"publisher","DOI":"10.1108\/IR-06-2021-0127"},{"key":"e_1_2_12_42_2","doi-asserted-by":"publisher","DOI":"10.3389\/fnbot.2022.883562"},{"key":"e_1_2_12_43_2","doi-asserted-by":"crossref","unstructured":"El-ShamoutyM. WuX. YangS. AlbusM. andHuberM. F. Towards Safe Human-Robot Collaboration Using Deep Reinforcement Learning 2020 IEEE International Conference on Robotics and Automation (ICRA) 2020 4899\u20134905 https:\/\/doi.org\/10.1109\/ICRA40945.2020.9196924.","DOI":"10.1109\/ICRA40945.2020.9196924"},{"key":"e_1_2_12_44_2","doi-asserted-by":"crossref","unstructured":"SangiovanniB. RendinielloA. IncremonaG. P. FerraraA. andPiastraM. Deep Reinforcement Learning for Collision Avoidance of Robotic Manipulators 2018 European Control Conference (ECC) 2018 2063\u20132068 https:\/\/doi.org\/10.23919\/ECC.2018.8550363 2-s2.0-85059803335.","DOI":"10.23919\/ECC.2018.8550363"},{"key":"e_1_2_12_45_2","doi-asserted-by":"publisher","DOI":"10.1109\/LCSYS.2020.3002852"},{"key":"e_1_2_12_46_2","unstructured":"XiongB. LiuQ. YaoB. LiuZ. andZhouZ. Deep Reinforcement Learning-Based Safe Interaction for Industrial Human-Robot Collaboration 49th International Conference on Computers & Industrial Engineering 2019 1\u201313."},{"key":"e_1_2_12_47_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.aei.2021.101360"},{"key":"e_1_2_12_48_2","doi-asserted-by":"publisher","DOI":"10.1109\/SMC42975.2020.9283018"},{"key":"e_1_2_12_49_2","doi-asserted-by":"publisher","DOI":"10.3390\/app12199837"},{"key":"e_1_2_12_50_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2021.3056903"},{"key":"e_1_2_12_51_2","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2021.3088091"},{"key":"e_1_2_12_52_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2022.05.006"},{"key":"e_1_2_12_53_2","unstructured":"SchaulT. QuanJ. AntonoglouI. andSilverD. Prioritized Experience Replay 2015."},{"key":"e_1_2_12_54_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.rcim.2019.101863"},{"key":"e_1_2_12_55_2","doi-asserted-by":"publisher","DOI":"10.1109\/access.2021.3073711"},{"key":"e_1_2_12_56_2","doi-asserted-by":"crossref","first-page":"301","DOI":"10.1007\/978-981-16-2336-3_27","volume-title":"Cognitive Systems and Signal Processing","author":"Shen Y.","year":"2021"},{"key":"e_1_2_12_57_2","doi-asserted-by":"crossref","unstructured":"AkinolaI. WangZ. andAllenP. CLAMGen: Closed-Loop Arm Motion Generation via Multi-View Vision-Based RL 2021 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS) 2021 2376\u20132382 https:\/\/doi.org\/10.1109\/IROS51168.2021.9636369.","DOI":"10.1109\/IROS51168.2021.9636369"},{"key":"e_1_2_12_58_2","doi-asserted-by":"publisher","DOI":"10.3390\/app11156770"},{"key":"e_1_2_12_59_2","doi-asserted-by":"publisher","DOI":"10.3390\/s22051697"},{"key":"e_1_2_12_60_2","doi-asserted-by":"crossref","unstructured":"ZhouD. JiaR. andYaoH. Robotic Arm Motion Planning Based on Curriculum Reinforcement Learning 2021 6th International Conference on Control and Robotics Engineering (ICCRE) 2021 44\u201349 https:\/\/doi.org\/10.1109\/ICCRE51898.2021.9435700.","DOI":"10.1109\/ICCRE51898.2021.9435700"},{"key":"e_1_2_12_61_2","doi-asserted-by":"publisher","DOI":"10.15607\/RSS.2018.XIV.048"},{"key":"e_1_2_12_62_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2019.2932257"},{"key":"e_1_2_12_63_2","doi-asserted-by":"publisher","DOI":"10.23919\/CCC55666.2022.9902722"},{"key":"e_1_2_12_64_2","doi-asserted-by":"publisher","DOI":"10.1109\/TSMC.2022.3228901"},{"key":"e_1_2_12_65_2","doi-asserted-by":"crossref","unstructured":"BhuiyanT. KastnerL. HuY. KutschankB. andLambrechtJ. Deep-Reinforcement-Learning-Based Path Planning for Industrial Robots Using Distance Sensors as Observation 2023 8th International Conference on Control and Robotics Engineering (ICCRE) 2023 204\u2013210 https:\/\/doi.org\/10.1109\/ICCRE57112.2023.10155608.","DOI":"10.1109\/ICCRE57112.2023.10155608"},{"key":"e_1_2_12_66_2","doi-asserted-by":"publisher","DOI":"10.3390\/s20205911"},{"key":"e_1_2_12_67_2","doi-asserted-by":"publisher","DOI":"10.3390\/app11062587"},{"key":"e_1_2_12_68_2","doi-asserted-by":"crossref","unstructured":"QiaoD. ZhongZ. ZhangH. andZhaoY. Trajectory Planning of Manipulator Based on DQN Algorithm Guided by MPC Sampling 2021 3rd International Symposium on Robotics & Intelligent Manufacturing Technology (ISRIMT) 2021 319\u2013323 https:\/\/doi.org\/10.1109\/ISRIMT53730.2021.9597010.","DOI":"10.1109\/ISRIMT53730.2021.9597010"},{"key":"e_1_2_12_69_2","doi-asserted-by":"publisher","DOI":"10.1109\/SMC53654.2022.9945504"},{"key":"e_1_2_12_70_2","doi-asserted-by":"publisher","DOI":"10.1016\/0005-1098(89)90002-2"},{"key":"e_1_2_12_71_2","doi-asserted-by":"publisher","DOI":"10.3389\/fenrg.2022.957869"},{"key":"e_1_2_12_72_2","doi-asserted-by":"publisher","DOI":"10.1109\/icnn.1995.488968"},{"key":"e_1_2_12_73_2","doi-asserted-by":"crossref","unstructured":"VahrensL. \u00c1lvarezD. D. BergerU. andB\u00f8ghS. Learning Task-independent Joint Control for Robotic Manipulators with Reinforcement Learning and Curriculum Learning 2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA 2022) 2022 1250\u20131257 https:\/\/doi.org\/10.1109\/ICMLA55696.2022.00201.","DOI":"10.1109\/ICMLA55696.2022.00201"},{"key":"e_1_2_12_74_2","doi-asserted-by":"publisher","DOI":"10.1109\/ISIE51358.2023.10227978"},{"key":"e_1_2_12_75_2","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2021.3116700"},{"key":"e_1_2_12_76_2","doi-asserted-by":"crossref","unstructured":"KamaliK. BonevI. A. andDesrosiersC. Real-time Motion Planning for Robotic Teleoperation Using Dynamic-Goal Deep Reinforcement Learning 2020 17th Conference on Computer and Robot Vision (CRV) 2020 182\u2013189 https:\/\/doi.org\/10.1109\/CRV50864.2020.00032.","DOI":"10.1109\/CRV50864.2020.00032"},{"key":"e_1_2_12_77_2","doi-asserted-by":"publisher","DOI":"10.23919\/ICCAS52745.2021.9649802"},{"key":"e_1_2_12_78_2","doi-asserted-by":"publisher","DOI":"10.1109\/IROS40897.2019.8968452"},{"key":"e_1_2_12_79_2","doi-asserted-by":"crossref","unstructured":"GolluccioG. Di VitoD. MarinoA. BriaA. andAntonelliG. Task-motion Planning via Tree-Based Q-Learning Approach for Robotic Object Displacement in Cluttered Spaces Proceedings of the 18th International Conference on Informatics in Control Automation and Robotics 2021 130\u2013137 https:\/\/doi.org\/10.5220\/0010542601300137.","DOI":"10.5220\/0010542600002994"},{"key":"e_1_2_12_80_2","doi-asserted-by":"publisher","DOI":"10.1109\/ROBIO.2018.8665248"},{"key":"e_1_2_12_81_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-18326-3_18"},{"key":"e_1_2_12_82_2","doi-asserted-by":"crossref","unstructured":"HeatonJ.andGivigiS. A Deep Reinforcement Learning Solution for the Low Level Motion Control of a Robot Manipulator System 2023 IEEE International Systems Conference (SysCon) 2023 https:\/\/doi.org\/10.1109\/SysCon53073.2023.10131174.","DOI":"10.1109\/SysCon53073.2023.10131174"},{"key":"e_1_2_12_83_2","doi-asserted-by":"crossref","unstructured":"XieS. GongL. ChenZ. andChenB. Simulation of Real-Time Collision-free Path Planning Method with Deep Policy Network in Human-Robot Interaction Scenario 2023 International Conference on Advanced Robotics and Mechatronics (ICARM) 2023 360\u2013365 https:\/\/doi.org\/10.1109\/ICARM58088.2023.10218854.","DOI":"10.1109\/ICARM58088.2023.10218854"},{"key":"e_1_2_12_84_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.01225"},{"key":"e_1_2_12_85_2","doi-asserted-by":"crossref","unstructured":"NicolaG.andGhidoniS. Deep Reinforcement Learning for Motion Planning in Human Robot Cooperative Scenarios 2021 26th IEEE International Conference on Emerging Technologies and Factory Automation (ETFA) 2021 https:\/\/doi.org\/10.1109\/ETFA45728.2021.9613505.","DOI":"10.1109\/ETFA45728.2021.9613505"},{"key":"e_1_2_12_86_2","doi-asserted-by":"publisher","DOI":"10.1109\/CASE49997.2022.9926592"},{"key":"e_1_2_12_87_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICAC55051.2022.9911177"},{"key":"e_1_2_12_88_2","doi-asserted-by":"publisher","DOI":"10.1007\/s40747-021-00366-1"},{"key":"e_1_2_12_89_2","doi-asserted-by":"publisher","DOI":"10.3390\/robotics12020061"},{"key":"e_1_2_12_90_2","doi-asserted-by":"crossref","unstructured":"StrudelR. PashevichA. KalevatykhI. LaptevI. SivicJ. andSchmidC. Learning to Combine Primitive Skills: A Step towards Versatile Robotic Manipulation Proceedings of International Conference on Robotics and Automation 2020 4637\u20134643 https:\/\/doi.org\/10.1109\/ICRA40945.2020.9196619.","DOI":"10.1109\/ICRA40945.2020.9196619"},{"key":"e_1_2_12_91_2","article-title":"Dynamic Trajectory Planning of a 7-DOF Surgical Robot Based on HER-DDPG Algorithm","volume":"85598","author":"Hou Q.","year":"2021","journal-title":"ASME International Mechanical Engineering Congress and Exposition"},{"key":"e_1_2_12_92_2","doi-asserted-by":"publisher","DOI":"10.1007\/s40747-021-00499-3"},{"key":"e_1_2_12_93_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2022.09.292"},{"key":"e_1_2_12_94_2","doi-asserted-by":"publisher","DOI":"10.3390\/app122211610"},{"key":"e_1_2_12_95_2","doi-asserted-by":"publisher","DOI":"10.1002\/adc2.79"},{"key":"e_1_2_12_96_2","doi-asserted-by":"publisher","DOI":"10.3390\/app11041816"},{"key":"e_1_2_12_97_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10845-020-01582-1"},{"key":"e_1_2_12_98_2","unstructured":"WarrenC. W. Global Path Planning Using Artificial Potential Fields IEEE International Conference on Robotics and Automation 1989 316\u2013317."},{"key":"e_1_2_12_99_2","doi-asserted-by":"publisher","DOI":"10.1007\/s41315-021-00172-5"},{"key":"e_1_2_12_100_2","doi-asserted-by":"publisher","DOI":"10.1108\/IR-09-2021-0194"},{"key":"e_1_2_12_101_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10846-023-01822-5"},{"key":"e_1_2_12_102_2","doi-asserted-by":"publisher","DOI":"10.1109\/TNNLS.2023.3262109"},{"key":"e_1_2_12_103_2","first-page":"303","article-title":"Rapidly-Exploring Random Trees: Progress and Prospects","author":"LaValle S. M.","year":"2001","journal-title":"Algorithmic and Computational Robotics"},{"key":"e_1_2_12_104_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2018.2872693"},{"key":"e_1_2_12_105_2","doi-asserted-by":"publisher","DOI":"10.1109\/CASE49997.2022.9926603"},{"key":"e_1_2_12_106_2","doi-asserted-by":"publisher","DOI":"10.1109\/TII.2021.3125447"},{"key":"e_1_2_12_107_2","doi-asserted-by":"crossref","unstructured":"ZhouD. JiaR. YaoH. andXieM. Robotic Arm Motion Planning Based on Residual Reinforcement Learning 2021 13th International Conference on Computer and Automation Engineering (ICCAE) 2021 89\u201394 https:\/\/doi.org\/10.1109\/ICCAE51876.2021.9426160.","DOI":"10.1109\/ICCAE51876.2021.9426160"},{"key":"e_1_2_12_108_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICUS55513.2022.9986816"},{"key":"e_1_2_12_109_2","unstructured":"YamadaJ. LeeY. SalhotraG.et al. Motion Planner Augmented Reinforcement Learning for Robot Manipulation in Obstructed Environments 2020 1\u201315 http:\/\/arxiv.org\/abs\/2010.11940."},{"key":"e_1_2_12_110_2","doi-asserted-by":"crossref","unstructured":"OtaK. JhaD. K. OikiT.et al. Trajectory Optimization for Unknown Constrained Systems Using Reinforcement Learning 2019 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS) 2019 3487\u20133494 https:\/\/doi.org\/10.1109\/IROS40897.2019.8968010.","DOI":"10.1109\/IROS40897.2019.8968010"},{"key":"e_1_2_12_111_2","doi-asserted-by":"publisher","DOI":"10.1109\/CDC45484.2021.9683056"},{"key":"e_1_2_12_112_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.rcim.2022.102517"},{"key":"e_1_2_12_113_2","volume-title":"Planning with Goal-Conditioned Policies","author":"Nasiriany S.","year":"2019"},{"key":"e_1_2_12_114_2","doi-asserted-by":"publisher","DOI":"10.1145\/3453160"},{"key":"e_1_2_12_115_2","first-page":"2976","article-title":"Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor","volume":"5","author":"Haarnoja T.","year":"2018","journal-title":"35th International Conference on Machine Learning"},{"key":"e_1_2_12_116_2","unstructured":"LillicrapT. P. HuntJ. J. PritzelA.et al. Continuous Control with Deep Reinforcement Learning 2016 https:\/\/arxiv.org\/abs\/1509.02971."},{"key":"e_1_2_12_117_2","unstructured":"SchulmanJ. WolskiF. DhariwalP. RadfordA. andKlimovO. Proximal Policy Optimization Algorithms 2017 1\u201312 http:\/\/arxiv.org\/abs\/1707.06347."},{"key":"e_1_2_12_118_2","unstructured":"RayA. AchiamJ. andAmodeiD. Benchmarking Safe Exploration in Deep Reinforcement Learning 2019."},{"key":"e_1_2_12_119_2","doi-asserted-by":"publisher","DOI":"10.1146\/annurev-control-042920-020211"},{"key":"e_1_2_12_120_2","first-page":"8092","article-title":"A Lyapunov-Based Approach to Safe Reinforcement Learning","volume":"2018","author":"Chow Y.","year":"2018","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_2_12_121_2","unstructured":"DalalG. DvijothamK. VecerikM. HesterT. PaduraruC. andTassaY. Safe Exploration in Continuous Action Spaces 2018."},{"key":"e_1_2_12_122_2","first-page":"1","article-title":"Fear Field: Adaptive Constraints for Safe Environment Transitions in Shielded Reinforcement Learning","volume":"3505","author":"Odriozola-Olalde H.","year":"2023","journal-title":"CEUR Workshop Proceedings"},{"key":"e_1_2_12_123_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA46639.2022.9811698"},{"key":"e_1_2_12_124_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.iswa.2022.200105"},{"key":"e_1_2_12_125_2","doi-asserted-by":"publisher","DOI":"10.1145\/3544585.3544606"},{"key":"e_1_2_12_126_2","article-title":"ISO 10218-1. Robots and Robotic Devices - Safety Requirements for Industrial Robots","author":"International Organization of Standardization","year":"2014","journal-title":"Part 1: Robot"},{"key":"e_1_2_12_127_2","article-title":"ISO 10218-2. Robots and Robotic Devices - Safety Requirements for Industrial Robots","author":"International Organization of Standardization","year":"2016","journal-title":"Part 2: Robot systems and integration"},{"key":"e_1_2_12_128_2","article-title":"ISO\/TS 15066","author":"International Organization of Standardization","year":"2016","journal-title":"Robots and Robotic Devices - Collaborative Robots"},{"key":"e_1_2_12_129_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ijhcs.2021.102744"},{"key":"e_1_2_12_130_2","doi-asserted-by":"publisher","DOI":"10.1109\/ROMAN.2011.6005223"},{"key":"e_1_2_12_131_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2020.3006254"},{"key":"e_1_2_12_132_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2021.3068769"}],"container-title":["International Journal of Intelligent Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1155\/int\/1636497","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,12,24]],"date-time":"2024-12-24T14:17:34Z","timestamp":1735049854000},"score":1,"resource":{"primary":{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/10.1155\/int\/1636497"}},"subtitle":[],"editor":[{"given":"Mohamadreza (Mohammad)","family":"Khosravi","sequence":"additional","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2024,1]]},"references-count":132,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2024,1]]}},"alternative-id":["10.1155\/int\/1636497"],"URL":"https:\/\/doi.org\/10.1155\/int\/1636497","archive":["Portico"],"relation":{},"ISSN":["0884-8173","1098-111X"],"issn-type":[{"value":"0884-8173","type":"print"},{"value":"1098-111X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,1]]},"article-number":"1636497"}}