{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,7,23]],"date-time":"2026-07-23T15:39:07Z","timestamp":1784821147144,"version":"3.55.0"},"reference-count":31,"publisher":"MDPI AG","issue":"12","license":[{"start":{"date-parts":[[2024,6,16]],"date-time":"2024-06-16T00:00:00Z","timestamp":1718496000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Guangxi Key Research and Development Plan Project","award":["No. AB24010274"],"award-info":[{"award-number":["No. AB24010274"]}]},{"name":"Guangxi Key Research and Development Plan Project","award":["No. AD24010061"],"award-info":[{"award-number":["No. AD24010061"]}]}],"content-domain":{"domain":["www.mdpi.com"],"crossmark-restriction":true},"short-container-title":["Sensors"],"abstract":"<jats:p>In the domain of mobile robot navigation, conventional path-planning algorithms typically rely on predefined rules and prior map information, which exhibit significant limitations when confronting unknown, intricate environments. With the rapid evolution of artificial intelligence technology, deep reinforcement learning (DRL) algorithms have demonstrated considerable effectiveness across various application scenarios. In this investigation, we introduce a self-exploration and navigation approach based on a deep reinforcement learning framework, aimed at resolving the navigation challenges of mobile robots in unfamiliar environments. Firstly, we fuse data from the robot\u2019s onboard lidar sensors and camera and integrate odometer readings with target coordinates to establish the instantaneous state of the decision environment. Subsequently, a deep neural network processes these composite inputs to generate motion control strategies, which are then integrated into the local planning component of the robot\u2019s navigation stack. Finally, we employ an innovative heuristic function capable of synthesizing map information and global objectives to select the optimal local navigation points, thereby guiding the robot progressively toward its global target point. In practical experiments, our methodology demonstrates superior performance compared to similar navigation methods in complex, unknown environments devoid of predefined map information.<\/jats:p>","DOI":"10.3390\/s24123895","type":"journal-article","created":{"date-parts":[[2024,6,17]],"date-time":"2024-06-17T06:29:43Z","timestamp":1718605783000},"page":"3895","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":30,"title":["Autonomous Navigation by Mobile Robot with Sensor Fusion Based on Deep Reinforcement Learning"],"prefix":"10.3390","volume":"24","author":[{"ORCID":"https:\/\/orcid.org\/0009-0002-0838-0389","authenticated-orcid":false,"given":"Yang","family":"Ou","sequence":"first","affiliation":[{"name":"School of Computer and Electronic Information, Guangxi University, Nanning 530004, China"},{"name":"The Guangxi Key Laboratory of Multimedia Communications and Network Technology, Guangxi University, Nanning 530004, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4237-9330","authenticated-orcid":false,"given":"Yiyi","family":"Cai","sequence":"additional","affiliation":[{"name":"School of Computer and Electronic Information, Guangxi University, Nanning 530004, China"},{"name":"The Guangxi Key Laboratory of Multimedia Communications and Network Technology, Guangxi University, Nanning 530004, China"},{"name":"School of Electronic and Information Engineering, South China University of Technology, Guangzhou 510641, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Youming","family":"Sun","sequence":"additional","affiliation":[{"name":"School of Computer and Electronic Information, Guangxi University, Nanning 530004, China"},{"name":"The Guangxi Key Laboratory of Multimedia Communications and Network Technology, Guangxi University, Nanning 530004, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Tuanfa","family":"Qin","sequence":"additional","affiliation":[{"name":"School of Computer and Electronic Information, Guangxi University, Nanning 530004, China"},{"name":"The Guangxi Key Laboratory of Multimedia Communications and Network Technology, Guangxi University, Nanning 530004, China"},{"name":"School of Electronic and Information Engineering, South China University of Technology, Guangzhou 510641, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2024,6,16]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Singandhupe, A., and La, H.M. (2019, January 25\u201327). A review of slam techniques and security in autonomous driving. Proceedings of the 2019 Third IEEE International Conference on Robotic Computing (IRC), Naples, Italy.","DOI":"10.1109\/IRC.2019.00122"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"270","DOI":"10.1016\/j.comcom.2019.10.014","article-title":"Path planning techniques for unmanned aerial vehicles: A review, solutions, and challenges","volume":"149","author":"Aggarwal","year":"2020","journal-title":"Comput. Commun."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"75","DOI":"10.14257\/ijsh.2014.8.3.07","article-title":"A multiple mobile robots path planning algorithm based on A-star and Dijkstra algorithm","volume":"8","author":"Zhang","year":"2014","journal-title":"Int. J. Smart Home"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Naderi, K., Rajam\u00e4ki, J., and H\u00e4m\u00e4l\u00e4inen, P. (2015, January 16\u201318). RT-RRT* a real-time path planning algorithm based on RRT. Proceedings of the 8th ACM SIGGRAPH Conference on Motion in Games, Paris, France.","DOI":"10.1145\/2822013.2822036"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"582","DOI":"10.1016\/j.dt.2019.04.011","article-title":"A review: On path planning strategies for navigation of mobile robot","volume":"15","author":"Patle","year":"2019","journal-title":"Def. Technol."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"486","DOI":"10.26599\/TST.2022.9010022","article-title":"Survey and Tutorial on Hybrid Human-Artificial Intelligence","volume":"28","author":"Shi","year":"2022","journal-title":"Tsinghua Sci. Technol."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"939","DOI":"10.26599\/TST.2021.9010048","article-title":"Optimizing the perceptual quality of time-domain speech enhancement with reinforcement learning","volume":"27","author":"Hao","year":"2022","journal-title":"Tsinghua Sci. Technol."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Bouhamed, O., Ghazzai, H., Besbes, H., and Massoud, Y. (2020, January 12\u201314). Autonomous UAV navigation: A DDPG-based deep reinforcement learning approach. Proceedings of the 2020 IEEE International Symposium on Circuits and Systems (ISCAS), Seville, Spain.","DOI":"10.1109\/ISCAS45731.2020.9181245"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Gao, J., Ye, W., Guo, J., and Li, Z. (2020). Deep reinforcement learning for indoor mobile robot path planning. Sensors, 20.","DOI":"10.3390\/s20195493"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"730","DOI":"10.1109\/LRA.2021.3133591","article-title":"Goal-driven autonomous exploration through deep reinforcement learning","volume":"7","author":"Cimurs","year":"2021","journal-title":"IEEE Robot. Autom. Lett."},{"key":"ref_11","first-page":"1726","article-title":"Intelligent Path Planning for Mobile Robots Based on SAC Algorithm","volume":"35","author":"Yang","year":"2023","journal-title":"J. Syst. Simul."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"773","DOI":"10.1007\/s11370-021-00398-z","article-title":"A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning","volume":"14","author":"Morales","year":"2021","journal-title":"Intell. Serv. Robot."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Cimurs, R., Suh, I.H., and Lee, J.H. (2021, January 12\u201314). Information-based heuristics for learned goal-driven exploration and mapping. Proceedings of the 2021 18th International Conference on Ubiquitous Robots (UR), Gangneung, Republic of Korea.","DOI":"10.1109\/UR52253.2021.9494668"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"815218","DOI":"10.3389\/fpls.2022.815218","article-title":"Autonomous navigation system of greenhouse mobile robot based on 3D lidar and 2D lidar SLAM","volume":"13","author":"Jiang","year":"2022","journal-title":"Front. Plant Sci."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"4901","DOI":"10.1109\/JSEN.2020.2966034","article-title":"Fusion of 3D LIDAR and camera data for object detection in autonomous vehicle applications","volume":"20","author":"Zhao","year":"2020","journal-title":"IEEE Sens. J."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Gatesichapakorn, S., Takamatsu, J., and Ruchanurucks, M. (2019, January 16\u201318). ROS based autonomous mobile robot navigation using 2D LiDAR and RGB-D camera. Proceedings of the 2019 First International Symposium on Instrumentation, Control, Artificial Intelligence, and Robotics (ICA-SYMP), Bangkok, Thailand.","DOI":"10.1109\/ICA-SYMP.2019.8645984"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"3803","DOI":"10.1109\/TNNLS.2019.2899311","article-title":"Kinodynamic motion planning with continuous-time Q-learning: An online, model-free, and safe navigation framework","volume":"30","author":"Kontoudis","year":"2019","journal-title":"IEEE Trans. Neural Netw. Learn. Syst."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Marchesini, E., and Farinelli, A. (August, January 31). Discrete deep reinforcement learning for mapless navigation. Proceedings of the 2020 IEEE International Conference on Robotics and Automation (ICRA), Paris, France.","DOI":"10.1109\/ICRA40945.2020.9196739"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Dong, Y., and Zou, X. (2020, January 16\u201318). Mobile robot path planning based on improved DDPG reinforcement learning algorithm. Proceedings of the 2020 IEEE 11th International Conference on Software Engineering and Service Science (ICSESS), Beijing, China.","DOI":"10.1109\/ICSESS49938.2020.9237641"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Li, P., Wang, Y., and Gao, Z. (2022, January 7\u201310). Path planning of mobile robot based on improved td3 algorithm. Proceedings of the 2022 IEEE International Conference on Mechatronics and Automation (ICMA), Guilin, China.","DOI":"10.1109\/ICMA54519.2022.9856399"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Aman, M.S., Mahmud, M.A., Jiang, H., Abdelgawad, A., and Yelamarthi, K. (2016, January 19\u201321). A sensor fusion methodology for obstacle avoidance robot. Proceedings of the 2016 IEEE International Conference on Electro Information Technology (EIT), Grand Forks, ND, USA.","DOI":"10.1109\/EIT.2016.7535284"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Forouher, D., Besselmann, M.G., and Maehle, E. (2016, January 13\u201315). Sensor fusion of depth camera and ultrasound data for obstacle detection and robot navigation. Proceedings of the 2016 14th International Conference on Control, Automation, Robotics and vision (ICARCV), Phuket, Thailand.","DOI":"10.1109\/ICARCV.2016.7838832"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Zhang, B., and Zhang, J. (2021, January 28\u201331). Robot Mapping and Navigation System Based on Multi-sensor Fusion. Proceedings of the 2021 4th International Conference on Artificial Intelligence and Big Data (ICAIBD), Chengdu, China.","DOI":"10.1109\/ICAIBD51990.2021.9459053"},{"key":"ref_24","unstructured":"Theodoridou, C., Antonopoulos, D., Kargakos, A., Kostavelis, I., Giakoumis, D., and Tzovaras, D. (July, January 29). Robot Navigation in Human Populated Unknown Environments based on Visual-Laser Sensor Fusion. Proceedings of the 15th International Conference on PErvasive Technologies Related to Assistive Environments, Corfu, Greece."},{"key":"ref_25","unstructured":"Surmann, H., Jestel, C., Marchel, R., Musberg, F., Elhadj, H., and Ardani, M. (2020). Deep reinforcement learning for real autonomous mobile robot navigation in indoor environments. arXiv."},{"key":"ref_26","unstructured":"Haarnoja, T., Zhou, A., Hartikainen, K., Tucker, G., Ha, S., Tan, J., Kumar, V., Zhu, H., Gupta, A., and Abbeel, P. (2018). Soft actor-critic algorithms and applications. arXiv."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"69061","DOI":"10.1109\/ACCESS.2021.3076530","article-title":"Motion planning for mobile robots\u2014Focusing on deep reinforcement learning: A systematic review","volume":"9","author":"Sun","year":"2021","journal-title":"IEEE Access"},{"key":"ref_28","unstructured":"Haarnoja, T., Zhou, A., Abbeel, P., and Levine, S. (2018, January 10\u201315). Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor. Proceedings of the International Conference on Machine Learning, PMLR, Stockholm, Sweden."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"173","DOI":"10.1613\/jair.1.12440","article-title":"Reward machines: Exploiting reward function structure in reinforcement learning","volume":"73","author":"Icarte","year":"2022","journal-title":"J. Artif. Intell. Res."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Durrant-Whyte, H., and Henderson, T.C. (2016). Multisensor data fusion. Springer Handbook of Robotics, Springer.","DOI":"10.1007\/978-3-319-32552-1_35"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Zhu, Z., Zhang, Y., Chen, H., Dong, Y., Zhao, S., Ding, W., Zhong, J., and Zheng, S. (2023, January 17\u201324). Understanding the Robustness of 3D Object Detection With Bird\u2019s-Eye-View Representations in Autonomous Driving. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPR52729.2023.02069"}],"updated-by":[{"DOI":"10.3390\/s25092780","type":"correction","label":"Correction","source":"publisher","updated":{"date-parts":[[2024,6,16]],"date-time":"2024-06-16T00:00:00Z","timestamp":1718496000000}}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/24\/12\/3895\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,8,3]],"date-time":"2025-08-03T14:35:36Z","timestamp":1754231736000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/24\/12\/3895"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,6,16]]},"references-count":31,"journal-issue":{"issue":"12","published-online":{"date-parts":[[2024,6]]}},"alternative-id":["s24123895"],"URL":"https:\/\/doi.org\/10.3390\/s24123895","relation":{"correction":[{"id-type":"doi","id":"10.3390\/s25092780","asserted-by":"object"}]},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,6,16]]}}}