{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,1]],"date-time":"2025-11-01T05:46:41Z","timestamp":1761976001713,"version":"build-2065373602"},"reference-count":60,"publisher":"MDPI AG","issue":"14","license":[{"start":{"date-parts":[[2022,7,10]],"date-time":"2022-07-10T00:00:00Z","timestamp":1657411200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>The ability to navigate unstructured environments is an essential task for intelligent systems. Autonomous navigation by ground vehicles requires developing an internal representation of space, trained by recognizable landmarks, robust visual processing, computer vision and image processing. A mobile robot needs a platform enabling it to operate in an environment autonomously, recognize the objects, and avoid obstacles in its path. In this study, an open-source ground robot called SROBO was designed to accurately identify its position and navigate certain areas using a deep convolutional neural network and transfer learning. The framework uses an RGB-D MYNTEYE camera, a 2D laser scanner and inertial measurement units (IMU) operating through an embedded system capable of deep learning. The real-time decision-making process and experiments were conducted while the onboard signal processing and image capturing system enabled continuous information analysis. State-of-the-art Real-Time Graph-Based SLAM (RTAB-Map) was adopted to create a map of indoor environments while benefiting from deep convolutional neural network (Deep-CNN) capability. Enforcing Deep-CNN improved the performance quality of the RTAB-Map SLAM. The proposed setting equipped the robot with more insight into its surroundings. The robustness of the SROBO increased by 35% using the proposed system compared to the conventional RTAB-Map SLAM.<\/jats:p>","DOI":"10.3390\/rs14143324","type":"journal-article","created":{"date-parts":[[2022,7,11]],"date-time":"2022-07-11T00:06:21Z","timestamp":1657497981000},"page":"3324","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":18,"title":["The Deep Convolutional Neural Network Role in the Autonomous Navigation of Mobile Robots (SROBO)"],"prefix":"10.3390","volume":"14","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-1443-0330","authenticated-orcid":false,"given":"Shabnam","family":"Sadeghi Esfahlani","sequence":"first","affiliation":[{"name":"Medical Technology Research Centre (MTRC), School of Engineering and the Built Environment, Anglia Ruskin University, Essex CM1 1SQ, UK"}]},{"given":"Alireza","family":"Sanaei","sequence":"additional","affiliation":[{"name":"School of Engineering and the Built Environment, Anglia Ruskin University, Essex CM1 1SQ, UK"}]},{"given":"Mohammad","family":"Ghorabian","sequence":"additional","affiliation":[{"name":"School of Engineering and the Built Environment, Anglia Ruskin University, Essex CM1 1SQ, UK"}]},{"given":"Hassan","family":"Shirvani","sequence":"additional","affiliation":[{"name":"Medical Technology Research Centre (MTRC), School of Engineering and the Built Environment, Anglia Ruskin University, Essex CM1 1SQ, UK"}]}],"member":"1968","published-online":{"date-parts":[[2022,7,10]]},"reference":[{"key":"ref_1","unstructured":"Crook, C. (2018). Computers and the Collaborative Experience of Learning, Routledge."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"869","DOI":"10.1016\/S1574-6526(07)03023-4","article-title":"Cognitive robotics","volume":"3","author":"Levesque","year":"2008","journal-title":"Found. Artif. Intell."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Samani, H. (2015). Cognitive Robotics, CRC Press.","DOI":"10.1201\/b19171"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"1147","DOI":"10.1109\/TRO.2015.2463671","article-title":"ORB-SLAM: A versatile and accurate monocular SLAM system","volume":"31","author":"Montiel","year":"2015","journal-title":"IEEE Trans. Robot."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1255","DOI":"10.1109\/TRO.2017.2705103","article-title":"ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo and RGB-D Cameras","volume":"33","year":"2017","journal-title":"IEEE Trans. Robot."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"1004","DOI":"10.1109\/TRO.2018.2853729","article-title":"Vins-mono: A robust and versatile monocular visual-inertial state estimator","volume":"34","author":"Qin","year":"2018","journal-title":"IEEE Trans. Robot."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Fontan, A., Civera, J., and Triebel, R. (2020, January 13\u201319). Information-Driven Direct RGB-D Odometry. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00498"},{"key":"ref_8","first-page":"1995","article-title":"Convolutional networks for images, speech, and time series","volume":"3361","author":"LeCun","year":"1995","journal-title":"Handb. Brain Theory Neural Netw."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","article-title":"Long short-term memory","volume":"9","author":"Hochreiter","year":"1997","journal-title":"Neural Comput."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1561\/2200000006","article-title":"Learning deep architectures for AI","volume":"2","author":"Bengio","year":"2009","journal-title":"Found. Trends\u00ae Mach. Learn."},{"key":"ref_11","unstructured":"Irsoy, O., and Cardie, C. (2014). Deep recursive neural networks for compositionality in language. Adv. Neural Inf. Process. Syst., 27."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Varsamou, M., and Antonakopoulos, T. (2019, January 19\u201321). Classification using Discriminative Restricted Boltzmann Machines on Spark. Proceedings of the 2019 International Conference on Software, Telecommunications and Computer Networks (SoftCOM), Split, Croatia.","DOI":"10.23919\/SOFTCOM.2019.8903859"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"1089","DOI":"10.1007\/s10462-018-9641-3","article-title":"Recent progress in semantic image segmentation","volume":"52","author":"Liu","year":"2019","journal-title":"Artif. Intell. Rev."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Jarrett, K., Kavukcuoglu, K., Ranzato, M., and LeCun, Y. (October, January 29). What is the best multi-stage architecture for object recognition?. Proceedings of the 2009 IEEE 12th International Conference on Computer Vision, Kyoto, Japan.","DOI":"10.1109\/ICCV.2009.5459469"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"LeCun, Y., Kavukcuoglu, K., and Farabet, C. (June, January 30). Convolutional networks and applications in vision. Proceedings of the 2010 IEEE International Symposium on Circuits and Systems, Paris, France.","DOI":"10.1109\/ISCAS.2010.5537907"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s40537-014-0007-7","article-title":"Deep learning applications and challenges in big data analytics","volume":"2","author":"Najafabadi","year":"2015","journal-title":"J. Big Data"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"5455","DOI":"10.1007\/s10462-020-09825-6","article-title":"A survey of the recent architectures of deep convolutional neural networks","volume":"53","author":"Khan","year":"2020","journal-title":"Artif. Intell. Rev."},{"key":"ref_18","unstructured":"Krizhevsky, A., Sutskever, I., and Hinton, G.E. (2012). Imagenet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst., 25."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1007\/s40903-015-0032-7","article-title":"An overview to visual odometry and visual SLAM: Applications to mobile robotics","volume":"1","author":"Yousif","year":"2015","journal-title":"Intell. Ind. Syst."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"177","DOI":"10.1109\/TRO.2013.2279412","article-title":"3-D mapping with an RGB-D camera","volume":"30","author":"Endres","year":"2013","journal-title":"IEEE Trans. Robot."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Kohlbrecher, S., Von Stryk, O., Meyer, J., and Klingauf, U. (2011, January 1\u20135). A flexible and scalable SLAM system with full 3D motion estimation. Proceedings of the 2011 IEEE International Symposium on Safety, Security, and Rescue Robotics, Kyoto, Japan.","DOI":"10.1109\/SSRR.2011.6106777"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"80","DOI":"10.1109\/MRA.2011.943233","article-title":"Visual odometry [tutorial]","volume":"18","author":"Scaramuzza","year":"2011","journal-title":"IEEE Robot. Autom. Mag."},{"key":"ref_23","unstructured":"Hackett, J.K., and Shah, M. (1990, January 13\u201318). Multi-sensor fusion: A perspective. Proceedings of the IEEE International Conference on Robotics and Automation, Cincinnati, OH, USA."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1186\/s41074-017-0027-2","article-title":"Visual SLAM algorithms: A survey from 2010 to 2016","volume":"9","author":"Taketomi","year":"2017","journal-title":"IPSJ Trans. Comput. Vis. Appl."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Chekhlov, D., Gee, A.P., Calway, A., and Mayol-Cuevas, W. (2007, January 13\u201316). Ninja on a plane: Automatic discovery of physical planes for augmented reality using visual slam. Proceedings of the 2007 6th IEEE and ACM International Symposium on Mixed and Augmented Reality, Nara, Japan.","DOI":"10.1109\/ISMAR.2007.4538840"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Lin, G., Milan, A., Shen, C., and Reid, I. (2017, January 21\u201326). Refinenet: Multi-path refinement networks for high-resolution semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.549"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., and Berg, A.C. (2016, January 11\u201314). Ssd: Single shot multibox detector. Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Li, Y., Huang, H., Xie, Q., Yao, L., and Chen, Q. (2018). Research on a surface defect detection algorithm based on MobileNet-SSD. Appl. Sci., 8.","DOI":"10.3390\/app8091678"},{"key":"ref_30","unstructured":"Ning, C., Zhou, H., Song, Y., and Tang, J. (2017, January 10\u201314). Inception single shot multibox detector for object detection. Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops (ICMEW), Hong Kong, China."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., and Wojna, Z. (2016, January 27\u201330). Rethinking the inception architecture for computer vision. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.308"},{"key":"ref_32","first-page":"46","article-title":"Kalman and extended kalman filters: Concept, derivation and properties","volume":"43","author":"Ribeiro","year":"2004","journal-title":"Inst. Syst. Robot."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"416","DOI":"10.1002\/rob.21831","article-title":"RTAB-Map as an open-source lidar and visual simultaneous localization and mapping library for large-scale and long-term online operation","volume":"36","author":"Michaud","year":"2019","journal-title":"J. Field Robot."},{"key":"ref_34","first-page":"4244","article-title":"Optimization and acceleration of convolutional neural networks: A survey","volume":"34","author":"Habib","year":"2020","journal-title":"J. King Saud Univ.-Comput. Inf. Sci."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1109\/MITS.2010.939925","article-title":"A tutorial on graph-based SLAM","volume":"2","author":"Grisetti","year":"2010","journal-title":"IEEE Intell. Transp. Syst. Mag."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"333","DOI":"10.1023\/A:1008854305733","article-title":"Globally consistent range scan alignment for environment mapping","volume":"4","author":"Lu","year":"1997","journal-title":"Auton. Robot."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"1133","DOI":"10.1007\/s10514-017-9682-5","article-title":"Long-term online multi-session graph-based SPLAM with memory management","volume":"42","author":"Michaud","year":"2018","journal-title":"Auton. Robot."},{"key":"ref_38","unstructured":"Shi, J. (1994, January 21\u201323). Good features to track. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Zhang, G., and Vela, P.A. (2015, January 7\u201312). Good features to track for visual slam. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298743"},{"key":"ref_40","first-page":"2","article-title":"Fast approximate nearest neighbors with automatic algorithm configuration","volume":"2","author":"Muja","year":"2009","journal-title":"VISAPP"},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"381","DOI":"10.1145\/358669.358692","article-title":"Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography","volume":"24","author":"Fischler","year":"1981","journal-title":"Commun. ACM"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Zeiler, M.D., and Fergus, R. (2014, January 6\u201312). Visualizing and understanding convolutional networks. Proceedings of the European Conference on Computer Vision, Zurich, Switzerland.","DOI":"10.1007\/978-3-319-10590-1_53"},{"key":"ref_43","unstructured":"Goodfellow, I., Bengio, Y., and Courville, A. (2016). Deep Learning, MIT Press."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Wofk, D., Ma, F., Yang, T.J., Karaman, S., and Sze, V. (2019, January 20\u201324). Fastdepth: Fast monocular depth estimation on embedded systems. Proceedings of the 2019 International Conference on Robotics and Automation (ICRA), Montreal, QC, Canada.","DOI":"10.1109\/ICRA.2019.8794182"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Garcia-Garcia, A., Orts-Escolano, S., Oprea, S., Villena-Martinez, V., and Garcia-Rodriguez, J. (2017). A review on deep learning techniques applied to semantic segmentation. arXiv.","DOI":"10.1016\/j.asoc.2018.05.018"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Tan, C., Sun, F., Kong, T., Zhang, W., Yang, C., and Liu, C. (2018, January 4\u20137). A survey on deep transfer learning. Proceedings of the International Conference on Artificial Neural Networks, Rhodes, Greece.","DOI":"10.1007\/978-3-030-01424-7_27"},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., and Rabinovich, A. (2015, January 7\u201312). Going deeper with convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"ref_48","unstructured":"Howard, A.G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., and Adam, H. (2017). Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv."},{"key":"ref_49","unstructured":"Guo, Y., Yao, A., and Chen, Y. (2016). Dynamic network surgery for efficient dnns. Adv. Neural Inf. Process. Syst., 29."},{"key":"ref_50","unstructured":"Zoph, B., and Le, Q.V. (2016). Neural architecture search with reinforcement learning. arXiv."},{"key":"ref_51","first-page":"281","article-title":"Random search for hyper-parameter optimization","volume":"13","author":"Bergstra","year":"2012","journal-title":"J. Mach. Learn. Res."},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"Zhang, X., Zhou, X., Lin, M., and Sun, J. (2018, January 18\u201323). Shufflenet: An extremely efficient convolutional neural network for mobile devices. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00716"},{"key":"ref_53","unstructured":"Real, E., Moore, S., Selle, A., Saxena, S., Suematsu, Y.L., Tan, J., Le, Q.V., and Kurakin, A. (2017, January 6\u201311). Large-scale evolution of image classifiers. Proceedings of the International Conference on Machine Learning, PMLR, Sydney, NSW, Australia."},{"key":"ref_54","unstructured":"Bouguet, J.Y. (2022, May 18). Camera Calibration Toolbox for Matlab. Available online: http:\/\/www.vision.caltech.edu\/bouguetj\/calib_doc\/index.html."},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Foote, T. (2013, January 22\u201323). tf: The transform library. Proceedings of the 2013 IEEE International Conference on Technologies for Practical Robot Applications (TePRA), Woburn, MA, USA.","DOI":"10.1109\/TePRA.2013.6556373"},{"key":"ref_56","doi-asserted-by":"crossref","unstructured":"Song, S., Lichtenberg, S.P., and Xiao, J. (2015, January 7\u201312). Sun rgb-d: A rgb-d scene understanding benchmark suite. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298655"},{"key":"ref_57","doi-asserted-by":"crossref","first-page":"98","DOI":"10.1007\/s11263-014-0733-5","article-title":"The pascal visual object classes challenge: A retrospective","volume":"111","author":"Everingham","year":"2015","journal-title":"Int. J. Comput. Vis."},{"key":"ref_58","doi-asserted-by":"crossref","unstructured":"Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., and Chen, L.C. (2018, January 18\u201323). Mobilenetv2: Inverted residuals and linear bottlenecks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00474"},{"key":"ref_59","doi-asserted-by":"crossref","unstructured":"Long, J., Shelhamer, E., and Darrell, T. (2015, January 7\u201312). Fully convolutional networks for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"ref_60","doi-asserted-by":"crossref","first-page":"339","DOI":"10.1109\/TRO.2016.2523542","article-title":"Vision-based distributed formation control without an external positioning system","volume":"32","author":"Montijano","year":"2016","journal-title":"IEEE Trans. Robot."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/14\/3324\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T23:47:38Z","timestamp":1760140058000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/14\/3324"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,7,10]]},"references-count":60,"journal-issue":{"issue":"14","published-online":{"date-parts":[[2022,7]]}},"alternative-id":["rs14143324"],"URL":"https:\/\/doi.org\/10.3390\/rs14143324","relation":{},"ISSN":["2072-4292"],"issn-type":[{"type":"electronic","value":"2072-4292"}],"subject":[],"published":{"date-parts":[[2022,7,10]]}}}