{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,7,9]],"date-time":"2026-07-09T23:55:26Z","timestamp":1783641326424,"version":"3.55.0"},"reference-count":47,"publisher":"MDPI AG","issue":"9","license":[{"start":{"date-parts":[[2025,9,19]],"date-time":"2025-09-19T00:00:00Z","timestamp":1758240000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["42301509"],"award-info":[{"award-number":["42301509"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["2025T180082"],"award-info":[{"award-number":["2025T180082"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["2024M761474"],"award-info":[{"award-number":["2024M761474"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["KYCX25_2107"],"award-info":[{"award-number":["KYCX25_2107"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"China Postdoctoral Science Foundation","award":["42301509"],"award-info":[{"award-number":["42301509"]}]},{"name":"China Postdoctoral Science Foundation","award":["2025T180082"],"award-info":[{"award-number":["2025T180082"]}]},{"name":"China Postdoctoral Science Foundation","award":["2024M761474"],"award-info":[{"award-number":["2024M761474"]}]},{"name":"China Postdoctoral Science Foundation","award":["KYCX25_2107"],"award-info":[{"award-number":["KYCX25_2107"]}]},{"name":"Postgraduate Research &amp; Practice Innovation Program of Jiangsu Province","award":["42301509"],"award-info":[{"award-number":["42301509"]}]},{"name":"Postgraduate Research &amp; Practice Innovation Program of Jiangsu Province","award":["2025T180082"],"award-info":[{"award-number":["2025T180082"]}]},{"name":"Postgraduate Research &amp; Practice Innovation Program of Jiangsu Province","award":["2024M761474"],"award-info":[{"award-number":["2024M761474"]}]},{"name":"Postgraduate Research &amp; Practice Innovation Program of Jiangsu Province","award":["KYCX25_2107"],"award-info":[{"award-number":["KYCX25_2107"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["IJGI"],"abstract":"<jats:p>Precise object detection is fundamental to robust indoor navigation and localization. However, the practical deployment of deep learning-based detectors on mobile platforms is frequently impeded by their extensive parameter counts, substantial computational overhead, and prolonged inference latency, rendering them impractical for real-time and GPU-independent applications. To overcome these limitations, this paper presents Nav-YOLO, a highly optimized and lightweight architecture derived from YOLOv8n, specifically engineered for navigational tasks. The model\u2019s efficiency stems from several key improvements: a ShuffleNetv2-based backbone significantly reduces model parameters; a Slim-Neck structure incorporating GSConv and GSbottleneck modules streamlines the feature fusion process; the VoV-GSCSP hierarchical network aggregates features with minimal computational cost; and a compact detection head is designed using Hybrid Convolutional Transformer Architecture Search (HyCTAS). Furthermore, the adoption of Inner-IoU as the bounding box regression loss accelerates the convergence of the training process. The model\u2019s efficacy is demonstrated through a purpose-built Android application. Experimental evaluations on the VOC2007 and VOC2012 datasets reveal that Nav-YOLO substantially outperforms the baseline YOLOv8n, achieving mAP50 improvements of 10.3% and 5.0%, respectively, while maintaining a comparable parameter footprint. Consequently, Nav-YOLO demonstrates a superior balance of accuracy, model compactness, and inference speed, presenting a compelling alternative to existing object detection algorithms for mobile systems.<\/jats:p>","DOI":"10.3390\/ijgi14090364","type":"journal-article","created":{"date-parts":[[2025,9,19]],"date-time":"2025-09-19T12:33:32Z","timestamp":1758285212000},"page":"364","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["Nav-YOLO: A Lightweight and Efficient Object Detection Model for Real-Time Indoor Navigation on Mobile Platforms"],"prefix":"10.3390","volume":"14","author":[{"given":"Cheng","family":"Su","sequence":"first","affiliation":[{"name":"School of Computer and Artificial Intelligence, Nanjing University of Finance and Economics, Nanjing 210023, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Litao","family":"Zhu","sequence":"additional","affiliation":[{"name":"School of Computer and Artificial Intelligence, Nanjing University of Finance and Economics, Nanjing 210023, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Wen","family":"Dai","sequence":"additional","affiliation":[{"name":"School of Geographical Sciences, Nanjing University of Information Science and Technology, Nanjing 210044, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jin","family":"Zhou","sequence":"additional","affiliation":[{"name":"School of Computer and Artificial Intelligence, Nanjing University of Finance and Economics, Nanjing 210023, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jialiang","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Computer and Artificial Intelligence, Nanjing University of Finance and Economics, Nanjing 210023, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yucheng","family":"Mao","sequence":"additional","affiliation":[{"name":"School of Computer and Artificial Intelligence, Nanjing University of Finance and Economics, Nanjing 210023, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jiangbing","family":"Sun","sequence":"additional","affiliation":[{"name":"School of Earth Science and Engineering, Hohai University, Nanjing 210023, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2025,9,19]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Alahmadi, T.J., Rahman, A.U., Alkahtani, H.K., and Kholidy, H. (2023). Enhancing object detection for VIPs using YOLOv4_Resnet101 and text-to-speech conversion model. Multimodal Technol. Interact., 7.","DOI":"10.3390\/mti7080077"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"26712","DOI":"10.1109\/ACCESS.2021.3052415","article-title":"Analysis of navigation assistants for blind and visually impaired people: A systematic review","volume":"9","author":"Khan","year":"2021","journal-title":"IEEE Access"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"884","DOI":"10.1080\/13682199.2023.2230419","article-title":"An integrated region proposal and spatial information guided convolution network based object recognition for visually impaired persons\u2019 indoor assistive navigation","volume":"72","author":"Masal","year":"2024","journal-title":"Imaging Sci. J."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Said, Y., Atri, M., Albahar, M.A., Ben Atitallah, A., and Alsariera, Y.A. (2023). Obstacle detection system for navigation assistance of visually impaired people based on deep learning techniques. Sensors, 23.","DOI":"10.3390\/s23115262"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"103661","DOI":"10.1016\/j.compind.2022.103661","article-title":"Deep learning-based object detection in augmented reality: A systematic review","volume":"139","author":"Ghasemi","year":"2022","journal-title":"Comput. Ind."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"257","DOI":"10.1109\/JPROC.2023.3238524","article-title":"Object detection in 20 years: A survey","volume":"111","author":"Zou","year":"2023","journal-title":"Proc. IEEE"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Dosi, S., Sambare, S., Singh, S., Lokhande, N., and Garware, B. (2018, January 16\u201318). Android application for object recognition using neural networks for the visually impaired. Proceedings of the 2018 Fourth International Conference on Computing Communication Control and Automation (ICCUBEA), Pune, India.","DOI":"10.1109\/ICCUBEA.2018.8697886"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Vaidya, S., Shah, N., Shah, N., and Shankarmani, R. (2020, January 13\u201315). Real-time object detection for visually challenged people. Proceedings of the 2020 4th International Conference on Intelligent Computing and Control Systems (ICICCS), Madurai, India.","DOI":"10.1109\/ICICCS48265.2020.9121085"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"24521","DOI":"10.1007\/s11042-024-20521-3","article-title":"Outdoor navigation for visually impaired people using YOLOv5 and Transfer learning: An analytical study","volume":"84","author":"Shariatinezhad","year":"2024","journal-title":"Multimed. Tools Appl."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"14819","DOI":"10.1109\/ACCESS.2022.3148036","article-title":"CNN-based object recognition and tracking system to assist visually impaired people","volume":"10","author":"Ashiq","year":"2022","journal-title":"IEEE Access"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"118720","DOI":"10.1016\/j.eswa.2022.118720","article-title":"DeepNAVI: A deep learning based smartphone navigation assistant for people with visual impairments","volume":"212","author":"Kuriakose","year":"2023","journal-title":"Expert Syst. Appl."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"105085","DOI":"10.1016\/j.jpdc.2025.105085","article-title":"Deep embedded lightweight CNN network for indoor objects detection on FPGA","volume":"201","author":"Afif","year":"2025","journal-title":"J. Parallel Distrib. Comput."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Gladis, K.A., Madavarapu, J.B., Kumar, R.R., and Sugashini, T. (2024). In-out YOLO glass: Indoor-outdoor object detection using adaptive spatial pooling squeeze and attention YOLO network. Biomed. Signal Process. Control, 91.","DOI":"10.1016\/j.bspc.2023.105925"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"1144","DOI":"10.1016\/j.procs.2023.01.093","article-title":"Third eye: Object recognition and speech generation for visually impaired","volume":"218","author":"Guravaiah","year":"2023","journal-title":"Procedia Comput. Sci."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"He, K., Gkioxari, G., Doll\u00e1r, P., and Girshick, R. (2017, January 22\u201329). Mask r-cnn. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.322"},{"key":"ref_16","first-page":"1137","article-title":"Faster r-cnn: Towards real-time object detection with region proposal networks","volume":"28","author":"Ren","year":"2015","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.-Y., and Berg, A.C. (2016, January 11\u201314). Ssd: Single shot multibox detector. Proceedings of the Computer Vision\u2013ECCV 2016: 14th European Conference, Amsterdam, The Netherlands. Proceedings, Part I 14.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016, January 27\u201330). You only look once: Unified, real-time object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Badrloo, S., Varshosaz, M., Pirasteh, S., and Li, J. (2022). Image-based obstacle detection methods for the safe navigation of unmanned vehicles: A review. Remote Sens., 14.","DOI":"10.3390\/rs14153824"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Kuriakose, B., Shrestha, R., and Sandnes, F.E. (2020, January 19\u201324). Smartphone navigation support for blind and visually impaired people-a comprehensive analysis of potentials and opportunities. Proceedings of the International Conference on Human-Computer Interaction, Copenhagen, Denmark.","DOI":"10.1007\/978-3-030-49108-6_41"},{"key":"ref_21","unstructured":"Iandola, F.N., Han, S., Moskewicz, M.W., Ashraf, K., Dally, W.J., and Keutzer, K. (2016). SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5 MB model size. arXiv."},{"key":"ref_22","unstructured":"Howard, A.G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., and Adam, H. (2017). Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Ma, N., Zhang, X., Zheng, H.-T., and Sun, J. (2018, January 8\u201314). Shufflenet v2: Practical guidelines for efficient cnn architecture design. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01264-9_8"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Han, K., Wang, Y., Tian, Q., Guo, J., Xu, C., and Xu, C. (2020, January 14\u201319). Ghostnet: More features from cheap operations. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00165"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"119108","DOI":"10.1016\/j.eswa.2022.119108","article-title":"Real-time vehicle detection algorithm based on a lightweight You-Only-Look-Once (YOLOv5n-L) approach","volume":"213","author":"Bie","year":"2023","journal-title":"Expert Syst. Appl."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"121036","DOI":"10.1016\/j.eswa.2023.121036","article-title":"An improved lightweight small object detection framework applied to real-time autonomous driving","volume":"234","author":"Mahaur","year":"2023","journal-title":"Expert Syst. Appl."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Zhong, H., Zhang, Y., Shi, Z., Zhang, Y., and Zhao, L. (2025). PS-YOLO: A Lighter and Faster Network for UAV Object Detection. Remote Sens., 17.","DOI":"10.3390\/rs17091641"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Hong, J., Ye, K., and Qiu, S. (2025). Study on lightweight strategies for L-YOLO algorithm in road object detection. Sci. Rep., 15.","DOI":"10.1038\/s41598-025-92148-9"},{"key":"ref_29","unstructured":"Redmon, J., and Farhadi, A. (2018). Yolov3: An incremental improvement. arXiv."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Lin, T.-Y., Doll\u00e1r, P., Girshick, R., He, K., Hariharan, B., and Belongie, S. (2017, January 21\u201326). Feature pyramid networks for object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.106"},{"key":"ref_31","unstructured":"Li, H., Xiong, P., An, J., and Wang, L. (2018). Pyramid attention network for semantic segmentation. arXiv."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1007\/s11263-009-0275-4","article-title":"The pascal visual object classes (voc) challenge","volume":"88","author":"Everingham","year":"2010","journal-title":"Int. J. Comput. Vis."},{"key":"ref_33","unstructured":"Li, H., Li, J., Wei, H., Liu, Z., Zhan, Z., and Ren, Q. (2022). Slim-neck by GSConv: A better design paradigm of detector architectures for autonomous vehicles. arXiv."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Huang, G., Liu, S., Van der Maaten, L., and Weinberger, K.Q. (2018, January 18\u201322). Condensenet: An efficient densenet using learned group convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00291"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Wang, C.-Y., Liao, H.-Y.M., Wu, Y.-H., Chen, P.-Y., Hsieh, J.-W., and Yeh, I.-H. (2020, January 14\u201319). CSPNet: A new backbone that can enhance learning capability of CNN. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops, Seattle, WA, USA.","DOI":"10.1109\/CVPRW50498.2020.00203"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Yu, H., Wan, C., Dai, X., Liu, M., Chen, D., Xiao, B., Huang, Y., Lu, Y., and Wang, L. (2024). Real-time image segmentation via hybrid convolutional-transformer architecture search. arXiv.","DOI":"10.2139\/ssrn.5221286"},{"key":"ref_37","first-page":"21002","article-title":"Generalized focal loss: Learning qualified and distributed bounding boxes for dense object detection","volume":"33","author":"Li","year":"2020","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Zheng, Z., Wang, P., Liu, W., Li, J., Ye, R., and Ren, D. (2020, January 7\u201312). Distance-IoU loss: Faster and better learning for bounding box regression. Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA.","DOI":"10.1609\/aaai.v34i07.6999"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Feng, C., Zhong, Y., Gao, Y., Scott, M.R., and Huang, W. (2021, January 11\u201317). Tood: Task-aligned one-stage object detection. Proceedings of the 2021 IEEE\/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada.","DOI":"10.1109\/ICCV48922.2021.00349"},{"key":"ref_40","unstructured":"Zhang, H., Xu, C., and Zhang, S. (2023). Inner-iou: More effective intersection over union loss with auxiliary bounding box. arXiv."},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Chen, J., Kao, S.-h., He, H., Zhuo, W., Wen, S., Lee, C.-H., and Chan, S.-H.G. (2023, January 17\u201324). Run, don\u2019t walk: Chasing higher FLOPS for faster neural networks. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPR52729.2023.01157"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Yang, G., Lei, J., Zhu, Z., Cheng, S., Feng, Z., and Liang, R. (2023, January 1\u20134). AFPN: Asymptotic feature pyramid network for object detection. Proceedings of the 2023 IEEE International Conference on Systems, Man, and Cybernetics (SMC), Honolulu, HI, USA.","DOI":"10.1109\/SMC53992.2023.10394415"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Tan, M., Pang, R., and Le, Q.V. (2020, January 14\u201319). Efficientdet: Scalable and efficient object detection. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01079"},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Zhao, Y., Lv, W., Xu, S., Wei, J., Wang, G., Dang, Q., Liu, Y., and Chen, J. (2024, January 16\u201322). Detrs beat yolos on real-time object detection. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR52733.2024.01605"},{"key":"ref_45","unstructured":"Zhang, X., Liu, C., Yang, D., Song, T., Ye, Y., Li, K., and Song, Y. (2023). RFAConv: Innovating spatial attention and standard convolutional operation. arXiv."},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Ding, X., Zhang, X., Ma, N., Han, J., Ding, G., and Sun, J. (2021, January 20\u201325). Repvgg: Making vgg-style convnets great again. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.01352"},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Ding, X., Zhang, X., Han, J., and Ding, G. (2021, January 20\u201325). Diverse branch block: Building a convolution as an inception-like unit. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.01074"}],"container-title":["ISPRS International Journal of Geo-Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2220-9964\/14\/9\/364\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T18:49:06Z","timestamp":1760035746000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2220-9964\/14\/9\/364"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,9,19]]},"references-count":47,"journal-issue":{"issue":"9","published-online":{"date-parts":[[2025,9]]}},"alternative-id":["ijgi14090364"],"URL":"https:\/\/doi.org\/10.3390\/ijgi14090364","relation":{},"ISSN":["2220-9964"],"issn-type":[{"value":"2220-9964","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,9,19]]}}}