{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,6]],"date-time":"2026-04-06T12:26:01Z","timestamp":1775478361353,"version":"3.50.1"},"reference-count":33,"publisher":"MDPI AG","issue":"18","license":[{"start":{"date-parts":[[2021,9,7]],"date-time":"2021-09-07T00:00:00Z","timestamp":1630972800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["41576107"],"award-info":[{"award-number":["41576107"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["41376109"],"award-info":[{"award-number":["41376109"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"National Key R&amp;D Program of China","award":["2016YFB0501703"],"award-info":[{"award-number":["2016YFB0501703"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>To overcome the shortcomings of the traditional manual detection of underwater targets in side-scan sonar (SSS) images, a real-time automatic target recognition (ATR) method is proposed in this paper. This method consists of image preprocessing, sampling, ATR by integration of the transformer module and YOLOv5s (that is, TR\u2013YOLOv5s), and target localization. By considering the target-sparse and feature-barren characteristics of SSS images, a novel TR\u2013YOLOv5s network and a down-sampling principle are put forward, and the attention mechanism is introduced in the method to meet the requirements of accuracy and efficiency for underwater target recognition. Experiments verified the proposed method achieved 85.6% mean average precision (mAP) and 87.8% macro-F2 score, and brought 12.5% and 10.6% gains compared with the YOLOv5s network trained from scratch, and had the real-time recognition speed of about 0.068 s per image.<\/jats:p>","DOI":"10.3390\/rs13183555","type":"journal-article","created":{"date-parts":[[2021,9,8]],"date-time":"2021-09-08T02:41:07Z","timestamp":1631068867000},"page":"3555","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":214,"title":["Real-Time Underwater Maritime Object Detection in Side-Scan Sonar Images Based on Transformer-YOLOv5"],"prefix":"10.3390","volume":"13","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-8822-7781","authenticated-orcid":false,"given":"Yongcan","family":"Yu","sequence":"first","affiliation":[{"name":"School of Geodesy and Geomatics, Wuhan University, Wuhan 430079, China"},{"name":"Institute of Marine Science and Technology, Wuhan University, Wuhan 430079, China"}]},{"given":"Jianhu","family":"Zhao","sequence":"additional","affiliation":[{"name":"School of Geodesy and Geomatics, Wuhan University, Wuhan 430079, China"},{"name":"Institute of Marine Science and Technology, Wuhan University, Wuhan 430079, China"}]},{"given":"Quanhua","family":"Gong","sequence":"additional","affiliation":[{"name":"New Energy Engineering Limited Company of China Communications Construction Company Third Harbor Engineering Limited Company, Shanghai 200137, China"}]},{"given":"Chao","family":"Huang","sequence":"additional","affiliation":[{"name":"School of Geodesy and Geomatics, Wuhan University, Wuhan 430079, China"},{"name":"Institute of Marine Science and Technology, Wuhan University, Wuhan 430079, China"}]},{"given":"Gen","family":"Zheng","sequence":"additional","affiliation":[{"name":"School of Geodesy and Geomatics, Wuhan University, Wuhan 430079, China"},{"name":"Institute of Marine Science and Technology, Wuhan University, Wuhan 430079, China"}]},{"given":"Jinye","family":"Ma","sequence":"additional","affiliation":[{"name":"School of Geodesy and Geomatics, Wuhan University, Wuhan 430079, China"},{"name":"Institute of Marine Science and Technology, Wuhan University, Wuhan 430079, China"}]}],"member":"1968","published-online":{"date-parts":[[2021,9,7]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"186","DOI":"10.1109\/JOE.2011.2107250","article-title":"A Novel Method for Sidescan Sonar Image Segmentation","volume":"36","author":"Celik","year":"2011","journal-title":"IEEE J. Ocean. Eng."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Zheng, G., Zhang, H., Li, Y., and Zhao, J. (2021). A Universal Automatic Bottom Tracking Method of Side Scan Sonar Data Based on Semantic Segmentation. Remote Sens., 13.","DOI":"10.3390\/rs13101945"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Li, S., Zhao, J., Zhang, H., Bi, Z., and Qu, S. (2020). A Novel Horizon Picking Method on Sub-Bottom Profiler Sonar Images. Remote Sens., 12.","DOI":"10.3390\/rs12203322"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"123","DOI":"10.1109\/JOE.2015.2408471","article-title":"A brain computer interface (BCI) for the detection of mine-like objects in sidescan sonar imagery","volume":"41","author":"Barngrovver","year":"2016","journal-title":"IEEE J. Ocean. Eng."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Lehardy, P.K., and Moore, C. (2014). Deep ocean search for Malaysia airlines flight 370. 2014 Oceans\u2014St. Jhons\u2019s, St. John\u2019s, IEEE.","DOI":"10.1109\/OCEANS.2014.7003292"},{"key":"ref_6","first-page":"1972","article-title":"A Review on Deep Learning-Based Approaches for Automatic Sonar Target Recognition","volume":"9","author":"Neupane","year":"2020","journal-title":"Elecronics"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"168","DOI":"10.1016\/j.sigpro.2017.07.022","article-title":"Detection of Small Objects in Sidescan Sonar Images Based on POHMT and Tsallis Entropy","volume":"142","author":"Zheng","year":"2017","journal-title":"Signal. Process."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Fakiris, E., Papatheodorou, G., Geraga, M., and Ferentinos, G. (2016). An Automatic Target Detection Algorithm for Swath Sonar Backscatter Imagery, Using Image Texture and Independent Component Analysis. Remote Sens., 8.","DOI":"10.3390\/rs8050373"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"295","DOI":"10.3390\/rs10020295","article-title":"A Side Scan Sonar Image Target Detection Algorithm Based on a Neutrosophic Set and Diffusion Maps","volume":"10","author":"Xiao","year":"2018","journal-title":"Remote Sens."},{"key":"ref_10","first-page":"95","article-title":"Unsupervised extraction of underwater regions of interest in sidescan sonar imagery","volume":"15","author":"Guillaume","year":"2020","journal-title":"J. Ocean. Eng."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Zhu, B., Wang, X., Chu, Z., Yang, Y., and Shi, J. (2019). Active Learning for Recognition of Shipwreck Target in Side-Scan Sonar Image. Remote Sens., 11.","DOI":"10.3390\/rs11030243"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Nargesian, F., Samulowitz, H., Khurana, U., Khalil, E.B., and Turaga, D. (2017, January 19\u201325). Learning Feature Engineering for Classification. Proceedings of the 26th International Joint Conference on Artificial Intelligence, Melbourne, Australia.","DOI":"10.24963\/ijcai.2017\/352"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Nguyen, H., Lee, E., and Lee, S. (2019). Study on the Classification Performance of Underwater Sonar Image Classification Based on Convolutional Neural Networks for Detecting a Submerged Human Body. Sensors, 20.","DOI":"10.3390\/s20010094"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1109\/JOE.2020.2980456","article-title":"Modeling and Simulation of Sidescan Using Conditional Generative Adversarial Network","volume":"46","author":"Bore","year":"2020","journal-title":"IEEE J. Ocean. Eng."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Steiniger, Y., Kraus, D., and Meisen, T. (2021). Generating Synthetic Sidescan Sonar Snippets Using Transfer-Learning in Generative Adversarial Networks. J. Mar. Sci. Eng., 9.","DOI":"10.3390\/jmse9030239"},{"key":"ref_16","unstructured":"Lee, S., Park, B., and Kim, A. (2019, January 18\u201320). Deep Learning based Object Detection via Style-transferred Underwater Sonar Images. Proceedings of the 12th IFAC Conference on Control Applications in Marine Systems, Robotics, and Vehicles CAMS 2019, Daejeon, Korea."},{"key":"ref_17","first-page":"118","article-title":"The application of convolutional neural networks for automatic detection of underwater object in side scan sonar images","volume":"37","author":"Kim","year":"2018","journal-title":"J. Acoust. Soc. Korea"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Einsidler, D., Dhanak, M., and Beaujean, P. (2018, January 22\u201325). A Deep Learning Approach to Target Recognition in Side-Scan Sonar Imagery. Proceedings of the MTS\/IEEE Charleston OCEANS Conference, Charleston, SC, USA.","DOI":"10.1109\/OCEANS.2018.8604879"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"219","DOI":"10.1007\/978-981-15-9460-1_16","article-title":"Machine learning techniques for AUV side scan sonar data feature extraction as applied to intelligent search for underwater archaeology sites","volume":"16","author":"Nayak","year":"2021","journal-title":"Field Serv. Robot."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"47407","DOI":"10.1109\/ACCESS.2020.2978880","article-title":"Underwater Object Classification in Sidescan Sonar Images Using Deep Transfer Learning and Semisynthetic Training Data","volume":"8","author":"Huo","year":"2020","journal-title":"IEEE Access"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"107691","DOI":"10.1016\/j.apacoust.2020.107691","article-title":"Zero shot objects classification method of side scan sonar image based on synthesis of pseudo samples","volume":"173","author":"Li","year":"2021","journal-title":"Appl. Acoust."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1109\/JOE.2019.2950974","article-title":"Real-Time Object Detection for AUVs Using Self-Cascaded Convolutional Neural Networks","volume":"46","author":"Song","year":"2019","journal-title":"IEEE J. Ocean. Eng."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Wu, M., Wang, Q., Rigall, E., Li, K., Zhu, W., He, B., and Yan, T. (2019). ECNet: Efficient Convolutional Networks for Side Scan Sonar Image Segmentation. Sensors, 19.","DOI":"10.3390\/s19092009"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Burguera, A., and Bonin-Font, F. (2020). On-Line Multi-Class Segmentation of Side-Scan Sonar Imagery Using an Autonomous Underwater Vehicle. J. Mar. Sci. Eng., 8.","DOI":"10.3390\/jmse8080557"},{"key":"ref_25","unstructured":"(2021, January 01). Ultralytics-Yolov5. Available online: https:\/\/github.com\/ultralytics\/yolov5."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Xu, R., Lin, H., Lu, K., Cao, L., and Liu, Y. (2021). A Forest Fire Detection System Based on Ensemble Learning. Forests, 12.","DOI":"10.3390\/f12020217"},{"key":"ref_27","unstructured":"Loffe, S., and Szegedy, C. (2015, January 6\u201311). Batch normalization: Accelerating deep network training by reducing internal covariate shift. Proceedings of the 32nd International Conference on International Conference on Machine Learning (ICML), Lile, France."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1016\/j.neunet.2017.12.012","article-title":"Sigmoid-weighted linear units for neural network function approximation in reinforcement learning","volume":"107","author":"Elfwing","year":"2018","journal-title":"Neural Netw."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Wang, C., Liao, H.M., Wu, Y., Chen, P., Hsieh, J., and Yeh, I. (2020, January 14\u201319). CSPNet: A New Backbone that can Enhance Learning Capability of CNN. Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, USA.","DOI":"10.1109\/CVPRW50498.2020.00203"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Lin, T., Doll\u00e1r, P., Girshick, R., He, K., Hariharan, B., and Belongie, S. (2017, January 21\u201326). Feature Pyramid Networks for Object Detection. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.106"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Liu, S., Qi, L., Qin, H., Shi, J., and Jia, J. (2018, January 18\u201323). Path Aggregation Network for Instance Segmentation. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00913"},{"key":"ref_32","unstructured":"Redmon, J., and Farhadi, A. (2018). YOLOv3: An incremental improvement. arXiv."},{"key":"ref_33","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L., and Polosukhin, I. (2017, January 4\u20139). Attention is all you need. Proceedings of the 31st International Conference on Neural Information Processing Systems (NIPS), Long Beach, CA, USA."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/18\/3555\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T06:58:05Z","timestamp":1760165885000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/18\/3555"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,9,7]]},"references-count":33,"journal-issue":{"issue":"18","published-online":{"date-parts":[[2021,9]]}},"alternative-id":["rs13183555"],"URL":"https:\/\/doi.org\/10.3390\/rs13183555","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,9,7]]}}}