{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,11]],"date-time":"2026-03-11T16:38:13Z","timestamp":1773247093530,"version":"3.50.1"},"reference-count":25,"publisher":"MDPI AG","issue":"11","license":[{"start":{"date-parts":[[2024,5,23]],"date-time":"2024-05-23T00:00:00Z","timestamp":1716422400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Guangdong Basic and Applied Basic Research Foundation","award":["2023B1515120066"],"award-info":[{"award-number":["2023B1515120066"]}]},{"name":"Guangdong Basic and Applied Basic Research Foundation","award":["2023RC1039"],"award-info":[{"award-number":["2023RC1039"]}]},{"name":"Hunan Province Science and Technology Innovation Leader","award":["2023B1515120066"],"award-info":[{"award-number":["2023B1515120066"]}]},{"name":"Hunan Province Science and Technology Innovation Leader","award":["2023RC1039"],"award-info":[{"award-number":["2023RC1039"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>This study presents a novel method for the nighttime detection of waterborne individuals using an enhanced YOLOv5s algorithm tailored for infrared thermal imaging. To address the unique challenges of nighttime water rescue operations, we have constructed a specialized dataset comprising 5736 thermal images collected from diverse aquatic environments. This dataset was further expanded through synthetic image generation using CycleGAN and a newly developed color gamut transformation technique, which significantly improves the data variance and model training effectiveness. Furthermore, we integrated the Convolutional Block Attention Module (CBAM) at the end of the last encoder\u2019s feedforward network. This integration maximizes the utilization of channel and spatial information to capture more intricate details in the feature maps. To decrease the computational demands of the network while maintaining model accuracy, Ghost convolution was employed, thereby boosting the inference speed as much as possible. Additionally, we applied hyperparameter evolution to refine the training parameters. The improved algorithm achieved an average detection accuracy of 85.49% on our proprietary dataset, significantly outperforming its predecessor, with a prediction speed of 23.51 FPS. The experimental outcomes demonstrate the proposed solution\u2019s high recognition capabilities and robustness, fulfilling the demands of intelligent lifesaving missions.<\/jats:p>","DOI":"10.3390\/s24113321","type":"journal-article","created":{"date-parts":[[2024,5,23]],"date-time":"2024-05-23T05:35:20Z","timestamp":1716442520000},"page":"3321","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":11,"title":["Personnel Detection in Dark Aquatic Environments Based on Infrared Thermal Imaging Technology and an Improved YOLOv5s Model"],"prefix":"10.3390","volume":"24","author":[{"given":"Liang","family":"Cheng","sequence":"first","affiliation":[{"name":"School of Ocean Engineering, Jiangsu Ocean University, Lianyungang 222005, China"},{"name":"Zhuhai Yunzhou Intelligence Technology Co., Ltd., Zhuhai 519085, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yunze","family":"He","sequence":"additional","affiliation":[{"name":"College of Electrical and Information Engineering, Hunan University, Changsha 410082, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yankai","family":"Mao","sequence":"additional","affiliation":[{"name":"College of Electrical and Information Engineering, Hunan University, Changsha 410082, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhenkang","family":"Liu","sequence":"additional","affiliation":[{"name":"College of Electrical and Information Engineering, Hunan University, Changsha 410082, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xiangzhao","family":"Dang","sequence":"additional","affiliation":[{"name":"College of Electrical and Information Engineering, Hunan University, Changsha 410082, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yilong","family":"Dong","sequence":"additional","affiliation":[{"name":"College of Electrical and Information Engineering, Hunan University, Changsha 410082, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Liangliang","family":"Wu","sequence":"additional","affiliation":[{"name":"College of Electrical and Information Engineering, Hunan University, Changsha 410082, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2024,5,23]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"93","DOI":"10.1016\/j.infrared.2018.10.020","article-title":"High-sensitivity short-wave infrared technology for thermal imaging","volume":"95","author":"Wen","year":"2018","journal-title":"Infrared Phys. Technol."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"61","DOI":"10.1016\/S0165-1684(97)00038-8","article-title":"Classification of EEG signals using the wavelet transform","volume":"59","author":"Hazarika","year":"1997","journal-title":"Signal Process."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Sun, Q., Xiong, J., Li, L., Yang, J., and Wang, Z. (2010, January 5\u20137). Research on a coupled shock-diffusion filter for passive millimeter-wave image enhancement. Proceedings of the 2010 2nd International Conference on Signal Processing Systems, Dalian, China.","DOI":"10.1109\/ICSPS.2010.5555428"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"7147","DOI":"10.1109\/TGRS.2018.2848901","article-title":"HSF-Net: Multiscale deep feature embedding for ship detection in optical remote sensing imagery","volume":"56","author":"Li","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"012166","DOI":"10.1088\/1742-6596\/1820\/1\/012166","article-title":"Recognition and classification of water surface targets based on deep learning","volume":"1820","author":"Peng","year":"2021","journal-title":"J. Phys. Conf. Ser."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"421","DOI":"10.1109\/ACCESS.2021.3138983","article-title":"Water target recognition method and application for unmanned surface vessels","volume":"10","author":"Cheng","year":"2021","journal-title":"IEEE Access"},{"key":"ref_7","unstructured":"Bochkovskiy, A., Wang, C.-Y., and Liao, H.-Y.M. (2020). Yolov4: Optimal speed and accuracy of object detection. arXiv."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"724","DOI":"10.1109\/83.568929","article-title":"Wavelet transform methods for object detection and recovery","volume":"6","author":"Strickland","year":"1997","journal-title":"IEEE Trans. Image Process."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Withagen, P.J., Schutte, K., Vossepoel, A.M., and Breuers, M.G. (1999, January 5\u20137). Automatic classification of ships from infrared (FLIR) images. Proceedings of the Signal Processing, Sensor Fusion, and Target Recognition VIII, Orlando, FL, USA.","DOI":"10.1117\/12.357157"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Altay, F., and Velipasalar, S. (2020, January 1\u20135). Pedestrian detection from thermal images incorporating saliency features. Proceedings of the 2020 54th Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA, USA.","DOI":"10.1109\/IEEECONF51394.2020.9443411"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Shimoda, M., Sada, Y., Kuramochi, R., and Nakahara, H. (2019, January 9\u201313). An FPGA implementation of real-time object detection with a thermal camera. Proceedings of the 2019 29th International Conference on Field Programmable Logic and Applications (FPL), Barcelona, Spain.","DOI":"10.1109\/FPL.2019.00072"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Redmon, J., and Farhadi, A. (2017, January 21\u201326). YOLO9000: Better, faster, stronger. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.690"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"5728","DOI":"10.1109\/JSEN.2017.2723599","article-title":"Fast motion object detection algorithm using complementary depth image on an RGB-D camera","volume":"17","author":"Sun","year":"2017","journal-title":"IEEE Sens. J."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"3745","DOI":"10.1109\/JSEN.2019.2960796","article-title":"YOLOv3-DPFIN: A dual-path feature fusion neural network for robust real-time sonar target detection","volume":"20","author":"Kong","year":"2019","journal-title":"IEEE Sens. J."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"25372","DOI":"10.1109\/JSEN.2021.3067608","article-title":"RDNet: Regression dense and attention for object detection in traffic symbols","volume":"21","author":"Hong","year":"2021","journal-title":"IEEE Sens. J."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"5951","DOI":"10.1109\/ACCESS.2020.3048437","article-title":"Deep learning-based thermal image reconstruction and object detection","volume":"9","author":"Batchuluun","year":"2020","journal-title":"IEEE Access"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"125459","DOI":"10.1109\/ACCESS.2020.3007481","article-title":"Thermal object detection in difficult weather conditions using YOLO","volume":"8","author":"Pobar","year":"2020","journal-title":"IEEE Access"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"1130","DOI":"10.1109\/TIV.2022.3158094","article-title":"Evaluation of thermal imaging on embedded GPU platforms for application in vehicular assistance systems","volume":"8","author":"Farooq","year":"2022","journal-title":"IEEE Trans. Intell. Veh."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"103754","DOI":"10.1016\/j.infrared.2021.103754","article-title":"Infrared machine vision and infrared thermography with deep learning: A review","volume":"116","author":"He","year":"2021","journal-title":"Infrared Phys. Technol."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Zhu, J.-Y., Park, T., Isola, P., and Efros, A.A. (2017, January 22\u201329). Unpaired image-to-image translation using cycle-consistent adversarial networks. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.244"},{"key":"ref_21","first-page":"1","article-title":"Faster r-cnn: Towards real-time object detection with region proposal networks","volume":"28","author":"Ren","year":"2015","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_22","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.-Y., and Berg, A.C. (2016). Computer Vision\u2013ECCV 2016, Proceedings of the 14th European Conference, Amsterdam, The Netherlands, 11\u201314 October 2016, Springer. Proceedings, Part I."},{"key":"ref_23","unstructured":"Redmon, J., and Farhadi, A. (2018). Yolov3: An incremental improvement. arXiv."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Woo, S., Park, J., Lee, J.-Y., and Kweon, I.S. (2018, January 8\u201314). Cbam: Convolutional block attention module. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_1"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Han, K., Wang, Y., Tian, Q., Guo, J., Xu, C., and Xu, C. (2020, January 13\u201319). Ghostnet: More features from cheap operations. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00165"}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/24\/11\/3321\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T14:46:59Z","timestamp":1760107619000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/24\/11\/3321"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,5,23]]},"references-count":25,"journal-issue":{"issue":"11","published-online":{"date-parts":[[2024,6]]}},"alternative-id":["s24113321"],"URL":"https:\/\/doi.org\/10.3390\/s24113321","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,5,23]]}}}