{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,29]],"date-time":"2026-01-29T20:05:16Z","timestamp":1769717116843,"version":"3.49.0"},"reference-count":29,"publisher":"SAGE Publications","issue":"3","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["IFS"],"published-print":{"date-parts":[[2023,8,24]]},"abstract":"<jats:p>Unmanned aerial vehicles (UAVs) play a crucial role in maritime search and rescue missions, capturing images of open water scenarios and assisting in object detection. Previous object detection models have mainly focused on general scenarios. However, existing object detection models have mainly focused on general scenarios, while images captured by UAVs in vast ocean scenarios often contain numerous small objects that significantly degrade the performance of the original models. To address this challenge, we propose a model that can automatically detect objects in images captured by UAVs during maritime search and rescue missions. Our approach involves designing a new detection head with higher resolution feature maps and more comprehensive feature information to improve the detection of small objects. Additionally, we integrate Swin Transformer blocks into the small object detection head, which can improve the model\u2019s ability to obtain abundant contextual information and thus improves the model\u2019s ability to detect small objects. Moreover, we fuse the Convolutional Block Attention Model into the small object detection head to help the model focus on important features. Finally, we adopt a model ensemble strategy to further improve the mean average precision (mAP). Our proposed model achieves a 4.05% improvement in mAP compared to the baseline model. Furthermore, our model outperforms the previous state-of-the-art model on the SeaDronesSee dataset in terms of fewer parameters, lower training costs, and higher mAP.<\/jats:p>","DOI":"10.3233\/jifs-230200","type":"journal-article","created":{"date-parts":[[2023,6,13]],"date-time":"2023-06-13T10:12:33Z","timestamp":1686651153000},"page":"3575-3586","source":"Crossref","is-referenced-by-count":3,"title":["Sea-YOLOv5s: A UAV image-based model for detecting objects in SeaDronesSee dataset"],"prefix":"10.1177","volume":"45","author":[{"given":"Xiaotian","family":"Wang","sequence":"first","affiliation":[{"name":"College of Software, Nankai University, Tianjin, China"}]},{"given":"Zhongjie","family":"Pan","sequence":"additional","affiliation":[{"name":"College of Software, Nankai University, Tianjin, China"}]},{"given":"Ningxin","family":"He","sequence":"additional","affiliation":[{"name":"College of Software, Nankai University, Tianjin, China"}]},{"given":"Tiegang","family":"Gao","sequence":"additional","affiliation":[{"name":"College of Software, Nankai University, Tianjin, China"}]}],"member":"179","reference":[{"issue":"12","key":"10.3233\/JIFS-230200_ref1","doi-asserted-by":"crossref","first-page":"7700","DOI":"10.1109\/TII.2020.2974047","article-title":"Maritime search and rescue based on group mobile computing for unmanned aerial vehicles and unmanned surface vehicles","volume":"16","author":"Yang","year":"2020","journal-title":"IEEE, Trans. Ind. Informatics"},{"key":"10.3233\/JIFS-230200_ref2","doi-asserted-by":"crossref","first-page":"110098","DOI":"10.1016\/j.oceaneng.2021.110098","article-title":"Coverage path planning for maritime search and rescue using reinforcement learning","volume":"241","author":"Ai","year":"2021","journal-title":"Ocean Eng"},{"key":"10.3233\/JIFS-230200_ref3","doi-asserted-by":"crossref","first-page":"109376","DOI":"10.1016\/j.oceaneng.2021.109376","article-title":"Cooperative mass path planning for marine man overboard search","volume":"235","author":"Mou","year":"2021","journal-title":"Ocean Eng"},{"key":"10.3233\/JIFS-230200_ref4","doi-asserted-by":"crossref","unstructured":"Albanese A. , Sciancalepore V. and Costa-P\u00e9rez X. , SARDO:an automated search-and-rescue drone-based solution for victims localization, IEEE, Trans. Mob. Comput. 21(9) (2022) 3312\u20133325. [Online]. Available: https:\/\/doi.org\/10.1109\/TMC.2021.3051273","DOI":"10.1109\/TMC.2021.3051273"},{"issue":"4","key":"10.3233\/JIFS-230200_ref5","doi-asserted-by":"crossref","first-page":"782","DOI":"10.1002\/rob.21849","article-title":"Fisher, Detection of bodies in maritime rescue operations using unmanned aerial vehicles with multispectral cameras","volume":"36","author":"Gallego","year":"2019","journal-title":"J. Field Robotics"},{"key":"10.3233\/JIFS-230200_ref6","unstructured":"Zhu P. , Wen L. , Bian X. , Ling H. and Hu Q. , Vision meets drones: Achallenge, CoRR, vol. abs\/abs\/1804.07437, 2018."},{"key":"10.3233\/JIFS-230200_ref7","doi-asserted-by":"crossref","unstructured":"Du D. , Qi Y. , Yu H. , Yang Y. , Duan K. , Li G. , Zhang W. , Huang Q. and Tian Q. , The unmanned aerial vehicle benchmark: Object detection and tracking, in Computer Vision - ECCV 2018 - 15th European Conference, Munich, Germany, September 8-14, 2018, Proceedings, Part X, ser. Lecture Notes in Computer Science, V. Ferrari, M. Hebert, C. Sminchisescu and Y. Weiss, Eds. 11214 (2018), 375\u2013391.","DOI":"10.1007\/978-3-030-01249-6_23"},{"key":"10.3233\/JIFS-230200_ref8","doi-asserted-by":"crossref","unstructured":"Varga L.A. , Kiefer B. , Messmer M. and Zell A. , Seadronessee: Amaritime benchmark for detecting humans in open water, in IEEE\/CVF Winter Conference on Applications of Computer Vision, WACV Waikoloa, HI, USA, January 3-8, 2022, (2022), pp. 3686\u20133696.","DOI":"10.1109\/WACV51458.2022.00374"},{"key":"10.3233\/JIFS-230200_ref9","doi-asserted-by":"crossref","unstructured":"Girshick R.B. , Fast R-CNN, in 2015 IEEE International Conference on Computer Vision, ICCV 2015, Santiago, Chile, December 7-13, 2015, (2015), pp. 1440\u20131448.","DOI":"10.1109\/ICCV.2015.169"},{"key":"10.3233\/JIFS-230200_ref10","unstructured":"Zhou X. , Wang D. and Kr\u00e4henb\u00fchl P. , Objects as points, CoRR, vol. abs\/1904.07850, 2019."},{"key":"10.3233\/JIFS-230200_ref11","first-page":"10778","article-title":"Efficientdet: Scalable and efficient object detection","author":"Tan","year":"2020","journal-title":"2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, CVPR Seattle, WA, USA, June 13-19, 2020"},{"key":"10.3233\/JIFS-230200_ref12","unstructured":"Kiefer B. , Ott D. and Zell A. , Leveraging synthetic data in object detection on unmanned aerial vehicles, CoRR vol.abs\/2112.12252, 2021, [Online]. Available: https:\/\/arxiv.org\/abs\/2112.12252."},{"key":"10.3233\/JIFS-230200_ref13","unstructured":"Jocher G. , \u201cyolov5,\u201d accessed 28 June 2022 [Online]. Available: https:\/\/github.com\/ultralytics\/yolov5."},{"key":"10.3233\/JIFS-230200_ref14","doi-asserted-by":"crossref","unstructured":"Redmon J. , Divvala S.K. , Girshick R.B. and Farhadi A. , You only lookonce: Unified, real-time object detection, in 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR Las Vegas, NV, USA, June 27-30, 2016, 2016 pp. 779\u2013788.","DOI":"10.1109\/CVPR.2016.91"},{"key":"10.3233\/JIFS-230200_ref15","first-page":"6517","article-title":"YOLO: better, faster, stronger","author":"Redmon","year":"2017","journal-title":"2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR Honolulu, HI, USA, July 21-26, 2017"},{"key":"10.3233\/JIFS-230200_ref16","unstructured":"Redmon J. and Farhadi A. , \u201cYolov3: An incremental improvement,\u201d, CoRR, vol. abs\/1804.02767, 2018."},{"key":"10.3233\/JIFS-230200_ref17","unstructured":"Bochkovskiy A. , Wang C. and Liao H.M. , Yolov4: Optimal speed andaccuracy of object detection, CoRR, vol. abs\/abs\/2004.10934, 2020. [Online]. Available: https:\/\/arxiv.org\/abs\/2004.10934"},{"key":"10.3233\/JIFS-230200_ref18","unstructured":"U. of Tuebingen, \u201cSeadronessee,\u201d 2022 accessed 29 June 2022. [Online]. Available: https:\/\/seadronessee.cs.uni-tuebingen.de\/leaderboard."},{"key":"10.3233\/JIFS-230200_ref19","doi-asserted-by":"crossref","unstructured":"Wang C. , Liao H.M. , Wu Y. , Chen P. , Hsieh J. and Yeh I. , \u201cCspnet:A new backbone that can enhance learning capability of CNN,\u201d in 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, CVPR Workshops Seattle, WA, USA, June 14-19, 2020, 2020 pp. 1571\u20131580.","DOI":"10.1109\/CVPRW50498.2020.00203"},{"key":"10.3233\/JIFS-230200_ref20","doi-asserted-by":"crossref","unstructured":"Liu S. , Qi L. , Qin H. , Shi J. and Jia J. , \u201cPath aggregation network for instance segmentation,\u201d in 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR Salt Lake City, UT, USA, June 18-22, 2018, 2018 pp. 8759\u20138768.","DOI":"10.1109\/CVPR.2018.00913"},{"key":"10.3233\/JIFS-230200_ref21","first-page":"9992","article-title":"Swin transformer: Hierarchical vision transformer using shifted windows","author":"Liu","year":"2021","journal-title":"2021 IEEE\/CVF International Conference on Computer Vision, ICCV Montreal, QC, Canada, October 10-17, 2021"},{"key":"10.3233\/JIFS-230200_ref22","doi-asserted-by":"crossref","unstructured":"Woo S. , Park J. , Lee J. and Kweon I.S. , \u201cCBAM: convolutional block attention module, in Computer Vision \u2013 ECCV - 15th European Conference, Munich, Germany, September 8-14, 2018, Proceedings, Part VII, ser. Lecture Notes in Computer Science, V. Ferrari, M. Hebert, C. Sminchisescu and Y. Weiss, Eds. 11211 (2018), pp. 3\u201319.","DOI":"10.1007\/978-3-030-01234-2_1"},{"key":"10.3233\/JIFS-230200_ref23","unstructured":"D. Programmer, Darklabel, 2022, accessed 20 June 2022. [Online]. Available: https:\/\/github.com\/darkpgmr\/DarkLabel."},{"issue":"1","key":"10.3233\/JIFS-230200_ref24","doi-asserted-by":"crossref","first-page":"361","DOI":"10.1007\/s00500-021-06407-8","article-title":"An improved yolov5 real-time detection method for small objects captured by UAV","volume":"26","author":"Zhan","year":"2022","journal-title":"Soft Comput."},{"key":"10.3233\/JIFS-230200_ref25","doi-asserted-by":"crossref","unstructured":"Lin T. , Maire M. , Belongie S.J. , Hays J. , Perona P. , Ramanan D. Doll\u00e1r P. and Zitnick C.L. , Microsoft COCO: common objects in context, in Computer Vision - ECCV \u2013 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V, ser. Lecture Notes in Computer Science, D.J. Fleet,T. Pajdla, B. Schiele and T. Tuytelaars, Eds. 8693 (2014), 740\u2013755.","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"10.3233\/JIFS-230200_ref26","doi-asserted-by":"crossref","first-page":"103910","DOI":"10.1016\/j.imavis.2020.103910","article-title":"Recent advances in small object detection based on deep learning: A review","volume":"97","author":"Tong","year":"2020","journal-title":"Image Vis. Comput."},{"key":"10.3233\/JIFS-230200_ref27","doi-asserted-by":"crossref","unstructured":"Zhu X. , Lyu S. , Wang X. and Zhao Q. , Tph-yolov5: Improved yolov5 based on transformer prediction head for object detection on drone-captured scenarios, in IEEE\/CVF International Conference on Computer Vision Workshops, ICCVW 2021 Montreal, BC, Canada, October 11\u201317, 2021, pp. 2778\u20132788.","DOI":"10.1109\/ICCVW54120.2021.00312"},{"key":"10.3233\/JIFS-230200_ref28","doi-asserted-by":"crossref","first-page":"17","DOI":"10.1016\/B978-0-12-801460-8.00002-1","article-title":"Chapter 2 \u2013 data mining process","author":"Kotu","year":"2015","journal-title":"Predictive Analytics and Data Mining"},{"key":"10.3233\/JIFS-230200_ref29","first-page":"850","article-title":"Efficient non-maximum suppression","author":"Neubeck","year":"2006","journal-title":"18th International Conference on Pattern Recognition (ICPR 2006), 20-24 August 2006, Hong Kong, China"}],"container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"original-title":[],"link":[{"URL":"https:\/\/content.iospress.com\/download?id=10.3233\/JIFS-230200","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,1,29]],"date-time":"2026-01-29T08:56:42Z","timestamp":1769677002000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/full\/10.3233\/JIFS-230200"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,8,24]]},"references-count":29,"journal-issue":{"issue":"3"},"URL":"https:\/\/doi.org\/10.3233\/jifs-230200","relation":{},"ISSN":["1064-1246","1875-8967"],"issn-type":[{"value":"1064-1246","type":"print"},{"value":"1875-8967","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,8,24]]}}}