{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,7]],"date-time":"2026-04-07T17:13:02Z","timestamp":1775581982377,"version":"3.50.1"},"reference-count":42,"publisher":"MDPI AG","issue":"3","license":[{"start":{"date-parts":[[2022,2,6]],"date-time":"2022-02-06T00:00:00Z","timestamp":1644105600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["No.12003018"],"award-info":[{"award-number":["No.12003018"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100012226","name":"Fundamental Research Funds for the Central Universities","doi-asserted-by":"publisher","award":["No. XJS191305"],"award-info":[{"award-number":["No. XJS191305"]}],"id":[{"id":"10.13039\/501100012226","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100002858","name":"China Postdoctoral Science Foundation","doi-asserted-by":"publisher","award":["No. 2018M633471"],"award-info":[{"award-number":["No. 2018M633471"]}],"id":[{"id":"10.13039\/501100002858","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Deep learning has attracted increasing attention across a number of disciplines in recent years. In the field of remote sensing, ship detection based on deep learning for synthetic aperture radar (SAR) imagery is replacing traditional methods as a mainstream research method. The multiple scales of ship objects make the detection of ship targets a challenging task in SAR images. This paper proposes a new methodology for better detection of multi-scale ship objects in SAR images, which is based on YOLOv5 with a small model size (YOLOv5s), namely the multi-scale ship detection network (MSSDNet). We construct two modules in MSSDNet: the CSPMRes2 (Cross Stage Partial network with Modified Res2Net) module for improving feature representation capability and the FC-FPN (Feature Pyramid Network with Fusion Coefficients) module for fusing feature maps adaptively. Firstly, the CSPMRes2 module introduces modified Res2Net (MRes2) with a coordinate attention module (CAM) for multi-scale features extraction in scale dimension, then the CSPMRes2 module will be used as a basic module in the depth dimension of the MSSDNet backbone. Thus, our backbone of MSSDNet has the capabilities of features extraction in both depth and scale dimensions. In the FC-FPN module, we set a learnable fusion coefficient for each feature map participating in fusion, which helps the FC-FPN module choose the best features to fuse for multi-scale objects detection tasks. After the feature fusion, we pass the output through the CSPMRes2 module for better feature representation. The performance evaluation for this study is conducted using an RTX2080Ti GPU, and two different datasets: SSDD and SARShip are used. These experiments on SSDD and SARShip datasets confirm that MSSDNet leads to superior multi-scale ship detection compared with the state-of-the-art methods. Moreover, in comparisons of network model size and inference time, our MSSDNet also has huge advantages with related methods.<\/jats:p>","DOI":"10.3390\/rs14030755","type":"journal-article","created":{"date-parts":[[2022,2,6]],"date-time":"2022-02-06T20:38:40Z","timestamp":1644179920000},"page":"755","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":102,"title":["Ship Detection in SAR Images Based on Multi-Scale Feature Extraction and Adaptive Feature Fusion"],"prefix":"10.3390","volume":"14","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-8384-457X","authenticated-orcid":false,"given":"Kexue","family":"Zhou","sequence":"first","affiliation":[{"name":"School of Aerospace Science & Technology, Xidian University, Xi\u2019an 710126, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8681-5889","authenticated-orcid":false,"given":"Min","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Aerospace Science & Technology, Xidian University, Xi\u2019an 710126, China"}]},{"given":"Hai","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Aerospace Science & Technology, Xidian University, Xi\u2019an 710126, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1096-3973","authenticated-orcid":false,"given":"Jinlin","family":"Tan","sequence":"additional","affiliation":[{"name":"School of Aerospace Science & Technology, Xidian University, Xi\u2019an 710126, China"},{"name":"Shaanxi Academy of Aerospace Technology Application Co., Ltd., Xi\u2019an 710199, China"}]}],"member":"1968","published-online":{"date-parts":[[2022,2,6]]},"reference":[{"key":"ref_1","first-page":"1","article-title":"A Novel Multidimensional Domain Deep Learning Network for SAR Ship Detection","volume":"60","author":"Li","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"1200","DOI":"10.1109\/TGRS.2020.3004911","article-title":"Multiscale CNN with Autoencoder Regularization Joint Contextual Attention Network for SAR Image Classification","volume":"59","author":"Wu","year":"2021","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Xie, E., Ding, J., Wang, W., Zhan, X., Xu, H., Sun, P., Li, Z., and Luo, P. (2021, January 11\u201317). Detco: Unsupervised Contrastive Learning for Object Detection. Proceedings of the Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV), Online.","DOI":"10.1109\/ICCV48922.2021.00828"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Qian, X., Lin, S., Cheng, G., Yao, X., Ren, H., and Wang, W. (2020). Object Detection in Remote Sensing Images Based on Improved Bounding Box Regression and Multi-Level Features Fusion. Remote Sens., 12.","DOI":"10.3390\/rs12010143"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Sun, P., Zhang, R., Jiang, Y., Kong, T., Xu, C., Zhan, W., Tomizuka, M., Li, L., Yuan, Z., and Wang, C. (2021, January 19\u201325). Sparse R-Cnn: End-to-End Object Detection with Learnable Proposals. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Online.","DOI":"10.1109\/CVPR46437.2021.01422"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C., and Berg, A.C. (2016, January 8\u201316). SSD: Single shot multibox detector. Proceedings of the European Conference on Computer Vision (ECCV), Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Girshick, R.B. (2015, January 7\u201313). Fast R-CNN. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","article-title":"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks","volume":"39","author":"Ren","year":"2017","journal-title":"IEEE Trans Pattern Anal. Mach. Intell."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"318","DOI":"10.1109\/TPAMI.2018.2858826","article-title":"Focal Loss for Dense Object Detection","volume":"42","author":"Lin","year":"2020","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"He, K., Gkioxari, G., Dollar, P., and Girshick, R. (2017, January 22\u201329). Mask R-CNN. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.322"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Redmon, J., and Farhadi, A. (2017, January 21\u201326). YOLO9000: Better faster stronger. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.690"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016, January 27\u201330). You only look once: Unified, real-time object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_13","unstructured":"Redmon, J., and Farhadi, A. (2018). YOLOv3: An incremental improvement. arXiv."},{"key":"ref_14","unstructured":"Bochkovskiy, A., Wang, C., and Liao, H. (2020). YOLOv4: Optimal speed and accuracy of object detection. arXiv."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"431","DOI":"10.1109\/LGRS.2020.2975541","article-title":"Cross-scale feature fusion for object detection in optical remote sensing images","volume":"18","author":"Cheng","year":"2020","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Huang, Z., Pan, Z., and Lei, B. (2017). Transfer learning with deep convolutional neural network for SAR target classification with limited labeled data. Remote Sens., 9.","DOI":"10.3390\/rs9090907"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Yu, J., Zhou, G., Zhou, S., and Qin, M. (2022). A Fast and Lightweight Detection Network for Multi-Scale SAR Ship Detection under Complex Backgrounds. Remote Sens., 14.","DOI":"10.3390\/rs14010031"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"9805389","DOI":"10.34133\/2021\/9805389","article-title":"Feature enhancement network for object detection in optical remote sensing images","volume":"2021","author":"Cheng","year":"2021","journal-title":"J. Remote Sens."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Kang, M., Leng, X., Lin, Z., and Ji, K. (2017, January 18\u201321). A Modified Faster R-CNN Based on CFAR Algorithm for SAR Ship Detection. Proceedings of the 2017 International Workshop on Remote Sensing with Intelligent Processing (RSIP), Shanghai, China.","DOI":"10.1109\/RSIP.2017.7958815"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Li, J., Qu, C., and Shao, J. (2017, January 13\u201314). Ship detection in SAR images based on an improved faster R-CNN. Proceedings of the SAR in Big Data Era (BIGSARDATA), Beijing, China.","DOI":"10.1109\/BIGSARDATA.2017.8124934"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"20881","DOI":"10.1109\/ACCESS.2018.2825376","article-title":"A Densely Connected End-to-End Neural Network for Multiscale and Multiscene SAR Ship Detection","volume":"6","author":"Jiao","year":"2018","journal-title":"IEEE Access"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Kang, M., Ji, K., Leng, X., and Lin, Z. (2017). Contextual Region-Based Convolutional Neural Network with Multilayer Fusion for SAR Ship Detection. Remote Sens., 9.","DOI":"10.3390\/rs9080860"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Liu, N., Cao, Z., Cui, Z., Pi, Y., and Dang, S. (2019). Multi-Scale Proposal Generation for Ship Detection in SAR Images. Remote Sens., 11.","DOI":"10.3390\/rs11050526"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"1331","DOI":"10.1109\/TGRS.2020.3005151","article-title":"An Anchor-Free Method Based on Feature Balancing and Refinement Network for Multiscale Ship Detection in SAR Images","volume":"59","author":"Fu","year":"2021","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"8983","DOI":"10.1109\/TGRS.2019.2923988","article-title":"Dense Attention Pyramid Networks for Multi-Scale Ship Detection in SAR Images","volume":"57","author":"Cui","year":"2019","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Gao, F., He, Y., Wang, J., Hussain, A., and Zhou, H. (2020). Anchor-free Convolutional Network with Dense Attention Feature Aggregation for Ship Detection in SAR Images. Remote Sens., 12.","DOI":"10.3390\/rs12162619"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"107787","DOI":"10.1016\/j.patcog.2020.107787","article-title":"A CenterNet++ model for ship detection in SAR images","volume":"112","author":"Guo","year":"2021","journal-title":"Pattern Recognit."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Wu, Z., Hou, B., Ren, B., Ren, Z., Wang, S., and Jiao, L. (2021). A Deep Detection Network Based on Interaction of Instance Segmentation and Object Detection for SAR Images. Remote Sens., 13.","DOI":"10.3390\/rs13132582"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Yu, L., Wu, H., Zhong, Z., Zheng, L., Deng, Q., and Hu, H. (2021). TWC-Net: A SAR Ship Detection Using Two-Way Convolution and Multiscale Feature Mapping. Remote Sens., 13.","DOI":"10.3390\/rs13132558"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Zhang, T., Zhang, X., and Ke, X. (2021). Quad-FPN: A Novel Quad Feature Pyramid Network for SAR Ship Detection. Remote Sens., 13.","DOI":"10.3390\/rs13142771"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Wang, Y., Wang, C., Zhang, H., Dong, Y., and Wei, S. (2019). A SAR Dataset of Ship Detection for Deep Learning under Complex Backgrounds. Remote Sens., 11.","DOI":"10.3390\/rs11070765"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Wang, C.-Y., Liao, H.-Y.M., Wu, Y.-H., Chen, P.-Y., Hsieh, J.-W., and Yeh, I.-H. (2020, January 14\u201319). CSPNet: A New Backbone That Can Enhance Learning Capability of CNN. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) workshops, Online.","DOI":"10.1109\/CVPRW50498.2020.00203"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Hou, Q., Zhou, D., and Feng, J. (2021, January 19\u201325). Coordinate Attention for Efficient Mobile Network Design. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Online.","DOI":"10.1109\/CVPR46437.2021.01350"},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"652","DOI":"10.1109\/TPAMI.2019.2938758","article-title":"Res2net: A New Multi-Scale Backbone Architecture","volume":"43","author":"Gao","year":"2019","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Qiu, H., Ma, Y., Li, Z., Liu, S., and Sun, J. (2020, January 23\u201328). Borderdet: Border Feature for Dense Object Detection. Proceedings of the European Conference on Computer Vision (ECCV), Glasgow, UK.","DOI":"10.1007\/978-3-030-58452-8_32"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Wang, J., Song, L., Li, Z., Sun, H., Sun, J., and Zheng, N. (2021, January 19\u201325). End-to-End Object Detection with Fully Convolutional Network. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Online.","DOI":"10.1109\/CVPR46437.2021.01559"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Li, X., Wang, W., Hu, X., Li, J., Tang, J., and Yang, J. (2021, January 19\u201325). Generalized Focal Loss v2: Learning Reliable Localization Quality Estimation for Dense Object Detection. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Online.","DOI":"10.1109\/CVPR46437.2021.01146"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Ge, Z., Liu, S., Li, Z., Yoshie, O., and Sun, J. (2021, January 19\u201325). OTA: Optimal Transport Assignment for Object Detection. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Online.","DOI":"10.1109\/CVPR46437.2021.00037"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Chen, Q., Wang, Y., Yang, T., Zhang, X., Cheng, J., and Sun, J. (2021, January 19\u201325). You Only Look One-Level Feature. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Online.","DOI":"10.1109\/CVPR46437.2021.01284"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Kim, K., and Lee, H.S. (2020, January 23\u201328). Probabilistic Anchor Assignment with Iou Prediction for Object Detection. Proceedings of the European Conference on Computer Vision (ECCV), Glasgow, UK.","DOI":"10.1007\/978-3-030-58595-2_22"},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"98","DOI":"10.1007\/s11263-014-0733-5","article-title":"The Pascal Visual Object Classes Challenge: A Retrospective","volume":"111","author":"Everingham","year":"2015","journal-title":"Int. J. Comput. Vis."},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Doll\u00e1r, P., and Zitnick, C.L. (2014, January 6\u201312). Microsoft Coco: Common Objects in Context. Proceedings of the European Conference on Computer Vision (ECCV), Zurich, Switzerland.","DOI":"10.1007\/978-3-319-10602-1_48"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/3\/755\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T22:15:02Z","timestamp":1760134502000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/3\/755"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,2,6]]},"references-count":42,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2022,2]]}},"alternative-id":["rs14030755"],"URL":"https:\/\/doi.org\/10.3390\/rs14030755","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,2,6]]}}}