{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,14]],"date-time":"2026-02-14T05:18:27Z","timestamp":1771046307762,"version":"3.50.1"},"reference-count":51,"publisher":"MDPI AG","issue":"16","license":[{"start":{"date-parts":[[2021,8,11]],"date-time":"2021-08-11T00:00:00Z","timestamp":1628640000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62071339"],"award-info":[{"award-number":["62071339"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61872277"],"award-info":[{"award-number":["61872277"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"National Key R&amp;D Program of China","award":["2020YFC1522703"],"award-info":[{"award-number":["2020YFC1522703"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>The detection of elongated objects, such as ships, from satellite images has very important application prospects in marine transportation, shipping management, and many other scenarios. At present, the research of general object detection using neural networks has made significant progress. However, in the context of ship detection from remote sensing images, due to the elongated shape of ship structure and the wide variety of ship size, the detection accuracy is often unsatisfactory. In particular, the detection accuracy of small-scale ships is much lower than that of the large-scale ones. To this end, in this paper, we propose a hierarchical scale sensitive CenterNet (HSSCenterNet) for ship detection from remote sensing images. HSSCenterNet adopts a multi-task learning strategy. First, it presents a dual-direction vector to represent the posture or direction of the tilted bounding box, and employs a two-layer network to predict the dual direction vector, which improves the detection block of CenterNet, and cultivates the ability of detecting targets with tilted posture. Second, it divides the full-scale detection task into three parallel sub-tasks for large-scale, medium-scale, and small-scale ship detection, respectively, and obtains the final results with non-maximum suppression. Experimental results show that, HSSCenterNet achieves a significant improved performance in detecting small-scale ship targets while maintaining a high performance at medium and large scales.<\/jats:p>","DOI":"10.3390\/rs13163182","type":"journal-article","created":{"date-parts":[[2021,8,11]],"date-time":"2021-08-11T08:35:52Z","timestamp":1628670952000},"page":"3182","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":11,"title":["Elongated Small Object Detection from Remote Sensing Images Using Hierarchical Scale-Sensitive Networks"],"prefix":"10.3390","volume":"13","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7700-0901","authenticated-orcid":false,"given":"Zheng","family":"He","sequence":"first","affiliation":[{"name":"School of Computer Science, Wuhan University, Wuhan 430072, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Li","family":"Huang","sequence":"additional","affiliation":[{"name":"School of Computer Science, Wuhan University, Wuhan 430072, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Weijiang","family":"Zeng","sequence":"additional","affiliation":[{"name":"School of Computer Science, Wuhan University, Wuhan 430072, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xining","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Computer Science, Wuhan University, Wuhan 430072, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yongxin","family":"Jiang","sequence":"additional","affiliation":[{"name":"Department of Navigation, Dalian Naval Academy, Dalian 116018, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7955-0782","authenticated-orcid":false,"given":"Qin","family":"Zou","sequence":"additional","affiliation":[{"name":"School of Computer Science, Wuhan University, Wuhan 430072, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2021,8,11]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 23\u201328). Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"7695","DOI":"10.3390\/rs70607695","article-title":"Automatic ship detection in SAR images using multi-scale heterogeneities and an a contrario decision","volume":"7","author":"Huang","year":"2015","journal-title":"Remote Sens."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"175","DOI":"10.1016\/j.infrared.2015.09.015","article-title":"Small object detection in forward-looking infrared images with sea clutter using context-driven Bayesian saliency model","volume":"73","author":"Yu","year":"2015","journal-title":"Infrared Phys. Technol."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Gao, F., Shi, W., Wang, J., Yang, E., and Zhou, H. (2019). Enhanced feature extraction for ship detection from multi-resolution and multi-scene synthetic aperture radar (SAR) images. Remote Sens., 11.","DOI":"10.3390\/rs11222694"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Fan, Q., Chen, F., Cheng, M., Lou, S., Xiao, R., Zhang, B., Wang, C., and Li, J. (2019). Ship detection using a fully convolutional network with compact polarimetric sar images. Remote Sens., 11.","DOI":"10.3390\/rs11182171"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Chen, L., Shi, W., and Deng, D. (2021). Improved YOLOv3 Based on Attention Mechanism for Fast and Accurate Ship Detection in Optical Remote Sensing Images. Remote Sens., 13.","DOI":"10.3390\/rs13040660"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Xu, P., Li, Q., Zhang, B., Wu, F., Zhao, K., Du, X., Yang, C., and Zhong, R. (2021). On-Board Real-Time Ship Detection in HISEA-1 SAR Images Based on CFAR and Lightweight Deep Learning. Remote Sens., 13.","DOI":"10.3390\/rs13101995"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.rse.2017.12.033","article-title":"Vessel detection and classification from spaceborne optical images: A literature survey","volume":"207","author":"Kanjir","year":"2018","journal-title":"Remote Sens. Environ."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Wang, Y., Wang, C., Zhang, H., Dong, Y., and Wei, S. (2019). A SAR dataset of ship detection for deep learning under complex backgrounds. Remote Sens., 11.","DOI":"10.3390\/rs11070765"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Wang, Y., Wang, C., Zhang, H., Dong, Y., and Wei, S. (2019). Automatic ship detection based on RetinaNet using multi-resolution Gaofen-3 imagery. Remote Sens., 11.","DOI":"10.3390\/rs11050531"},{"key":"ref_11","unstructured":"Ren, S., He, K., Girshick, R., and Sun, J. (2015, January 7\u201312). Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. Proceedings of the Advances in Neural Information Processing Systems, Montreal, QC, Canada."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Ding, J., Xue, N., Long, Y., Xia, G.S., and Lu, Q. (2019, January 15\u201320). Learning roi transformer for oriented object detection in aerial images. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00296"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Han, J., Ding, J., Xue, N., and Xia, G.S. (2021, January 19\u201325). Redet: A rotation-equivariant detector for aerial object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Online.","DOI":"10.1109\/CVPR46437.2021.00281"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"781","DOI":"10.1109\/TCSVT.2019.2897980","article-title":"Saliency-aware convolution neural network for ship detection in surveillance video","volume":"30","author":"Shao","year":"2020","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"5110","DOI":"10.1109\/TITS.2019.2949005","article-title":"Surrounding Vehicle Detection Using an FPGA Panoramic Camera and Deep CNNs","volume":"21","author":"Chen","year":"2019","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_16","unstructured":"Everingham, M., Zisserman, A., and Williams, C. (2005, January 11). The 2005 PASCAL Visual Object Classes Challenge. Proceedings of the First International Conference on Machine Learning Challenges: Evaluating Predictive Uncertainty Visual Object Classification, and Recognizing Textual Entailment, Southampton, UK."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"98","DOI":"10.1007\/s11263-014-0733-5","article-title":"The Pascal Visual Object Classes Challenge: A Retrospective","volume":"111","author":"Everingham","year":"2014","journal-title":"Int. J. Comput. Vis."},{"key":"ref_18","unstructured":"Chen, C., Liu, M.Y., Tuzel, O., and Xiao, J. (2016). R-CNN for small object detection. Asian Conference on Computer Vision, Springer."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"103910","DOI":"10.1016\/j.imavis.2020.103910","article-title":"Recent advances in small object detection based on deep learning: A review","volume":"97","author":"Tong","year":"2020","journal-title":"Image Vis. Comput."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"10600","DOI":"10.1109\/TIE.2019.2962413","article-title":"DenseLightNet: A light-weight vehicle detection network for autonomous driving","volume":"67","author":"Chen","year":"2020","journal-title":"IEEE Trans. Ind. Electron."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Dollar, P., Girshick, R., He, K., Hariharan, B., and Belongie, S. (2017, January 21\u201326). Feature Pyramid Networks for Object Detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.106"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Liu, S., Qi, L., Qin, H., Shi, J., and Jia, J. (2018, January 18\u201322). Path Aggregation Network for Instance Segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00913"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Singh, B., and Davis, L.S. (2018, January 18\u201322). An Analysis of Scale Invariance in Object Detection-SNIP. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00377"},{"key":"ref_24","unstructured":"Singh, B., Najibi, M., and Sniper, D.L. (2018, January 3\u20138). Efficient Multi-Scale Training. Proceedings of the Advances in Neural Information Processing Systems, Montr\u00e9al, QC, Canada."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Hu, P., and Ramanan, D. (2017, January 21\u201326). Finding Tiny Faces. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.166"},{"key":"ref_26","unstructured":"Li, Y., Chen, Y., Wang, N., and Zhang, Z. (November, January 27). Scale-Aware Trident Networks for Object Detection. Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV), Seoul, Korea."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Bai, Y., Zhang, Y., Ding, M., and Ghanem, B. (2018, January 18\u201322). Finding Tiny Faces in the Wild with Generative Adversarial Network. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00010"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Bai, Y., and Ghanem, B. (2017). Multi-Branch Fully Convolutional Network for Face Detection. arXiv.","DOI":"10.1109\/CVPRW.2017.259"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Li, J., Liang, X., Wei, Y., Xu, T., Feng, J., and Yan, S. (2017, January 21\u201326). Perceptual Generative Adversarial Networks for Small Object Detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.211"},{"key":"ref_30","unstructured":"Zhou, X., Wang, D., and Kr\u00e4henb\u00fchl, P. (2019). Objects as points. arXiv."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"2394","DOI":"10.1109\/TIP.2017.2676342","article-title":"Unsupervised simplification of image hierarchies via evolution analysis in scale-sets framework","volume":"26","author":"Hu","year":"2017","journal-title":"IEEE Trans. Image Process."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"2461","DOI":"10.1109\/JSTARS.2018.2833102","article-title":"Stepwise evolution analysis of the region-merging segmentation for scale parameterization","volume":"11","author":"Hu","year":"2018","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_33","unstructured":"Dai, J., Li, Y., He, K., and Sun, J. (2016, January 5\u201310). R-FCN: Object Detection via Region-based Fully Convolutional Networks. Proceedings of the Advances in Neural Information Processing Systems, Barcelona, Spain."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Dai, J., Qi, H., Xiong, Y., Li, Y., Zhang, G., Hu, H., and Wei, Y. (2017, January 22\u201329). Deformable Convolutional Networks. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.89"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Lin, Y., Abdelfatah, K., Zhou, Y., Fan, X., Yu, H., Qian, H., and Wang, S. (2015, January 7\u201313). Co-interest person detection from multiple wearable camera videos. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.503"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"3111","DOI":"10.1109\/TMM.2018.2818020","article-title":"Arbitrary-Oriented Scene Text Detection via Rotation Proposals","volume":"20","author":"Ma","year":"2018","journal-title":"IEEE Transacitons Multimed."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Jiang, Y., Zhu, X., Wang, X., Yang, S., Li, W., Wang, H., Fu, P., and Luo, Z. (2017). R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection. arXiv.","DOI":"10.1109\/ICPR.2018.8545598"},{"key":"ref_38","unstructured":"Yang, X., Yang, J., Yan, J., Zhang, Y., Zhang, T., Guo, Z., Sun, X., and Fu, K. (November, January 27). SCRDet: Towards More Robust Detection for Small, Cluttered and Rotated Objects. Proceedings of the IEEE International Conference on Computer Vision, Seoul, Korea."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Zhou, L., Wei, H., and Li, H. (2020). Objects detection for remote sensing images based on polar coordinates. arXiv.","DOI":"10.1109\/ACCESS.2020.3041025"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Law, H., and Deng, J. (2018, January 8\u201314). CornerNet: Detecting Objects as Paired Keypoints. Proceedings of the European Conference on Computer Vision, Munich, Germany.","DOI":"10.1007\/978-3-030-01264-9_45"},{"key":"ref_41","unstructured":"Law, H., Teng, Y., Russakovsky, O., and Deng, J. (2020). CornerNet-Lite: Efficient Keypoint Based Object Detection. arXiv."},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Tian, Z., Shen, C., Chen, H., and He, T. (2019, January 27\u201328). FCOS: Fully Convolutional One-Stage Object Detection. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Seoul, Korea.","DOI":"10.1109\/ICCV.2019.00972"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep Residual Learning for Image Recognition. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition CVPR, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Cao, Y., Ju, L., Zou, Q., Qu, C., and Wang, S. (2011, January 20\u201325). A Multichannel Edge-Weighted Centroidal Voronoi Tessellation algorithm for 3D super-alloy image segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Colorado Springs, CO, USA.","DOI":"10.1109\/CVPR.2011.5995590"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Maire, M., Belongie, S., and Hays, J. (2014, January 6\u201312). Microsoft COCO: Common Objects in Context. Proceedings of the European Conference on Computer Vision, Zurich, Switzerland.","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"ref_46","unstructured":"Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G., Killeen, T., Lin, Z., Gimelshein, N., and Antiga, L. (2019, January 8\u201314). PyTorch: An Imperative Style, High-Performance Deep Learning Library. Proceedings of the Advances in Neural Information Processing Systems, Vancouver, BC, Canada."},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2015, January 7\u201313). Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.123"},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Goyal, P., Girshick, R., He, K., and Doll\u00e1r, P. (2017, January 22\u201329). Focal loss for dense object detection. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.324"},{"key":"ref_49","unstructured":"Sutskever, I., Martens, J., Dahl, G., and Hinton, G. (2013, January 16\u201321). On the importance of initialization and momentum in deep learning. Proceedings of the International Conference on Machine Learning, Atlanta, GA, USA."},{"key":"ref_50","unstructured":"Ioffe, S., and Szegedy, C. (2015, January 6\u201311). Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. Proceedings of the International Conference on Machine Learning, Lille, France."},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Wu, Y., and He, K. (2018, January 8\u201314). Group Normalization. Proceedings of the European Conference on Computer Vision, Munich, Germany.","DOI":"10.1007\/978-3-030-01261-8_1"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/16\/3182\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T06:44:13Z","timestamp":1760165053000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/16\/3182"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,8,11]]},"references-count":51,"journal-issue":{"issue":"16","published-online":{"date-parts":[[2021,8]]}},"alternative-id":["rs13163182"],"URL":"https:\/\/doi.org\/10.3390\/rs13163182","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,8,11]]}}}