{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,8]],"date-time":"2026-04-08T16:24:15Z","timestamp":1775665455416,"version":"3.50.1"},"reference-count":89,"publisher":"MDPI AG","issue":"21","license":[{"start":{"date-parts":[[2022,10,30]],"date-time":"2022-10-30T00:00:00Z","timestamp":1667088000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Natural Science Foundation of China","award":["62076190"],"award-info":[{"award-number":["62076190"]}]},{"name":"National Natural Science Foundation of China","award":["61572384"],"award-info":[{"award-number":["61572384"]}]},{"name":"National Natural Science Foundation of China","award":["41831072"],"award-info":[{"award-number":["41831072"]}]},{"name":"National Natural Science Foundation of China","award":["2022ZDLGY01-11"],"award-info":[{"award-number":["2022ZDLGY01-11"]}]},{"name":"The Key Industry Innovation Chain of Shaanxi","award":["62076190"],"award-info":[{"award-number":["62076190"]}]},{"name":"The Key Industry Innovation Chain of Shaanxi","award":["61572384"],"award-info":[{"award-number":["61572384"]}]},{"name":"The Key Industry Innovation Chain of Shaanxi","award":["41831072"],"award-info":[{"award-number":["41831072"]}]},{"name":"The Key Industry Innovation Chain of Shaanxi","award":["2022ZDLGY01-11"],"award-info":[{"award-number":["2022ZDLGY01-11"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Ships comprise the only and most important ocean transportation mode. Thus, ship detection is one of the most critical technologies in ship monitoring, which plays an essential role in maintaining marine safety. Optical remote-sensing images contain rich color and texture information, which is beneficial to ship detection. However, few optical remote-sensing datasets are open publicly due to the issue of sensitive data and copyrights, and only the HRSC2016 dataset is built for the ship-detection task. Moreover, almost all general object detectors suffer from the failure of multi-scale ship detection because of the diversity of spatial resolution and ship size. In this paper, we re-annotate the HRSC2016 dataset and supplement 610 optical remote-sensing images to build a new open source ship-detection benchmark dataset with rich multi-scale ship objects named the HRSC2016-MS dataset. In addition, we further explore the potential of a recursive mechanism in the field of object detection and propose a novel multi-scale ship-detection framework (MSSDet) in optical remote-sensing images. The success of detecting multi-scale objects depends on the hierarchical pyramid structure in the object-detection framework. However, the inherent semantic and spatial gaps among hierarchical pyramid levels seriously affect detection performance. To alleviate this problem, we propose a joint recursive feature pyramid (JRFP), which can generate semantically strong and spatially refined multi-scale features. Extensive experiments were conducted on the HRSC2016-MS, HRSC2016, and DIOR datasets. Detailed ablation studies directly demonstrated the effectiveness of the proposed JRFP architecture and also showed that the proposed method has excellent generalizability. Comparisons with state-of-the-art methods showed that the proposed method achieves competitive performance, i.e., 77.3%, 95.8%, and 73.3% mean average precision accuracy on the HRSC2016-MS, HRSC2016, and DIOR datasets, respectively.<\/jats:p>","DOI":"10.3390\/rs14215460","type":"journal-article","created":{"date-parts":[[2022,10,30]],"date-time":"2022-10-30T10:47:57Z","timestamp":1667126877000},"page":"5460","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":32,"title":["MSSDet: Multi-Scale Ship-Detection Framework in Optical Remote-Sensing Images and New Benchmark"],"prefix":"10.3390","volume":"14","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-0586-1278","authenticated-orcid":false,"given":"Weiming","family":"Chen","sequence":"first","affiliation":[{"name":"School of Electronic Engineering, Xidian University, Xi\u2019an 710071, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6473-0438","authenticated-orcid":false,"given":"Bing","family":"Han","sequence":"additional","affiliation":[{"name":"School of Electronic Engineering, Xidian University, Xi\u2019an 710071, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5491-8912","authenticated-orcid":false,"given":"Zheng","family":"Yang","sequence":"additional","affiliation":[{"name":"School of Electronic Engineering, Xidian University, Xi\u2019an 710071, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1443-0776","authenticated-orcid":false,"given":"Xinbo","family":"Gao","sequence":"additional","affiliation":[{"name":"School of Electronic Engineering, Xidian University, Xi\u2019an 710071, China"},{"name":"Chongqing Key Laboratory of Image Cognition, Chongqing University of Posts and Telecommunications, Chongqing 400065, China"}]}],"member":"1968","published-online":{"date-parts":[[2022,10,30]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"226","DOI":"10.1109\/LGRS.2009.2031826","article-title":"Characterization of a Bayesian Ship Detection Method in Optical Satellite Images","volume":"7","author":"Proia","year":"2010","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"3446","DOI":"10.1109\/TGRS.2010.2046330","article-title":"A Novel Hierarchical Method of Ship Detection from Spaceborne Optical Image Based on Shape and Texture Features","volume":"48","author":"Zhu","year":"2010","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"937","DOI":"10.1109\/LGRS.2018.2813094","article-title":"Arbitrary-Oriented Ship Detection Framework in Optical Remote-Sensing Images","volume":"15","author":"Liu","year":"2018","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"50839","DOI":"10.1109\/ACCESS.2018.2869884","article-title":"Position Detection and Direction Prediction for Arbitrary-Oriented Ships via Multitask Rotation Region Convolutional Neural Network","volume":"6","author":"Yang","year":"2018","journal-title":"IEEE Access"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"5772","DOI":"10.1109\/TGRS.2020.2969979","article-title":"A Rotational Libra R-CNN Method for Ship Detection","volume":"58","author":"Guo","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"686","DOI":"10.1109\/TGRS.2020.2995477","article-title":"A Novel CNN-Based Method for Accurate Ship Detection in HR Optical Remote Sensing Images via Rotated Bounding Box","volume":"59","author":"Li","year":"2021","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_7","first-page":"1","article-title":"A Cascade Rotated Anchor-Aided Detector for Ship Detection in Remote Sensing Images","volume":"60","author":"Yu","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_8","first-page":"112","article-title":"A ship detection method with high-resolution remote sensing images","volume":"38","author":"Sun","year":"2013","journal-title":"Sci. Surv. Mapp."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"1920","DOI":"10.1109\/LGRS.2016.2618385","article-title":"A Novel Inshore Ship Detection via Ship Head Classification and Body Boundary Determination","volume":"13","author":"Li","year":"2016","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"151","DOI":"10.1007\/s11801-017-7014-9","article-title":"Ship detection in optical remote sensing image based on visual saliency and AdaBoost classifier","volume":"13","author":"Wang","year":"2017","journal-title":"Optoelectron. Lett."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"5837","DOI":"10.1080\/01431161.2010.512310","article-title":"A complete processing chain for ship detection using optical satellite imagery","volume":"31","author":"Corbane","year":"2010","journal-title":"Int. J. Remote Sens."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"3014","DOI":"10.1109\/JSTARS.2019.2919382","article-title":"Geospatial Object Detection via Deconvolutional Region Proposal Network","volume":"12","author":"Wang","year":"2019","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Wang, Y., Wang, C., Zhang, H., Dong, Y., and Wei, S. (2019). Automatic Ship Detection Based on RetinaNet Using Multi-Resolution Gaofen-3 Imagery. Remote Sens., 11.","DOI":"10.3390\/rs11050531"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"1645","DOI":"10.1109\/TGRS.2020.2999082","article-title":"X-LineNet: Detecting Aircraft in Remote Sensing Images by a Pair of Intersecting Line Segments","volume":"59","author":"Wei","year":"2021","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Doll\u00e1r, P., Girshick, R., He, K., Hariharan, B., and Belongie, S. (2017, January 21\u201326). Feature Pyramid Networks for Object Detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.106"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Liu, S., Qi, L., Qin, H., Shi, J., and Jia, J. (2018, January 18\u201323). Path Aggregation Network for Instance Segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00913"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Zhou, P., Ni, B., Geng, C., Hu, J., and Xu, Y. (2018, January 18\u201323). Scale-Transferrable Object Detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00062"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Pang, J., Chen, K., Shi, J., Feng, H., Ouyang, W., and Lin, D. (2019, January 15\u201320). Libra R-CNN: Towards Balanced Learning for Object Detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00091"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Ghiasi, G., Lin, T.Y., and Le, Q.V. (2019, January 15\u201320). NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00720"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Xu, A., Yao, A., Li, A., Liang, A., and Zhang, A. (2019, January 15\u201320). Auto-FPN: Automatic Network Architecture Adaptation for Object Detection Beyond Classification. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/ICCV.2019.00675"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Wang, N., Gao, Y., Chen, H., Wang, P., Tian, Z., Shen, C., and Zhang, Y. (2020, January 14\u201319). NAS-FCOS: Fast Neural Architecture Search for Object Detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01196"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Tan, M., Pang, R., and Le, Q.V. (2020, January 14\u201319). EfficientDet: Scalable and Efficient Object Detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01079"},{"key":"ref_23","unstructured":"Liang, M., and Hu, X. (2015, January 7\u201312). Recurrent convolutional neural network for object recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA."},{"key":"ref_24","unstructured":"Kim, J., Lee, J.K., and Lee, K.M. (July, January 26). Deeply-Recursive Convolutional Network for Image Super-Resolution. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Tai, Y., Yang, J., and Liu, X. (2017, January 21\u201326). Image Super-Resolution via Deep Recursive Residual Network. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.298"},{"key":"ref_26","first-page":"11653","article-title":"CBNet: A Novel Composite Backbone Network Architecture for Object Detection","volume":"34","author":"Liu","year":"2020","journal-title":"Proc. AAAI Conf. Artif. Intell."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Qiao, S., Chen, L.C., and Yuille, A. (2020). DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution. arXiv.","DOI":"10.1109\/CVPR46437.2021.01008"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Liu, Z., Yuan, L., Weng, L., and Yang, Y. (2016, January 20). A High Resolution Optical Satellite Image Dataset for Ship Recognition and Some New Baselines. Proceedings of the 6th International Conference on Pattern Recognition Applications and Methods, Porto, Portugal.","DOI":"10.5220\/0006120603240331"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"296","DOI":"10.1016\/j.isprsjprs.2019.11.023","article-title":"Object detection in optical remote sensing images: A survey and a new benchmark","volume":"159","author":"Li","year":"2020","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","article-title":"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks","volume":"39","author":"Ren","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_31","unstructured":"Li, Z., Peng, C., Yu, G., Zhang, X., Deng, Y., and Sun, J. (2017). Light-Head R-CNN: In Defense of Two-Stage Object Detector. arXiv."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"He, K., Gkioxari, G., Doll\u00e1r, P., and Girshick, R. (2017, January 22\u201329). Mask R-CNN. Proceedings of the International Confenrece Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.322"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Cai, Z., and Vasconcelos, N. (2018, January 18\u201322). Cascade R-CNN: Delving Into High Quality Object Detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00644"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Lu, X., Li, B., Yue, Y., Li, Q., and Yan, J. (2019, January 15\u201320). Grid R-CNN. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00754"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Wu, Y., Chen, Y., Yuan, L., Liu, Z., Wang, L., Li, H., and Fu, Y. (2019). Rethinking Classification and Localization for Object Detection. arXiv.","DOI":"10.1109\/CVPR42600.2020.01020"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Zhang, H., Chang, H., Ma, B., Wang, N., and Chen, X. (2020). Dynamic R-CNN: Towards High Quality Object Detection via Dynamic Training. arXiv.","DOI":"10.1007\/978-3-030-58555-6_16"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Sun, P., Zhang, R., Jiang, Y., Kong, T., Xu, C., Zhan, W., Tomizuka, M., Li, L., Yuan, Z., and Wang, C. (2020). SparseR-CNN: End-to-End Object Detection with Learnable Proposals. arXiv.","DOI":"10.1109\/CVPR46437.2021.01422"},{"key":"ref_38","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (July, January 26). You Only Look Once: Unified, Real-Time Object Detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Redmon, J., and Farhadi, A. (2017, January 21\u201326). YOLO9000: Better, Faster, Stronger. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.690"},{"key":"ref_40","unstructured":"Joseph, R., and Ali, F. (2018). YOLOv3: An Incremental Improvement. arXiv."},{"key":"ref_41","unstructured":"Alexey, B., Chien-Yao, W., and Hong-Yuan, M.L. (2020). YOLOv4: Optimal Speed and Accuracy of Object Detection. arXiv."},{"key":"ref_42","unstructured":"Jocher, G. (2022, April 20). YOLOv5. Available online: https:\/\/github.com\/ultralytics\/yolov5."},{"key":"ref_43","unstructured":"Ge, Z., Liu, S., Wang, F., Li, Z., and Sun, J. (2021). YOLOX: Exceeding YOLO Series in 2021. arXiv."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"5832","DOI":"10.1109\/TGRS.2016.2572736","article-title":"Ship Detection in Spaceborne Optical Image With SVD Networks","volume":"54","author":"Zou","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_45","first-page":"1","article-title":"Ship detection in optical remote sensing images based on deep convolutional neural networks","volume":"11","author":"Yuan","year":"2017","journal-title":"J. Appl. Remote Sens."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"7147","DOI":"10.1109\/TGRS.2018.2848901","article-title":"HSF-Net: Multiscale Deep Feature Embedding for Ship Detection in Optical Remote Sensing Imagery","volume":"56","author":"Li","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"449","DOI":"10.1109\/LGRS.2018.2793960","article-title":"Ship Detection from Thermal Remote Sensing Imagery through Region-Based Deep Forest","volume":"15","author":"Yang","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Nie, S., Jiang, Z., Zhang, H., Cai, B., and Yao, Y. (2018, January 22\u201327). Inshore Ship Detection Based on Mask R-CNN. Proceedings of the IGARSS 2018\u20142018 IEEE International Geoscience and Remote Sensing Symposium, Valencia, Spain.","DOI":"10.1109\/IGARSS.2018.8519123"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Jiang, Y., Zhu, X., Wang, X., Yang, S., Li, W., Wang, H., Fu, P., and Luo, Z. (2017). R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection. arXiv.","DOI":"10.1109\/ICPR.2018.8545598"},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"3111","DOI":"10.1109\/TMM.2018.2818020","article-title":"Arbitrary-Oriented Scene Text Detection via Rotation Proposals","volume":"20","author":"Ma","year":"2018","journal-title":"IEEE Trans. Multimed."},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Liu, Z., Hu, J., Weng, L., and Yang, Y. (2017, January 17\u201320). Rotated region based CNN for ship detection. Proceedings of the IEEE International Conference Image Process, Beijing, China.","DOI":"10.1109\/ICIP.2017.8296411"},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"Yang, X., Hao, S., Fu, K., Yang, J., Xian, S., Yan, M., and Zhi, G. (2018). Automatic Ship Detection in Remote Sensing Images from Google Earth of Complex Scenes Based on Multiscale Rotation Dense Feature Pyramid Networks. Remote Sens., 10.","DOI":"10.3390\/rs10010132"},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Cai, Z., Fan, Q., Feris, R.S., and Vasconcelos, N. (2016, January 11\u201314). A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection. Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46493-0_22"},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., and Berg, A.C. (2016, January 11\u201314). SSD: Single Shot MultiBox Detector. Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Islam, M.A., Rochan, M., Bruce, N.D.B., and Wang, Y. (2017, January 21\u201326). Gated Feedback Refinement Network for Dense Image Labeling. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.518"},{"key":"ref_56","unstructured":"Barret, Z., and Quoc, V.L. (2016). Neural architecture search with reinforcement learning. arXiv."},{"key":"ref_57","first-page":"71500R","article-title":"Fully automated procedure for ship detection using optical satellite imagery","volume":"7150","author":"Corbane","year":"2008","journal-title":"Int. Soc. Opt. Photonics"},{"key":"ref_58","doi-asserted-by":"crossref","first-page":"2053","DOI":"10.1109\/JSTARS.2015.2404578","article-title":"Object Detection Based on Sparse Representation and Hough Voting for Optical Remote Sensing Imagery","volume":"8","author":"Yokoya","year":"2015","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_59","doi-asserted-by":"crossref","first-page":"617","DOI":"10.1109\/LGRS.2013.2272492","article-title":"A New Method on Inshore Ship Detection in High-Resolution Satellite Images Using Shape and Context Information","volume":"11","author":"Liu","year":"2014","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_60","doi-asserted-by":"crossref","first-page":"1451","DOI":"10.1109\/LGRS.2015.2408355","article-title":"Unsupervised Ship Detection Based on Saliency and S-HOG Descriptor From Optical Satellite Images","volume":"12","author":"Qi","year":"2015","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_61","doi-asserted-by":"crossref","first-page":"4511","DOI":"10.1109\/TGRS.2013.2282355","article-title":"Ship Detection in High-Resolution Optical Imagery Based on Anomaly Detector and Local Shape Feature","volume":"52","author":"Shi","year":"2014","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_62","doi-asserted-by":"crossref","unstructured":"Heitz, G., and Koller, D. (2008, January 12\u201318). Learning Spatial Context: Using Stuff to Find Things. Proceedings of the European Conference on Computer Vision, Marseille, France.","DOI":"10.1007\/978-3-540-88682-2_4"},{"key":"ref_63","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1109\/TPAMI.2011.94","article-title":"Building Development Monitoring in Multitemporal Remotely Sensed Image Pairs with Stochastic Birth-Death Dynamics","volume":"34","author":"Benedek","year":"2012","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_64","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1016\/j.isprsjprs.2016.03.014","article-title":"A survey on object detection in optical remote sensing images","volume":"117","author":"Cheng","year":"2016","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_65","doi-asserted-by":"crossref","first-page":"187","DOI":"10.1016\/j.jvcir.2015.11.002","article-title":"Vehicle detection in aerial imagery: A small target detection benchmark","volume":"34","author":"Razakarivony","year":"2016","journal-title":"J. Vis. Commun. Image Represent."},{"key":"ref_66","doi-asserted-by":"crossref","unstructured":"Zhu, H., Chen, X., Dai, W., Fu, K., Ye, Q., and Jiao, J. (2015, January 27\u201330). Orientation robust object detection in aerial images using deep convolutional neural network. Proceedings of the IEEE International Conference Image Process, Quebec City, QC, Canada.","DOI":"10.1109\/ICIP.2015.7351502"},{"key":"ref_67","doi-asserted-by":"crossref","first-page":"1938","DOI":"10.1109\/LGRS.2015.2439517","article-title":"Fast Multiclass Vehicle Detection on Aerial Images","volume":"12","author":"Liu","year":"2015","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_68","doi-asserted-by":"crossref","first-page":"618","DOI":"10.1080\/01431161.2014.999881","article-title":"Elliptic Fourier transformation-based histograms of oriented gradients for rotationally invariant object detection in remote-sensing images","volume":"36","author":"Xiao","year":"2015","journal-title":"Int. J. Remote Sens."},{"key":"ref_69","doi-asserted-by":"crossref","unstructured":"Xia, G.S., Bai, X., Ding, J., Zhu, Z., Belongie, S., Luo, J., Datcu, M., Pelillo, M., and Zhang, L. (2018, January 18\u201322). DOTA: A Large-Scale Dataset for Object Detection in Aerial Images. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00418"},{"key":"ref_70","doi-asserted-by":"crossref","unstructured":"Ding, J., Xue, N., Long, Y., Xia, G.S., and Lu, Q. (2019, January 15\u201320). Learning RoI Transformer for Detecting Oriented Objects in Aerial Images. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00296"},{"key":"ref_71","doi-asserted-by":"crossref","first-page":"5535","DOI":"10.1109\/TGRS.2019.2900302","article-title":"Hierarchical and Robust Convolutional Neural Network for Very High-Resolution Remote Sensing Object Detection","volume":"57","author":"Zhang","year":"2019","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_72","doi-asserted-by":"crossref","first-page":"7778","DOI":"10.1109\/TPAMI.2021.3117983","article-title":"Object Detection in Aerial Images: A Large-Scale Benchmark and Challenges","volume":"44","author":"Ding","year":"2022","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_73","unstructured":"You, H. (2022, September 14). roLabelImg. Available online: https:\/\/github.com\/cgvict\/roLabelImg."},{"key":"ref_74","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Doll\u00e1r, P., and Zitnick, C.L. (2014, January 6\u201312). Microsoft COCO: Common Objects in Context. Proceedings of the European Conference on Computer Vision, Zurich, Switzerland.","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"ref_75","doi-asserted-by":"crossref","first-page":"2011","DOI":"10.1109\/TPAMI.2019.2913372","article-title":"Squeeze-and-Excitation Networks","volume":"42","author":"Hu","year":"2020","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_76","doi-asserted-by":"crossref","unstructured":"Woo, S., Park, J., Lee, J.Y., and Kweon, I.S. (2018, January 8\u201314). CBAM: Convolutional Block Attention Module. Proceedings of the European Conference on Computer Vision, Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_1"},{"key":"ref_77","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1007\/s11263-009-0275-4","article-title":"The Pascal Visual Object Classes (VOC) Challenge","volume":"88","author":"Everingham","year":"2010","journal-title":"Int. J. Comput. Vis."},{"key":"ref_78","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep Residual Learning for Image Recognition. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_79","unstructured":"Chen, K., Wang, J., Pang, J., Cao, Y., Xiong, Y., Li, X., Sun, S., Feng, W., Liu, Z., and Xu, J. (2019). MMDetection: Open MMLab Detection Toolbox and Benchmark. arXiv."},{"key":"ref_80","doi-asserted-by":"crossref","unstructured":"Chen, Q., Wang, Y., Yang, T., Zhang, X., Cheng, J., and Sun, J. (2021, January 19\u201325). You Only Look One-level Feature. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.01284"},{"key":"ref_81","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Goyal, P., Girshick, R., He, K., and Doll\u00e1r, P. (2017, January 20\u201329). Focal Loss for Dense Object Detection. Proceedings of the International Conference Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.324"},{"key":"ref_82","unstructured":"Tian, Z., Shen, C., Chen, H., and He, T. (November, January 27). FCOS: Fully Convolutional One-Stage Object Detection. Proceedings of the International Confenrece Computer Vision, Seoul, Korea."},{"key":"ref_83","doi-asserted-by":"crossref","unstructured":"Chen, K., Pang, J., Wang, J., Xiong, Y., Li, X., Sun, S., Feng, W., Liu, Z., Shi, J., and Ouyang, W. (2019, January 16\u201320). Hybrid task cascade for instance segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00511"},{"key":"ref_84","doi-asserted-by":"crossref","unstructured":"Law, H., and Deng, J. (2018, January 8\u201314). Cornernet: Detecting objects as paired keypoints. Proceedings of the 15th European Conference on Computer Vision, ECCV, Munich, Germany.","DOI":"10.1007\/978-3-030-01264-9_45"},{"key":"ref_85","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 23\u201328). Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_86","doi-asserted-by":"crossref","first-page":"7405","DOI":"10.1109\/TGRS.2016.2601622","article-title":"Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images","volume":"54","author":"Cheng","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_87","doi-asserted-by":"crossref","first-page":"2337","DOI":"10.1109\/TGRS.2017.2778300","article-title":"Rotation-Insensitive and Context-Augmented Object Detection in Remote Sensing Images","volume":"56","author":"Li","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_88","doi-asserted-by":"crossref","first-page":"265","DOI":"10.1109\/TIP.2018.2867198","article-title":"Learning Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection","volume":"28","author":"Cheng","year":"2019","journal-title":"IEEE Trans. Image Process."},{"key":"ref_89","first-page":"1","article-title":"Guiding Clean Features for Object Detection in Remote Sensing Images","volume":"19","author":"Cheng","year":"2022","journal-title":"IEEE Geosci. Remote Sens. Lett."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/21\/5460\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T01:06:13Z","timestamp":1760144773000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/21\/5460"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,10,30]]},"references-count":89,"journal-issue":{"issue":"21","published-online":{"date-parts":[[2022,11]]}},"alternative-id":["rs14215460"],"URL":"https:\/\/doi.org\/10.3390\/rs14215460","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,10,30]]}}}