{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,19]],"date-time":"2025-12-19T09:40:34Z","timestamp":1766137234098,"version":"build-2065373602"},"reference-count":54,"publisher":"MDPI AG","issue":"16","license":[{"start":{"date-parts":[[2019,8,14]],"date-time":"2019-08-14T00:00:00Z","timestamp":1565740800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["41701508"],"award-info":[{"award-number":["41701508"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Ship category classification in high-resolution aerial images has attracted great interest in applications such as maritime security, naval construction, and port management. However, the applications of previous methods were mainly limited by the following issues: (i) The existing ship category classification methods were mainly to classify on accurately-cropped image patches. This is unsatisfactory for the results of the existing methods in practical applications, because the location of the ship in the patch obtained by the object detection varies greatly. (ii) The factors such as target scale variations and class imbalance have a great influence on the performance of ship category classification. Aiming at the issues above, we propose a novel ship detection and category classification framework. The category classification is based on accurate location. The detection network can generate more precise rotated bounding boxes in large-scale aerial images by introducing a novel Sequence Local Context (SLC) module. Besides, three different ship category classification networks are proposed to eliminate the effect of scale variations, and the Spatial Transform Crop (STC) operation is used to get aligned image patches. Whatever the problem of insufficient samples or class imbalance have, the Proposals Simulation Generator (PSG) is considered to handle this properly. Most remarkably, the state-of-the-art performance of our framework is demonstrated by experiments based on the 19-class ship dataset HRSC2016 and our multiclass warship dataset.<\/jats:p>","DOI":"10.3390\/rs11161901","type":"journal-article","created":{"date-parts":[[2019,8,15]],"date-time":"2019-08-15T04:22:54Z","timestamp":1565842974000},"page":"1901","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":38,"title":["Towards Automated Ship Detection and Category Recognition from High-Resolution Aerial Images"],"prefix":"10.3390","volume":"11","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-4017-8885","authenticated-orcid":false,"given":"Yingchao","family":"Feng","sequence":"first","affiliation":[{"name":"Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China"},{"name":"Key Laboratory of Network Information System Technology (NIST), Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China"},{"name":"School of Electronic, Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing 100190, China"}]},{"given":"Wenhui","family":"Diao","sequence":"additional","affiliation":[{"name":"Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China"},{"name":"Key Laboratory of Network Information System Technology (NIST), Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China"}]},{"given":"Xian","family":"Sun","sequence":"additional","affiliation":[{"name":"Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China"},{"name":"Key Laboratory of Network Information System Technology (NIST), Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China"}]},{"given":"Menglong","family":"Yan","sequence":"additional","affiliation":[{"name":"Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China"},{"name":"Key Laboratory of Network Information System Technology (NIST), Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China"}]},{"given":"Xin","family":"Gao","sequence":"additional","affiliation":[{"name":"Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China"},{"name":"Key Laboratory of Network Information System Technology (NIST), Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China"}]}],"member":"1968","published-online":{"date-parts":[[2019,8,14]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"3616","DOI":"10.1109\/JSTARS.2017.2692820","article-title":"A novel ship detector based on the generalized-likelihood ratio test for SAR imagery","volume":"10","author":"Iervolino","year":"2017","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_2","first-page":"63","article-title":"A multifamily GLRT for oil spill detection","volume":"55","author":"Orlando","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"528","DOI":"10.1049\/iet-rsn.2010.0223","article-title":"Generalised likelihood ratio test detector for micro-motion targets in synthetic aperture radar raw signals","volume":"5","author":"Deng","year":"2011","journal-title":"IET Radar Sonar Navig."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Iervolino, P., Guida, R., and Whittaker, P. (2015, January 26\u201331). A new GLRT-based ship detection technique in SAR images. Proceedings of the IEEE International Geoscience and Remote Sensing Symposium 2015 (IGARSS), Milan, Italy.","DOI":"10.1109\/IGARSS.2015.7326480"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Graziano, M., D\u2019Errico, M., and Rufino, G. (2016). Wake component detection in X-band SAR images for ship heading and velocity estimation. Remote Sens., 8.","DOI":"10.3390\/rs8060498"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Graziano, M., Grasso, M., and D\u2019Errico, M. (2017). Performance analysis of ship wake detection on Sentinel-1 SAR images. Remote Sens., 9.","DOI":"10.3390\/rs9111107"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Biondi, F. (2016, January 19\u201322). Low rank plus sparse decomposition of synthetic aperture radar data for maritime surveillance. Proceedings of the 2016 4th International Workshop on Compressed Sensing Theory and Its Applications to Radar, Sonar and Remote Sensing (CoSeRa), Aachen, Germany.","DOI":"10.1109\/CoSeRa.2016.7745703"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"117","DOI":"10.1109\/LGRS.2017.2777264","article-title":"Low-rank plus sparse decomposition and localized radon transform for ship-wake detection in synthetic aperture radar images","volume":"15","author":"Biondi","year":"2017","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1109\/LGRS.2018.2868365","article-title":"A polarimetric extension of low-rank plus sparse decomposition and radon transform for ship wake detection in synthetic aperture radar images","volume":"16","author":"Biondi","year":"2018","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Filippo, B. (2019). COSMO-SkyMed staring spotlight SAR data for micro-motion and inclination angle estimation of ships by pixel tracking and convex optimization. Remote Sens., 11.","DOI":"10.3390\/rs11070766"},{"key":"ref_11","unstructured":"Krizhevsky, A., Sutskever, I., and Hinton, G.E. (2012, January 3\u20139). Imagenet classification with deep convolutional neural networks. Proceedings of the Advances in Neural Information Processing Systems, Lake Tahoe, NV, USA."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"5832","DOI":"10.1109\/TGRS.2016.2572736","article-title":"Ship detection in spaceborne optical image with SVD networks","volume":"54","author":"Zou","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/TGRS.2018.2829166","article-title":"HSF-net: Multiscale deep feature embedding for ship detection in optical remote sensing imagery","volume":"56","author":"Li","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"215","DOI":"10.1016\/j.neucom.2018.12.050","article-title":"Ship detection based on squeeze excitation skip-connection path networks for optical remote sensing images","volume":"332","author":"Huang","year":"2019","journal-title":"Neurocomputing"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"2337","DOI":"10.1109\/TGRS.2017.2778300","article-title":"Rotation-insensitive and context-augmented object detection in remote sensing images","volume":"56","author":"Li","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"042611","DOI":"10.1117\/1.JRS.11.042611","article-title":"Ship detection in optical remote sensing images based on deep convolutional neural networks","volume":"11","author":"Yao","year":"2017","journal-title":"J. Appl. Remote Sens."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Liu, Z., Hu, J., Weng, L., and Yang, Y. (2018, January 7\u201310). Rotated region based CNN for ship detection. Proceedings of the IEEE International Conference on Image Processing, Athens, Greece.","DOI":"10.1109\/ICIP.2017.8296411"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"1745","DOI":"10.1109\/LGRS.2018.2856921","article-title":"Toward arbitrary-oriented ship detection with rotated region proposal and discrimination networks","volume":"15","author":"Zhang","year":"2018","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Yang, X., Sun, H., Fu, K., Yang, J., Sun, X., Yan, M., and Guo, Z. (2018). Automatic ship detection in remote sensing images from google earth of complex scenes based on multiscale rotation dense feature pyramid networks. Remote Sens., 10.","DOI":"10.3390\/rs10010132"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Fu, K., Li, Y., Sun, H., Yang, X., Xu, G., Li, Y., and Sun, X. (2018). A ship rotation detection model in remote sensing images based on feature fusion pyramid network and deep reinforcement learning. Remote Sens., 10.","DOI":"10.3390\/rs10121922"},{"key":"ref_21","unstructured":"Ren, S., He, K., Girshick, R., and Sun, J. (2015, January 7\u201312). Faster r-cnn: Towards real-time object detection with region proposal networks. Proceedings of the Advances in Neural Information Processing Systems, Montreal, QC, Canada."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"3111","DOI":"10.1109\/TMM.2018.2818020","article-title":"Arbitrary-oriented scene text detection via rotation proposals","volume":"20","author":"Ma","year":"2018","journal-title":"IEEE Trans. Multimed."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"He, K., Gkioxari, G., Dollar, P., and Girshick, R.B. (2017, January 22\u201329). Mask R-CNN. Proceedings of the International Conference on Computer Cision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.322"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"1275","DOI":"10.1109\/LGRS.2012.2237377","article-title":"Merchant vessel classification based on scattering component analysis for COSMO-SkyMed SAR images","volume":"10","author":"Hong","year":"2013","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"1562","DOI":"10.1109\/LGRS.2013.2262073","article-title":"Ship classification in TerraSAR-X images with feature space based sparse representation","volume":"10","author":"Xing","year":"2013","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"6457","DOI":"10.1080\/01431161.2017.1356487","article-title":"A novel ship classification approach for high resolution SAR images based on the BDA-KELM classification model","volume":"38","author":"Wu","year":"2017","journal-title":"Int. J. Remote Sens."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"2830","DOI":"10.1109\/JSTARS.2017.2665346","article-title":"Learning attribute representations for remote sensing ship category classification","volume":"10","author":"Oliveau","year":"2017","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_28","unstructured":"Simonyan, K., and Zisserman, A. (2015, January 7\u20139). Very deep convolutional networks for large-scale image recognition. Proceedings of the International Conference on Learning Representations, San Diego, CA, USA."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., and Rabinovich, A. (2015, January 7\u201312). Going deeper with convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Huang, G., Liu, Z., Der Maaten, L.V., and Weinberger, K.Q. (2017, January 21\u201326). Densely connected convolutional networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.243"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"258","DOI":"10.1109\/JOE.2017.2767106","article-title":"Ship classification in terrasar-x images with convolutional neural networks","volume":"43","author":"Bentes","year":"2018","journal-title":"IEEE J. Ocean. Eng."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Ma, M., Chen, J., Liu, W., and Yang, W. (2018). Ship classification and detection based on CNN using GF-3 SAR images. Remote Sens., 10.","DOI":"10.3390\/rs10122043"},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"32","DOI":"10.1007\/s11263-016-0981-7","article-title":"Visual genome: Connecting language and vision using crowdsourced dense image annotations","volume":"123","author":"Krishna","year":"2016","journal-title":"Int. J. Comput. Vis."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Chen, Z., and Gao, X. (2018, January 9\u201311). An Improved Algorithm for Ship Target Detection in SAR Images Based on Faster R-CNN. Proceedings of the Ninth International Conference on Intelligent Control and Information Processing (ICICIP 2018), Wanzhou, China.","DOI":"10.1109\/ICICIP.2018.8606720"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Li, S., Zhang, Z., Li, B., and Li, C. (2018). Multiscale rotated bounding box-based deep learning method for detecting ship targets in remote sensing images. Sensors, 18.","DOI":"10.3390\/s18082702"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Koo, J., Seo, J., Jeon, S., Choe, J., and Jeon, T. (2018, January 6\u20139). RBox-CNN: Rotated bounding box based CNN for ship detection in remote sensing image. Proceedings of the 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACM 2018), Seattle, WA, USA.","DOI":"10.1145\/3274895.3274915"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Liao, M., Zhu, Z., Shi, B., Xia, G.s., and Bai, X. (2018, January 18\u201322). Rotation-sensitive regression for oriented scene text detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00619"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Doll\u00e1r, P., Girshick, R., He, K., Hariharan, B., and Belongie, S. (2017, January 21\u201326). Feature pyramid networks for object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.106"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Goyal, P., Girshick, R., He, K., and Doll\u00e1r, P. (2017, January 21\u201326). Focal loss for dense object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/ICCV.2017.324"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Girshick, R.B., Donahue, J., Darrell, T., and Malik, J. (2014, January 24\u201327). Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"405","DOI":"10.1016\/0196-6774(87)90020-4","article-title":"A heuristic triangulation algorithm","volume":"8","author":"Plaisted","year":"1987","journal-title":"J. Algorithm."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"1074","DOI":"10.1109\/LGRS.2016.2565705","article-title":"Ship rotated bounding box space for ship extraction from high-resolution optical satellite images with complex backgrounds","volume":"13","author":"Liu","year":"2016","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Xia, G.S., Bai, X., Ding, J., Zhu, Z., Belongie, S., Luo, J., Datcu, M., Pelillo, M., and Zhang, L. (2018, January 18\u201322). DOTA: A large-scale dataset for object detection in aerial images. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00418"},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1016\/j.isprsjprs.2014.10.002","article-title":"Multi-class geospatial object detection and geographic image classification based on collection of part detectors","volume":"98","author":"Cheng","year":"2014","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Girshick, R. (2015, January 7\u201313). Fast R-CNN. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Feng, Y., Diao, W., Chang, Z., Yan, M., Sun, X., and Gao, X. (2019). Ship instance segmentation from remote sensing images using sequence local context module. arXiv.","DOI":"10.1109\/IGARSS.2019.8897948"},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"1904","DOI":"10.1109\/TPAMI.2015.2389824","article-title":"Spatial pyramid pooling in deep convolutional networks for visual recognition","volume":"37","author":"He","year":"2015","journal-title":"IEEE Trans. Pattern Anal. Machine Intell."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1007\/s11633-017-1053-3","article-title":"A survey on deep learning-based fine-grained object classification and semantic segmentation","volume":"14","author":"Zhao","year":"2017","journal-title":"Int. J. Autom. Comput."},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Long, J., Shelhamer, E., and Darrell, T. (2015, January 7\u201312). Fully convolutional networks for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Zhang, X., Wei, Y., Feng, J., Yang, Y., and Huang, T.S. (2018, January 18\u201322). Adversarial complementary learning for weakly supervised object localization. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00144"},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Liu, Z., Yuan, L., Weng, L., and Yang, Y. (2017, January 24\u201326). A high resolution optical satellite image dataset for ship recognition and some new baselines. Proceedings of the 6th International Conference on Pattern Recognition Applications and Methods, Porto, Portugal.","DOI":"10.5220\/0006120603240331"},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1007\/s11263-009-0275-4","article-title":"The pascal visual object classes (voc) challenge","volume":"88","author":"Everingham","year":"2010","journal-title":"Int. J. Comput. Vis."},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Lin, T., Maire, M., Belongie, S.J., Hays, J., Perona, P., Ramanan, D., Dollar, P., and Zitnick, C.L. (2014, January 5\u201312). Microsoft COCO: Common Objects in Context. Proceedings of the European Conference on Computer Vision, Zurich, Switzerland.","DOI":"10.1007\/978-3-319-10602-1_48"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/11\/16\/1901\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T13:11:01Z","timestamp":1760188261000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/11\/16\/1901"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,8,14]]},"references-count":54,"journal-issue":{"issue":"16","published-online":{"date-parts":[[2019,8]]}},"alternative-id":["rs11161901"],"URL":"https:\/\/doi.org\/10.3390\/rs11161901","relation":{},"ISSN":["2072-4292"],"issn-type":[{"type":"electronic","value":"2072-4292"}],"subject":[],"published":{"date-parts":[[2019,8,14]]}}}