{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,15]],"date-time":"2026-03-15T23:22:25Z","timestamp":1773616945695,"version":"3.50.1"},"reference-count":55,"publisher":"MDPI AG","issue":"21","license":[{"start":{"date-parts":[[2019,10,26]],"date-time":"2019-10-26T00:00:00Z","timestamp":1572048000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>It is still challenging to effectively detect ship objects in optical remote-sensing images with complex backgrounds. Many current CNN-based one-stage and two-stage detection methods usually first predefine a series of anchors with various scales, aspect ratios and angles, and then the detection results can be outputted by performing once or twice classification and bounding box regression for predefined anchors. However, most of the defined anchors have relatively low accuracy, and are useless for the following classification and regression. In addition, the preset anchors are not robust to produce good performance for other different detection datasets. To avoid the above problems, in this paper we design a paired semantic segmentation network to generate more accurate rotated anchors with smaller numbers. Specifically, the paired segmentation network predicts four parts (i.e., top-left, bottom-right, top-right, and bottom-left parts) of ships. By combining paired top-left and bottom-right parts (or top-right and bottom-left parts), we can take the minimum bounding box of these two parts as the rotated anchor. This way can be more robust to different ship datasets, and the generated anchors are more accurate and have fewer numbers. Furthermore, to effectively use fine-scale detail information and coarse-scale semantic information, we use the magnified convolutional features to classify and regress the generated rotated anchors. Meanwhile, the horizontal minimum bounding box of the rotated anchor is also used to combine more context information. We compare the proposed algorithm with state-of-the-art object-detection methods for natural images and ship-detection methods, and demonstrate the superiority of our method.<\/jats:p>","DOI":"10.3390\/rs11212506","type":"journal-article","created":{"date-parts":[[2019,10,28]],"date-time":"2019-10-28T04:44:31Z","timestamp":1572237871000},"page":"2506","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":20,"title":["Ship Detection under Complex Backgrounds Based on Accurate Rotated Anchor Boxes from Paired Semantic Segmentation"],"prefix":"10.3390","volume":"11","author":[{"given":"Xiaowu","family":"Xiao","sequence":"first","affiliation":[{"name":"School of Automation, Beijing Institute of Technology, Beijing 100081, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6871-8236","authenticated-orcid":false,"given":"Zhiqiang","family":"Zhou","sequence":"additional","affiliation":[{"name":"School of Automation, Beijing Institute of Technology, Beijing 100081, China"}]},{"given":"Bo","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Automation, Beijing Institute of Technology, Beijing 100081, China"}]},{"given":"Linhao","family":"Li","sequence":"additional","affiliation":[{"name":"School of Automation, Beijing Institute of Technology, Beijing 100081, China"}]},{"given":"Lingjuan","family":"Miao","sequence":"additional","affiliation":[{"name":"School of Automation, Beijing Institute of Technology, Beijing 100081, China"}]}],"member":"1968","published-online":{"date-parts":[[2019,10,26]]},"reference":[{"key":"ref_1","first-page":"88","article-title":"An AIAC-based Inshore Ship Target Detection Approach","volume":"22","author":"Jiang","year":"2007","journal-title":"Remote Sens. Technol. Appl."},{"key":"ref_2","unstructured":"Lin, J., Yang, X., Xiao, S., Yu, Y., and Jia, C. (2010). A Line Segment Based Inshore Ship Detection Method. International Conference on Remote Sensing, Springer."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Xu, J., Fu, K., and Sun, X. (2011, January 9\u201311). An Invariant Generalized Hough Transform Based Method of Inshore Ships Detection. Proceedings of the International Symposium on Image and Data Fusion, Tengchong, China.","DOI":"10.1109\/ISIDF.2011.6024201"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"617","DOI":"10.1109\/LGRS.2013.2272492","article-title":"A New Method on Inshore Ship Detection in High-Resolution Satellite Images Using Shape and Context Information","volume":"11","author":"Liu","year":"2013","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1174","DOI":"10.1109\/TGRS.2014.2335751","article-title":"Compressed-Domain Ship Detection on Spaceborne Optical Image Using Deep Neural Network and Extreme Learning Machine","volume":"53","author":"Tang","year":"2014","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"2070","DOI":"10.1109\/LGRS.2014.2319082","article-title":"Automatic Detection of Inshore Ships in High-Resolution Remote Sensing Images Using Robust Invariant Generalized Hough Transform","volume":"11","author":"Jian","year":"2014","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"1920","DOI":"10.1109\/LGRS.2016.2618385","article-title":"A Novel Inshore Ship Detection via Ship Head Classification and Body Boundary Determination","volume":"13","author":"Li","year":"2017","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_8","unstructured":"Hong, S., Roh, B., Kim, K.H., Cheon, Y., and Park, M. (2016). Pvanet: Lightweight deep neural networks for real-time object detection. arXiv."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","article-title":"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks","volume":"39","author":"Ren","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., and Berg, A.C. (2016). SSD: Single Shot MultiBox Detector. European Conference on Computer Vision, Springer.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Redmon, J., and Farhadi, A. (2017, January 21\u201326). YOLO9000: Better, Faster, Stronger. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.690"},{"key":"ref_12","unstructured":"Redmon, J., and Farhadi, A. (2018). Yolov3: An incremental improvement. arXiv."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Goyal, P., Girshick, R., He, K., and Dollar, P. (2017, January 22\u201329). Focal Loss for Dense Object Detection. Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.324"},{"key":"ref_14","unstructured":"Liu, L., Pan, Z., and Lei, B. (2017). Learning a rotation invariant detector with rotatable bounding box. arXiv."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Liu, Z., Hu, J., Weng, L., and Yang, Y. (2017, January 17\u201320). Rotated region based CNN for ship detection. Proceedings of the IEEE International Conference on Image Processing, Beijing, China.","DOI":"10.1109\/ICIP.2017.8296411"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Yang, X., Sun, H., Fu, K., Yang, J., Sun, X., Yan, M., and Guo, Z. (2018). Automatic ship detection in remote sensing images from google earth of complex scenes based on multiscale rotation dense feature pyramid networks. Remote Sens., 10.","DOI":"10.3390\/rs10010132"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"937","DOI":"10.1109\/LGRS.2018.2813094","article-title":"Arbitrary-Oriented Ship Detection Framework in Optical Remote-Sensing Images","volume":"15","author":"Liu","year":"2018","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"50839","DOI":"10.1109\/ACCESS.2018.2869884","article-title":"Position Detection and Direction Prediction for Arbitrary-Oriented Ships via Multitask Rotation Region Convolutional Neural Network","volume":"6","author":"Xue","year":"2018","journal-title":"IEEE Access"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"154","DOI":"10.1007\/s11263-013-0620-5","article-title":"Selective Search for Object Recognition","volume":"104","author":"Uijlings","year":"2013","journal-title":"Int. J. Comput. Vis."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Zitnick, C.L., and Dollar, P. (2014). Edge Boxes: Locating Object Proposals from Edges. European Conference on Computer Vision, Springer.","DOI":"10.1007\/978-3-319-10602-1_26"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Cheng, M.M., Zhang, Z., Lin, W.Y., and Torr, P. (2014, January 23\u201328). BING: Binarized Normed Gradients for Objectness Estimation at 300fps. Proceedings of the Computer Vision and Pattern Recognition, Washington, DC, USA.","DOI":"10.1109\/CVPR.2014.414"},{"key":"ref_22","unstructured":"Pont-Tuset, J., Barron, J., Marques, F., and Malik, J. (2014, January 23\u201328). Multiscale Combinatorial Grouping. Proceedings of the Computer Vision and Pattern Recognition, Washington, DC, USA."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"1074","DOI":"10.1109\/LGRS.2016.2565705","article-title":"Ship Rotated Bounding Box Space for Ship Extraction From High-Resolution Optical Satellite Images With Complex Backgrounds","volume":"13","author":"Liu","year":"2017","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 23\u201328). Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"1904","DOI":"10.1109\/TPAMI.2015.2389824","article-title":"Spatial pyramid pooling in deep convolutional networks for visual recognition","volume":"37","author":"He","year":"2015","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_26","first-page":"423","article-title":"S-cnn-based ship detection from high-resolution remote sensing images","volume":"XLI-B7","author":"Zhang","year":"2016","journal-title":"ISPRS Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Girshick, R. (2015, January 7\u201313). Fast r-cnn. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_28","unstructured":"Dai, J., Li, Y., He, K., and Sun, J. (2016, January 5\u201310). R-fcn: Object detection via region-based fully convolutional networks. Proceedings of the 30th International Conference on Advances in Neural Information Processing Systems, Barcelona, Spain."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Liu, S., Qi, L., Qin, H., Shi, J., and Jia, J. (2018, January 18\u201323). Path aggregation network for instance segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00913"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Najibi, M., Samangouei, P., Chellappa, R., and Davis, L.S. (2017, January 22\u201329). Ssh: Single stage headless face detector. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.522"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Zhu, C., He, Y., and Savvides, M. (2019). Feature Selective Anchor-Free Module for Single-Shot Object Detection. arXiv.","DOI":"10.1109\/CVPR.2019.00093"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Tian, Z., Shen, C., Chen, H., and He, T. (2019). FCOS: Fully Convolutional One-Stage Object Detection. arXiv.","DOI":"10.1109\/ICCV.2019.00972"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Milletari, F., Navab, N., and Ahmadi, S.A. (2016, January 25\u201328). V-net: Fully convolutional neural networks for volumetric medical image segmentation. Proceedings of the 2016 Fourth International Conference on 3D Vision (3DV), Stanford, CA, USA.","DOI":"10.1109\/3DV.2016.79"},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"2481","DOI":"10.1109\/TPAMI.2016.2644615","article-title":"Segnet: A deep convolutional encoder-decoder architecture for image segmentation","volume":"39","author":"Badrinarayanan","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_35","unstructured":"Iglovikov, V., and Shvets, A. (2018). Ternausnet: U-net with vgg11 encoder pre-trained on imagenet for image segmentation. arXiv."},{"key":"ref_36","unstructured":"Fu, J., Liu, J., Wang, Y., Zhou, J., Wang, C., and Lu, H. (2019). Stacked deconvolutional network for semantic segmentation. IEEE Trans. Image Process."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"247","DOI":"10.1109\/LGRS.2016.2637439","article-title":"SeNet: Structured edge network for sea\u2013land segmentation","volume":"14","author":"Cheng","year":"2016","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"881","DOI":"10.1109\/TGRS.2016.2616585","article-title":"Dense semantic labeling of subdecimeter resolution images with convolutional neural networks","volume":"55","author":"Volpi","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_39","unstructured":"Paszke, A., Chaurasia, A., Kim, S., and Culurciello, E. (2016). Enet: A deep neural network architecture for real-time semantic segmentation. arXiv."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"474","DOI":"10.1109\/LGRS.2018.2795531","article-title":"Fully Convolutional Networks for Semantic Segmentation of Very High Resolution Remotely Sensed Images Combined with DSM","volume":"15","author":"Sun","year":"2018","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"1119","DOI":"10.3390\/rs10071119","article-title":"Very Deep Convolutional Neural Networks for Complex Land Cover Mapping Using Multispectral Remote Sensing Imagery","volume":"10","author":"Masoud","year":"2018","journal-title":"Remote Sens."},{"key":"ref_42","unstructured":"Ioffe, S., and Szegedy, C. (2015). Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv."},{"key":"ref_43","unstructured":"Nair, V., and Hinton, G.E. (2010, January 21\u201324). Rectified linear units improve restricted boltzmann machines. Proceedings of the 27th International Conference on Machine Learning (ICML-10), Haifa, Israel."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Xie, S., and Tu, Z. (2015, January 7\u201313). Holistically-nested edge detection. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.164"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Liu, Y., Cheng, M.M., Hu, X., Wang, K., and Bai, X. (2017, January 21\u201326). Richer convolutional features for edge detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.622"},{"key":"ref_47","unstructured":"Hara, K., Liu, M.Y., Tuzel, O., and Farahmand, A.m. (2017). Attentional network for visual object detection. arXiv."},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Doll\u00e1r, P., Girshick, R., He, K., Hariharan, B., and Belongie, S. (2017, January 21\u201326). Feature pyramid networks for object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.106"},{"key":"ref_49","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. arXiv."},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Zeiler, M.D., and Fergus, R. (2014). Visualizing and understanding convolutional networks. European Conference on Computer Vision, Springer.","DOI":"10.1007\/978-3-319-10590-1_53"},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"3111","DOI":"10.1109\/TMM.2018.2818020","article-title":"Arbitrary-oriented scene text detection via rotation proposals","volume":"20","author":"Ma","year":"2018","journal-title":"IEEE Trans. Multimedia"},{"key":"ref_52","first-page":"249","article-title":"Understanding the difficulty of training deep feedforward neural networks","volume":"9","author":"Glorot","year":"2010","journal-title":"J. Mach. Learn. Res."},{"key":"ref_53","unstructured":"Paszke, A., Gross, S., Chintala, S., Chanan, G., Yang, E., DeVito, Z., Lin, Z., Desmaison, A., Antiga, L., and Lerer, A. (2019, October 20). Automatic Differentiation in Pytorch. Available online: https:\/\/openreview.net\/forum?id=BJJsrmfCZ."},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Jia, Y., Shelhamer, E., Donahue, J., Karayev, S., Long, J., Girshick, R., Girshick, R., and Darrell, T. (2014, January 3\u20137). Caffe: Convolutional Architecture for Fast Feature Embedding. Proceedings of the 22nd ACM International Conference on Multimedia, Orlando, FL, USA.","DOI":"10.1145\/2647868.2654889"},{"key":"ref_55","unstructured":"Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado, G.S., Davis, A., Dean, J., and Devin, M. (2016). TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems. arXiv."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/11\/21\/2506\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T13:29:32Z","timestamp":1760189372000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/11\/21\/2506"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,10,26]]},"references-count":55,"journal-issue":{"issue":"21","published-online":{"date-parts":[[2019,11]]}},"alternative-id":["rs11212506"],"URL":"https:\/\/doi.org\/10.3390\/rs11212506","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,10,26]]}}}