{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,15]],"date-time":"2026-01-15T11:54:31Z","timestamp":1768478071005,"version":"3.49.0"},"reference-count":42,"publisher":"MDPI AG","issue":"24","license":[{"start":{"date-parts":[[2022,12,8]],"date-time":"2022-12-08T00:00:00Z","timestamp":1670457600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61971141"],"award-info":[{"award-number":["61971141"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Oriented object detection is a fundamental and challenging task in remote sensing image analysis and has received much attention in recent years. Optical remote sensing images often have more complex background information than natural images, and the number of annotated samples varies in different categories. To enhance the difference between foreground and background, current one-stage object detection algorithms attempt to exploit focus loss to balance the foreground and background weights, thus making the network more focused on the foreground part. However, the current one-stage object detectors still face two main challenges: (1) the detection network pays little attention to the foreground and does not make full use of the foreground information; (2) the distinction of similar object categories has not attracted attention. To address the above challenges, this paper presents a foreground feature enhancement method applied to one-stage object detection. The proposed method mainly includes two important components: keypoint attention module (KAM) and prototype contrastive learning module (PCLM). The KAM is used to enhance the features of the foreground part of the image and reduce the features of the background part of the image, and the PCLM is utilized to enhance the discrimination of samples between foreground categories and reduce the confusion of samples between different categories. Furthermore, the proposed method designs and adopts an equalized modulation focal loss (EMFL) to optimize the training process of the model and increase the loss weight of the foreground later in the model training. Experimental results on the publicly available DOTA datasets and HRSC2016 datasets show that our method exhibits state-of-the-art performance.<\/jats:p>","DOI":"10.3390\/rs14246226","type":"journal-article","created":{"date-parts":[[2022,12,9]],"date-time":"2022-12-09T03:23:49Z","timestamp":1670556229000},"page":"6226","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["Oriented Object Detection Based on Foreground Feature Enhancement in Remote Sensing Images"],"prefix":"10.3390","volume":"14","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-2982-8878","authenticated-orcid":false,"given":"Peng","family":"Lin","sequence":"first","affiliation":[{"name":"Key Laboratory for Information Science of Electromagnetic Waves (MoE), Fudan University, Shanghai 200433, China"},{"name":"Image and Intelligence Laboratory, School of Information Science and Technology, Fudan University, Shanghai 200433, China"}]},{"given":"Xiaofeng","family":"Wu","sequence":"additional","affiliation":[{"name":"Key Laboratory for Information Science of Electromagnetic Waves (MoE), Fudan University, Shanghai 200433, China"},{"name":"Image and Intelligence Laboratory, School of Information Science and Technology, Fudan University, Shanghai 200433, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4748-6426","authenticated-orcid":false,"given":"Bin","family":"Wang","sequence":"additional","affiliation":[{"name":"Key Laboratory for Information Science of Electromagnetic Waves (MoE), Fudan University, Shanghai 200433, China"},{"name":"Image and Intelligence Laboratory, School of Information Science and Technology, Fudan University, Shanghai 200433, China"}]}],"member":"1968","published-online":{"date-parts":[[2022,12,8]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","article-title":"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks","volume":"39","author":"Ren","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016, January 27\u201330). You Only Look Once: Unified, Real-Time Object Detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_3","unstructured":"Tian, Z., Shen, C., Chen, H., and He, T. (November, January 27). Fcos: Fully Convolutional One-Stage Object Detection. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea."},{"key":"ref_4","unstructured":"Liu, L., Pan, Z., and Lei, B. (2017). Learning a Rotation Invariant Detector with Rotatable Bounding Box. arXiv."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Ding, J., Xue, N., Long, Y., Xia, G.S., and Lu, Q. (2019, January 15\u201320). Learning RoI Transformer for Oriented Object Detection in Aerial Images. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00296"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Han, J., Ding, J., Xue, N., and Xia, G.S. (2021, January 19\u201325). Redet: A Rotation-Equivariant Detector for Aerial Object Detection. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.00281"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Xie, X., Cheng, G., Wang, J., Yao, X., and Han, J. (2021, January 11\u201317). Oriented R-CNN for Object Detection. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, BC, Canada.","DOI":"10.1109\/ICCV48922.2021.00350"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Yang, X., Yan, J., Feng, Z., and He, T. (2021, January 2\u20139). R3det: Refined Single-Stage Detector with Feature Refinement for Rotating Object. Proceedings of the AAAI Conference on Artificial Intelligence, Virtually.","DOI":"10.1609\/aaai.v35i4.16426"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Yang, X., Yan, J., Liao, W., Yang, X., Tang, J., and He, T. (2022). Scrdet++: Detecting Small, Cluttered and Rotated Objects via Instance-Level Feature Denoising and Rotation Loss Smoothing. IEEE Trans. Pattern Anal. Mach. Intell., Early Access.","DOI":"10.1109\/TPAMI.2022.3166956"},{"key":"ref_10","first-page":"1","article-title":"Align Deep Features for Oriented Object Detection","volume":"60","author":"Han","year":"2021","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_11","unstructured":"Lin, Y., Feng, P., Guan, J., Wang, W., and Chambers, J. (2019). IENet: Interacting Embranchment One Stage Anchor Free Detector for Orientation Aerial Object Detection. arXiv."},{"key":"ref_12","unstructured":"Yang, X., Yan, J., Ming, Q., Wang, W., Zhang, X., and Tian, Q. (2021, January 9\u201311). Rethinking Rotated Object Detection with Gaussian Wasserstein Distance Loss. Proceedings of the International Conference on Machine Learning, Chongqing, China."},{"key":"ref_13","first-page":"18381","article-title":"Learning High-Precision Bounding Box for Rotated Object Detection via Kullback-Leibler Divergence","volume":"34","author":"Yang","year":"2021","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_14","unstructured":"Llerena, J.M., Zeni, L.F., Kristen, L.N., and Jung, C. (2021). Gaussian Bounding Boxes and Probabilistic Intersection-over-Union for Object Detection. arXiv."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Chen, Z., Chen, K., Lin, W., See, J., Yu, H., Ke, Y., and Yang, C. (2020, January 23\u201328). Piou Loss: Towards Accurate Oriented Object Detection in Complex Environments. Proceedings of the European Conference on Computer Vision, Glasgow, UK.","DOI":"10.1007\/978-3-030-58558-7_12"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Yi, J., Wu, P., Liu, B., Huang, Q., Qu, H., and Metaxas, D. (2021, January 5\u20139). Oriented Object Detection in Aerial Images with Box Boundary-Aware Vectors. Proceedings of the Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision, Virtual.","DOI":"10.1109\/WACV48630.2021.00220"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"5831","DOI":"10.1080\/01431161.2021.1931535","article-title":"Polardet: A Fast, More Precise Detector for Rotated Target in Aerial Images","volume":"42","author":"Zhao","year":"2021","journal-title":"Int. J. Remote Sens."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"3731","DOI":"10.3390\/rs13183731","article-title":"Predicting Arbitrary-Oriented Objects as Points in Remote Sensing Images","volume":"13","author":"Wang","year":"2021","journal-title":"Remote Sens."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Goyal, P., Girshick, R., He, K., and Doll\u00e1r, P. (2017, January 22\u201329). Focal Loss for Dense Object Detection. Proceedings of the Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.324"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"1745","DOI":"10.1109\/LGRS.2018.2856921","article-title":"Toward Arbitrary-Oriented Ship Detection with Rotated Region Proposal and Discrimination Networks","volume":"15","author":"Zhang","year":"2018","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"132","DOI":"10.3390\/rs10010132","article-title":"Automatic Ship Detection in Remote Sensing Images from Google Earth of Complex Scenes Based on Multiscale Rotation Dense Feature Pyramid Networks","volume":"10","author":"Yang","year":"2018","journal-title":"Remote Sens."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Doll\u00e1r, P., Girshick, R., He, K., Hariharan, B., and Belongie, S. (2017, January 21\u201326). Feature Pyramid Networks for Object Detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.106"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"50839","DOI":"10.1109\/ACCESS.2018.2869884","article-title":"Position Detection and Direction Prediction for Arbitrary-Oriented Ships via Multitask Rotation Region Convolutional Neural Network","volume":"6","author":"Yang","year":"2018","journal-title":"IEEE Access"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Liao, M., Zhu, Z., Shi, B., Xia, G.S., and Bai, X. (2018, January 18\u201323). Rotation-Sensitive Regression for Oriented Scene Text Detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00619"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"1452","DOI":"10.1109\/TPAMI.2020.2974745","article-title":"Gliding Vertex on the Horizontal Bounding Box for Multi-Oriented Object Detection","volume":"43","author":"Xu","year":"2020","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"908","DOI":"10.3390\/rs12060908","article-title":"Axis Learning for Orientated Objects Detection in Aerial Images","volume":"12","author":"Xiao","year":"2020","journal-title":"Remote Sens."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"223373","DOI":"10.1109\/ACCESS.2020.3041025","article-title":"Arbitrary-Oriented Object Detection in Remote Sensing Images Based on Polar Coordinates","volume":"8","author":"Zhou","year":"2020","journal-title":"IEEE Access"},{"key":"ref_28","unstructured":"Zhou, X., Wang, D., and Kr\u00e4henb\u00fchl, P. (2019). Objects as Points. arXiv."},{"key":"ref_29","unstructured":"Lang, S., Ventola, F., and Kersting, K. (2021). DAFNe: A One-Stage Anchor-Free Approach for Oriented Object Detection. arXiv."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"He, K., Fan, H., Wu, Y., Xie, S., and Girshick, R. (2020, January 13\u201319). Momentum Contrast for Unsupervised Visual Representation Learning. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00975"},{"key":"ref_31","unstructured":"Chen, X., Fan, H., Girshick, R., and He, K. (2020). Improved Baselines with Momentum Contrastive Learning. arXiv."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Zhu, R., Zhao, B., Liu, J., Sun, Z., and Chen, C.W. (2021, January 11\u201317). Improving Contrastive Learning by Visualizing Feature Transformation. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, BC, Canada.","DOI":"10.1109\/ICCV48922.2021.01014"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Xie, E., Ding, J., Wang, W., Zhan, X., Xu, H., Sun, P., and Luo, P. (2021, January 11\u201317). Detco: Unsupervised Contrastive Learning for Object Detection. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, BC, Canada.","DOI":"10.1109\/ICCV48922.2021.00828"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Xie, Z., Lin, Y., Zhang, Z., Cao, Y., Lin, S., and Hu, H. (2021, January 19\u201325). Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.01641"},{"key":"ref_35","unstructured":"Wang, X., Gao, J., Long, M., and Wang, J. (2021, January 18\u201324). Self-Tuning for Data-Efficient Deep Learning. Proceedings of the International Conference on Machine Learning, Virtual."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Law, H., and Deng, J. (2018, January 8\u201314). Cornernet: Detecting Objects as Paired Keypoints. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01264-9_45"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Chen, Q., Wang, Y., Yang, T., Zhang, X., Cheng, J., and Sun, J. (2021, January 19\u201325). You Only Look One-Level Feature. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.01284"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Xia, G.S., Bai, X., Ding, J., Zhu, Z., Belongie, S., Luo, J., Datcu, M., Pelillo, M., and Zhang, L. (2018, January 18\u201323). DOTA: A Large-Scale Dataset for Object Detection in Aerial Images. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00418"},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"324","DOI":"10.5220\/0006120603240331","article-title":"A High Resolution Optical Satellite Image Dataset for Ship Recognition and Some New Baselines","volume":"Volume 2","author":"Liu","year":"2017","journal-title":"Proceedings of the International Conference on Pattern Recognition Applications and Methods"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep Residual Learning for Image Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_41","first-page":"2579","article-title":"Visualizing Data Using T-SNE","volume":"9","author":"Hinton","year":"2008","journal-title":"J. Mach. Learn. Res."},{"key":"ref_42","first-page":"5998","article-title":"Attention Is All You Need","volume":"30","author":"Vaswani","year":"2017","journal-title":"Adv. Neural Inf. Process. Syst."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/24\/6226\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T01:36:45Z","timestamp":1760146605000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/24\/6226"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,12,8]]},"references-count":42,"journal-issue":{"issue":"24","published-online":{"date-parts":[[2022,12]]}},"alternative-id":["rs14246226"],"URL":"https:\/\/doi.org\/10.3390\/rs14246226","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,12,8]]}}}