{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,28]],"date-time":"2026-02-28T04:31:08Z","timestamp":1772253068023,"version":"3.50.1"},"reference-count":34,"publisher":"MDPI AG","issue":"16","license":[{"start":{"date-parts":[[2022,8,16]],"date-time":"2022-08-16T00:00:00Z","timestamp":1660608000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Natural Science Foundation of China","award":["42101448"],"award-info":[{"award-number":["42101448"]}]},{"name":"National Natural Science Foundation of China","award":["OEIP-O-202009"],"award-info":[{"award-number":["OEIP-O-202009"]}]},{"name":"Open Project Program Foundation of the Key Laboratory of Opto-Electronics Information Processing, Chinese Academy of Sciences","award":["42101448"],"award-info":[{"award-number":["42101448"]}]},{"name":"Open Project Program Foundation of the Key Laboratory of Opto-Electronics Information Processing, Chinese Academy of Sciences","award":["OEIP-O-202009"],"award-info":[{"award-number":["OEIP-O-202009"]}]},{"name":"Supercomputing Center of Wuhan University","award":["42101448"],"award-info":[{"award-number":["42101448"]}]},{"name":"Supercomputing Center of Wuhan University","award":["OEIP-O-202009"],"award-info":[{"award-number":["OEIP-O-202009"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>The object detection task is usually affected by complex backgrounds. In this paper, a new image object detection method is proposed, which can perform multi-feature selection on multi-scale feature maps. By this method, a bidirectional multi-scale feature fusion network was designed to fuse semantic features and shallow features to improve the detection effects of small objects in complex backgrounds. When the shallow features are transferred to the top layer, a bottom-up path is added to reduce the number of network layers experienced by the feature fusion network, reducing the loss of shallow features. In addition, a multi-feature selection module based on the attention mechanism is used to minimize the interference of useless information in subsequent classification and regression, allowing the network to adaptively focus on appropriate information for classification or regression to improve detection accuracy. Because the traditional five-parameter regression method has severe boundary problems when predicting objects with large aspect ratios, the proposed network treats angle prediction as a classification task. The experimental results on the DOTA dataset, the self-made DOTA-GF dataset and the HRSC 2016 dataset show that, compared with several popular object detection algorithms, the proposed method has certain advantages in detection accuracy.<\/jats:p>","DOI":"10.3390\/rs14163969","type":"journal-article","created":{"date-parts":[[2022,8,16]],"date-time":"2022-08-16T03:40:32Z","timestamp":1660621232000},"page":"3969","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":14,"title":["Multi-Scale Object Detection with the Pixel Attention Mechanism in a Complex Background"],"prefix":"10.3390","volume":"14","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5403-1895","authenticated-orcid":false,"given":"Jinsheng","family":"Xiao","sequence":"first","affiliation":[{"name":"School of Electronic Information, Wuhan University, Wuhan 430064, China"}]},{"given":"Haowen","family":"Guo","sequence":"additional","affiliation":[{"name":"School of Electronic Information, Wuhan University, Wuhan 430064, China"}]},{"given":"Yuntao","family":"Yao","sequence":"additional","affiliation":[{"name":"School of Electronic Information, Wuhan University, Wuhan 430064, China"}]},{"given":"Shuhao","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Electronic Information, Wuhan University, Wuhan 430064, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6707-6542","authenticated-orcid":false,"given":"Jian","family":"Zhou","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430079, China"}]},{"given":"Zhijun","family":"Jiang","sequence":"additional","affiliation":[{"name":"Aerospace System Development Research Center, China Aerospace Science and Technology Corporation, Beijing 100094, China"},{"name":"Qian Xuesen Laboratory of Space Technology, Beijing 100094, China"}]}],"member":"1968","published-online":{"date-parts":[[2022,8,16]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"296","DOI":"10.1016\/j.isprsjprs.2019.11.023","article-title":"Object detection in optical remote sensing images: A survey and a new benchmark","volume":"159","author":"Li","year":"2020","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Fatima, S.A., Kumar, A., Pratap, A., and Raoof, S.S. (2020, January 10\u201312). Object Recognition and Detection in Remote Sensing Images: A Comparative Study. Proceedings of the 2020 International Conference on Artificial Intelligence and Signal Processing, AISP 2020, Amaravati, India.","DOI":"10.1109\/AISP48273.2020.9073614"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1016\/j.isprsjprs.2022.07.006","article-title":"CG-SSD: Corner guided single stage 3D object detection from LiDAR point cloud","volume":"191","author":"Ma","year":"2022","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_4","first-page":"102858","article-title":"Geometric feature enhanced line segment extraction from large-scale point clouds with hierarchical topological optimization","volume":"112","author":"Hu","year":"2022","journal-title":"Int. J. Appl. Earth Obs. Geoinf."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"3111","DOI":"10.1109\/TMM.2018.2818020","article-title":"Arbitrary-oriented scene text detection via rotation proposals","volume":"20","author":"Ma","year":"2018","journal-title":"IEEE Trans. Multimed."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"143","DOI":"10.1007\/s10032-019-00320-5","article-title":"Scene text detection and recognition with advances in deep learning: A survey","volume":"22","author":"Liu","year":"2019","journal-title":"Int. J. Doc. Anal. Recognit. (IJDAR)"},{"key":"ref_7","unstructured":"Lin, T.Y., Dollar, P., Girshick, R., He, K., Hariharan, B., and Belongie, S. (2017, January 21\u201326). Image-to-image translation with conditional adversarial networks. Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Liu, S., Qi, L., Qin, H., Shi, J., and Jia, J. (2018, January 18\u201322). Path Aggregation Network for Instance Segmentation. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00913"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Guo, C., Fan, B., Zhang, Q., Xiang, S., and Pan, C. (2020, January 14\u201319). AUGFPN: Improving multi-scale feature learning for object detection. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01261"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Ghiasi, G., Lin, T.Y., and Le, Q.V. (2019, January 15\u201320). NAS-FPN: Learning scalable feature pyramid architecture for object detection. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00720"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"188","DOI":"10.1109\/JMASS.2020.3025970","article-title":"Multiclass Object Detection in UAV Images Based on Rotation Region Network","volume":"1","author":"Xiao","year":"2020","journal-title":"IEEE J. Miniaturization Air Space Syst."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Yang, X., Liu, Q., Yan, J., and Li, A. (2021, January 2\u20139). R3Det: Refined Single-Stage Detector with Feature Refinement for Rotating Object. Proceedings of the AAAI Conference on Artificial Intelligence, Online.","DOI":"10.1609\/aaai.v35i4.16426"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Yang, X., Yang, J., Yan, J., Zhang, Y., Zhang, T., Guo, Z., Sun, X., and Fu, K. (November, January 27). SCRDet: Towards More Robust Detection for Small, Cluttered and Rotated Objects. Proceedings of the 2019 IEEE\/CVF International Conference on Computer Vision (ICCV), Seoul, Korea.","DOI":"10.1109\/ICCV.2019.00832"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"402","DOI":"10.1016\/j.ins.2018.06.028","article-title":"Kernel Wiener Filtering Model with Low-Rank Approximation for Image Denoising","volume":"462","author":"Zhang","year":"2018","journal-title":"Inf. Sci."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Li, Q., Mou, L.M., Jiang, K., Liu, Q., Wang, Y., and Zhu, X. (2018, January 22\u201327). Hierarchical Region Based Convolution Neural Network for Multi-scale Object Detection in Remote Sensing Images. Proceedings of the 2018 IEEE International Geoscience and Remote Sensing Symposium, Valencia, Spain.","DOI":"10.1109\/IGARSS.2018.8518345"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Xie, H., Wang, T., Qiao, M., Zhang, M., Shan, G., and Snoussi, H. (2017, January 20\u201322). Robust object detection for tiny and dense targets in VHR aerial images. Proceedings of the 2017 Chinese Automation Congress, Jinan, China.","DOI":"10.1109\/CAC.2017.8243930"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Xia, G.S., Bai, X., Ding, J., Zhu, Z., Belongie, S., Luo, J., Datcu, M., Pelillo, M., and Zhang, L. (2018, January 18\u201322). DOTA: A Large-scale Dataset for Object Detection in Aerial Images. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00418"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","article-title":"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks","volume":"39","author":"Ren","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., and Berg, A. (2016, January 11\u201314). SSD: Single shot multibox detector. Proceedings of the 14th European Conference on Computer Vision, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"318","DOI":"10.1109\/TPAMI.2018.2858826","article-title":"Focal Loss for Dense Object Detection","volume":"42","author":"Lin","year":"2020","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_21","first-page":"1","article-title":"Anchor-Free Oriented Proposal Generator for Object Detection","volume":"60","author":"Cheng","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Xie, X., Cheng, G., Wang, J., Yao, X., and Han, J. (2021, January 11\u201317). Oriented R-CNN for Object Detection. Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV), Montreal, BC, Canada.","DOI":"10.1109\/ICCV48922.2021.00350"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Wang, F., Jiang, M., Qian, C., Yang, S., Li, C., Zhang, H., Wang, X., and Tang, X. (2017, January 21\u201326). Residual attention network for image classification. Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.683"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"2281","DOI":"10.1109\/TGRS.2020.3007921","article-title":"Hyperspectral Image Classification With Attention-Aided CNNs","volume":"59","author":"Hang","year":"2021","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Zhong, Z., Lin, Z.Q., Bidart, R., Hu, X., Daya, I.B., Li, Z., Zheng, W.S., Li, J., and Wong, A. (2020, January 14\u201319). Squeeze-and-attention networks for semantic segmentation. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01308"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Woo, S., Park, J., Lee, J.Y., and Kweon, I.S. (2018, January 8\u201314). CBAM: Convolutional block attention module. Proceedings of the Computer Vision\u2014ECCV 2018\u201415th European Conference, Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_1"},{"key":"ref_28","first-page":"677","article-title":"Arbitrary-Oriented Object Detection with Circular Smooth Label","volume":"12353","author":"Lin","year":"2020","journal-title":"Yang Xue Yan Junchi"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Yang, X., Hou, L., Zhou, Y., Wang, W., and Yan, J. (2021, January 19\u201325). Dense Label Encoding for Boundary Discontinuity Free Rotation Detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2021, Virtual.","DOI":"10.1109\/CVPR46437.2021.01556"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Hu, J., Shen, L., and Sun, G. (2018, January 18\u201322). Squeeze-and-excitation networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1016\/j.isprsjprs.2016.03.014","article-title":"A survey on object detection in optical remote sensing images","volume":"117","author":"Cheng","year":"2016","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1016\/j.isprsjprs.2014.10.002","article-title":"Multi-class geospatial object detection and geographic image classification based on collection of part detectors","volume":"98","author":"Cheng","year":"2014","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"5512","DOI":"10.1109\/TGRS.2019.2899955","article-title":"R2-CNN: Fast Tiny Object Detection in Large-Scale Remote Sensing Images","volume":"57","author":"Pang","year":"2019","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Ding, J., Xue, N., Long, Y., Xia, G.S., and Lu, Q. (2019, January 15\u201320). Learning roi transformer for oriented object detection in aerial images. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00296"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/16\/3969\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T00:09:14Z","timestamp":1760141354000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/16\/3969"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,8,16]]},"references-count":34,"journal-issue":{"issue":"16","published-online":{"date-parts":[[2022,8]]}},"alternative-id":["rs14163969"],"URL":"https:\/\/doi.org\/10.3390\/rs14163969","relation":{"has-preprint":[{"id-type":"doi","id":"10.20944\/preprints202206.0390.v1","asserted-by":"object"}]},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,8,16]]}}}