{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,8]],"date-time":"2026-04-08T16:53:43Z","timestamp":1775667223568,"version":"3.50.1"},"reference-count":37,"publisher":"MDPI AG","issue":"21","license":[{"start":{"date-parts":[[2022,10,31]],"date-time":"2022-10-31T00:00:00Z","timestamp":1667174400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China (NSFC)","doi-asserted-by":"publisher","award":["62101160"],"award-info":[{"award-number":["62101160"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Compared with anchor-based detectors, anchor-free detectors have the advantage of flexibility and a lower calculation complexity. However, in complex remote sensing scenes, the limited geometric size, weak features of objects, and widely distributed environmental elements similar to the characteristics of objects make small object detection a challenging task. To solve these issues, we propose an anchor-free detector named FE-CenterNet, which can accurately detect small objects such as vehicles in complicated remote sensing scenes. First, we designed a feature enhancement module (FEM) composed of a feature aggregation structure (FAS) and an attention generation structure (AGS). This module contributes to suppressing the interference of false alarms in the scene by mining multiscale contextual information and combining a coordinate attention mechanism, thus improving the perception of small objects. Meanwhile, to meet the high positioning accuracy requirements of small objects, we proposed a new loss function without extra calculation and time cost during the inference process. Finally, to verify the algorithm performance and provide a foundation for subsequent research, we established a dim and small vehicle dataset (DSVD) containing various objects and complex scenes. The experiment results demonstrate that the proposed method performs better than mainstream object detectors. Specifically, the average precision (AP) metric of our method is 7.2% higher than that of the original CenterNet with only a decrease of 1.3 FPS.<\/jats:p>","DOI":"10.3390\/rs14215488","type":"journal-article","created":{"date-parts":[[2022,11,1]],"date-time":"2022-11-01T06:01:28Z","timestamp":1667282488000},"page":"5488","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":56,"title":["Feature-Enhanced CenterNet for Small Object Detection in Remote Sensing Images"],"prefix":"10.3390","volume":"14","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-3927-2436","authenticated-orcid":false,"given":"Tianjun","family":"Shi","sequence":"first","affiliation":[{"name":"Research Center for Space Optical Engineering, Harbin Institute of Technology, Harbin 150001, China"}]},{"given":"Jinnan","family":"Gong","sequence":"additional","affiliation":[{"name":"Research Center for Space Optical Engineering, Harbin Institute of Technology, Harbin 150001, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4418-605X","authenticated-orcid":false,"given":"Jianming","family":"Hu","sequence":"additional","affiliation":[{"name":"Research Center for Space Optical Engineering, Harbin Institute of Technology, Harbin 150001, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5504-8480","authenticated-orcid":false,"given":"Xiyang","family":"Zhi","sequence":"additional","affiliation":[{"name":"Research Center for Space Optical Engineering, Harbin Institute of Technology, Harbin 150001, China"}]},{"given":"Wei","family":"Zhang","sequence":"additional","affiliation":[{"name":"Research Center for Space Optical Engineering, Harbin Institute of Technology, Harbin 150001, China"}]},{"given":"Yin","family":"Zhang","sequence":"additional","affiliation":[{"name":"Key Laboratory of Space Photoelectric Detection and Perception (Nanjing University of Aeronautics and Astronautics), Ministry of Industry and Information Technology, Nanjing 211106, China"}]},{"given":"Pengfei","family":"Zhang","sequence":"additional","affiliation":[{"name":"Research Center for Space Optical Engineering, Harbin Institute of Technology, Harbin 150001, China"}]},{"given":"Guangzheng","family":"Bao","sequence":"additional","affiliation":[{"name":"Research Center for Space Optical Engineering, Harbin Institute of Technology, Harbin 150001, China"}]}],"member":"1968","published-online":{"date-parts":[[2022,10,31]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"296","DOI":"10.1016\/j.isprsjprs.2019.11.023","article-title":"Object detection in optical remote sensing images: A survey and a new benchmark","volume":"159","author":"Li","year":"2020","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"10015","DOI":"10.1109\/TGRS.2019.2930982","article-title":"CAD-Net: A context-aware detection network for objects in remote sensing imagery","volume":"57","author":"Zhang","year":"2019","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Chen, L., Shi, W., and Deng, D. (2021). Improved YOLOv3 based on attention mechanism for fast and accurate ship detection in optical remote sensing images. Remote Sens., 13.","DOI":"10.3390\/rs13040660"},{"key":"ref_4","first-page":"5615113","article-title":"LOVD: Land Vehicle Detection in Complex Scenes of Optical Remote Sensing Image","volume":"60","author":"Yan","year":"2021","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_5","first-page":"8020805","article-title":"Target-guided feature super-resolution for vehicle detection in remote sensing images","volume":"19","author":"Li","year":"2021","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Hu, J., Zhi, X., Shi, T., Zhang, W., Cui, Y., and Zhao, S. (2021). PAG-YOLO: A portable attention-guided YOLO network for small ship detection. Remote Sens., 13.","DOI":"10.3390\/rs13163059"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Cheng, G., Yuan, X., Yao, X., Yan, K., Zeng, Q., and Han, J. (2022). Towards Large-Scale Small Object Detection: Survey and Benchmarks. arXiv.","DOI":"10.1109\/TPAMI.2023.3290594"},{"key":"ref_8","unstructured":"Krizhevsky, A., Sutskever, I., and Hinton, G.E. (2012, January 3\u20136). Imagenet classification with deep convolutional neural networks. Proceedings of the Advances in Neural Information Processing Systems 25, Lake Tahoe, NV, USA."},{"key":"ref_9","unstructured":"Ren, S., He, K., Girshick, R., and Sun, J. (2015, January 7\u201312). Faster r-cnn: Towards real-time object detection with region proposal networks. Proceedings of the Advances in Neural Information Processing Systems 28, Montreal, QC, Canada."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Redmon, J., and Farhadi, A. (2017, January 21\u201326). YOLO9000: Better, faster, stronger. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.690"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Xia, G.-S., Bai, X., Ding, J., Zhu, Z., Belongie, S., Luo, J., Datcu, M., Pelillo, M., and Zhang, L. (2018, January 18\u201323). DOTA: A large-scale dataset for object detection in aerial images. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00418"},{"key":"ref_12","unstructured":"Zhou, X., Wang, D., and Kr\u00e4henb\u00fchl, P. (2019). Objects as points. arXiv."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 23\u201328). Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Girshick, R. (2015, January 7\u201313). Fast r-cnn. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"He, K., Gkioxari, G., Doll\u00e1r, P., and Girshick, R. (2017, January 22\u201329). Mask r-cnn. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.322"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Cai, Z., and Vasconcelos, N. (2018, January 18\u201323). Cascade r-cnn: Delving into high quality object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00644"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Wang, Y., Ye, S., Bai, Y., Gao, G., and Gu, Y. (2021, January 11\u201316). Vehicle Detection Using Deep Learning with Deformable Convolution. Proceedings of the 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS, Brussels, Belgium.","DOI":"10.1109\/IGARSS47720.2021.9553028"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"18091","DOI":"10.1007\/s11042-022-12609-5","article-title":"Small-Scale Aircraft Detection in Remote Sensing Images Based on Faster-RCNN","volume":"81","author":"Zhang","year":"2022","journal-title":"Multimed. Tools Appl."},{"key":"ref_19","unstructured":"Redmon, J., and Farhadi, A. (2018). Yolov3: An incremental improvement. arXiv."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Lin, T.-Y., Goyal, P., Girshick, R., He, K., and Doll\u00e1r, P. (2017, January 22\u201329). Focal loss for dense object detection. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.324"},{"key":"ref_21","unstructured":"Bochkovskiy, A., Wang, C.-Y., and Liao, H.-Y.M. (2020). Yolov4: Optimal speed and accuracy of object detection. arXiv."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Wang, C.-Y., Bochkovskiy, A., and Liao, H.-Y.M. (2022). YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. arXiv.","DOI":"10.1109\/CVPR52729.2023.00721"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Bashir, S.M.A., and Wang, Y. (2021). Small Object Detection in Remote Sensing Images with Residual Feature Aggregation-Based Super-Resolution and Object Detector Network. Remote Sens., 13.","DOI":"10.3390\/rs13091854"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"8683226","DOI":"10.1155\/2021\/8683226","article-title":"Vehicle Detection in Remote Sensing Image Based on Machine Vision","volume":"2021","author":"Zhou","year":"2021","journal-title":"Comput. Intell. Neurosci."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Kim, M., Jeong, J., and Kim, S. (2021). ECAP-YOLO: Efficient Channel Attention Pyramid YOLO for Small Object Detection in Aerial Image. Remote Sens., 13.","DOI":"10.3390\/rs13234851"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Law, H., and Deng, J. (2018, January 8\u201314). Cornernet: Detecting objects as paired keypoints. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01264-9_45"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Dong, Z., Li, G., Liao, Y., Wang, F., Ren, P., and Qian, C. (2020, January 13\u201319). Centripetalnet: Pursuing high-quality keypoint pairs for object detection. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01053"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Zhou, X., Zhuo, J., and Krahenbuhl, P. (2019, January 15\u201320). Bottom-up object detection by grouping extreme and center points. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00094"},{"key":"ref_29","unstructured":"Tian, Z., Shen, C., Chen, H., and He, T. (November, January 27). Fcos: Fully convolutional one-stage object detection. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Korea."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"121","DOI":"10.1109\/TVT.2021.3049805","article-title":"FII-CenterNet: An anchor-free detector with foreground attention for traffic object detection","volume":"70","author":"Fan","year":"2021","journal-title":"IEEE Trans. Veh. Technol."},{"key":"ref_31","unstructured":"Zhou, X., Koltun, V., and Kr\u00e4henb\u00fchl, P. (2021). Probabilistic two-stage detection. arXiv."},{"key":"ref_32","unstructured":"Duan, K., Bai, S., Xie, L., Qi, H., Huang, Q., and Tian, Q. (2022). CenterNet++ for Object Detection. arXiv."},{"key":"ref_33","unstructured":"Chen, L.-C., Papandreou, G., Kokkinos, I., Murphy, K., and Yuille, A.L. (2016). DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs. arXiv."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Hu, J., Shen, L., and Sun, G. (2018, January 18\u201323). Squeeze-and-excitation networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Hou, Q., Zhou, D., and Feng, J. (2021, January 20\u201325). Coordinate attention for efficient mobile network design. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.01350"},{"key":"ref_36","unstructured":"Chen, D., and Miao, D. (2021). Control Distance IoU and Control Distance IoU Loss Function for Better Bounding Box Regression. arXiv."},{"key":"ref_37","unstructured":"Leong, C., Rovito, T., Mendoza-Schrock, O., Menart, C., Bowser, J., Moore, L., Scarborough, S., Minardi, M., and Hascher, D. (2019, October 16). Unified Coincident Optical and Radar for Recognition (UNICORN) 2008 Dataset. Available online: https:\/\/github.com\/AFRL-RY\/data-unicorn-2008."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/21\/5488\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T01:07:00Z","timestamp":1760144820000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/21\/5488"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,10,31]]},"references-count":37,"journal-issue":{"issue":"21","published-online":{"date-parts":[[2022,11]]}},"alternative-id":["rs14215488"],"URL":"https:\/\/doi.org\/10.3390\/rs14215488","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,10,31]]}}}