{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,3]],"date-time":"2026-04-03T15:21:09Z","timestamp":1775229669584,"version":"3.50.1"},"reference-count":55,"publisher":"MDPI AG","issue":"9","license":[{"start":{"date-parts":[[2021,5,10]],"date-time":"2021-05-10T00:00:00Z","timestamp":1620604800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Natural Science Foundation of China","award":["62071384"],"award-info":[{"award-number":["62071384"]}]},{"name":"Natural Science Basic Research Plan in Shaanxi Province of China","award":["2019JM-311"],"award-info":[{"award-number":["2019JM-311"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>This paper deals with detecting small objects in remote sensing images from satellites or any aerial vehicle by utilizing the concept of image super-resolution for image resolution enhancement using a deep-learning-based detection method. This paper provides a rationale for image super-resolution for small objects by improving the current super-resolution (SR) framework by incorporating a cyclic generative adversarial network (GAN) and residual feature aggregation (RFA) to improve detection performance. The novelty of the method is threefold: first, a framework is proposed, independent of the final object detector used in research, i.e., YOLOv3 could be replaced with Faster R-CNN or any object detector to perform object detection; second, a residual feature aggregation network was used in the generator, which significantly improved the detection performance as the RFA network detected complex features; and third, the whole network was transformed into a cyclic GAN. The image super-resolution cyclic GAN with RFA and YOLO as the detection network is termed as SRCGAN-RFA-YOLO, which is compared with the detection accuracies of other methods. Rigorous experiments on both satellite images and aerial images (ISPRS Potsdam, VAID, and Draper Satellite Image Chronology datasets) were performed, and the results showed that the detection performance increased by using super-resolution methods for spatial resolution enhancement; for an IoU of 0.10, AP of 0.7867 was achieved for a scale factor of 16.<\/jats:p>","DOI":"10.3390\/rs13091854","type":"journal-article","created":{"date-parts":[[2021,5,10]],"date-time":"2021-05-10T10:49:51Z","timestamp":1620643791000},"page":"1854","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":99,"title":["Small Object Detection in Remote Sensing Images with Residual Feature Aggregation-Based Super-Resolution and Object Detector Network"],"prefix":"10.3390","volume":"13","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-9899-6293","authenticated-orcid":false,"given":"Syed Muhammad Arsalan","family":"Bashir","sequence":"first","affiliation":[{"name":"School of Electronics and Information, Northwestern Polytechnical University, Xi\u2019an 710129, China"},{"name":"Pakistan Space &amp; Upper Atmosphere Research Commission, Karachi, Sindh 59201, Pakistan"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7743-1779","authenticated-orcid":false,"given":"Yi","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Electronics and Information, Northwestern Polytechnical University, Xi\u2019an 710129, China"}]}],"member":"1968","published-online":{"date-parts":[[2021,5,10]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"106838","DOI":"10.1109\/ACCESS.2019.2932731","article-title":"An Improved Faster R-CNN for Small Object Detection","volume":"7","author":"Cao","year":"2019","journal-title":"IEEE Access"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Stuparu, D.G., Ciobanu, R.I., and Dobre, C. (2020). Vehicle detection in overhead satellite images using a one-stage object detection model. Sensors, 20.","DOI":"10.3390\/s20226485"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"951","DOI":"10.2991\/ijcis.11.1.72","article-title":"SCAN: Semantic context aware network for accurate small object detection","volume":"11","author":"Guan","year":"2018","journal-title":"Int. J. Comput. Intell. Syst."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1109\/LGRS.2020.2967819","article-title":"Multiscale Convolutional Neural Networks for Geospatial Object Detection in VHR Satellite Images","volume":"18","author":"Yao","year":"2021","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Alganci, U., Soydas, M., and Sertel, E. (2020). Comparative research on deep learning approaches for airplane detection from very high-resolution satellite images. Remote Sens., 12.","DOI":"10.3390\/rs12030458"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"103910","DOI":"10.1016\/j.imavis.2020.103910","article-title":"Recent advances in small object detection based on deep learning: A review","volume":"97","author":"Tong","year":"2020","journal-title":"Image Vis. Comput."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Robinson, Y.H., Vimal, S., Khari, M., Hern\u00e1ndez, F.C.L., and Crespo, R.G. (2020). Tree-based convolutional neural networks for object classification in segmented satellite images. Int. J. High. Perform. Comput. Appl.","DOI":"10.1177\/1094342020945026"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"752","DOI":"10.1080\/01431161.2019.1648904","article-title":"Multi-channel higher-order local autocorrelation for object detection on satellite images","volume":"41","author":"Uehara","year":"2020","journal-title":"Int. J. Remote Sens."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Zhang, W., Wang, S., Thachan, S., Chen, J., and Qian, Y. (2018, January 22\u201327). Deconv R-CNN for small object detection on remote sensing images. Proceedings of the International Geoscience and Remote Sensing Symposium (IGARSS), Valencia, Spain.","DOI":"10.1109\/IGARSS.2018.8517436"},{"key":"ref_10","unstructured":"Ren, S., He, K., Girshick, R., and Sun, J. (2015, January 10\u201312). Faster R-CNN: Towards real-time object detection with region proposal networks. Proceedings of the Advances in Neural Information Processing Systems, Montreal, QC, Canada."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Doll\u00e1r, P., Girshick, R., He, K., Hariharan, B., and Belongie, S. (2017, January 21\u201326). Feature pyramid networks for object detection. Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.106"},{"key":"ref_12","unstructured":"Redmon, J., and Farhadi, A. (2018). YOLOv3: An incremental improvement. arXiv."},{"key":"ref_13","unstructured":"Bochkovskiy, A., Wang, C.Y., and Liao, H.Y.M. (2020). YOLOv4: Optimal Speed and Accuracy of Object Detection. arXiv."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., and Berg, A.C. (2016, January 11\u201314). SSD: Single shot multibox detector. Proceedings of the European Conference on Computer Vision (ECCV), Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Goyal, P., Girshick, R., He, K., and Dollar, P. (2017, January 22\u201329). Focal Loss for Dense Object Detection. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.324"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Yan, J., Wang, H., Yan, M., Diao, W., Sun, X., and Li, H. (2019). IoU-adaptive deformable R-CNN: Make full use of IoU for multi-class object detection in remote sensing imagery. Remote Sens., 11.","DOI":"10.3390\/rs11030286"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Xu, Z., Shi, H., Li, N., Xiang, C., and Zhou, H. (2018, January 10\u201312). Vehicle Detection under UAV Based on Optimal Dense YOLO Method. Proceedings of the 2018 5th International Conference on Systems and Informatics, ICSAI 2018, Nanjing, China.","DOI":"10.1109\/ICSAI.2018.8599403"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Liu, M., Wang, X., Zhou, A., Fu, X., Ma, Y., and Piao, C. (2020). Uav-yolo: Small object detection on unmanned aerial vehicle perspective. Sensors, 20.","DOI":"10.3390\/s20082238"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Froidevaux, A., Julier, A., Lifschitz, A., Pham, M.T., Dambreville, R., Lef\u00e8vre, S., Lassalle, P., and Huynh, T.L. (October, January 26). Vehicle Detection and Counting from VHR Satellite Images: Efforts and Open Issues. Proceedings of the IGARSS 2020\u20142020 IEEE International Geoscience and Remote Sensing Symposium, Virtual Symposium.","DOI":"10.1109\/IGARSS39084.2020.9323827"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Pham, M.T., Courtrai, L., Friguet, C., Lef\u00e8vre, S., and Baussard, A. (2020). YOLO-fine: One-stage detector of small objects under various backgrounds in remote sensing images. Remote Sens., 12.","DOI":"10.3390\/rs12152501"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"122","DOI":"10.1080\/2150704X.2020.1837988","article-title":"SSS-YOLO: Towards more accurate detection for small ships in SAR image","volume":"12","author":"Wang","year":"2021","journal-title":"Remote Sens. Lett."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Courtrai, L., Pham, M.T., and Lef\u00e8vre, S. (2020). Small object detection in remote sensing images based on super-resolution with auxiliary generative adversarial networks. Remote Sens., 12.","DOI":"10.3390\/rs12193152"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Hui, Z., Li, J., Gao, X., and Wang, X. (2021). Progressive perception-oriented network for single image super-resolution. Inf. Sci., 769\u2013786.","DOI":"10.1016\/j.ins.2020.08.114"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Bashir, S.M.A., Wang, Y., and Khan, M. (2021). A Comprehensive Review of Deep Learning-based Single Image Super-resolution. arXiv.","DOI":"10.7717\/peerj-cs.621"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Dong, C., Loy, C.C., He, K., and Tang, X. (2014). Learning a deep convolutional network for image super-resolution. Lecture Notes in Computer Science, Springer.","DOI":"10.1007\/978-3-319-10593-2_13"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Kim, J., Lee, J.K., and Lee, K.M. (2016, January 27\u201330). Deeply-recursive convolutional network for image super-resolution. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.181"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"391","DOI":"10.1007\/978-3-319-46475-6_25","article-title":"Accelerating the super-resolution convolutional neural network","volume":"Volume 9906 LNCS","author":"Dong","year":"2016","journal-title":"Lecture Notes in Computer Science"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Kim, J., Lee, J.K., and Lee, K.M. (2016, January 27\u201330). Accurate image super-resolution using very deep convolutional networks. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.182"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Lai, W.S., Huang, J.B., Ahuja, N., and Yang, M.H. (2017, January 21\u201326). Deep laplacian pyramid networks for fast and accurate super-resolution. Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.618"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Lim, B., Son, S., Kim, H., Nah, S., and Lee, K.M. (2017, January 21\u201326). Enhanced Deep Residual Networks for Single Image Super-Resolution. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, Honolulu, HI, USA.","DOI":"10.1109\/CVPRW.2017.151"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Sajjadi, M.S.M., Scholkopf, B., and Hirsch, M. (2017, January 21\u201326). EnhanceNet: Single Image Super-Resolution through Automated Texture Synthesis. Proceedings of the IEEE International Conference on Computer Vision, Honolulu, HI, USA.","DOI":"10.1109\/ICCV.2017.481"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Haris, M., Shakhnarovich, G., and Ukita, N. (2019, January 16\u201320). Recurrent back-projection network for video super-resolution. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00402"},{"key":"ref_33","unstructured":"Ioffe, S., and Szegedy, C. (2015, January 6\u201311). Batch normalization: Accelerating deep network training by reducing internal covariate shift. Proceedings of the 32nd International Conference on Machine Learning, ICML 2015, Lille, France."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"116","DOI":"10.1016\/j.neucom.2019.11.044","article-title":"Wavelet-based residual attention network for image super-resolution","volume":"382","author":"Xue","year":"2020","journal-title":"Neurocomputing"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Zhang, L., and Ma, J. (2021). Salient Object Detection Based on Progressively Supervised Learning for Remote Sensing Images. IEEE Trans. Geosci. Remote. Sens.","DOI":"10.1109\/TGRS.2020.3045708"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Ferdous, S.N., Mostofa, M., and Nasrabadi, N. (2019, January 15\u201317). Super resolution-assisted deep aerial vehicle detection. Proceedings of the Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications, Baltimore, MD, USA.","DOI":"10.1117\/12.2519045"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Rabbi, J., Ray, N., Schubert, M., Chowdhury, S., and Chao, D. (2020). Small-object detection in remote sensing images with end-to-end edge-enhanced GAN and object detector network. Remote Sens., 12.","DOI":"10.20944\/preprints202003.0313.v2"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Leal-Taix\u00e9, L., and Roth, S. (2019). ESRGAN: Enhanced Super-Resolution Generative Adversarial Networks. Computer Vision\u2014ECCV 2018 Workshops. ECCV 2018, Springer.","DOI":"10.1007\/978-3-030-11024-6"},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"5799","DOI":"10.1109\/TGRS.2019.2902431","article-title":"Edge-Enhanced GAN for Remote Sensing Image Superresolution","volume":"57","author":"Jiang","year":"2019","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"293","DOI":"10.5194\/isprsannals-I-3-293-2012","article-title":"The ISPRS benchmark on urban object classification and 3D building reconstruction","volume":"I-3","author":"Rottensteiner","year":"2012","journal-title":"ISPRS Ann. Photogramm. Remote Sens. Spat. Inf. Sci."},{"key":"ref_41","unstructured":"(2021, May 08). Draper Draper Satellite Image Chronology. Available online: https:\/\/www.kaggle.com\/c\/draper-satellite-image-chronology\/overview."},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Li, C.Y., and Lin, H.Y. (2020, January 27\u201329). Vehicle detection and classification in aerial images using convolutional neural networks. Proceedings of the VISIGRAPP 2020\u201415th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, Valetta, Malta.","DOI":"10.5220\/0008941707750782"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Shermeyer, J., and Van Etten, A. (2019, January 16\u201320). The effects of super-resolution on object detection performance in satellite imagery. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, Long Beach, CA, USA.","DOI":"10.1109\/CVPRW.2019.00184"},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Wei, Z., and Liu, Y. (2021). Deep Intelligent Neural Network for Medical Geographic Small-target Intelligent Satellite Image Super-resolution. J. Imaging Sci. Technol.","DOI":"10.2352\/J.ImagingSci.Technol.2021.65.3.030406"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Shi, W., Caballero, J., Huszar, F., Totz, J., Aitken, A.P., Bishop, R., Rueckert, D., and Wang, Z. (2016, January 27\u201330). Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.207"},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"295","DOI":"10.1109\/TPAMI.2015.2439281","article-title":"Image Super-Resolution Using Deep Convolutional Networks","volume":"38","author":"Dong","year":"2016","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1109\/38.988747","article-title":"Example-based super-resolution","volume":"22","author":"Freeman","year":"2002","journal-title":"IEEE Comput. Graph. Appl."},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Liu, J., Zhang, W., Tang, Y., Tang, J., and Wu, G. (2020, January 14\u201319). Residual Feature Aggregation Network for Image Super-Resolution. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Virtual Conference.","DOI":"10.1109\/CVPR42600.2020.00243"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Audebert, N., Le Saux, B., and Lef\u00e8vre, S. (2017). Segment-before-detect: Vehicle detection and classification through semantic segmentation of aerial images. Remote Sens., 9.","DOI":"10.3390\/rs9040368"},{"key":"ref_50","unstructured":"Arjovsky, M., Chintala, S., and Bottou, L. (2017, January 6\u201311). Wasserstein generative adversarial networks. Proceedings of the 34th International Conference on Machine Learning, ICML 2017, Sydney, Australia."},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Ledig, C., Theis, L., Husz\u00e1r, F., Caballero, J., Cunningham, A., Acosta, A., Aitken, A., Tejani, A., Totz, J., and Wang, Z. (2017, January 21\u201326). Photo-realistic single image super-resolution using a generative adversarial network. Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.19"},{"key":"ref_52","unstructured":"Gulrajani, I., Ahmed, F., Arjovsky, M., Dumoulin, V., and Courville, A. (2017, January 4\u20139). Improved training of wasserstein GANs. Proceedings of the Advances in Neural Information Processing Systems, Long Beach, CA, USA."},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Zhu, J.Y., Park, T., Isola, P., and Efros, A.A. (2017, January 21\u201326). Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks. Proceedings of the IEEE International Conference on Computer Vision, Honolulu, HI, USA.","DOI":"10.1109\/ICCV.2017.244"},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Tan, M., Pang, R., and Le, Q.V. (2020, January 14\u201319). EfficientDet: Scalable and efficient object detection. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Virtual Conference.","DOI":"10.1109\/CVPR42600.2020.01079"},{"key":"ref_55","unstructured":"Sumbul, G., Charfuelan, M., Demir, B., and Markl, V. (August, January 28). Bigearthnet: A Large-Scale Benchmark Archive for Remote Sensing Image Understanding. Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, Yokohama, Japan."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/9\/1854\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T05:58:47Z","timestamp":1760162327000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/9\/1854"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,5,10]]},"references-count":55,"journal-issue":{"issue":"9","published-online":{"date-parts":[[2021,5]]}},"alternative-id":["rs13091854"],"URL":"https:\/\/doi.org\/10.3390\/rs13091854","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,5,10]]}}}