{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,16]],"date-time":"2026-01-16T03:58:05Z","timestamp":1768535885006,"version":"3.49.0"},"reference-count":75,"publisher":"MDPI AG","issue":"10","license":[{"start":{"date-parts":[[2021,5,14]],"date-time":"2021-05-14T00:00:00Z","timestamp":1620950400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Object detection in remote sensing images (RSIs) is one of the basic tasks in the field of remote sensing image automatic interpretation. In recent years, the deep object detection frameworks of natural scene images (NSIs) have been introduced into object detection on RSIs, and the detection performance has improved significantly because of the powerful feature representation. However, there are still many challenges concerning the particularities of remote sensing objects. One of the main challenges is the missed detection of small objects which have less than five percent of the pixels of the big objects. Generally, the existing algorithms choose to deal with this problem by multi-scale feature fusion based on a feature pyramid. However, the benefits of this strategy are limited, considering that the location of small objects in the feature map will disappear when the detection task is processed at the end of the network. In this study, we propose a subtask attention network (StAN), which handles the detection task directly on the shallow layer of the network. First, StAN contains one shared feature branch and two subtask attention branches of a semantic auxiliary subtask and a detection subtask based on the multi-task attention network (MTAN). Second, the detection branch uses only low-level features considering small objects. Third, the attention map guidance mechanism is put forward to optimize the network for keeping the identification ability. Fourth, the multi-dimensional sampling module (MdS), global multi-view channel weights (GMulW) and target-guided pixel attention (TPA) are designed for further improvement of the detection accuracy in complex scenes. The experimental results on the NWPU VHR-10 dataset and DOTA dataset demonstrated that the proposed algorithm achieved the SOTA performance, and the missed detection of small objects decreased. On the other hand, ablation experiments also proved the effects of MdS, GMulW and TPA.<\/jats:p>","DOI":"10.3390\/rs13101925","type":"journal-article","created":{"date-parts":[[2021,5,17]],"date-time":"2021-05-17T02:31:34Z","timestamp":1621218694000},"page":"1925","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":13,"title":["Subtask Attention Based Object Detection in Remote Sensing Images"],"prefix":"10.3390","volume":"13","author":[{"given":"Shengzhou","family":"Xiong","sequence":"first","affiliation":[{"name":"National Key Laboratory of Science &amp; Technology on Multi-Spectral Information Processing, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0963-5339","authenticated-orcid":false,"given":"Yihua","family":"Tan","sequence":"additional","affiliation":[{"name":"National Key Laboratory of Science &amp; Technology on Multi-Spectral Information Processing, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, China"},{"name":"Shenzhen Huazhong University of Science and Technology Research Institute, Shenzhen 518000, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8203-1246","authenticated-orcid":false,"given":"Yansheng","family":"Li","sequence":"additional","affiliation":[{"name":"School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, China"}]},{"given":"Cai","family":"Wen","sequence":"additional","affiliation":[{"name":"National Key Laboratory of Science &amp; Technology on Multi-Spectral Information Processing, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, China"}]},{"given":"Pei","family":"Yan","sequence":"additional","affiliation":[{"name":"National Key Laboratory of Science &amp; Technology on Multi-Spectral Information Processing, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, China"}]}],"member":"1968","published-online":{"date-parts":[[2021,5,14]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1016\/j.isprsjprs.2016.03.014","article-title":"A Survey on Object Detection in Optical Remote Sensing Images","volume":"117","author":"Cheng","year":"2016","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"2207","DOI":"10.1109\/JPROC.2016.2598228","article-title":"Big data for remote sensing: Challenges and opportunities","volume":"104","author":"Chi","year":"2016","journal-title":"Proc. IEEE"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"94","DOI":"10.1016\/j.inffus.2020.10.008","article-title":"Image retrieval from remote sensing big data: A survey","volume":"67","author":"Li","year":"2021","journal-title":"Inf. Fusion"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"1756","DOI":"10.1109\/TCYB.2020.2989241","article-title":"Error-Tolerant Deep Learning for Remote Sensing Image Scene Classification","volume":"51","author":"Li","year":"2021","journal-title":"IEEE Trans. Cybern."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"573","DOI":"10.1109\/LGRS.2012.2214022","article-title":"Aircraft Recognition in High-Resolution Satellite Images Using Coarse-to-Fine Shape Prior","volume":"10","author":"Liu","year":"2013","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"1701","DOI":"10.1109\/TGRS.2012.2207123","article-title":"Automated detection of arbitrarily shaped buildings in complex environments from monocular VHR optical satellite imagery","volume":"51","author":"Ok","year":"2013","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"180","DOI":"10.1016\/j.isprsjprs.2013.09.014","article-title":"Geographic object-based image analysis\u2013towards a new paradigm","volume":"87","author":"Blaschke","year":"2014","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_8","first-page":"1938","article-title":"Fast multiclass vehicle detection on aerial images","volume":"9","author":"Liu","year":"2015","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"32","DOI":"10.1016\/j.isprsjprs.2013.08.001","article-title":"Object detection in remote sensing imagery using a discriminatively trained mixture model","volume":"85","author":"Cheng","year":"2013","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"128","DOI":"10.1109\/LGRS.2010.2051792","article-title":"Airport detection from large IKONOS images using clustered SIFT keypoints and region information","volume":"1","author":"Tao","year":"2011","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"113","DOI":"10.1016\/j.patcog.2017.12.012","article-title":"Robust infrared small target detection using local steering kernel reconstruction","volume":"77","author":"Li","year":"2018","journal-title":"Pattern Recognit."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"374","DOI":"10.1109\/LGRS.2010.2077272","article-title":"A multifeature tensor for remotesensing target recognition","volume":"8","author":"Zhang","year":"2011","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"366","DOI":"10.1109\/LGRS.2009.2035644","article-title":"Object classification of aerial images with bag of visual words","volume":"7","author":"Xu","year":"2010","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"15014","DOI":"10.3390\/rs71115014","article-title":"Accurate annotation of remote sensing images via active spectral clustering with little expert knowledge","volume":"7","author":"Xia","year":"2015","journal-title":"Remote Sens."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1016\/j.isprsjprs.2014.10.002","article-title":"Multi-class geospatial object detection and geographic image classification based on collection of part detectors","volume":"98","author":"Cheng","year":"2014","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"701","DOI":"10.1109\/LGRS.2014.2358994","article-title":"Weakly supervised learning for target detection in remote sensing images","volume":"12","author":"Zhang","year":"2015","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"3978","DOI":"10.1109\/TGRS.2007.907109","article-title":"A multiple conditional random fields ensemble model for urban area detection in remote sensing optical images","volume":"45","author":"Zhong","year":"2007","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"4483","DOI":"10.1109\/TGRS.2015.2400462","article-title":"Robust rooftop extraction from visible band images using higher order CRF","volume":"53","author":"Li","year":"2015","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"162","DOI":"10.1016\/j.neucom.2015.02.073","article-title":"A coarse-to-fine model for airport detection from remote sensing images using targetoriented visual saliency and CRF","volume":"164","author":"Yao","year":"2015","journal-title":"Neurocomputing"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1080\/01431161.2012.705443","article-title":"Automatic landslide detection from remote-sensing imagery using a scene classification method based on BoVW and pLSA","volume":"34","author":"Cheng","year":"2013","journal-title":"Int. J. Remote Sens."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"6627","DOI":"10.1109\/TGRS.2014.2299540","article-title":"Detection of compound structures using a Gaussian mixture model with spectral and spatial constraints","volume":"52","author":"Ari","year":"2014","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"4511","DOI":"10.1109\/TGRS.2013.2282355","article-title":"Ship detection in high-resolution optical imagery based on anomaly detector and local shape feature","volume":"52","author":"Shi","year":"2014","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"1830","DOI":"10.1109\/JSTARS.2015.2416255","article-title":"Target detection based on random forest metric learning","volume":"8","author":"Dong","year":"2015","journal-title":"IEEE J. Sel. Top. Appl. Earth Observ. Remote Sens."},{"key":"ref_24","unstructured":"Krizhevsky, A., Sutskever, I., and Hinton, G.E. (2012, January 3\u20136). Imagenet classification with deep convolutional neural networks. Proceedings of the Advances in Neural Information Processing Systems, Lake Tahoe, NV, USA."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 23\u201328). Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016, January 27\u201330). You only look once: Unified, real-time object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, C., Fu, C., and Berg, A.C. (2016, January 11\u201314). SSD: Single Shot Multibox Detector. Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"182","DOI":"10.1016\/j.isprsjprs.2018.09.014","article-title":"Deep networks under scene-level supervision for multi-class geospatial object detection from remote sensing images","volume":"146","author":"Li","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"112045","DOI":"10.1016\/j.rse.2020.112045","article-title":"Accurate cloud detection in high-resolution remote sensing imagery by weakly supervised deep learning","volume":"250","author":"Li","year":"2020","journal-title":"Remote Sens. Environ."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"4895","DOI":"10.1109\/JSTARS.2015.2467377","article-title":"A Hierarchical Oil Tank Detector with Deep Surrounding Features for High-Resolution Optical Satellite Imagery","volume":"8","author":"Zhang","year":"2017","journal-title":"IEEE J. Sel. Top. Appl. Earth Observ. Remote Sens."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"1293","DOI":"10.1109\/LGRS.2017.2708722","article-title":"M-FCN: Effective Fully Convolutional Network-Based Airplane Detection Framework","volume":"14","author":"Yang","year":"2017","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"2486","DOI":"10.1109\/TGRS.2016.2645610","article-title":"Accurate Object Localization in Remote Sensing Images Based on Convolutional Neural Networks","volume":"55","author":"Long","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1016\/j.isprsjprs.2021.02.009","article-title":"Learning deep semantic segmentation network under multiple weakly-supervised constraints for cross-domain remote sensing image semantic segmentation","volume":"175","author":"Li","year":"2021","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Kang, M., Ji, K., Leng, X., and Lin, Z. (2017). Contextual region-based convolutional neural network with multilayer fusion for SAR ship detection. Remote Sens., 9.","DOI":"10.3390\/rs9080860"},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"2337","DOI":"10.1109\/TGRS.2017.2778300","article-title":"Rotation-insensitive and context-augmented object detection in remote sensing images","volume":"56","author":"Li","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_36","unstructured":"Cheng, L., Liu, X., Li, L., Jiao, L., and Tang, X. (2018). Deep Adaptive Proposal Network for Object Detection in Optical Remote Sensing Images. arXiv."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Tang, T., Zhou, S., Deng, Z., Zou, H., and Lei, L. (2017). Vehicle Detection in Aerial Images Based on Region Convolutional Neural Networks and Hard Negative Example Mining. Sensors, 17.","DOI":"10.3390\/s17020336"},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"937","DOI":"10.1109\/LGRS.2018.2813094","article-title":"Arbitrary-Oriented Ship Detection Framework in Optical Remote-Sensing Images","volume":"15","author":"Liu","year":"2018","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_39","unstructured":"Etten, A.V. (2018). You Only Look Twice: Rapid Multi-Scale Object Detection in Satellite Imagery. arXiv."},{"key":"ref_40","unstructured":"Liu, L., Pan, Z., and Lei, B. (2017). Learning a Rotation Invariant Detector with Rotatable Bounding Box. arXiv."},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Chen, S., Zhan, R., and Zhang, J. (2018). Geospatial Object Detection in Remote Sensing Imagery Based on Multiscale Single-Shot Detector with Activated Semantics. Remote Sens., 10.","DOI":"10.3390\/rs10060820"},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"120113:1","DOI":"10.1007\/s11432-019-2723-1","article-title":"MDSSD: Multi-scale deconvolutional single shot detector for small objects","volume":"63","author":"Cui","year":"2020","journal-title":"Sci. China Inf. Sci."},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Liu, S., Johns, E., and Davison, A.J. (2019, January 16\u201320). End-to-end multi-task learning with attention. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00197"},{"key":"ref_44","unstructured":"Redmon, J., and Farhadi, A. (2018). Yolov3: An Incremental Improvement. arXiv."},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep Residual Learning for Image Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Bi, F., Hou, J., Chen, L., Yang, Z., and Wang, Y. (2019). Ship Detection for Optical Remote Sensing Images Based on Visual Attention Enhanced Network. Sensors, 19.","DOI":"10.3390\/s19102271"},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"681","DOI":"10.1109\/LGRS.2019.2930462","article-title":"Multi-Scale Spatial and Channel-wise Attention for Improving Object Detection in Remote Sensing Imagery","volume":"17","author":"Chen","year":"2019","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"1305","DOI":"10.1109\/TIP.2020.3042084","article-title":"Dense Attention Fluid Network for Salient Object Detection in Optical Remote Sensing Images","volume":"30","author":"Zhang","year":"2020","journal-title":"IEEE Trans. Image Process."},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Yin, W., Diao, W., Wang, P., Gao, X., Li, Y., and Sun, X. (2021). PCAN\u2014Part-Based Context Attention Network for Thermal Power Plant Detection in Remote Sensing Imagery. Remote Sens., 13.","DOI":"10.3390\/rs13071243"},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., and Rabinovich, A. (2015, January 7\u201312). Going Deeper with Convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"ref_51","unstructured":"Yu, F., and Koltun, V. (2015). Multi-scale Context Aggregation by Dilated Convolutions. arXiv."},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"Chollet, F. (2017, January 21\u201326). Xception: Deep Learning with Depthwise Separable Convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.195"},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Woo, S., Park, J., Lee, J.Y., and Kweon, I.S. (2018, January 8\u201314). CBAM: Convolutional Block Attention Module. Proceedings of the European Conference on Computer Vision, Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_1"},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Zhao, Z., Li, J., Luo, Z., Li, J., and Chen, C. (2020). Remote Sensing Image Scene Classification Based on an Enhanced Attention Module. IEEE Geosci. Remote Sens. Lett.","DOI":"10.1109\/LGRS.2020.3011405"},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Kalantidis, Y., Mellina, C., and Osindero, S. (2016, January 11\u201314). Cross-dimensional Weighting for Aggregated Deep Convolutional Features. Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46604-0_48"},{"key":"ref_56","unstructured":"Yang, X., Yang, J., Yan, J., Zhang, Y., Zhang, T., Guo, Z., Sun, X., and Kun, F. (November, January 27). SCRDet: Towards More Robust Detection for Small, Cluttered and Rotated Objects. Proceedings of the International Conference on Computer Vision, Seoul, Korea."},{"key":"ref_57","doi-asserted-by":"crossref","first-page":"7405","DOI":"10.1109\/TGRS.2016.2601622","article-title":"Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images","volume":"54","author":"Cheng","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_58","doi-asserted-by":"crossref","unstructured":"Xia, G., Bai, X., Ding, J., Zhu, Z., Belongie, S., Luo, J., Datcu, M., Pelillo, M., and Zhang, L. (2018, January 18\u201323). DOTA: A Large-scale Dataset for Object Detection in Aerial Images. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00418"},{"key":"ref_59","doi-asserted-by":"crossref","unstructured":"Xu, Z., Xu, X., Wang, L., Yang, R., and Pu, F. (2017). Deformable ConvNet with Aspect Ratio Constrained NMS for Object Detection in Remote Sensing Imagery. Remote Sens., 9.","DOI":"10.3390\/rs9121312"},{"key":"ref_60","doi-asserted-by":"crossref","first-page":"109","DOI":"10.1109\/LGRS.2011.2161569","article-title":"Automatic Target Detection in High-Resolution Remote Sensing Images Using Spatial Sparse Coding Bag-of-Words Model","volume":"9","author":"Sun","year":"2011","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_61","doi-asserted-by":"crossref","unstructured":"Han, X., Zhong, Y., and Zhang, L. (2017). An Efficient and Robust Integrated Geospatial Object Detection Framework for High Spatial Resolution Remote Sensing Imagery. Remote Sens., 9.","DOI":"10.3390\/rs9070666"},{"key":"ref_62","doi-asserted-by":"crossref","unstructured":"Shen, Z., Liu, Z., Li, J., Jiang, Y., Chen, Y., and Xue, X. (2017, January 22\u201329). DSOD: Learning Deeply Supervised Object Detectors from Scratch. Proceedings of the International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.212"},{"key":"ref_63","unstructured":"Ren, S., He, K., Girshick, R., and Sun, J. (2015, January 7\u201312). Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. Proceedings of the Advances in Neural Information Processing Systems, Montreal, QC, Canada."},{"key":"ref_64","doi-asserted-by":"crossref","first-page":"4193","DOI":"10.1080\/01431161.2018.1562260","article-title":"A subclass supported convolutional neural network for object detection and localization in remote-sensing images","volume":"40","author":"Kilic","year":"2019","journal-title":"Int. J. Remote Sens."},{"key":"ref_65","doi-asserted-by":"crossref","first-page":"864","DOI":"10.1109\/LGRS.2018.2888887","article-title":"Scale Adaptive Proposal Network for Object Detection in Remote Sensing Images","volume":"16","author":"Zhang","year":"2019","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_66","doi-asserted-by":"crossref","unstructured":"Ren, Y., Zhu, C., and Xiao, S. (2018). Deformable Faster R-CNN with Aggregating Multi-Layer Features for Partially Occluded Object Detection in Optical Remote Sensing Images. Remote Sens., 10.","DOI":"10.3390\/rs10091470"},{"key":"ref_67","doi-asserted-by":"crossref","unstructured":"Redmon, J., and Farhadi, A. (2017, January 21\u201326). YOLO9000: Better, Faster, Stronger. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.690"},{"key":"ref_68","unstructured":"Dai, J., Li, Y., He, K., and Sun, J. (2016, January 5\u201310). R-FCN: Object Detection via Region-based Fully Convolutional Networks. Proceedings of the Advances in Neural Information Processing Systems, Barcelona, Spain."},{"key":"ref_69","doi-asserted-by":"crossref","unstructured":"Yang, X., Sun, H., Fu, K., Yang, J., Sun, X., Yan, M., and Guo, Z. (2018). Automatic Ship Detection in Remote Sensing Images from Google Earth of Complex Scenes Based on Multiscale Rotation Dense Feature Pyramid Networks. Remote Sens., 10.","DOI":"10.3390\/rs10010132"},{"key":"ref_70","doi-asserted-by":"crossref","first-page":"3111","DOI":"10.1109\/TMM.2018.2818020","article-title":"Arbitrary-Oriented Scene Text Detection via Rotation Proposals","volume":"20","author":"Ma","year":"2018","journal-title":"IEEE Trans. Multimed."},{"key":"ref_71","doi-asserted-by":"crossref","unstructured":"Ding, J., Xue, N., Long, Y., Xia, G., and Lu, Q. (2018). Learning RoI Transformer for Detecting Oriented Objects in Aerial Images. arXiv.","DOI":"10.1109\/CVPR.2019.00296"},{"key":"ref_72","doi-asserted-by":"crossref","unstructured":"Azimi, S.M., Vig, E., Bahmanyar, R., K\u00f6rner, M., and Reinartz, P. (2018, January 2\u20136). Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery. Proceedings of the Asian Conference on Computer Vision, Perth, WA, Australia.","DOI":"10.1007\/978-3-030-20893-6_10"},{"key":"ref_73","doi-asserted-by":"crossref","first-page":"959","DOI":"10.1080\/2150704X.2019.1633486","article-title":"A multiscale object detection approach for remote sensing images based on MSE-DenseNet and the dynamic anchor assignment","volume":"10","author":"Zhu","year":"2019","journal-title":"Remote Sens. Lett."},{"key":"ref_74","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2015, January 7\u201313). Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification. Proceedings of the International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.123"},{"key":"ref_75","unstructured":"Kingma, D.P., and Ba, J. (2014). Adam: A Method for Stochastic Optimization. arXiv."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/10\/1925\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T06:00:55Z","timestamp":1760162455000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/10\/1925"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,5,14]]},"references-count":75,"journal-issue":{"issue":"10","published-online":{"date-parts":[[2021,5]]}},"alternative-id":["rs13101925"],"URL":"https:\/\/doi.org\/10.3390\/rs13101925","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,5,14]]}}}