{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,7]],"date-time":"2026-01-07T08:01:38Z","timestamp":1767772898504,"version":"build-2065373602"},"reference-count":50,"publisher":"MDPI AG","issue":"5","license":[{"start":{"date-parts":[[2019,3,12]],"date-time":"2019-03-12T00:00:00Z","timestamp":1552348800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"CETC Key Laboratory of Aerospace Information Applications","award":["XX17629X009"],"award-info":[{"award-number":["XX17629X009"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>With the rapid advances in remote-sensing technologies and the larger number of satellite images, fast and effective object detection plays an important role in understanding and analyzing image information, which could be further applied to civilian and military fields. Recently object detection methods with region-based convolutional neural network have shown excellent performance. However, these two-stage methods contain region proposal generation and object detection procedures, resulting in low computation speed. Because of the expensive manual costs, the quantity of well-annotated aerial images is scarce, which also limits the progress of geospatial object detection in remote sensing. In this paper, on the one hand, we construct and release a large-scale remote-sensing dataset for geospatial object detection (RSD-GOD) that consists of 5 different categories with 18,187 annotated images and 40,990 instances. On the other hand, we design a single shot detection framework with multi-scale feature fusion. The feature maps from different layers are fused together through the up-sampling and concatenation blocks to predict the detection results. High-level features with semantic information and low-level features with fine details are fully explored for detection tasks, especially for small objects. Meanwhile, a soft non-maximum suppression strategy is put into practice to select the final detection results. Extensive experiments have been conducted on two datasets to evaluate the designed network. Results show that the proposed approach achieves a good detection performance and obtains the mean average precision value of 89.0% on a newly constructed RSD-GOD dataset and 83.8% on the Northwestern Polytechnical University very high spatial resolution-10 (NWPU VHR-10) dataset at 18 frames per second (FPS) on a NVIDIA GTX-1080Ti GPU.<\/jats:p>","DOI":"10.3390\/rs11050594","type":"journal-article","created":{"date-parts":[[2019,3,13]],"date-time":"2019-03-13T04:07:37Z","timestamp":1552450057000},"page":"594","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":35,"title":["A Single Shot Framework with Multi-Scale Feature Fusion for Geospatial Object Detection"],"prefix":"10.3390","volume":"11","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-8052-3877","authenticated-orcid":false,"given":"Shuo","family":"Zhuang","sequence":"first","affiliation":[{"name":"School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China"},{"name":"CETC Key Laboratory of Aerospace Information Applications, Shijiazhuang 050081, China"}]},{"given":"Ping","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China"}]},{"given":"Boran","family":"Jiang","sequence":"additional","affiliation":[{"name":"School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China"}]},{"given":"Gang","family":"Wang","sequence":"additional","affiliation":[{"name":"CETC Key Laboratory of Aerospace Information Applications, Shijiazhuang 050081, China"}]},{"given":"Cong","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China"}]}],"member":"1968","published-online":{"date-parts":[[2019,3,12]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Chen, Z., Zhang, T., and Ouyang, C. (2018). End-to-End Airplane Detection Using Transfer Learning in Remote Sensing Images. Remote Sens., 10.","DOI":"10.3390\/rs10010139"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"1250","DOI":"10.1109\/TPAMI.2010.182","article-title":"Vehicle Detection Using Partial Least Squares","volume":"33","author":"Kembhavi","year":"2011","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"2327","DOI":"10.1109\/JSTARS.2013.2242846","article-title":"Airborne Vehicle Detection in Dense Urban Areas Using HoG Features and Disparity Maps","volume":"6","author":"Tuermer","year":"2013","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"382","DOI":"10.1016\/j.isprsjprs.2007.10.005","article-title":"On-line boosting-based car detection from aerial images","volume":"63","author":"Grabner","year":"2008","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Han, X., Zhong, Y., and Zhang, L. (2017). An Efficient and Robust Integrated Geospatial Object Detection Framework for High Spatial Resolution Remote Sensing Imagery. Remote Sens., 9.","DOI":"10.3390\/rs9070666"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"5553","DOI":"10.1109\/TGRS.2016.2569141","article-title":"Weakly Supervised Learning Based on Coupled Convolutional Neural Networks for Aircraft Detection","volume":"54","author":"Zhang","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"1938","DOI":"10.1109\/LGRS.2015.2439517","article-title":"Fast Multiclass Vehicle Detection on Aerial Images","volume":"12","author":"Liu","year":"2015","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"1074","DOI":"10.1109\/LGRS.2016.2565705","article-title":"Ship Rotated Bounding Box Space for Ship Extraction from High-Resolution Optical Satellite Images With Complex Backgrounds","volume":"13","author":"Liu","year":"2016","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1016\/j.isprsjprs.2014.10.002","article-title":"Multi-class geospatial object detection and geographic image classification based on collection of part detectors","volume":"98","author":"Cheng","year":"2014","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1016\/j.isprsjprs.2016.03.014","article-title":"A survey on object detection in optical remote sensing images","volume":"117","author":"Cheng","year":"2016","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"971","DOI":"10.1109\/TPAMI.2002.1017623","article-title":"Multiresolution gray-scale and rotation invariant texture classification with local binary patterns","volume":"24","author":"Ojala","year":"2002","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"886","DOI":"10.1109\/CVPR.2005.177","article-title":"Histograms of Oriented Gradients for Human Detection","volume":"Volume 1","author":"Dalal","year":"2005","journal-title":"Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR\u201905)"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"524","DOI":"10.1109\/CVPR.2005.16","article-title":"A Bayesian Hierarchical Model for Learning Natural Scene Categories","volume":"Volume 2","author":"Li","year":"2005","journal-title":"Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR\u201905)"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"128","DOI":"10.1109\/LGRS.2010.2051792","article-title":"Airport Detection From Large IKONOS Images Using Clustered SIFT Keypoints and Region Information","volume":"8","author":"Tao","year":"2011","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"471","DOI":"10.1109\/LGRS.2012.2210189","article-title":"Texture-Based Airport Runway Detection","volume":"10","author":"Aytekin","year":"2013","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"701","DOI":"10.1109\/LGRS.2014.2358994","article-title":"Weakly Supervised Learning for Target Detection in Remote Sensing Images","volume":"12","author":"Zhang","year":"2015","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"3325","DOI":"10.1109\/TGRS.2014.2374218","article-title":"Object Detection in Optical Remote Sensing Images Based on Weakly Supervised Learning and High-Level Feature Learning","volume":"53","author":"Han","year":"2015","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"32","DOI":"10.1016\/j.isprsjprs.2013.08.001","article-title":"Object detection in remote sensing imagery using a discriminatively trained mixture model","volume":"85","author":"Cheng","year":"2013","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"109","DOI":"10.1109\/LGRS.2011.2161569","article-title":"Automatic Target Detection in High-Resolution Remote Sensing Images Using Spatial Sparse Coding Bag-of-Words Model","volume":"9","author":"Sun","year":"2012","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"74","DOI":"10.1109\/LGRS.2013.2246538","article-title":"Object Detection in High-Resolution Remote Sensing Images Using Rotation Invariant Parts Based Model","volume":"11","author":"Zhang","year":"2014","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"247","DOI":"10.1016\/j.isprsjprs.2010.11.001","article-title":"Support vector machines in remote sensing: A review","volume":"66","author":"Mountrakis","year":"2011","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"749","DOI":"10.1109\/LGRS.2011.2180695","article-title":"A Visual Search Inspired Computational Model for Ship Detection in Optical Satellite Images","volume":"9","author":"Bi","year":"2012","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"2795","DOI":"10.1109\/TGRS.2010.2043109","article-title":"Vehicle Detection in Very High Resolution Satellite Images of City Areas","volume":"48","author":"Leitloff","year":"2010","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"4511","DOI":"10.1109\/TGRS.2013.2282355","article-title":"Ship Detection in High-Resolution Optical Imagery Based on Anomaly Detector and Local Shape Feature","volume":"52","author":"Shi","year":"2014","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., and Malik, J. (arXiv, 2013). Rich feature hierarchies for accurate object detection and semantic segmentation, arXiv.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Girshick, R. (2015, January 13\u201316). Fast R-CNN. Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"154","DOI":"10.1007\/s11263-013-0620-5","article-title":"Selective Search for Object Recognition","volume":"104","author":"Uijlings","year":"2013","journal-title":"Int. J. Comput. Vis."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","article-title":"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks","volume":"39","author":"Ren","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (arXiv, 2015). You Only Look Once: Unified, Real-Time Object Detection, arXiv.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Redmon, J., and Farhadi, A. (arXiv, 2016). YOLO9000: Better, Faster, Stronger, arXiv.","DOI":"10.1109\/CVPR.2017.690"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1007\/978-3-319-46448-0_2","article-title":"SSD: Single Shot MultiBox Detector","volume":"Volume 9905","author":"Leibe","year":"2016","journal-title":"Computer Vision\u2014ECCV 2016"},{"key":"ref_32","unstructured":"Dai, J., Li, Y., He, K., and Sun, J. (arXiv, 2016). R-FCN: Object Detection via Region-based Fully Convolutional Networks, arXiv."},{"key":"ref_33","unstructured":"Xu, M., Cui, L., Lv, P., Jiang, X., Niu, J., Zhou, B., and Wang, M. (arXiv, 2018). MDSSD: Multi-scale Deconvolutional Single Shot Detector for Small Objects, arXiv."},{"key":"ref_34","unstructured":"Fu, C.-Y., Liu, W., Ranga, A., Tyagi, A., and Berg, A.C. (arXiv, 2017). DSSD: Deconvolutional Single Shot Detector, arXiv."},{"key":"ref_35","unstructured":"Redmon, J., and Farhadi, A. (arXiv, 2018). YOLOv3: An Incremental Improvement, arXiv."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Xie, H., Wang, T., Qiao, M., Zhang, M., Shan, G., and Snoussi, H. (2017, January 20\u201322). Robust object detection for tiny and dense targets in VHR aerial images. Proceedings of the 2017 Chinese Automation Congress (CAC), Jinan, China.","DOI":"10.1109\/CAC.2017.8243930"},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"2486","DOI":"10.1109\/TGRS.2016.2645610","article-title":"Accurate Object Localization in Remote Sensing Images Based on Convolutional Neural Networks","volume":"55","author":"Long","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"281","DOI":"10.1016\/j.isprsjprs.2018.02.014","article-title":"Multi-class geospatial object detection based on a position-sensitive balancing framework for high spatial resolution remote sensing imagery","volume":"138","author":"Zhong","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"5832","DOI":"10.1109\/TGRS.2016.2572736","article-title":"Ship Detection in Spaceborne Optical Image With SVD Networks","volume":"54","author":"Zou","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Yang, X., Sun, H., Fu, K., Yang, J., Sun, X., Yan, M., and Guo, Z. (2018). Automatic Ship Detection in Remote Sensing Images from Google Earth of Complex Scenes Based on Multiscale Rotation Dense Feature Pyramid Networks. Remote Sens., 10.","DOI":"10.3390\/rs10010132"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Miao, K., Ji, K., Leng, X., and Lin, Z. (2017). Contextual Region-Based Convolutional Neural Network with Multilayer Fusion for SAR Ship Detection. Remote Sens., 9.","DOI":"10.3390\/rs9080860"},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"7405","DOI":"10.1109\/TGRS.2016.2601622","article-title":"Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images","volume":"54","author":"Cheng","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Cai, B., Jiang, Z., Zhang, H., Zhao, D., and Yao, Y. (2017). Airport Detection Using End-to-End Convolutional Neural Network with Hard Example Mining. Remote Sens., 9.","DOI":"10.3390\/rs9111198"},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Xu, Y., Zhu, M., Li, S., Feng, H., Ma, S., and Che, J. (2018). End-to-End Airport Detection in Remote Sensing Images Combining Cascade Region Proposal Networks and Multi-Threshold Detection Networks. Remote Sens., 10.","DOI":"10.3390\/rs10101516"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Guo, W., Yang, W., Zhang, H., and Hua, G. (2018). Geospatial Object Detection in High Resolution Satellite Images Based on Multi-Scale Convolutional Neural Network. Remote Sens., 10.","DOI":"10.3390\/rs10010131"},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1016\/j.isprsjprs.2018.04.003","article-title":"Multi-scale object detection in remote sensing imagery with convolutional neural networks","volume":"145","author":"Deng","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Ren, Y., Zhu, C., and Xiao, S. (2018). Deformable Faster R-CNN with Aggregating Multi-Layer Features for Partially Occluded Object Detection in Optical Remote Sensing Images. Remote Sens., 10.","DOI":"10.3390\/rs10091470"},{"key":"ref_48","unstructured":"Li, Z., and Zhou, F. (arXiv, 2017). FSSD: Feature Fusion Single Shot Multibox Detector, arXiv."},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Bodla, N., Singh, B., Chellappa, R., and Davis, L.S. (arXiv, 2017). Soft-NMS\u2014Improving Object Detection with One Line of Code, arXiv.","DOI":"10.1109\/ICCV.2017.593"},{"key":"ref_50","unstructured":"Ioffe, S., and Szegedy, C. (arXiv, 2015). Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift, arXiv."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/11\/5\/594\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T12:38:13Z","timestamp":1760186293000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/11\/5\/594"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,3,12]]},"references-count":50,"journal-issue":{"issue":"5","published-online":{"date-parts":[[2019,3]]}},"alternative-id":["rs11050594"],"URL":"https:\/\/doi.org\/10.3390\/rs11050594","relation":{},"ISSN":["2072-4292"],"issn-type":[{"type":"electronic","value":"2072-4292"}],"subject":[],"published":{"date-parts":[[2019,3,12]]}}}