{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,13]],"date-time":"2026-02-13T05:38:16Z","timestamp":1770961096323,"version":"3.50.1"},"reference-count":47,"publisher":"MDPI AG","issue":"12","license":[{"start":{"date-parts":[[2018,12,8]],"date-time":"2018-12-08T00:00:00Z","timestamp":1544227200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61331021;61421001"],"award-info":[{"award-number":["61331021;61421001"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Geospatial object detection is a fundamental but challenging problem in the remote sensing community. Although deep learning has shown its power in extracting discriminative features, there is still room for improvement in its detection performance, particularly for objects with large ranges of variations in scale and direction. To this end, a novel approach, entitled multi-scale and rotation-insensitive convolutional channel features (MsRi-CCF), is proposed for geospatial object detection by integrating robust low-level feature generation, classifier generation with outlier removal, and detection with a power law. The low-level feature generation step consists of rotation-insensitive and multi-scale convolutional channel features, which were obtained by learning a regularized convolutional neural network (CNN) and integrating multi-scaled convolutional feature maps, followed by the fine-tuning of high-level connections in the CNN, respectively. Then, these generated features were fed into AdaBoost (chosen due to its lower computation and storage costs) with outlier removal to construct an object detection framework that facilitates robust classifier training. In the test phase, we adopted a log-space sampling approach instead of fine-scale sampling by using the fast feature pyramid strategy based on a computable power law. Extensive experimental results demonstrate that compared with several state-of-the-art baselines, the proposed MsRi-CCF approach yields better detection results, with 90.19% precision with the satellite dataset and 81.44% average precision with the NWPU VHR-10 datasets. Importantly, MsRi-CCF incurs no additional computational cost, which is only 0.92 s and 0.7 s per test image on the two datasets. Furthermore, we determined that most previous methods fail to gain an acceptable detection performance, particularly when they face several obstacles, such as deformations in objects (e.g., rotation, illumination, and scaling). Yet, these factors are effectively addressed by MsRi-CCF, yielding a robust geospatial object detection method.<\/jats:p>","DOI":"10.3390\/rs10121990","type":"journal-article","created":{"date-parts":[[2018,12,10]],"date-time":"2018-12-10T03:36:41Z","timestamp":1544413001000},"page":"1990","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":34,"title":["MsRi-CCF: Multi-Scale and Rotation-Insensitive Convolutional Channel Features for Geospatial Object Detection"],"prefix":"10.3390","volume":"10","author":[{"given":"Xin","family":"Wu","sequence":"first","affiliation":[{"name":"School of Information and Electronics, Beijing Institute of Technology (BIT), Beijing 100081, China"},{"name":"Beijing Key Laboratory of Fractional Signals and Systems, School of Information and Electronics, Beijing Institute of Technology (BIT), Beijing 100081, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3212-9584","authenticated-orcid":false,"given":"Danfeng","family":"Hong","sequence":"additional","affiliation":[{"name":"Remote Sensing Technology Institute (IMF), German Aerospace Center (DLR), 82234 Wessling, Germany"},{"name":"Signal Processing in Earth Observation (SiPEO), Technical University of Munich (TUM), 80333 Munich, Germany"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1203-741X","authenticated-orcid":false,"given":"Pedram","family":"Ghamisi","sequence":"additional","affiliation":[{"name":"Machine Learning Group, Exploration Division, Helmholtz Institute Freiberg for Resource Technology, Helmholtz-Zentrum Dresden-Rossendorf, 09599 Freiberg, Germany"}]},{"given":"Wei","family":"Li","sequence":"additional","affiliation":[{"name":"School of Information and Electronics, Beijing Institute of Technology (BIT), Beijing 100081, China"},{"name":"Beijing Key Laboratory of Fractional Signals and Systems, School of Information and Electronics, Beijing Institute of Technology (BIT), Beijing 100081, China"}]},{"given":"Ran","family":"Tao","sequence":"additional","affiliation":[{"name":"School of Information and Electronics, Beijing Institute of Technology (BIT), Beijing 100081, China"},{"name":"Beijing Key Laboratory of Fractional Signals and Systems, School of Information and Electronics, Beijing Institute of Technology (BIT), Beijing 100081, China"}]}],"member":"1968","published-online":{"date-parts":[[2018,12,8]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Hong, D., Yokoya, N., Xu, J., and Zhu, X. (2018, January 8\u201314). Joint & Progressive Learning from High-Dimensional Data for Multi-Label Classification. Proceedings of the European Conference on Computer Vision (ECCV) (2018), Munich, Germany.","DOI":"10.1007\/978-3-030-01237-3_29"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"1363","DOI":"10.1109\/JSTARS.2018.2799698","article-title":"Open data for global multimodal land use classification: Outcome of the 2017 IEEE GRSS Data Fusion Contest","volume":"11","author":"Yokoya","year":"2018","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"2960","DOI":"10.1109\/JSTARS.2017.2682189","article-title":"Learning a Robust Local Manifold Representation for Hyperspectral Dimensionality Reduction","volume":"10","author":"Hong","year":"2017","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"1907","DOI":"10.1109\/JPROC.2012.2190811","article-title":"Very high-resolution remote sensing: Challenges and opportunities","volume":"100","author":"Benediktsson","year":"2012","journal-title":"Proc. IEEE"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"4567","DOI":"10.1109\/TGRS.2017.2694159","article-title":"Object tracking by hierarchical decomposition of hyperspectral video sequences: Application to chemical gas plume tracking","volume":"55","author":"Tochon","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Hong, D., Yokoya, N., Chanussot, J., and Zhu, X. (2017, January 17\u201320). Learning a low-coherence dictionary to address spectral variability for hyperspectral unmixing. Proceedings of the 24th IEEE International Conference on Image Processing (ICIP) (2017), Beijing, China.","DOI":"10.1109\/ICIP.2017.8296278"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Hong, D., Yokoya, N., Chanussot, J., and Zhu, X. (arXiv, 2018). An Augmented Linear Mixing Model to Address Spectral Variability for Hyperspectral Unmixing, arXiv.","DOI":"10.1109\/ICIP.2017.8296278"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1016\/j.isprsjprs.2016.03.014","article-title":"A Survey on Object Detection in Optical Remote Sensing Images","volume":"117","author":"Cheng","year":"2016","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"84","DOI":"10.1109\/MSP.2017.2749125","article-title":"Advanced deep learning techniques for salient and category-specific object detection: A survey","volume":"35","author":"Han","year":"2018","journal-title":"IEEE Signal Process. Mag."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"154","DOI":"10.1007\/s11263-013-0620-5","article-title":"Selective search for object recognition","volume":"104","author":"Uijlings","year":"2013","journal-title":"Int. J. Comput. Vis."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"937","DOI":"10.1109\/TGRS.2017.2756851","article-title":"Multisource Remote Sensing Data Classification Based on Convolutional Neural Network","volume":"56","author":"Xu","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"2623","DOI":"10.1109\/TIP.2018.2809606","article-title":"Diverse Region-Based CNN for Hyperspectral Image Classification","volume":"27","author":"Zhang","year":"2018","journal-title":"IEEE Trans. Image Process."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"6232","DOI":"10.1109\/TGRS.2016.2584107","article-title":"Deep feature extraction and classification of hyperspectral images based on convolutional neural networks","volume":"54","author":"Chen","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"2355","DOI":"10.1109\/LGRS.2017.2764915","article-title":"Hyperspectral images classification with Gabor filtering and convolutional neural network","volume":"14","author":"Chen","year":"2017","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"794","DOI":"10.1109\/LGRS.2018.2806945","article-title":"IMG2DSM: Height Simulation From Single Imagery Using Conditional Generative Adversarial Net","volume":"15","author":"Ghamisi","year":"2018","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"95","DOI":"10.1145\/2001269.2001295","article-title":"Unsupervised learning of hierarchical representations with convolutional deep belief networks","volume":"54","author":"Lee","year":"2011","journal-title":"Commun. ACM"},{"key":"ref_17","unstructured":"Yosinski, J., Clune, J., Nguyen, A., Fuchs, T., and Lipson, H. (2015, January 6\u201311). Understanding Neural Networks Through Deep Visualization. Proceedings of the 31st International Conference on Machine Learning (ICML), Lille, France."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Zeiler, M.D., and Fergus, R. (2014, January 6\u201312). Visualizing and Understanding Convolutional Networks. Proceedings of the 13th European Conference on Computer Vision (ECCV), Zurich, Switzerland.","DOI":"10.1007\/978-3-319-10590-1_53"},{"key":"ref_19","first-page":"142","article-title":"Deep Adaptive Proposal Network for Object Detection in Optical Remote Sensing Images","volume":"10","author":"Cheng","year":"2008","journal-title":"IEEE Trans. Geosci. Remote Sens. Mag."},{"key":"ref_20","first-page":"142","article-title":"Multi-scale object detection in remote sensing imagery with convolutional neural networks","volume":"10","author":"Deng","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"1915","DOI":"10.5194\/isprs-archives-XLII-3-1915-2018","article-title":"Rapid Target Detection in High Resolution Remote Sensing Images Using YOLO Model","volume":"XLII-3","author":"Wu","year":"2018","journal-title":"Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Xia, G.S., Bai, X., Ding, J., Zhu, Z., Belongie, S., Luo, J.B., Datcu, M.H., Pelillo, M., and Zhang, L.P. (2018, January 18\u201322). DOTA: A Large-scale Dataset for Object Detection in Aerial Images. Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00418"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"7405","DOI":"10.1109\/TGRS.2016.2601622","article-title":"Learning rotation-invariant convolutional neural networks for object detection in vhr optical remote sensing images","volume":"54","author":"Cheng","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens. Mag."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"1797","DOI":"10.1109\/LGRS.2014.2309695","article-title":"Vehicle detection in satellite images by hybrid deep convolutional neural networks","volume":"11","author":"Chen","year":"2014","journal-title":"IEEE Trans. Geosci. Remote Sens. Lett."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Chen, X., Xiang, S., Liu, C.L., and Pan, C.H. (2013, January 5\u20138). Vehicle detection in satellite images by parallel deep convolutional neural networks. Proceedings of the 2nd IAPR Asian Conference on Pattern Recognition (ACPR) (2013), Okinawa, Japan.","DOI":"10.1109\/ACPR.2013.33"},{"key":"ref_26","unstructured":"Krizhevsky, A., Sutskever, I., and Hinton, G.E. (2012, January 3\u20136). ImageNet Classification with Deep Convolutional Neural Networks. Proceedings of the 25th International Conference on Neural Information Processing Systems (NIPS) (2012), Lake Tahoe, NV, USA."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"He, K.M., Zhang, X.Y., Ren, S.Q., Sun, J., and Torr, P. (2016, January 27\u201330). Deep Residual Learning for Image Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"841","DOI":"10.1214\/aos\/1024691079","article-title":"Rejoinder: Arcing classifiers","volume":"26","author":"Breiman","year":"1998","journal-title":"Ann. Stat."},{"key":"ref_29","first-page":"1722","article-title":"The state of boosting","volume":"31","author":"Breiman","year":"2001","journal-title":"Comput. Sci. Stat."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Yang, B., Yan, J.J., Lei, Z., and Li, S. (2015, January 11\u201318). Convolutional Channel Features. Proceedings of the International Conference on Computer Vision (ICCV) (2015), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.18"},{"key":"ref_31","unstructured":"Moghimi, M., Belongie, S., Saberian, M., and Yang, J. (2016, January 19\u201322). Convolutional Channel Features. Proceedings of the 27th British Machine Vision Conference (2016), York, UK."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Wu, C.H., Gan, W.H., Lan, D., and Jay, C.C. (2017, January 24\u201331). Boosted Convolutional Neural Networks (BCNN) for Pedestrian Detection. Proceedings of the IEEE Winter Conference on Applications of Computer Vision (WACV) (2017), Santa Rosa, CA, USA.","DOI":"10.1109\/WACV.2017.66"},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"814","DOI":"10.1103\/PhysRevLett.73.814","article-title":"Statistics of Natural Images: Scaling in the Woods","volume":"73","author":"Ruderman","year":"1994","journal-title":"Phys. Rev. Lett."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"533","DOI":"10.1038\/323533a0","article-title":"Learning representations by back-propagating errors","volume":"323","author":"Rumelhart","year":"1986","journal-title":"Nature"},{"key":"ref_35","unstructured":"Zhang, C., Bengio, S., Hardt, M., Recht, B., and Vinyals, O. (2017, January 24\u201326). Understanding deep learning requires rethinking generalization. Proceedings of the European Conference on Computer Vision (ICLR) (2017), Toulon, France."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"KapoorEmail, A., Hemani, H., Sakthivel, N., and Chaturvedi, S. (2015, January 10\u201312). Mpi implementation of expectation maximization algorithm for gaussian mixture models. Proceedings of the 12th Annual Conference of the Italian Association for Cognitive Sciences (AISC), Genova, Italy.","DOI":"10.1007\/978-3-319-13731-5_56"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Cai, L., Xu, Y.R., He, L., Zhao, Y.M., and Yang, X. (2009, January 23\u201327). An effective segmentation for noise-based image verification using gamma mixture models. Proceedings of the 9th Asian Conference on Computer Vision (ACCV) (2009), Xi\u00e1n, China.","DOI":"10.1007\/978-3-642-12297-2_3"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Wu, X., Cai, L., and Ji, R.R. (2018, January 7\u201310). Gamma Mixture Models for Outlier Removal. Proceedings of the 25th IEEE International Conference on Image Processing (ICIP) (2018), Athens, Greece.","DOI":"10.1109\/ICIP.2018.8451217"},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"95","DOI":"10.1214\/aos\/1176346060","article-title":"On the convergence properties of the em algorithm","volume":"11","author":"Wu","year":"1983","journal-title":"Ann. Stat."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"1532","DOI":"10.1109\/TPAMI.2014.2300479","article-title":"Fast Feature Pyramids for Object Detection","volume":"36","author":"Piotr","year":"2014","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1016\/j.isprsjprs.2014.10.002","article-title":"Multi-class geospatial object detection and geographic imageclassification based on collection of part detectors","volume":"98","author":"Cheng","year":"2014","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Heitz, G., and Koller, D. (2008, January 12\u201318). Learning spatial context: Using stuff to find things. Proceedings of the 10th European Conference on Computer Vision (ECCV), Marseille-France, Palais.","DOI":"10.1007\/978-3-540-88682-2_4"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Malisiewicz, T., Gupta, A., and Efros, A. (2011, January 6\u201313). Ensemble of exemplar-svms for object detection and beyond. Proceedings of the International Conference on Computer Vision (ICCV) (2011), Barcelona, Spain.","DOI":"10.1109\/ICCV.2011.6126229"},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"366","DOI":"10.1109\/LGRS.2009.2035644","article-title":"Object classification of aerial images with bag of visual words","volume":"7","author":"Xu","year":"2010","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (July, January 26). You only look once: unified, real-time object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Redmon, J., and Farhadi, A. (2017, January 21\u201326). Yolo9000: Better, faster, stronger. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition(CVPR) (2016), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.690"},{"key":"ref_47","unstructured":"Lin, M., Chen, Q., and Yan, S.C. (2014, January 14\u201316). Network In Network. Proceedings of the International Conference on Learning Representations (ICLR) (2014), Banff, AB, Canada."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/10\/12\/1990\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T15:32:12Z","timestamp":1760196732000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/10\/12\/1990"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,12,8]]},"references-count":47,"journal-issue":{"issue":"12","published-online":{"date-parts":[[2018,12]]}},"alternative-id":["rs10121990"],"URL":"https:\/\/doi.org\/10.3390\/rs10121990","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,12,8]]}}}