{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,13]],"date-time":"2025-10-13T20:00:42Z","timestamp":1760385642051,"version":"build-2065373602"},"reference-count":50,"publisher":"MDPI AG","issue":"6","license":[{"start":{"date-parts":[[2018,6,5]],"date-time":"2018-06-05T00:00:00Z","timestamp":1528156800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61602517","61501475"],"award-info":[{"award-number":["61602517","61501475"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"National Key R&amp;D Program of China","award":["SQ2017YFB140187"],"award-info":[{"award-number":["SQ2017YFB140187"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Effective feature representations play an important role in remote sensing image analysis tasks. With the rapid progress of deep learning techniques, deep features have been widely applied to remote sensing image understanding in recent years and shown powerful ability in image representation. The existing deep feature extraction approaches are usually carried out on the whole image directly. However, such deep feature representation strategies may not effectively capture the local geometric invariance of target regions in remote sensing images. In this paper, we propose a novel region-wise deep feature extraction framework for remote sensing images. First, regions that may contain the target information are extracted from one whole image. Then, these regions are fed into a pre-trained convolutional neural network (CNN) model to extract regional deep features. Finally, the regional deep features are encoded by an improved Vector of Locally Aggregated Descriptors (VLAD) algorithm to generate the feature representation for the image. We conducted extensive experiments on remote sensing image classification and retrieval tasks based on the proposed region-wise deep feature extraction framework. The comparison results show that the proposed approach is superior to the existing CNN feature extraction methods.<\/jats:p>","DOI":"10.3390\/rs10060871","type":"journal-article","created":{"date-parts":[[2018,6,5]],"date-time":"2018-06-05T04:16:43Z","timestamp":1528172203000},"page":"871","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":39,"title":["Region-Wise Deep Feature Representation for Remote Sensing Images"],"prefix":"10.3390","volume":"10","author":[{"given":"Peng","family":"Li","sequence":"first","affiliation":[{"name":"College of Information and Control Engineering, China University of Petroleum (East China), Qingdao 266580, China"},{"name":"State Key Laboratory of Mathematical Engineering and Advanced Computing, Wuxi 214125, China"}]},{"given":"Peng","family":"Ren","sequence":"additional","affiliation":[{"name":"College of Information and Control Engineering, China University of Petroleum (East China), Qingdao 266580, China"},{"name":"State Key Laboratory of Mathematical Engineering and Advanced Computing, Wuxi 214125, China"}]},{"given":"Xiaoyu","family":"Zhang","sequence":"additional","affiliation":[{"name":"Institute of Information Engineering, Chinese Academy of Sciences, Beijing 100093, China"}]},{"given":"Qian","family":"Wang","sequence":"additional","affiliation":[{"name":"College of Computer and Information Engineering, Beijing Technology and Business University, Beijing 100048, China"}]},{"given":"Xiaobin","family":"Zhu","sequence":"additional","affiliation":[{"name":"College of Computer and Information Engineering, Beijing Technology and Business University, Beijing 100048, China"}]},{"given":"Lei","family":"Wang","sequence":"additional","affiliation":[{"name":"Academy of Broadcasting Science, SARFT, Beijing 100045, China"}]}],"member":"1968","published-online":{"date-parts":[[2018,6,5]]},"reference":[{"key":"ref_1","unstructured":"Du, R., Chen, Y., Tang, H., and Fang, T. (2005, January 25\u201329). Study on content-based remote sensing image retrieval. Proceedings of the IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Seoul, Korea."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"2770","DOI":"10.1109\/TGRS.2012.2219314","article-title":"Latent dirichlet allocation for spatial analysis of satellite images","volume":"51","author":"Vaduva","year":"2013","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"248","DOI":"10.1109\/TGRS.2016.2604680","article-title":"Structure tensor Riemannian statistical models for CBIR and classification of remote sensing images","volume":"55","author":"Rosu","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"1865","DOI":"10.1109\/JPROC.2017.2675998","article-title":"Remote sensing image scene classification: Benchmark and state of the art","volume":"105","author":"Cheng","year":"2017","journal-title":"Proc. IEEE"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1996","DOI":"10.1109\/LGRS.2014.2316143","article-title":"Performance analysis of state-of-the-art representation methods for geographical image retrieval and categorization","volume":"11","author":"Ozkan","year":"2014","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1080\/17538947.2014.882420","article-title":"An improved Bag-of-Words framework for remote sensing image retrieval in large-scale image databases","volume":"8","author":"Yang","year":"2015","journal-title":"Int. J. Digit. Earth"},{"key":"ref_7","unstructured":"Dos Santos, J., Penatti, O., and Da Silva Torres, R. (2010, January 17\u201321). Evaluating the potential of texture and color descriptors for remote sensing image retrieval and classification. Proceedings of the Fifth International Conference on Computer Vision Theory and Applications (VISAPP), Angers, France."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"1349","DOI":"10.1109\/TGRS.2015.2478379","article-title":"Unsupervised deep feature extraction for remote sensing image classification","volume":"54","author":"Romero","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"272","DOI":"10.1016\/j.patcog.2017.03.030","article-title":"Locality constraint distance metric learning for traffic congestion detection","volume":"75","author":"Wang","year":"2018","journal-title":"Pattern Recogn."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"3023","DOI":"10.1109\/TGRS.2013.2268736","article-title":"Remote sensing image retrieval with global morphological texture descriptors","volume":"52","author":"Aptoula","year":"2014","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"210","DOI":"10.1364\/AO.43.000210","article-title":"Using texture to analyze and manage large collections of remote sensed image and video data","volume":"43","author":"Newsam","year":"2004","journal-title":"Appl. Opt."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"1465","DOI":"10.1109\/TIP.2008.925367","article-title":"Indexing of satellite images with different resolutions by wavelet features","volume":"17","author":"Luo","year":"2008","journal-title":"IEEE Trans. Image Process."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"818","DOI":"10.1109\/TGRS.2012.2205158","article-title":"Geographic image retrieval using local invariant features","volume":"51","author":"Yang","year":"2013","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"813","DOI":"10.1016\/j.neucom.2016.05.061","article-title":"Local structure learning in high resolution remote sensing image retrieval","volume":"207","author":"Du","year":"2016","journal-title":"Neurocomputing"},{"key":"ref_15","unstructured":"Lazebnik, S., Schmid, C., and Ponce, J. (2006, January 17\u201322). Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), New York, NY, USA."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Yang, Y., and Newsam, S. (2010, January 3\u20135). Bag-of-visual-words and spatial extensions for land-use classification. Proceedings of the ACM SIGSPATIAL International Symposium on Advances in Geographic Information Systems (GIS), San Jose, CA, USA.","DOI":"10.1145\/1869790.1869829"},{"key":"ref_17","unstructured":"Yang, Y., and Newsam, S. (2011, January 6\u201313). Spatial pyramid co-occurrence for image classification. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Barcelona, Spain."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"439","DOI":"10.1109\/TGRS.2013.2241444","article-title":"Unsupervised feature learning for aerial scene classification","volume":"52","author":"Cheriyadat","year":"2014","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"2175","DOI":"10.1109\/TGRS.2014.2357078","article-title":"Saliency-guided unsupervised feature learning for scene classification","volume":"53","author":"Zhang","year":"2015","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"775","DOI":"10.1080\/2150704X.2015.1074756","article-title":"High-resolution remotesensing imagery retrieval using sparse features by auto-encoder","volume":"6","author":"Zhou","year":"2015","journal-title":"Remote Sens. Lett."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Li, Y., Zhang, Y., Tao, C., and Zhu, H. (2016). Content-based high-resolution remote sensing image retrieval via unsupervised feature learning and collaborative affinity metric fusion. Remote Sens., 8.","DOI":"10.3390\/rs8090709"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"6020","DOI":"10.1109\/TGRS.2016.2579648","article-title":"A three-layered graph-based learning approach for remote sensing image retrieval","volume":"54","author":"Wang","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_23","unstructured":"Krizhevsky, A., Sutskever, I., and Hinton, G. (2012, January 3\u20136). ImageNet classification with deep convolutional neural networks. Proceedings of the 25th International Conference on Neural Information Processing Systems (NIPS), Lake Tahoe, NV, USA."},{"key":"ref_24","unstructured":"Simonyan, K., and Zisserman, A. (2015, January 7\u20139). Very deep convolutional networks for large-scale image recognition. Proceedings of the International Conference on Learning Representations (ICLR), San Diego, CA, USA."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., and Rabinovich, A. (2015, January 7\u201312). Going deeper with convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Penatti, O., Nogueira, K., and Dos Santos, J. (2015, January 7\u201312). Do deep features generalize from everyday objects to remote sensing and aerial scenes domains?. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Boston, MA, USA.","DOI":"10.1109\/CVPRW.2015.7301382"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Wang, Q., Wan, J., and Yuan, Y. (2017). Deep metric learning for crowdedness regression. IEEE Trans. Circ. Syst. Video.","DOI":"10.1109\/TCSVT.2017.2703920"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"1457","DOI":"10.1109\/TITS.2017.2726546","article-title":"A joint convolutional neural networks and context transfer for street scenes labeling","volume":"19","author":"Wang","year":"2018","journal-title":"IEEE Trans. Intell. Transp."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"3188","DOI":"10.1038\/srep03188","article-title":"Deep cognitive imaging systems enable estimation of continental-scale fire incidence from climate data","volume":"3","author":"Dutta","year":"2013","journal-title":"Sci. Rep."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"150241","DOI":"10.1098\/rsos.150241","article-title":"Big data integration shows Australian bush-fire frequency is increasing significantly","volume":"3","author":"Dutta","year":"2016","journal-title":"R. Soc. Open Sci."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"1793","DOI":"10.1109\/TGRS.2015.2488681","article-title":"Scene classification via a gradient boosting random convolutional network framework","volume":"54","author":"Zhang","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"14680","DOI":"10.3390\/rs71114680","article-title":"Transferring deep convolutional neural networks for the scene classification of high-resolution remote sensing imagery","volume":"7","author":"Hu","year":"2015","journal-title":"Remote Sens."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"2321","DOI":"10.1109\/LGRS.2015.2475299","article-title":"Deep learning based feature selection for remote sensing scene classification","volume":"12","author":"Zou","year":"2015","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Zhou, W., Newsam, S., Li, C., and Shao, Z. (2017). Learning low dimensional convolutional neural networks for high-resolution remote sensing image retrieval. Remote Sens., 9.","DOI":"10.3390\/rs9050489"},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"2092","DOI":"10.1109\/LGRS.2017.2752750","article-title":"MARTA GANs: Unsupervised representation learning for remote sensing image classification","volume":"14","author":"Lin","year":"2017","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_36","first-page":"23","article-title":"An unsupervised convolutional feature fusion network for deep representation of remote sensing images","volume":"15","author":"Yu","year":"2018","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"5148","DOI":"10.1109\/TGRS.2017.2702596","article-title":"Remote sensing scene classification by unsupervised representation learning","volume":"55","author":"Lu","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"1735","DOI":"10.1109\/LGRS.2017.2731997","article-title":"Remote sensing image scene classification using bag of convolutional features","volume":"14","author":"Cheng","year":"2017","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"950","DOI":"10.1109\/TGRS.2017.2756911","article-title":"Large-scale remote sensing image retrieval by deep hashing neural networks","volume":"56","author":"Li","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Zitnick, C., and Doll\u00e1r, P. (2014, January 6\u201312). Edge boxes: Locating object proposals from edges. Proceedings of the 13th European Conference on Computer Vision, Zurich, Switzerland.","DOI":"10.1007\/978-3-319-10602-1_26"},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"1704","DOI":"10.1109\/TPAMI.2011.235","article-title":"Aggregating local image descriptors into compact codes","volume":"34","author":"Perronnin","year":"2012","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"3965","DOI":"10.1109\/TGRS.2017.2685945","article-title":"AID: A benchmark dataset for performance evaluation of aerial scene classification","volume":"55","author":"Xia","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1016\/j.isprsjprs.2018.04.002","article-title":"A review of accuracy assessment for object-based image analysis: From per-pixel to per-polygon approaches","volume":"141","author":"Ye","year":"2018","journal-title":"ISPRS J. Photogramm."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"852","DOI":"10.1109\/TGRS.2005.843569","article-title":"Use of the Bradley-Terry model to quantify association in remotely sensed images","volume":"43","author":"Stein","year":"2005","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"892","DOI":"10.1109\/TGRS.2015.2469138","article-title":"Hashing-based scalable remote sensing image search and retrieval in large archives","volume":"54","author":"Demir","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"464","DOI":"10.1109\/LGRS.2017.2651056","article-title":"Partial randomness hashing for large-scale remote sensing image retrieval","volume":"14","author":"Li","year":"2017","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Ye, D., Li, Y., Tao, C., Xie, X., and Wang, X. (2017). Multiple feature hashing learning for large-scale remote sensing image retrieval. ISPRS Int. J. Geo-Inf., 6.","DOI":"10.3390\/ijgi6110364"},{"key":"ref_48","unstructured":"Liu, W., Wang, J., Ji, R., Jiang, Y., and Chang, S. (2012, January 16\u201321). Supervised hashing with kernels. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Providence, RI, USA."},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Shen, F., Shen, C., Liu, W., and Shen, H. (2015, January 7\u201312). Supervised discrete hashing. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298598"},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Kang, W., Li, W., and Zhou, Z. (2016, January 12\u201317). Column sampling based discrete supervised hashing. Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI), Phoenix, AZ, USA.","DOI":"10.1609\/aaai.v30i1.10176"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/10\/6\/871\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T15:07:16Z","timestamp":1760195236000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/10\/6\/871"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,6,5]]},"references-count":50,"journal-issue":{"issue":"6","published-online":{"date-parts":[[2018,6]]}},"alternative-id":["rs10060871"],"URL":"https:\/\/doi.org\/10.3390\/rs10060871","relation":{},"ISSN":["2072-4292"],"issn-type":[{"type":"electronic","value":"2072-4292"}],"subject":[],"published":{"date-parts":[[2018,6,5]]}}}