{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,27]],"date-time":"2025-11-27T10:43:25Z","timestamp":1764240205369,"version":"build-2065373602"},"reference-count":45,"publisher":"MDPI AG","issue":"3","license":[{"start":{"date-parts":[[2019,2,1]],"date-time":"2019-02-01T00:00:00Z","timestamp":1548979200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61790550","61790554"],"award-info":[{"award-number":["61790550","61790554"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Effective feature representations play a decisive role in content-based remote sensing image retrieval (CBRSIR). Recently, learning-based features have been widely used in CBRSIR and they show powerful ability of feature representations. In addition, a significant effort has been made to improve learning-based features from the perspective of the network structure. However, these learning-based features are not sufficiently discriminative for CBRSIR. In this paper, we propose two effective schemes for generating discriminative features for CBRSIR. In the first scheme, the attention mechanism and a new attention module are introduced to the Convolutional Neural Networks (CNNs) structure, causing more attention towards salient features, and the suppression of other features. In the second scheme, a multi-task learning network structure is proposed, to force learning-based features to be more discriminative, with inter-class dispersion and intra-class compaction, through penalizing the distances between the feature representations and their corresponding class centers. Then, a new method for constructing more challenging datasets is first used for remote sensing image retrieval, to better validate our schemes. Extensive experiments on challenging datasets are conducted to evaluate the effectiveness of our two schemes, and the comparison of the results demonstrate that our proposed schemes, especially the fusion of the two schemes, can improve the baseline methods by a significant margin.<\/jats:p>","DOI":"10.3390\/rs11030281","type":"journal-article","created":{"date-parts":[[2019,2,1]],"date-time":"2019-02-01T03:08:05Z","timestamp":1548990485000},"page":"281","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":62,"title":["A Discriminative Feature Learning Approach for Remote Sensing Image Retrieval"],"prefix":"10.3390","volume":"11","author":[{"given":"Wei","family":"Xiong","sequence":"first","affiliation":[{"name":"Research Institute of information Fusion, Naval Aviation University, Yantai 264001, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2779-5099","authenticated-orcid":false,"given":"Yafei","family":"Lv","sequence":"additional","affiliation":[{"name":"Research Institute of information Fusion, Naval Aviation University, Yantai 264001, China"}]},{"given":"Yaqi","family":"Cui","sequence":"additional","affiliation":[{"name":"Research Institute of information Fusion, Naval Aviation University, Yantai 264001, China"}]},{"given":"Xiaohan","family":"Zhang","sequence":"additional","affiliation":[{"name":"Research Institute of information Fusion, Naval Aviation University, Yantai 264001, China"}]},{"given":"Xiangqi","family":"Gu","sequence":"additional","affiliation":[{"name":"Research Institute of information Fusion, Naval Aviation University, Yantai 264001, China"}]}],"member":"1968","published-online":{"date-parts":[[2019,2,1]]},"reference":[{"key":"ref_1","unstructured":"Du, P., Chen, Y., Hong, T., and Tao, F. (2005, January 29\u201329). Study on content-based remote sensing image retrieval. Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, Seoul, Korea."},{"key":"ref_2","first-page":"60440Q","article-title":"Content-based remote sensing image retrieval","volume":"6044","author":"Li","year":"2005","journal-title":"Proc. SPIE Int. Soc. Opt. Eng."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"1343","DOI":"10.1080\/01431161.2017.1399472","article-title":"Visual descriptors for content-based retrieval of remote-sensing images","volume":"39","author":"Napoletano","year":"2018","journal-title":"Int. J. Remote Sens."},{"key":"ref_4","first-page":"328","article-title":"Remote Sensing Image Retrieval Using Color and Texture Fused Features","volume":"9","author":"Lu","year":"2004","journal-title":"J. Image Graph."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"3023","DOI":"10.1109\/TGRS.2013.2268736","article-title":"Remote Sensing Image Retrieval with Global Morphological Texture Descriptors","volume":"52","author":"Aptoula","year":"2014","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"8","DOI":"10.1109\/MGRS.2017.2762307","article-title":"Deep learning in remote sensing: A review","volume":"5","author":"Zhu","year":"2017","journal-title":"IEEE Geosci. Remote Sens. Mag."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Wan, J., Wang, D., Hoi, S.C.H., Wu, P., Zhu, J., Zhang, Y., and Li, J. (2014, January 3\u20137). Deep Learning for Content-Based Image Retrieval:A Comprehensive Study. Proceedings of the ACM International Conference on Multimedia, Orlando, FL, USA.","DOI":"10.1145\/2647868.2654948"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Lowe, D.G. (1999, January 20\u201327). Object recognition from local scale-invariant features. Proceedings of the Seventh IEEE International Conference on Computer Vision, Kerkyra, Greece.","DOI":"10.1109\/ICCV.1999.790410"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Bay, H., Tuytelaars, T., and Van Gool, L. (2006, January 7\u201313). Surf: Speeded up robust features. Proceedings of the European Conference on Computer Vision, Graz, Austria.","DOI":"10.1007\/11744023_32"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1080\/17538947.2014.882420","article-title":"An improved Bag-of-Words framework for remote sensing image retrieval in large-scale image databases","volume":"8","author":"Yang","year":"2015","journal-title":"Int. J. Digit. Earth"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Sivic, J., and Zisserman, A. (2003, January 13\u201316). Video Google: A text retrieval approach to object matching in videos. Proceedings of the Ninth IEEE International Conference on Computer Vision, Nice, France.","DOI":"10.1109\/ICCV.2003.1238663"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Tang, X., Zhang, X., Liu, F., and Jiao, L. (2018). Unsupervised Deep Feature Learning for Remote Sensing Image Retrieval. Remote Sens., 10.","DOI":"10.3390\/rs10081243"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"J\u00e9gou, H., Douze, M., Schmid, C., and P\u00e9rez, P. (2010, January 13\u201318). Aggregating local descriptors into a compact image representation. Proceedings of the Computer Vision and Pattern Recognition (CVPR), San Francisco, CA, USA.","DOI":"10.1109\/CVPR.2010.5540039"},{"key":"ref_14","unstructured":"Krizhevsky, A., Sutskever, I., and Hinton, G.E. (2017, January 4\u20139). ImageNet classification with deep convolutional neural networks. Proceedings of the International Conference on Neural Information Processing Systems, Long Beach, CA, USA."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., and Rabinovich, A. (2015, January 7\u201312). Going deeper with convolutions. Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (arXiv, 2015). Deep Residual Learning for Image Recognition, arXiv.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Dollar, P., Girshick, R., He, K., Hariharan, B., and Belongie, S. (2017, January 21\u201326). Feature Pyramid Networks for Object Detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.106"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., and Berg, A.C. (2016, January 8\u201316). SSD: Single Shot MultiBox Detector. Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Redmon, J., and Farhadi, A. (2017, January 21\u201326). YOLO9000: Better, Faster, Stronger. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.690"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Long, J., Shelhamer, E., and Darrell, T. (2015, January 7\u201312). Fully convolutional networks for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Ioffe, S., Vanhoucke, V., and Alemi, A. (arXiv, 2016). Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning, arXiv.","DOI":"10.1609\/aaai.v31i1.11231"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Ge, Y., Jiang, S., Xu, Q., Jiang, C., and Ye, F. (2017). Exploiting representations from pre-trained convolutional neural networks for high-resolution remote sensing image retrieval. Multimedia Tools Appl., 1\u201327.","DOI":"10.1007\/s11042-017-5314-5"},{"key":"ref_23","first-page":"1","article-title":"AID: A Benchmark Data Set for Performance Evaluation of Aerial Scene Classification","volume":"PP","author":"Xia","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Shao, Z., Yang, K., and Zhou, W. (2018). Performance Evaluation of Single-Label and Multi-Label Remote Sensing Image Retrieval Using a Dense Labeling Dataset. Remote Sens., 10.","DOI":"10.3390\/rs10060964"},{"key":"ref_25","first-page":"1","article-title":"Multilabel Remote Sensing Image Retrieval Using a Semisupervised Graph-Theoretic Method","volume":"PP","author":"Chaudhuri","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_26","unstructured":"Zhou, W., Deng, X., and Shao, Z. (arXiv, 2018). Region Convolutional Features for Multi-Label Remote Sensing Image Retrieval, arXiv."},{"key":"ref_27","unstructured":"Xia, G.S., Tong, X.Y., Hu, F., Zhong, Y., Datcu, M., and Zhang, L. (arXiv, 2017). Exploiting Deep Features for Remote Sensing Image Retrieval: A Systematic Investigation, arXiv."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Zhou, W., Newsam, S., Li, C., and Shao, Z. (2016). Learning Low Dimensional Convolutional Neural Networks for High-Resolution Remote Sensing Image Retrieval. Remote Sens., 9.","DOI":"10.3390\/rs9050489"},{"key":"ref_29","first-page":"1","article-title":"Large-Scale Remote Sensing Image Retrieval by Deep Hashing Neural Networks","volume":"PP","author":"Li","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_30","first-page":"2204","article-title":"Recurrent models of visual attention","volume":"3","author":"Mnih","year":"2014","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_31","unstructured":"Gregor, K., Danihelka, I., Graves, A., Rezende, D.J., and Wierstra, D. (2015). DRAW: A recurrent neural network for image generation. Comput. Sci., 1462\u20131471."},{"key":"ref_32","unstructured":"Ba, J., Mnih, V., and Kavukcuoglu, K. (arXiv, 2014). Multiple Object Recognition with Visual Attention, arXiv."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Wang, F., Jiang, M., Qian, C., Yang, S., Li, C., Zhang, H., Wang, X., and Tang, X. (2017, January 21\u201326). Residual Attention Network for Image Classification. Proceedings of the Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.683"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Hu, J., Shen, L., and Sun, G. (arXiv, 2017). Squeeze-and-Excitation Networks, arXiv.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Woo, S., Park, J., Lee, J.Y., and Kweon, I.S. (arXiv, 2018). CBAM: Convolutional Block Attention Module, arXiv.","DOI":"10.1007\/978-3-030-01234-2_1"},{"key":"ref_36","unstructured":"Chopra, S., Hadsell, R., and Lecun, Y. (2005, January 20\u201325). Learning a Similarity Metric Discriminatively, with Application to Face Verification. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005, San Diego, CA, USA."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Wen, Y., Li, Z., and Qiao, Y. (2016, January 27\u201330). Latent Factor Guided Convolutional Neural Networks for Age-Invariant Face Recognition. Proceedings of the Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.529"},{"key":"ref_38","unstructured":"Chen, Y., Chen, Y., Wang, X., and Tang, X. (2014, January 8\u201313). Deep learning face representation by joint identification-verification. Proceedings of the International Conference on Neural Information Processing Systems, Montreal, QC, Canada."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Schroff, F., Kalenichenko, D., and Philbin, J. (2015, January 7\u201312). FaceNet: A unified embedding for face recognition and clustering. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298682"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Wen, Y., Zhang, K., Li, Z., and Qiao, Y. (2016, January 11\u201314). A Discriminative Feature Learning Approach for Deep Face Recognition. Proceedings of the Computer Vision\u2014ECCV 2016: 14th European Conference, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46478-7_31"},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"197","DOI":"10.1016\/j.isprsjprs.2018.01.004","article-title":"PatternNet: A benchmark dataset for performance evaluation of remote sensing image retrieval","volume":"145","author":"Zhou","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"2321","DOI":"10.1109\/LGRS.2015.2475299","article-title":"Deep Learning Based Feature Selection for Remote Sensing Scene Classification","volume":"12","author":"Zou","year":"2015","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Yang, Y., and Newsam, S. (2010, January 2\u20135). Bag-of-visual-words and spatial extensions for land-use classification. Proceedings of the Sigspatial International Conference on Advances in Geographic Information Systems, San Jose, CA, USA.","DOI":"10.1145\/1869790.1869829"},{"key":"ref_44","unstructured":"Xia, G.-S., Yang, W., Delon, J., Gousseau, Y., Sun, H., and Ma\u00eetre, H. (2010, January 5\u20137). Structural high-resolution satellite image indexing. Proceedings of the ISPRS TC VII Symposium-100 Years ISPRS, Vienna, Austria."},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., and Batra, D. (arXiv, 2016). Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization, arXiv.","DOI":"10.1109\/ICCV.2017.74"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/11\/3\/281\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T12:30:05Z","timestamp":1760185805000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/11\/3\/281"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,2,1]]},"references-count":45,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2019,2]]}},"alternative-id":["rs11030281"],"URL":"https:\/\/doi.org\/10.3390\/rs11030281","relation":{},"ISSN":["2072-4292"],"issn-type":[{"type":"electronic","value":"2072-4292"}],"subject":[],"published":{"date-parts":[[2019,2,1]]}}}