{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,1]],"date-time":"2025-12-01T11:18:26Z","timestamp":1764587906140,"version":"build-2065373602"},"reference-count":58,"publisher":"MDPI AG","issue":"3","license":[{"start":{"date-parts":[[2018,3,6]],"date-time":"2018-03-06T00:00:00Z","timestamp":1520294400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["41671400","41701446","61602429"],"award-info":[{"award-number":["41671400","41701446","61602429"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"National key R &amp; D program of China","award":["2017YFC0602204"],"award-info":[{"award-number":["2017YFC0602204"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Remote sensing (RS) scene classification is important for RS imagery semantic interpretation. Although tremendous strides have been made in RS scene classification, one of the remaining open challenges is recognizing RS scenes in low quality variance (e.g., various scales and noises). This paper proposes a deep salient feature based anti-noise transfer network (DSFATN) method that effectively enhances and explores the high-level features for RS scene classification in different scales and noise conditions. In DSFATN, a novel discriminative deep salient feature (DSF) is introduced by saliency-guided DSF extraction, which conducts a patch-based visual saliency (PBVS) algorithm using \u201cvisual attention\u201d mechanisms to guide pre-trained CNNs for producing the discriminative high-level features. Then, an anti-noise network is proposed to learn and enhance the robust and anti-noise structure information of RS scene by directly propagating the label information to fully-connected layers. A joint loss is used to minimize the anti-noise network by integrating anti-noise constraint and a softmax classification loss. The proposed network architecture can be easily trained with a limited amount of training data. The experiments conducted on three different scale RS scene datasets show that the DSFATN method has achieved excellent performance and great robustness in different scales and noise conditions. It obtains classification accuracy of 98.25%, 98.46%, and 98.80%, respectively, on the UC Merced Land Use Dataset (UCM), the Google image dataset of SIRI-WHU, and the SAT-6 dataset, advancing the state-of-the-art substantially.<\/jats:p>","DOI":"10.3390\/rs10030410","type":"journal-article","created":{"date-parts":[[2018,3,6]],"date-time":"2018-03-06T12:16:27Z","timestamp":1520338587000},"page":"410","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":41,"title":["Deep Salient Feature Based Anti-Noise Transfer Network for Scene Classification of Remote Sensing Imagery"],"prefix":"10.3390","volume":"10","author":[{"given":"Xi","family":"Gong","sequence":"first","affiliation":[{"name":"Department of Information Engineering, China University of Geosciences, Wuhan 430075, China"},{"name":"National Engineering Research Center of Geographic Information System, Wuhan 430075, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhong","family":"Xie","sequence":"additional","affiliation":[{"name":"Department of Information Engineering, China University of Geosciences, Wuhan 430075, China"},{"name":"National Engineering Research Center of Geographic Information System, Wuhan 430075, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0465-3976","authenticated-orcid":false,"given":"Yuanyuan","family":"Liu","sequence":"additional","affiliation":[{"name":"Department of Information Engineering, China University of Geosciences, Wuhan 430075, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2815-7897","authenticated-orcid":false,"given":"Xuguo","family":"Shi","sequence":"additional","affiliation":[{"name":"Department of Information Engineering, China University of Geosciences, Wuhan 430075, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhuo","family":"Zheng","sequence":"additional","affiliation":[{"name":"Department of Information Engineering, China University of Geosciences, Wuhan 430075, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2018,3,6]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Yang, Y., and Newsam, S. (2010, January 3\u20135). Bag-of-visual-words and spatial extensions for land-use classification. Proceedings of the 18th SIGSPATIAL International Conference on Advances in Geographic Information Systems, San Jose, CA, USA.","DOI":"10.1145\/1869790.1869829"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"439","DOI":"10.1109\/TGRS.2013.2241444","article-title":"Unsupervised feature learning for aerial scene classification","volume":"52","author":"Cheriyadat","year":"2014","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"4472","DOI":"10.1109\/TGRS.2015.2400449","article-title":"Learning high-level features for satellite image classification with limited labeled samples","volume":"53","author":"Yang","year":"2015","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"8588","DOI":"10.1080\/01431161.2013.845925","article-title":"Extreme value theory-based calibration for the fusion of multiple features in high-resolution satellite scene classification","volume":"34","author":"Shao","year":"2013","journal-title":"Int. J. Remote Sens."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"2077","DOI":"10.1109\/LGRS.2017.2751559","article-title":"Locality Adaptive Discriminant Analysis for Spectral\u2013Spatial Classification of Hyperspectral Images","volume":"14","author":"Wang","year":"2017","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"778","DOI":"10.1016\/j.imavis.2006.07.015","article-title":"Which is the best way to organize\/classify images by content?","volume":"25","author":"Bosch","year":"2007","journal-title":"Image Vis. Comput."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"6207","DOI":"10.1109\/TGRS.2015.2435801","article-title":"Scene classification based on the multifeature fusion probabilistic topic model for high spatial resolution remote sensing imagery","volume":"53","author":"Zhong","year":"2015","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"424","DOI":"10.1016\/j.patcog.2012.07.017","article-title":"Scene classification using a multi-resolution bag-of-features model","volume":"46","author":"Zhou","year":"2013","journal-title":"Pattern Recognit."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Zhao, B., Zhong, Y., Zhang, L., and Huang, B. (2016). The fisher kernel coding framework for high spatial resolution scene classification. Remote Sens., 8.","DOI":"10.3390\/rs8020157"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"2108","DOI":"10.1109\/TGRS.2015.2496185","article-title":"Dirichlet-derived multiple topic scene classification model fusing heterogeneous features for high spatial resolution remote sensing imagery","volume":"54","author":"Zhao","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"025006","DOI":"10.1117\/1.JRS.10.025006","article-title":"Large patch convolutional neural networks for the scene classification of high spatial resolution imagery","volume":"10","author":"Zhong","year":"2016","journal-title":"J. Appl. Remote Sens."},{"key":"ref_12","unstructured":"Lazebnik, S., Schmid, C., and Ponce, J. (2006, January 17\u201322). Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, New York, NY, USA."},{"key":"ref_13","unstructured":"Yang, Y., and Newsam, S. (2011, January 6\u201313). Spatial pyramid co-occurrence for image classification. Proceedings of the IEEE International Conference on Computer Vision, Barcelona, Spain."},{"key":"ref_14","first-page":"993","article-title":"Latent dirichlet allocation","volume":"3","author":"Blei","year":"2003","journal-title":"J. Mach. Learn. Res."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"28","DOI":"10.1109\/LGRS.2009.2023536","article-title":"Semantic annotation of satellite images using latent dirichlet allocation","volume":"7","author":"Lienou","year":"2010","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"2770","DOI":"10.1109\/TGRS.2012.2219314","article-title":"Latent dirichlet allocation for spatial analysis of satellite images","volume":"51","author":"Vaduva","year":"2013","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"712","DOI":"10.1109\/TPAMI.2007.70716","article-title":"Scene classification using a hybrid generative\/discriminative approach","volume":"30","author":"Bosch","year":"2008","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1080\/01431161.2012.705443","article-title":"Automatic landslide detection from remote-sensing imagery using a scene classification method based on BoVW and pLSA","volume":"34","author":"Cheng","year":"2013","journal-title":"Int. J. Remote Sens."},{"key":"ref_19","unstructured":"Simonyan, K., and Zisserman, A. (2015, January 7\u20139). Very deep convolutional networks for large-scale image recognition. Proceedings of the International Conference on Learning Representations, San Diego, CA, USA."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"5832","DOI":"10.1109\/TGRS.2016.2572736","article-title":"Ship detection in spaceborne optical image with SVD networks","volume":"54","author":"Zou","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_21","unstructured":"Gao, J., Wang, Q., and Yuan, Y. (June, January 29). Embedding structured contour and location prior in siamesed fully convolutional networks for road detection. Proceedings of the IEEE International Conference on Robotics and Automation, Singapore."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Li, C., and Wand, M. (arXiv, 2016). Combining markov random fields and convolutional neural networks for image synthesis, arXiv.","DOI":"10.1109\/CVPR.2016.272"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1109\/MGRS.2016.2540798","article-title":"Deep Learning for Remote Sensing Data: A Technical Tutorial on the State of the Art","volume":"4","author":"Zhang","year":"2016","journal-title":"IEEE Geosci. Remote Sens. Mag."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"8","DOI":"10.1109\/MGRS.2017.2762307","article-title":"Deep Learning in Remote Sensing: A Comprehensive Review and List of Resources","volume":"5","author":"Zhu","year":"2017","journal-title":"IEEE Geosci. Remote Sens. Mag."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"5148","DOI":"10.1109\/TGRS.2017.2702596","article-title":"Remote sensing scene classification by unsupervised representation learning","volume":"55","author":"Lu","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"2381","DOI":"10.1109\/JSTARS.2015.2388577","article-title":"Spectral\u2013spatial classification of hyperspectral data based on deep belief network","volume":"8","author":"Chen","year":"2015","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., and Fei-Fei, L. (2009, January 20\u201325). Imagenet: A large-scale hierarchical image database. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA.","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"262","DOI":"10.1080\/2150704X.2016.1258127","article-title":"Vehicle detection in remote sensing images using denoising-based convolutional neural networks","volume":"8","author":"Li","year":"2017","journal-title":"Remote Sens. Lett."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"1293","DOI":"10.1109\/LGRS.2017.2708722","article-title":"M-FCN: Effective fully convolutional network-based airplane detection framework","volume":"14","author":"Yang","year":"2017","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"1797","DOI":"10.1109\/LGRS.2014.2309695","article-title":"Vehicle detection in satellite images by hybrid deep convolutional neural networks","volume":"11","author":"Chen","year":"2017","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_31","unstructured":"Simard, P.Y., Steinkraus, D., and Platt, J.C. (2003, January 3\u20136). Best practices for convolutional neural networks applied to visual document analysis. Proceedings of the Seventh International Conference on Document Analysis and Recognition, Edinburgh, UK."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"1441","DOI":"10.1093\/mnras\/stv632","article-title":"Rotation-invariant convolutional neural networks for galaxy morphology prediction","volume":"450","author":"Dieleman","year":"2015","journal-title":"Mon. Not. R. Astron. Soc."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"2175","DOI":"10.1109\/TGRS.2014.2357078","article-title":"Saliency-guided unsupervised feature learning for scene classification","volume":"53","author":"Zhang","year":"2015","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_34","unstructured":"Castelluccio, M., Poggi, G., Sansone, C., and Verdoliva, L. (arXiv, 2015). Land use classification in remote sensing images by convolutional neural networks, arXiv."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., and Anguelov, D. (2015, January 11\u201312). Going deeper with convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Penatti, O.A., Nogueira, K., and dos Santos, J.A. (2015, January 12). Do deep features generalize from everyday objects to remote sensing and aerial scenes domains?. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, Boston, MA, USA.","DOI":"10.1109\/CVPRW.2015.7301382"},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"14680","DOI":"10.3390\/rs71114680","article-title":"Transferring deep convolutional neural networks for the scene classification of high-resolution remote sensing imagery","volume":"7","author":"Hu","year":"2015","journal-title":"Remote Sens."},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Gong, Y., Wang, L., Guo, R., and Lazebnik, S. (2014). Multi-scale orderless pooling of deep convolutional activation features. The European Conference on Computer Vision (ECCV), Springer.","DOI":"10.1007\/978-3-319-10584-0_26"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Lee, H., Grosse, R., Ranganath, R., and Ng, A.Y. (2009, January 14\u201318). Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations. Proceedings of the 26th annual international conference on machine learning, Montreal, QC, Canada.","DOI":"10.1145\/1553374.1553453"},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"1254","DOI":"10.1109\/34.730558","article-title":"A model of saliency-based visual attention for rapid scene analysis","volume":"20","author":"Itti","year":"1998","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Harel, J., Koch, C., and Perona, P. (2007, January 3). Graph-based visual saliency. Proceedings of the Advances in Neural Information Processing Systems, Vancouver, BC, Canada.","DOI":"10.7551\/mitpress\/7503.003.0073"},{"key":"ref_42","unstructured":"Harel, J. (2018, January 10). A Saliency Implementation in MATLAB. Available online: http:\/\/www.vision.caltech.edu\/~harel\/share\/gbvs.php."},{"key":"ref_43","unstructured":"Caldwell, D.R. (2018, January 10). Unlocking the mysteries of the bounding box. Available online: http:\/\/www.stonybrook.edu\/libmap\/coordinates\/seriesa\/no2\/a2.pdf."},{"key":"ref_44","unstructured":"Glorot, X., Bordes, A., and Bengio, Y. (2011, January 11\u201313). Deep sparse rectifier neural networks. Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, Lauderdale, FL, USA."},{"key":"ref_45","unstructured":"Peng, X., Lu, C., Yi, Z., and Tang, H. (2016). Connections between Nuclear Norm and Frobenius-Norm-Based Representations. IEEE Trans. Neural Netw. Learn. Syst."},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Basu, S., Ganguly, S., Mukhopadhyay, S., Dibiano, R., Karki, M., and Nemani, R. (2015, January 3\u20136). Deepsat: A learning framework for satellite imagery. Proceedings of the 23rd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, Seattle, WA, USA.","DOI":"10.1145\/2820783.2820816"},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"1947","DOI":"10.1109\/TGRS.2014.2351395","article-title":"Pyramid of spatial relatons for scene-level land use classification","volume":"53","author":"Chen","year":"2015","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_48","unstructured":"Krizhevsky, A., Sutskever, I., and Hinton, G.E. (2012, January 3\u20138). ImageNet classification with deep convolutional neural networks. Proceedings of the Twenty-Sixth International Conference on Neural Information Processing Systems, Lake Tahoe, NY, USA."},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach. Learn."},{"key":"ref_50","unstructured":"Dalal, N., and Triggs, B. (2005, January 20\u201326). Histograms of oriented gradients for human detection. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Diego, CA, USA."},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1023\/B:VISI.0000029664.99615.94","article-title":"Distinctive image features from scale-invariant keypoints","volume":"60","author":"Lowe","year":"2004","journal-title":"Int. J. Comput. Vis."},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"971","DOI":"10.1109\/TPAMI.2002.1017623","article-title":"Multiresolution gray-scale and rotation invariant texture classification with local binary patterns","volume":"24","author":"Ojala","year":"2002","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_53","first-page":"2579","article-title":"Visualizing data using t-SNE","volume":"9","author":"Hinton","year":"2008","journal-title":"J. Mach. Learn. Res."},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Jia, Y., Shelhamer, E., Donahue, J., Karayev, S., Long, J., Girshick, R., Guadarrama, S., and Darrell, T. (2014, January 3\u20137). Caffe: Convolutional architecture for fast feature embedding. Proceedings of the ACM International Conference on Multimedia, Orlando, FL, USA.","DOI":"10.1145\/2647868.2654889"},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Chatfield, K., Simonyan, K., Vedaldi, A., and Zisserman, A. (2014, January 1\u20135). Return of the devil in the details: Delving deep into convolutional nets. Proceedings of the British Machine Vision Conference, Nottingham, UK.","DOI":"10.5244\/C.28.6"},{"key":"ref_56","unstructured":"Ioffe, S., and Szegedy, C. (2015, January 6\u201311). Batch normalization: Accelerating deep network training by reducing internal covariate shift. Proceedings of the International Conference on Machine Learning, Lille, France."},{"key":"ref_57","unstructured":"Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., and Wojna, Z. (July, January 26). Rethinking the inception architecture for computer vision. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA."},{"key":"ref_58","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (July, January 26). Deep residual learning for image recognition. Proceedings of the IEEE conference on computer vision and pattern recognition, Las Vegas, NV, USA."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/10\/3\/410\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T14:57:45Z","timestamp":1760194665000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/10\/3\/410"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,3,6]]},"references-count":58,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2018,3]]}},"alternative-id":["rs10030410"],"URL":"https:\/\/doi.org\/10.3390\/rs10030410","relation":{},"ISSN":["2072-4292"],"issn-type":[{"type":"electronic","value":"2072-4292"}],"subject":[],"published":{"date-parts":[[2018,3,6]]}}}