{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,1]],"date-time":"2026-04-01T08:22:41Z","timestamp":1775031761591,"version":"3.50.1"},"reference-count":95,"publisher":"MDPI AG","issue":"7","license":[{"start":{"date-parts":[[2018,7,23]],"date-time":"2018-07-23T00:00:00Z","timestamp":1532304000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["71771216"],"award-info":[{"award-number":["71771216"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["71701209"],"award-info":[{"award-number":["71701209"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Aerial scene classification is an active and challenging problem in high-resolution remote sensing imagery understanding. Deep learning models, especially convolutional neural networks (CNNs), have achieved prominent performance in this field. The extraction of deep features from the layers of a CNN model is widely used in these CNN-based methods. Although the CNN-based approaches have obtained great success, there is still plenty of room to further increase the classification accuracy. As a matter of fact, the fusion with other features has great potential for leading to the better performance of aerial scene classification. Therefore, we propose two effective architectures based on the idea of feature-level fusion. The first architecture, i.e., texture coded two-stream deep architecture, uses the raw RGB network stream and the mapped local binary patterns (LBP) coded network stream to extract two different sets of features and fuses them using a novel deep feature fusion model. In the second architecture, i.e., saliency coded two-stream deep architecture, we employ the saliency coded network stream as the second stream and fuse it with the raw RGB network stream using the same feature fusion model. For sake of validation and comparison, our proposed architectures are evaluated via comprehensive experiments with three publicly available remote sensing scene datasets. The classification accuracies of saliency coded two-stream architecture with our feature fusion model achieve 97.79%, 98.90%, 94.09%, 95.99%, 85.02%, and 87.01% on the UC-Merced dataset (50% and 80% training samples), the Aerial Image Dataset (AID) (20% and 50% training samples), and the NWPU-RESISC45 dataset (10% and 20% training samples), respectively, overwhelming state-of-the-art methods.<\/jats:p>","DOI":"10.3390\/rs10071158","type":"journal-article","created":{"date-parts":[[2018,7,24]],"date-time":"2018-07-24T02:58:56Z","timestamp":1532401136000},"page":"1158","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":66,"title":["Dense Connectivity Based Two-Stream Deep Feature Fusion Framework for Aerial Scene Classification"],"prefix":"10.3390","volume":"10","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4809-9738","authenticated-orcid":false,"given":"Yunlong","family":"Yu","sequence":"first","affiliation":[{"name":"Institute of Air Defense and Anti-Missile, Air Force Engineering University, Xi\u2019an 710051, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Fuxian","family":"Liu","sequence":"additional","affiliation":[{"name":"Institute of Air Defense and Anti-Missile, Air Force Engineering University, Xi\u2019an 710051, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2018,7,23]]},"reference":[{"key":"ref_1","first-page":"2","article-title":"Integration of Remote Sensing and GIS Techniques for Flood Monitoring and Damage Assessment: A Case Study of Naogaon District","volume":"7","author":"Faisal","year":"2018","journal-title":"Bangladesh J. Remote Sens. GIS"},{"key":"ref_2","first-page":"105400Q","article-title":"Development technology of principle prototype of high-resolution quantum remote sensing imaging","volume":"Volume 10540","author":"Bi","year":"2018","journal-title":"Quantum Sensing and Nano Electronics and Photonics XV"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Weng, Q., Quattrochi, D., and Gamba, P.E. (2018). Urban Remote Sensing, CRC Press.","DOI":"10.1201\/9781315166612"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Mukherjee, A.B., Krishna, A.P., and Patel, N. (2018). Application of Remote Sensing Technology, GIS and AHP-TOPSIS Model to Quantify Urban Landscape Vulnerability to Land Use Transformation. Information and Communication Technology for Sustainable Development, Springer.","DOI":"10.1007\/978-981-10-3920-1_4"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"818","DOI":"10.1109\/TGRS.2012.2205158","article-title":"Geographic image retrieval using local invariant features","volume":"51","author":"Yang","year":"2013","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"652","DOI":"10.1109\/LGRS.2012.2216499","article-title":"Automatic annotation of satellite images via multifeature joint sparse coding with spatial relation constraint","volume":"10","author":"Zheng","year":"2013","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"14988","DOI":"10.3390\/rs71114988","article-title":"A comparative study of sampling analysis in the scene classification of optical high-spatial resolution remote sensing imagery","volume":"7","author":"Hu","year":"2015","journal-title":"Remote Sens."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"597","DOI":"10.1109\/LGRS.2018.2800642","article-title":"Asymmetric Adaptation of Deep Features for Cross-Domain Classification in Remote Sensing Imagery","volume":"15","author":"Ammour","year":"2018","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Alhichri, H., Othman, E., Zuair, M., Ammour, N., and Bazi, Y. (2018). Tile-Based Semisupervised Classification of Large-Scale VHR Remote Sensing Images. J. Sens., 2018.","DOI":"10.1155\/2018\/6257810"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Banerjee, B., and Chaudhuri, S. (2018). Scene Recognition From Optical Remote Sensing Images Using Mid-Level Deep Feature Mining. IEEE Geosci. Remote Sens. Lett., 15.","DOI":"10.1109\/LGRS.2018.2822779"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Minetto, R., Segundo, M.P., and Sarkar, S. (arXiv, 2018). Hydra: An Ensemble of Convolutional Neural Networks for Geospatial Land Classification, arXiv.","DOI":"10.1109\/TGRS.2019.2906883"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Yang, Y., and Newsam, S. (2008, January 12\u201315). Comparing SIFT descriptors and Gabor texture features for classification of remote sensed imagery. Proceedings of the 15th IEEE International Conference on Image Processing (ICIP 2008), San Diego, CA, USA.","DOI":"10.1109\/ICIP.2008.4712139"},{"key":"ref_13","unstructured":"Dos Santos, J.A., Penatti, O.A.B., and da Silva Torres, R. (2010, January 17\u201321). Evaluating the Potential of Texture and Color Descriptors for Remote Sensing Image Retrieval and Classification. Proceedings of the Fifth International Conference on Computer Vision Theory and Applications, Angers, France."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"2296","DOI":"10.1080\/01431161.2014.890762","article-title":"A 2-D wavelet decomposition-based bag-of-visual-words model for land-use scene classification","volume":"35","author":"Zhao","year":"2014","journal-title":"Int. J. Remote Sens."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"1947","DOI":"10.1109\/TGRS.2014.2351395","article-title":"Pyramid of spatial relatons for scene-level land use classification","volume":"53","author":"Chen","year":"2015","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"14680","DOI":"10.3390\/rs71114680","article-title":"Transferring deep convolutional neural networks for the scene classification of high-resolution remote sensing imagery","volume":"7","author":"Hu","year":"2015","journal-title":"Remote Sens."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"2448","DOI":"10.1109\/LGRS.2015.2483680","article-title":"Multiview deep learning for land-use classification","volume":"12","author":"Luus","year":"2015","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Chen, J., Wang, C., Ma, Z., Chen, J., He, D., and Ackland, S. (2018). Remote Sensing Scene Classification Based on Convolutional Neural Networks Pre-Trained Using Attention-Guided Sparse Filters. Remote Sens., 10.","DOI":"10.3390\/rs10020290"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"12","DOI":"10.1186\/s12942-018-0132-1","article-title":"Residential scene classification for gridded population sampling in developing countries using deep convolutional neural networks on satellite imagery","volume":"17","author":"Chew","year":"2018","journal-title":"Int. J. Health Geogr."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Jia, Y., Shelhamer, E., Donahue, J., Karayev, S., Long, J., Girshick, R., Guadarrama, S., and Darrell, T. (2014, January 3\u20137). Caffe: Convolutional architecture for fast feature embedding. Proceedings of the 22nd ACM International Conference on Multimedia, Orlando, FL, USA.","DOI":"10.1145\/2647868.2654889"},{"key":"ref_21","unstructured":"Krizhevsky, A., Sutskever, I., and Hinton, G.E. (2012). Imagenet Classification with Deep Convolutional Neural Networks, Neural Information Processing Systems Foundation, Inc."},{"key":"ref_22","unstructured":"Simonyan, K., and Zisserman, A. (arXiv, 2014). Very deep convolutional networks for large-scale image recognition, arXiv."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., and Rabinovich, A. (2015, January 7\u201312). Going deeper with convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201915), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"ref_24","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (July, January 26). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201916), Las Vegas Valley, NV, USA."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"8","DOI":"10.1109\/MGRS.2017.2762307","article-title":"Deep Learning in Remote Sensing: A Comprehensive Review and List of Resources","volume":"5","author":"Zhu","year":"2017","journal-title":"IEEE Geosci. Remote Sens. Mag."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"042609","DOI":"10.1117\/1.JRS.11.042609","article-title":"Comprehensive survey of deep learning in remote sensing: Theories, tools, and challenges for the community","volume":"11","author":"Ball","year":"2017","journal-title":"J. Appl. Remote Sens."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Li, Y., Zhang, H., Xue, X., Jiang, Y., and Shen, Q. (2018). Deep learning for remote sensing image classification: A survey. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, John Wiley & Sons.","DOI":"10.1002\/widm.1264"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"971","DOI":"10.1109\/TPAMI.2002.1017623","article-title":"Multiresolution gray-scale and rotation invariant texture classification with local binary patterns","volume":"24","author":"Ojala","year":"2002","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1023\/B:VISI.0000029664.99615.94","article-title":"Distinctive image features from scale-invariant keypoints","volume":"60","author":"Lowe","year":"2004","journal-title":"Int. J. Comput. Vis."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1007\/BF00130487","article-title":"Color indexing","volume":"7","author":"Swain","year":"1991","journal-title":"Int. J. Comput. Vis."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"145","DOI":"10.1023\/A:1011139631724","article-title":"Modeling the shape of the scene: A holistic representation of the spatial envelope","volume":"42","author":"Oliva","year":"2001","journal-title":"Int. J. Comput. Vis."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"1899","DOI":"10.1109\/JSTARS.2012.2228254","article-title":"Indexing of remote sensing images with different resolutions by multiple features","volume":"6","author":"Luo","year":"2013","journal-title":"IEEE J. Sel. Top. Appl. Earth Observ. Remote Sens."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Yang, Y., and Newsam, S. (2010, January 2\u20135). Bag-of-visual-words and spatial extensions for land-use classification. Proceedings of the 18th SIGSPATIAL International Conference on Advances in Geographic Information Systems, San Jose, CA, USA.","DOI":"10.1145\/1869790.1869829"},{"key":"ref_34","unstructured":"Yang, Y., and Newsam, S. (2011, January 6\u201313). Spatial pyramid co-occurrence for image classification. Proceedings of the 2011 IEEE International Conference on Computer Vision, Barcelona, Spain."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Shao, W., Yang, W., Xia, G.S., and Liu, G. (2013). A hierarchical scheme of multiple feature fusion for high-resolution satellite scene categorization. Computer Vision Systems, Proceedings of the 9th International Conference, ICVS 2013, St. Petersburg, Russia, 16\u201318 July 2013, Springer.","DOI":"10.1007\/978-3-642-39402-7_33"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"4620","DOI":"10.1109\/JSTARS.2014.2339842","article-title":"Land-use scene classification using a concentric circle-structured multiscale bag-of-visual-words model","volume":"7","author":"Zhao","year":"2014","journal-title":"IEEE J. Sel. Top. Appl. Earth Observ. Remote Sens."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"676","DOI":"10.1109\/LGRS.2014.2357392","article-title":"Bag of lines (BoL) for improved aerial scene representation","volume":"12","author":"Sridharan","year":"2015","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Hu, J., Jiang, T., Tong, X., Xia, G.S., and Zhang, L. (2015, January 26\u201331). A benchmark for scene classification of high spatial resolution remote sensing imagery. Proceedings of the 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Milan, Italy.","DOI":"10.1109\/IGARSS.2015.7326956"},{"key":"ref_39","unstructured":"Lazebnik, S., Schmid, C., and Ponce, J. (2006, January 17\u201322). Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR\u201906), New York, NY, USA."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Wang, J., Yang, J., Yu, K., Lv, F., Huang, T., and Gong, Y. (2010, January 13\u201318). Locality-constrained linear coding for image classification. Proceedings of the 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201910), San Francisco, CA, USA.","DOI":"10.1109\/CVPR.2010.5540018"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Bosch, A., Zisserman, A., and Mu\u00f1oz, X. (2006). Scene classification via pLSA. Computer Vision\u2014ECCV 2006: Proceeding of the 9th European Conference on Computer Vision, Graz, Austria, 7\u201313 May 2006, Springer.","DOI":"10.1007\/11744085_40"},{"key":"ref_42","first-page":"993","article-title":"Latent dirichlet allocation","volume":"3","author":"Blei","year":"2003","journal-title":"J. Mach. Learn. Res."},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Perronnin, F., S\u00e1nchez, J., and Mensink, T. (2010). Improving the fisher kernel for large-scale image classification. Computer Vision\u2014ECCV 2006, Proceeding of the 11th European Conference on Computer Vision, Heraklion, Crete, Greece, 5\u201311 September 2010, Springer.","DOI":"10.1007\/978-3-642-15561-1_11"},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"1704","DOI":"10.1109\/TPAMI.2011.235","article-title":"Aggregating local image descriptors into compact codes","volume":"34","author":"Jegou","year":"2012","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Penatti, O.A., Nogueira, K., and dos Santos, J.A. (2015, January 7). Do deep features generalize from everyday objects to remote sensing and aerial scenes domains?. Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Boston, MA, USA.","DOI":"10.1109\/CVPRW.2015.7301382"},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"1793","DOI":"10.1109\/TGRS.2015.2488681","article-title":"Scene classification via a gradient boosting random convolutional network framework","volume":"54","author":"Zhang","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Aryal, J., and Dutta, R. (2015, January 13\u201317). Smart city and geospatiality: Hobart deeply learned. In Proceeding of the 2015 31st IEEE International Conference on Data Engineering Workshops (ICDEW), Seoul, Korea.","DOI":"10.1109\/ICDEW.2015.7129557"},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"3188","DOI":"10.1038\/srep03188","article-title":"Deep cognitive imaging systems enable estimation of continental-scale fire incidence from climate data","volume":"3","author":"Dutta","year":"2013","journal-title":"Sci. Rep."},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1007\/s11263-015-0816-y","article-title":"Imagenet large scale visual recognition challenge","volume":"115","author":"Russakovsky","year":"2015","journal-title":"Int. J. Comput. Vis."},{"key":"ref_50","unstructured":"Sermanet, P., Eigen, D., Zhang, X., Mathieu, M., Fergus, R., and LeCun, Y. (arXiv, 2013). Overfeat: Integrated recognition, localization and detection using convolutional networks, arXiv."},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"3965","DOI":"10.1109\/TGRS.2017.2685945","article-title":"AID: A benchmark data set for performance evaluation of aerial scene classification","volume":"55","author":"Xia","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"1865","DOI":"10.1109\/JPROC.2017.2675998","article-title":"Remote sensing image scene classification: benchmark and state of the art","volume":"105","author":"Cheng","year":"2017","journal-title":"Proc. IEEE"},{"key":"ref_53","unstructured":"Castelluccio, M., Poggi, G., Sansone, C., and Verdoliva, L. (arXiv, 2015). Land use classification in remote sensing images by convolutional neural networks, arXiv."},{"key":"ref_54","doi-asserted-by":"crossref","first-page":"539","DOI":"10.1016\/j.patcog.2016.07.001","article-title":"Towards better exploiting convolutional neural networks for remote sensing scene classification","volume":"61","author":"Nogueira","year":"2017","journal-title":"Pattern Recognit."},{"key":"ref_55","doi-asserted-by":"crossref","first-page":"220","DOI":"10.1109\/JSTARS.2017.2761800","article-title":"Scene classification via triplet networks","volume":"11","author":"Liu","year":"2018","journal-title":"IEEE J. Sel. Top. Appl. Earth Observ. Remote Sens."},{"key":"ref_56","doi-asserted-by":"crossref","unstructured":"Qi, K., Yang, C., Guan, Q., Wu, H., and Gong, J. (2017). A Multiscale Deeply Described Correlatons-Based Model for Land-Use Scene Classification. Remote Sens., 9.","DOI":"10.3390\/rs9090917"},{"key":"ref_57","doi-asserted-by":"crossref","first-page":"1735","DOI":"10.1109\/LGRS.2017.2731997","article-title":"Remote sensing image scene classification using bag of convolutional features","volume":"14","author":"Cheng","year":"2017","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_58","doi-asserted-by":"crossref","first-page":"015010","DOI":"10.1117\/1.JRS.12.015010","article-title":"Multiscale deep features learning for land-use scene recognition","volume":"12","author":"Yuan","year":"2018","journal-title":"J. Appl. Remote Sens."},{"key":"ref_59","doi-asserted-by":"crossref","first-page":"11215","DOI":"10.1109\/ACCESS.2018.2798799","article-title":"Exploiting Convolutional Neural Networks with Deeply Local Description for Remote Sensing Image Classification","volume":"6","author":"Liu","year":"2018","journal-title":"IEEE Access"},{"key":"ref_60","doi-asserted-by":"crossref","unstructured":"Liu, N., Lu, X., Wan, L., Huo, H., and Fang, T. (2018). Improving the Separability of Deep Features with Discriminative Convolution Filters for RSI Classification. ISPRS Int. J. Geo-Inf., 7.","DOI":"10.3390\/ijgi7030095"},{"key":"ref_61","doi-asserted-by":"crossref","unstructured":"Anwer, R.M., Khan, F.S., van de Weijer, J., Molinier, M., and Laaksonen, J. (arXiv, 2017). Binary patterns encoded convolutional neural networks for texture recognition and remote sensing scene classification, arXiv.","DOI":"10.1016\/j.isprsjprs.2018.01.023"},{"key":"ref_62","doi-asserted-by":"crossref","first-page":"4775","DOI":"10.1109\/TGRS.2017.2700322","article-title":"Deep feature fusion for VHR remote sensing scene classification","volume":"55","author":"Chaib","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_63","doi-asserted-by":"crossref","first-page":"5653","DOI":"10.1109\/TGRS.2017.2711275","article-title":"Integrating Multilayer Features of Convolutional Neural Networks for Remote Sensing Scene Classification","volume":"55","author":"Li","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_64","doi-asserted-by":"crossref","first-page":"295","DOI":"10.1080\/2150704X.2017.1415477","article-title":"Parallel multi-stage features fusion of deep convolutional neural networks for aerial scene classification","volume":"9","author":"Ye","year":"2018","journal-title":"Remote Sens. Lett."},{"key":"ref_65","doi-asserted-by":"crossref","first-page":"183","DOI":"10.1109\/LGRS.2017.2779469","article-title":"Scene Classification Based on Two-Stage Deep Feature Fusion","volume":"15","author":"Liu","year":"2018","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_66","doi-asserted-by":"crossref","first-page":"287","DOI":"10.1109\/LGRS.2017.2786241","article-title":"Aerial Scene Classification via Multilevel Fusion Based on Deep Convolutional Neural Networks","volume":"15","author":"Yu","year":"2018","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_67","doi-asserted-by":"crossref","unstructured":"Chowdhury, A.R., Lin, T.Y., Maji, S., and Learned-Miller, E. (arXiv, 2015). One-to-many face recognition with bilinear cnns, arXiv.","DOI":"10.1109\/WACV.2016.7477593"},{"key":"ref_68","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., RoyChowdhury, A., and Maji, S. (2015, January 7\u201313). Bilinear cnn models for fine-grained visual recognition. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.170"},{"key":"ref_69","unstructured":"Feichtenhofer, C., Pinz, A., and Zisserman, A. (July, January 26). Convolutional two-stream network fusion for video action recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201916), Las Vegas Valley, NV, USA."},{"key":"ref_70","doi-asserted-by":"crossref","unstructured":"Park, E., Han, X., Berg, T.L., and Berg, A.C. (2016, January 7\u20139). Combining multiple sources of knowledge in deep cnns for action recognition. Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision (WACV), Lake Placid, NY, USA.","DOI":"10.1109\/WACV.2016.7477589"},{"key":"ref_71","doi-asserted-by":"crossref","unstructured":"Wu, Z., Wang, X., Jiang, Y.G., Ye, H., and Xue, X. (2015, January 26\u201330). Modeling spatial-temporal clues in a hybrid deep learning framework for video classification. Proceedings of the 23rd ACM international conference on Multimedia, Brisbane, Australia.","DOI":"10.1145\/2733373.2806222"},{"key":"ref_72","doi-asserted-by":"crossref","unstructured":"Bodla, N., Zheng, J., Xu, H., Chen, J.C., Castillo, C., and Chellappa, R. (2017, January 24\u201331). Deep heterogeneous feature fusion for template-based face recognition. Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision (WACV), Santa Rosa, CA, USA.","DOI":"10.1109\/WACV.2017.71"},{"key":"ref_73","doi-asserted-by":"crossref","unstructured":"Levi, G., and Hassner, T. (2015, January 9\u201313). Emotion recognition in the wild via convolutional neural networks and mapped binary patterns. Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA.","DOI":"10.1145\/2818346.2830587"},{"key":"ref_74","unstructured":"Borg, I., and Groenen, P.J. (2005). Modern Multidimensional Scaling: Theory and Applications, Springer Science & Business Media."},{"key":"ref_75","unstructured":"Seber, G.A. (2009). Multivariate Observations, John Wiley & Sons."},{"key":"ref_76","doi-asserted-by":"crossref","first-page":"99","DOI":"10.1023\/A:1026543900054","article-title":"The earth mover\u2019s distance as a metric for image retrieval","volume":"40","author":"Rubner","year":"2000","journal-title":"Int. J. Comput. Vis."},{"key":"ref_77","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1146\/annurev-psych-122414-033400","article-title":"Neural mechanisms of selective visual attention","volume":"68","author":"Moore","year":"2017","journal-title":"Annual Rev. Psychol."},{"key":"ref_78","doi-asserted-by":"crossref","first-page":"1647","DOI":"10.1007\/s10802-017-0263-z","article-title":"Selective visual attention towards oneself and associated state body satisfaction: An eye-tracking study in adolescents with different types of eating disorders","volume":"45","author":"Bauer","year":"2017","journal-title":"J. Abnormal Child. Psychol."},{"key":"ref_79","doi-asserted-by":"crossref","first-page":"1580","DOI":"10.1364\/OL.37.001580","article-title":"Saliency model for object detection: Searching for novel items in the scene","volume":"37","author":"Zheng","year":"2012","journal-title":"Opt. Lett."},{"key":"ref_80","doi-asserted-by":"crossref","first-page":"193","DOI":"10.1038\/nrn1052","article-title":"Cognitive neuroscience: Neural mechanisms for detecting and remembering novel events","volume":"4","author":"Ranganath","year":"2003","journal-title":"Nat. Rev. Neurosci."},{"key":"ref_81","doi-asserted-by":"crossref","unstructured":"Huang, G., Liu, Z., Weinberger, K.Q., and van der Maaten, L. (2017, January 21\u201326). Densely connected convolutional networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201917), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.243"},{"key":"ref_82","unstructured":"Ioffe, S., and Szegedy, C. (arXiv, 2015). Batch normalization: Accelerating deep network training by reducing internal covariate shift, arXiv."},{"key":"ref_83","unstructured":"Glorot, X., Bordes, A., and Bengio, Y. (2011, January 6\u20138). Deep sparse rectifier neural networks. Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics (ICAIS), Klagenfurt, Austria."},{"key":"ref_84","unstructured":"Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado, G.S., Davis, A., Dean, J., and Devin, M. (arXiv, 2016). Tensorflow: Large-scale machine learning on heterogeneous distributed systems, arXiv."},{"key":"ref_85","doi-asserted-by":"crossref","unstructured":"Steinwart, I., and Christmann, A. (2008). Support Vector Machines, Springer Science & Business Media.","DOI":"10.1007\/978-0-387-77242-4"},{"key":"ref_86","doi-asserted-by":"crossref","first-page":"704","DOI":"10.1109\/LGRS.2017.2672643","article-title":"Land-use classification via extreme learning classifier based on deep convolutional features","volume":"14","author":"Weng","year":"2017","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_87","doi-asserted-by":"crossref","first-page":"439","DOI":"10.1109\/TGRS.2013.2241444","article-title":"Unsupervised feature learning for aerial scene classification","volume":"52","author":"Cheriyadat","year":"2014","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_88","doi-asserted-by":"crossref","first-page":"2175","DOI":"10.1109\/TGRS.2014.2357078","article-title":"Saliency-guided unsupervised feature learning for scene classification","volume":"53","author":"Zhang","year":"2015","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_89","doi-asserted-by":"crossref","first-page":"3180","DOI":"10.1016\/j.patcog.2015.02.001","article-title":"Learning LBP structure by maximizing the conditional mutual information","volume":"48","author":"Ren","year":"2015","journal-title":"Pattern Recognit."},{"key":"ref_90","doi-asserted-by":"crossref","unstructured":"Negrel, R., Picard, D., and Gosselin, P.H. (2014, January 18\u201320). Evaluation of second-order visual features for land-use classification. Proceedings of the 2014 12th International Workshop onContent-Based Multimedia Indexing (CBMI 2014), Klagenfurt, Austria.","DOI":"10.1109\/CBMI.2014.6849835"},{"key":"ref_91","doi-asserted-by":"crossref","unstructured":"Huang, L., Chen, C., Li, W., and Du, Q. (2016). Remote sensing image scene classification using multi-scale completed local binary patterns and fisher vectors. Remote Sens., 8.","DOI":"10.3390\/rs8060483"},{"key":"ref_92","doi-asserted-by":"crossref","unstructured":"Ji, W., Li, X., and Lu, X. (2017). Bidirectional Adaptive Feature Fusion for Remote Sensing Scene Classification. Computer Vision, Proccedings of the Second CCF Chinese Conference, CCCV 2017, Tianjin, China, 11\u201314 October 2017, Springer.","DOI":"10.1007\/978-981-10-7302-1_40"},{"key":"ref_93","doi-asserted-by":"crossref","first-page":"2889","DOI":"10.1109\/JSTARS.2017.2683799","article-title":"Fusing local and global features for high-resolution scene classification","volume":"10","author":"Bian","year":"2017","journal-title":"IEEE J. Sel. Top. Appl. Earth Observ. Remote Sens."},{"key":"ref_94","doi-asserted-by":"crossref","first-page":"345","DOI":"10.1109\/LGRS.2017.2787421","article-title":"A massively parallel deep rule-based ensemble classifier for remote sensing scenes","volume":"15","author":"Gu","year":"2018","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_95","doi-asserted-by":"crossref","first-page":"2689","DOI":"10.1109\/TGRS.2017.2781712","article-title":"Scene Classification Based on the Sparse Homogeneous\u2013 Heterogeneous Topic Feature Model","volume":"56","author":"Zhu","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/10\/7\/1158\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T15:13:43Z","timestamp":1760195623000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/10\/7\/1158"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,7,23]]},"references-count":95,"journal-issue":{"issue":"7","published-online":{"date-parts":[[2018,7]]}},"alternative-id":["rs10071158"],"URL":"https:\/\/doi.org\/10.3390\/rs10071158","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,7,23]]}}}