{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,5]],"date-time":"2026-03-05T03:30:15Z","timestamp":1772681415928,"version":"3.50.1"},"reference-count":63,"publisher":"MDPI AG","issue":"2","license":[{"start":{"date-parts":[[2023,1,5]],"date-time":"2023-01-05T00:00:00Z","timestamp":1672876800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"the National Natural Science Foundation of China","award":["42261068"],"award-info":[{"award-number":["42261068"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Remote sensing scene classification is quite important in earth observation and other fields. Previous research has found that most of the existing models are based on deep learning models. However, the classification accuracy of the deep learning model is difficult to break through due to the challenges of difficulty distinguishing the socio-economic attributes of scenes, high interclass similarity, and large intraclass differences. To tackle the challenges, we propose a novel scene classification model that integrates heterogeneous features of multi-source data. Firstly, a multi-granularity feature learning module is designed, which can conduct uniform grid sampling of images to learn multi-granularity features. In this module, in addition to the features of our previous research, we also supplemented the socio-economic semantic features of the scene, and attention-based pooling is introduced to achieve different levels of representation of images. Then, to reduce the dimension of the feature, we adopt the feature-level fusion method. Next, the maxout-based module is designed to fuse the features of different granularity and extract the most distinguishing second-order latent ontology essence features. The weighted adaptive fusion method is used to fuse all the features. Finally, the Lie Group Fisher algorithm is used for scene classification. Extensive experimentation and evaluations show that our proposed model can find better solutions to the above challenges.<\/jats:p>","DOI":"10.3390\/rs15020325","type":"journal-article","created":{"date-parts":[[2023,1,5]],"date-time":"2023-01-05T06:11:51Z","timestamp":1672899111000},"page":"325","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":14,"title":["Scene Classification Based on Heterogeneous Features of Multi-Source Data"],"prefix":"10.3390","volume":"15","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-2052-7419","authenticated-orcid":false,"given":"Chengjun","family":"Xu","sequence":"first","affiliation":[{"name":"School of Software, Jiangxi Normal University, Nanchang 330022, China"},{"name":"School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, China"}]},{"given":"Jingqian","family":"Shu","sequence":"additional","affiliation":[{"name":"School of Software, Jiangxi Normal University, Nanchang 330022, China"}]},{"given":"Guobin","family":"Zhu","sequence":"additional","affiliation":[{"name":"School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, China"}]}],"member":"1968","published-online":{"date-parts":[[2023,1,5]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"166","DOI":"10.1016\/j.isprsjprs.2019.04.015","article-title":"Deep learning in remote sensing applications: A meta-analysis and review","volume":"152","author":"Ma","year":"2019","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"4928","DOI":"10.1109\/TGRS.2011.2151866","article-title":"Segment optimization and data-driven thresholding for knowledge-based landslide detection by object-based image analysis","volume":"49","author":"Martha","year":"2011","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"296","DOI":"10.1016\/j.isprsjprs.2019.11.023","article-title":"Object detection in optical remote sensing images: A survey and a new benchmark","volume":"159","author":"Li","year":"2020","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"8775","DOI":"10.1109\/TGRS.2019.2922908","article-title":"A multi-level semantic scene interpretation strategy for change interpretation in remote sensing imagery","volume":"57","author":"Ghazouani","year":"2019","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1675","DOI":"10.1080\/13658816.2017.1324976","article-title":"Classifying urban land use by integrating remote sensing and social media data","volume":"31","author":"Liu","year":"2017","journal-title":"Int. J. Geogr. Inf. Sci."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"111838","DOI":"10.1016\/j.rse.2020.111838","article-title":"Open-source data-driven urban land-use mapping integrating point-line-polygon semantic objects: A case study of Chinese cities","volume":"247","author":"Zhong","year":"2020","journal-title":"Remote Sens. Environ."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"82","DOI":"10.1109\/TGRS.2019.2931801","article-title":"Remote sensing scene classification by gated bidirectional network","volume":"58","author":"Sun","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Xu, C., Zhu, G., and Shu, J. (2022). A Combination of Lie Group Machine Learning and Deep Learning for Remote Sensing Scene Classification Using Multi-Layer Heterogeneous Feature Extraction and Fusion. Remote Sens., 14.","DOI":"10.3390\/rs14061445"},{"key":"ref_9","first-page":"1","article-title":"A Lightweight and Robust Lie Group-Convolutional Neural Networks Joint Representation for Remote Sensing Scene Classification","volume":"60","author":"Xu","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_10","first-page":"796","article-title":"Robust Joint Representation of Intrinsic Mean and Kernel Function of Lie Group for Remote Sensing Scene Classification","volume":"118","author":"Xu","year":"2020","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"1741","DOI":"10.1109\/LGRS.2020.3007775","article-title":"A Lightweight Intrinsic Mean for Remote Sensing Classification With Lie Group Kernel Function","volume":"18","author":"Xu","year":"2020","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"2461","DOI":"10.1080\/01431161.2022.2061318","article-title":"Lie Group spatial attention mechanism model for remote sensing scene classification","volume":"43","author":"Xu","year":"2022","journal-title":"Int. J. Remote Sens."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"2395","DOI":"10.1080\/01431161.2011.608740","article-title":"High-resolution satellite scene classification using a sparse coding based multiple feature combination","volume":"33","author":"Sheng","year":"2011","journal-title":"Int. J. Remote Sens."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1007\/BF00130487","article-title":"Color indexing","volume":"7","author":"Swain","year":"1991","journal-title":"Int. J. Comput. Vis."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1023\/B:VISI.0000029664.99615.94","article-title":"Distinctive image features from scale-invariant keypoints","volume":"60","author":"Lowe","year":"2004","journal-title":"Int. J. Comput. Vis."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1007\/s11760-014-0704-x","article-title":"Block-based semantic classification of high-resolution multispectral aerial images","volume":"10","year":"2016","journal-title":"Signal Image Video Process."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"4775","DOI":"10.1109\/TGRS.2017.2700322","article-title":"Deep feature fusion for VHR remote sensing scene classification","volume":"55","author":"Chaib","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_18","first-page":"993","article-title":"Latent dirichlet allocation","volume":"3","author":"Blei","year":"2003","journal-title":"J. Mach. Learn. Res."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"177","DOI":"10.1023\/A:1007617005950","article-title":"Unsupervised learning by probabilistic latent semantic analysis","volume":"42","author":"Hofmann","year":"2001","journal-title":"Mach. Learn."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"6916","DOI":"10.1109\/TGRS.2019.2909695","article-title":"Scale-free convolutional neural network for remote sensing scene classification","volume":"57","author":"Xie","year":"2019","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Peng, F., Lu, W., Tan, W., Qi, K., Zhang, X., and Zhu, Q. (2022). Multi-Output Network Combining GNN and CNN for Remote Sensing Scene Classification. Remote Sens., 14.","DOI":"10.3390\/rs14061478"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"112916","DOI":"10.1016\/j.rse.2022.112916","article-title":"Knowledge-guided land pattern depiction for urban land use mapping: A case study of Chinese cities","volume":"272","author":"Zhu","year":"2022","journal-title":"Remote Sens. Environ."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"1647","DOI":"10.1109\/LGRS.2019.2949253","article-title":"Combining multilevel features for remote sensing image scene classification with attention model","volume":"17","author":"Ji","year":"2020","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_24","first-page":"1723","article-title":"A new feature fusion method for hyperspectral image classification","volume":"17","author":"Marandi","year":"2017","journal-title":"Proc. Iran. Conf. Electr. Eng. (ICEE)"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Jia, S., and Xian, J. (2018, January 22\u201327). Multi-feature-based decision fusion framework for hyperspectral imagery classification. Proceedings of the IGARSS 2018\u20142018 IEEE International Geoscience and Remote Sensing Symposium, Valencia, Spain.","DOI":"10.1109\/IGARSS.2018.8518355"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"118472","DOI":"10.1109\/ACCESS.2019.2936295","article-title":"Fusion High-and-Low-Level Features via Ridgelet and Convolutional Neural Networks for Very High-Resolution Remote Sensing Imagery Classification","volume":"7","author":"Zheng","year":"2019","journal-title":"IEEE Access"},{"key":"ref_27","first-page":"1","article-title":"Cohesion Intensive Hash Code Book Co-construction for Efficiently Localizing Sketch Depicted Scenes","volume":"60","author":"Fang","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/TGRS.2022.3231215","article-title":"Multisensor Fusion and Explicit Semantic Preserving-Based Deep Hashing for Cross-Modal Remote Sensing Image Retrieval","volume":"60","author":"Sun","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_29","first-page":"1245","article-title":"An introduction to cognitive linguistics","volume":"17","author":"Ungerer","year":"2006","journal-title":"J. Chengdu Coll. Educ."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"2520","DOI":"10.1109\/TGRS.2020.3001401","article-title":"RSNet: The search for remote sensing deep neural networks in recognition tasks","volume":"59","author":"Wang","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"76038","DOI":"10.1109\/ACCESS.2021.3081922","article-title":"MGFN: A Multi-Granularity Fusion Convolutional Neural Network for Remote Sensing Scene Classification","volume":"9","author":"Zeng","year":"2021","journal-title":"IEEE Access"},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"3965","DOI":"10.1109\/TGRS.2017.2685945","article-title":"AID: A benchmark data set for performance evaluation of aerial scene classification","volume":"55","author":"Xia","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_33","unstructured":"Fei-Fei, L., and Perona, P. (2005, January 20\u201325). A Bayesian hierarchical model for learning natural scene categories. Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR\u201905), San Diego, CA, USA."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Soliman, A., Soltani, K., Yin, J., Padmanabhan, A., and Wang, S. (2017). Social sensing of urban land use based on analysis of twitter users\u2019 mobility patterns. PLoS ONE, 14.","DOI":"10.1371\/journal.pone.0181657"},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"643","DOI":"10.1080\/02693799608902102","article-title":"A spatial data model design for feature-based geographical information systems","volume":"10","author":"Tang","year":"1996","journal-title":"Int. J. Geogr. Inf. Syst."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"825","DOI":"10.1080\/13658816.2016.1244608","article-title":"Sensing spatial distribution of urban land use by integrating points-of-interest and Google Word2Vec model","volume":"31","author":"Yao","year":"2017","journal-title":"Int. J. Geogr. Inf. Sci."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Fonte, C.C., Minghini, M., Patriarca, J., Antoniou, V., See, L., and Skopeliti, A. (2017). Generating up-to-date and detailed land use and land cover maps using openstreetmap and GlobeLand30. ISPRS Int. J. Geo-Inform., 6.","DOI":"10.3390\/ijgi6040125"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Chen, C., Du, Z., Zhu, D., Zhang, C., and Yang, J. (2016, January 18\u201320). Land use classification in construction areas based on volunteered geographic information. Proceedings of the International Conference on Agro-Geoinformatics, Tianjin, China.","DOI":"10.1109\/Agro-Geoinformatics.2016.7577633"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., and Chen, L.C. (2018, January 15\u201317). Mobilenetv2: Inverted residuals and linear bottlenecks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, New York, NY, USA.","DOI":"10.1109\/CVPR.2018.00474"},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"5653","DOI":"10.1109\/TGRS.2017.2711275","article-title":"Integrating multilayer features of convolutional neural networks for remote sensing scene classification","volume":"55","author":"Li","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"74","DOI":"10.1016\/j.isprsjprs.2018.01.023","article-title":"Binary patterns encoded convolutional neural networks for texture recognition and remote sensing scene classification","volume":"138","author":"Anwer","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"217628","DOI":"10.1109\/ACCESS.2020.3042501","article-title":"Remote Sensing Scene Classification Using Heterogeneous Feature Extraction and Multi-Level Fusion","volume":"8","author":"Wang","year":"2020","journal-title":"IEEE Access"},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"2015","DOI":"10.1109\/JSTARS.2015.2444405","article-title":"Unsupervised feature learning via spectral clustering of multidimensional patches for remotely sensed scene classification","volume":"8","author":"Hu","year":"2015","journal-title":"IEEE J. Sel. Top. Appl. Earth Observ. Remote Sens."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"1017","DOI":"10.1109\/TCYB.2016.2536638","article-title":"Stacked convolutional denoising auto-encoders for feature representation","volume":"47","author":"Du","year":"2017","journal-title":"IEEE Trans. Cybern."},{"key":"ref_45","unstructured":"Baker, A. (2012). Matrix Groups: An Introduction to Lie Group Theory, Springer Science & Business Media."},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Yang, Y., and Newsam, S. (2010, January 2\u20135). Bag-of-visual-words and spatial extensions for land-use classification. Proceedings of the 18th SIGSPATIAL International Conference on Advances in Geographic Information Systems, San Jose, CA, USA.","DOI":"10.1145\/1869790.1869829"},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"1865","DOI":"10.1109\/JPROC.2017.2675998","article-title":"Remote sensing image scene classification: Benchmark and state of the art","volume":"105","author":"Cheng","year":"2017","journal-title":"Proc. IEEE"},{"key":"ref_48","unstructured":"Hensman, P., and Masko, D. (2015). The Impact of Imbalanced Training Data for Convolutional Neural Networks, Degree Project in Computer Science; KTH Royal Institute of Technology."},{"key":"ref_49","unstructured":"Dalal, N., and Triggs, B. (2005, January 20\u201325). Histograms of oriented gradients for human detection. Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR\u201905), San Diego, CA, USA."},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"436","DOI":"10.1038\/nature14539","article-title":"Deep learning","volume":"521","author":"LeCun","year":"2015","journal-title":"Nature"},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"2811","DOI":"10.1109\/TGRS.2017.2783902","article-title":"When deep learning meets metric learning: Remote sensing image scene classification via learning discriminative CNNs","volume":"56","author":"Cheng","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_52","first-page":"1","article-title":"A Supervised Progressive Growing Generative Adversarial Network for Remote Sensing Image Scene Classification","volume":"60","author":"Ma","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_53","doi-asserted-by":"crossref","first-page":"18195","DOI":"10.1109\/ACCESS.2021.3052977","article-title":"A Multi-Level Convolution Pyramid Semantic Fusion Framework for High-Resolution Remote Sensing Image Scene Classification and Annotation","volume":"9","author":"Sun","year":"2021","journal-title":"IEEE Access"},{"key":"ref_54","first-page":"1","article-title":"A Two-Stage Adaptation Network (TSAN) for Remote Sensing Scene Classification in Single-Source-Mixed-Multiple-Target Domain Adaptation (S\u00b2M\u00b2T DA) Scenarios","volume":"60","author":"Zheng","year":"2021","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_55","doi-asserted-by":"crossref","first-page":"2636","DOI":"10.1109\/TNNLS.2020.3007412","article-title":"C-CNN: Contourlet convolutional neural networks","volume":"32","author":"Liu","year":"2020","journal-title":"IEEE Trans. Neural Netw. Learn. Syst."},{"key":"ref_56","first-page":"1603","article-title":"APDC-Net: Attention pooling-based convolutional network for aerial scene classification","volume":"9","author":"Bi","year":"2019","journal-title":"Remote Sens. Lett."},{"key":"ref_57","doi-asserted-by":"crossref","first-page":"1986","DOI":"10.1109\/JSTARS.2020.2988477","article-title":"Classification of high spatial resolution remote sensing scenes methodusing transfer learning and deep convolutional neural network","volume":"13","author":"Li","year":"2020","journal-title":"IEEE J. Sel. Top. Appl. Earth Observ. Remote Sens."},{"key":"ref_58","doi-asserted-by":"crossref","unstructured":"Aral, R.A., Keskin, \u015e.R., Kaya, M., and Hac\u0131\u00f6mero\u011flu, M. (2018, January 10\u201313). Classification of trashnet dataset based on deep learning models. Proceedings of the 2018 IEEE International Conference on Big Data (Big Data), Seattle, WA, USA.","DOI":"10.1109\/BigData.2018.8622212"},{"key":"ref_59","doi-asserted-by":"crossref","first-page":"119951","DOI":"10.1109\/ACCESS.2020.3005450","article-title":"A New Image Recognition and Classification Method Combining Transfer Learning Algorithm and MobileNet Model for Welding Defects","volume":"8","author":"Pan","year":"2020","journal-title":"IEEE Access"},{"key":"ref_60","doi-asserted-by":"crossref","first-page":"136668","DOI":"10.1109\/ACCESS.2020.3005044","article-title":"Automatic Detection and Monitoring of Diabetic Retinopathy using Efficient Convolutional Neural Networks and Contrast Limited Adaptive Histogram Equalization","volume":"8","author":"Pour","year":"2020","journal-title":"IEEE Access"},{"key":"ref_61","doi-asserted-by":"crossref","first-page":"1986","DOI":"10.1155\/2018\/8639367","article-title":"A two-stream deep fusion framework for high-resolution aerial scene classification","volume":"2018","author":"Yu","year":"2018","journal-title":"Comput. Intell. Neurosci."},{"key":"ref_62","doi-asserted-by":"crossref","first-page":"2636","DOI":"10.1109\/JSTARS.2019.2919317","article-title":"A Lightweight and Discriminative Model for Remote Sensing Scene Classification With Multidilation Pooling Module","volume":"12","author":"Zhang","year":"2019","journal-title":"IEEE J. Sel. Top. Appl. Earth Observ. Remote Sens."},{"key":"ref_63","doi-asserted-by":"crossref","first-page":"183","DOI":"10.1109\/LGRS.2017.2779469","article-title":"Scene classification based on two-stage deep feature fusion","volume":"15","author":"Liu","year":"2018","journal-title":"IEEE Geosci. Remote Sens. Lett."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/2\/325\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T18:00:11Z","timestamp":1760119211000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/2\/325"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,1,5]]},"references-count":63,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2023,1]]}},"alternative-id":["rs15020325"],"URL":"https:\/\/doi.org\/10.3390\/rs15020325","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,1,5]]}}}