{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,6]],"date-time":"2026-04-06T09:14:17Z","timestamp":1775466857782,"version":"3.50.1"},"reference-count":39,"publisher":"MDPI AG","issue":"24","license":[{"start":{"date-parts":[[2019,12,17]],"date-time":"2019-12-17T00:00:00Z","timestamp":1576540800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Funda\u00e7\u00e3o para a Ci\u00eancia e Tecnologia","award":["PCIF\/SSI\/0102\/2017"],"award-info":[{"award-number":["PCIF\/SSI\/0102\/2017"]}]},{"name":"Funda\u00e7\u00e3o para a Ci\u00eancia e Tecnologia","award":["DSAIPA\/AI\/0100\/2018 -  IPSTERS"],"award-info":[{"award-number":["DSAIPA\/AI\/0100\/2018 -  IPSTERS"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>The automatic production of land use\/land cover maps continues to be a challenging problem, with important impacts on the ability to promote sustainability and good resource management. The ability to build robust automatic classifiers and produce accurate maps can have a significant impact on the way we manage and optimize natural resources. The difficulty in achieving these results comes from many different factors, such as data quality and uncertainty. In this paper, we address the imbalanced learning problem, a common and difficult conundrum in remote sensing that affects the quality of classification results, by proposing Geometric-SMOTE, a novel oversampling method, as a tool for addressing the imbalanced learning problem in remote sensing. Geometric-SMOTE is a sophisticated oversampling algorithm which increases the quality of the instances generated in previous methods, such as the synthetic minority oversampling technique. The performance of Geometric- SMOTE, in the LUCAS (Land Use\/Cover Area Frame Survey) dataset, is compared to other oversamplers using a variety of classifiers. The results show that Geometric-SMOTE significantly outperforms all the other oversamplers and improves the robustness of the classifiers. These results indicate that, when using imbalanced datasets, remote sensing researchers should consider the use of these new generation oversamplers to increase the quality of the classification results.<\/jats:p>","DOI":"10.3390\/rs11243040","type":"journal-article","created":{"date-parts":[[2019,12,20]],"date-time":"2019-12-20T03:19:36Z","timestamp":1576811976000},"page":"3040","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":86,"title":["Imbalanced Learning in Land Cover Classification: Improving Minority Classes\u2019 Prediction Accuracy Using the Geometric SMOTE Algorithm"],"prefix":"10.3390","volume":"11","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7019-3782","authenticated-orcid":false,"given":"Georgios","family":"Douzas","sequence":"first","affiliation":[{"name":"NOVA Information Management School (NOVA IMS), Campus de Campolide, Universidade Nova de Lisboa, 1070-312 Lisboa, Portugal"}]},{"given":"Fernando","family":"Bacao","sequence":"additional","affiliation":[{"name":"NOVA Information Management School (NOVA IMS), Campus de Campolide, Universidade Nova de Lisboa, 1070-312 Lisboa, Portugal"}]},{"given":"Joao","family":"Fonseca","sequence":"additional","affiliation":[{"name":"NOVA Information Management School (NOVA IMS), Campus de Campolide, Universidade Nova de Lisboa, 1070-312 Lisboa, Portugal"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9774-3190","authenticated-orcid":false,"given":"Manvel","family":"Khudinyan","sequence":"additional","affiliation":[{"name":"NOVA Information Management School (NOVA IMS), Campus de Campolide, Universidade Nova de Lisboa, 1070-312 Lisboa, Portugal"}]}],"member":"1968","published-online":{"date-parts":[[2019,12,17]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1016\/j.isprsjprs.2015.03.014","article-title":"Exploring issues of training data imbalance and mislabelling on random forest performance for large area land cover classification using the ensemble margin","volume":"105","author":"Mellor","year":"2015","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"89","DOI":"10.1016\/j.rse.2016.02.028","article-title":"A meta-analysis of remote sensing research on supervised pixel-based land-cover image classification processes: General guidelines for practitioners and future research","volume":"177","author":"Khatami","year":"2016","journal-title":"Remote Sens. Environ."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.rse.2015.01.006","article-title":"A critical synthesis of remotely sensed optical image change detection techniques","volume":"160","author":"Tewkesbury","year":"2015","journal-title":"Remote Sens. Environ."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"1231","DOI":"10.1109\/TGRS.2007.910220","article-title":"An Active Learning Approach to Hyperspectral Data Classification","volume":"46","author":"Rajan","year":"2008","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Feng, W., Huang, W., and Bao, W. (2019). Imbalanced Hyperspectral Image Classification With an Adaptive Ensemble Method Based on SMOTE and Rotation Forest With Differentiated Sampling Rates. IEEE Geosci. Remote Sens. Lett., 1\u20135.","DOI":"10.1109\/LGRS.2019.2913387"},{"key":"ref_6","unstructured":"Eurostat (2015). LUCAS 2015 (Land Use\/Cover Area Frame Survey), Eurostat. Technical Reference Document C1, Instructions for Surveyors."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"583","DOI":"10.1016\/j.rse.2018.12.001","article-title":"Mapping pan-European land cover using Landsat spectral-temporal metrics and the European LUCAS survey","volume":"221","author":"Pflugmacher","year":"2019","journal-title":"Remote Sens. Environ."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"244","DOI":"10.1080\/2150704X.2016.1249299","article-title":"A semi-automated approach for the generation of a new land use and land cover product for Germany based on Landsat time-series and Lucas in-situ data","volume":"8","author":"Mack","year":"2017","journal-title":"Remote Sens. Lett."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/1007730.1007733","article-title":"Editorial: Special issue on learning from imbalanced data sets","volume":"6","author":"Chawla","year":"2004","journal-title":"ACM SIGKDD Explor. Newsl."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"238","DOI":"10.1109\/TKDE.2015.2458858","article-title":"To Combat Multi-Class Imbalanced Problems by Means of Over-Sampling Techniques","volume":"28","author":"Abdi","year":"2016","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"ref_11","first-page":"22","article-title":"Dynamic ensemble selection for multi-class imbalanced datasets","volume":"445\u2013446","author":"Zhang","year":"2018","journal-title":"Inf. Sci."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"164","DOI":"10.1016\/j.patcog.2016.03.012","article-title":"Analyzing the oversampling of different classes and types of examples in multi-class imbalanced datasets","volume":"57","author":"Krawczyk","year":"2016","journal-title":"Pattern Recognit."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1016\/j.knosys.2013.01.018","article-title":"Analysing the classification of imbalanced data-sets with multiple classes: Binarization techniques and ad-hoc approaches","volume":"42","author":"Galar","year":"2013","journal-title":"Knowl.-Based Syst."},{"key":"ref_14","unstructured":"Eurostat (2015). LUCAS 2015 (Land Use\/Cover Area Frame Survey), Eurostat. Technical Reference Document C3 Classification (Land cover and Land Use)."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"118","DOI":"10.1016\/j.ins.2019.06.007","article-title":"Geometric SMOTE a geometrically enhanced drop-in replacement for SMOTE","volume":"501","author":"Douzas","year":"2019","journal-title":"Inf. Sci."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1613\/jair.953","article-title":"SMOTE: Synthetic Minority Over-sampling Technique","volume":"16","author":"Chawla","year":"2002","journal-title":"J. Artif. Intell. Res."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Han, H., Wang, W.Y., and Mao, B.H. (2005, January 23\u201326). Borderline-SMOTE: A New Over-Sampling Method in Imbalanced Data Sets Learning. Proceedings of the International Conference on Intelligent Computing, Hefei, China.","DOI":"10.1007\/11538059_91"},{"key":"ref_18","unstructured":"He, H., Bai, Y., Garcia, E.A., and Li, S. (2008, January 1\u20138). ADASYN: Adaptive synthetic sampling approach for imbalanced learning. Proceedings of the 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence), Hong Kong, China."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"973","DOI":"10.14358\/PERS.82.12.973","article-title":"Improved urban scene classification using full-waveform LiDAR","volume":"82","author":"Azadbakht","year":"2016","journal-title":"Photogramm. Eng. Remote Sens."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"221","DOI":"10.1007\/s13748-016-0094-0","article-title":"Learning from imbalanced data: Open challenges and future directions","volume":"5","author":"Krawczyk","year":"2016","journal-title":"Prog. Artif. Intell."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"2784","DOI":"10.1080\/01431161.2018.1433343","article-title":"Implementation of machine-learning classification in remote sensing: An applied review","volume":"39","author":"Maxwell","year":"2018","journal-title":"Int. J. Remote Sens."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Feng, W., Huang, W., Ye, H., and Zhao, L. (2018, January 22\u201327). Synthetic Minority Over-Sampling Technique Based Rotation Forest for the Classification of Unbalanced Hyperspectral Data. Proceedings of the IGARSS 2018-2018 IEEE International Geoscience and Remote Sensing Symposium, Valencia, Spain.","DOI":"10.1109\/IGARSS.2018.8518242"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Cenggoro, T.W., Isa, S.M., Kusuma, G.P., and Pardamean, B. (2017, January 2\u20134). Classification of imbalanced land-use\/land-cover data using variational semi-supervised learning. Proceedings of the 2017 International Conference on Innovative and Creative Information Technology: Computational Intelligence and IoT, ICITech 2017, Salatiga, Indonesia.","DOI":"10.1109\/INNOCIT.2017.8319149"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"140","DOI":"10.1016\/j.apgeog.2015.12.006","article-title":"Integrating OpenStreetMap crowdsourced data and Landsat time-series imagery for rapid land use\/land cover (LULC) mapping: Case study of the Laguna de Bay area of the Philippines","volume":"67","author":"Johnson","year":"2016","journal-title":"Appl. Geogr."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Bogner, C., Seo, B., Rohner, D., and Reineking, B. (2018). Classification of rare land cover types: Distinguishing annual and perennial crops in an agricultural catchment in South Korea. PLoS ONE, 13.","DOI":"10.1371\/journal.pone.0190476"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Panda, A., Singh, A., Kumar, K., Kumar, A., and Swetapadma, A. (2018, January 20\u201321). Land Cover Prediction from Satellite Imagery Using Machine Learning Techniques. Proceedings of the 2018 Second International Conference on Inventive Communication and Computational Technologies (ICICCT), Coimbatore, India.","DOI":"10.1109\/ICICCT.2018.8473241"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"40","DOI":"10.1016\/j.eswa.2017.03.073","article-title":"Self-Organizing Map Oversampling (SOMO) for imbalanced data set learning","volume":"82","author":"Douzas","year":"2017","journal-title":"Expert Syst. Appl."},{"key":"ref_28","unstructured":"Nguyen, H.M., Cooper, E.W., and Kamei, K. (2009, January 10\u201312). Borderline over-sampling for imbalanced data classification. Proceedings of the 5th International Workshop on Computational Intelligence & Applications (IWCIA2009), Hiroshima, Japan."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"245","DOI":"10.1007\/s10115-011-0465-6","article-title":"SMOTE-RSB*: A hybrid preprocessing approach based on oversampling and undersampling for high imbalanced data-sets using SMOTE and rough sets theory","volume":"33","author":"Ramentol","year":"2012","journal-title":"Knowl. Inf. Syst."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"606","DOI":"10.1016\/j.rse.2006.10.010","article-title":"Comparative assessment of the measures of thematic classification accuracy","volume":"107","author":"Liu","year":"2007","journal-title":"Remote Sens. Environ."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"122","DOI":"10.1016\/j.rse.2012.10.031","article-title":"Making better use of accuracy data in land change studies: Estimating accuracy and area and quantifying uncertainty using stratified estimation","volume":"129","author":"Olofsson","year":"2013","journal-title":"Remote Sens. Environ."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"1263","DOI":"10.1109\/TKDE.2008.239","article-title":"Learning from Imbalanced Data","volume":"21","author":"He","year":"2009","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"McCullagh, P., and Nelder, J. (1989). Generalized Linear Models, Chapman and Hall.","DOI":"10.1007\/978-1-4899-3242-6"},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1109\/TIT.1967.1053964","article-title":"Nearest neighbor pattern classification","volume":"13","author":"Cover","year":"1967","journal-title":"IEEE Trans. Inf. Theory"},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1007\/BF00993309","article-title":"C4.5: Programs for Machine Learning by J. Ross Quinlan. Morgan Kaufmann Publishers, Inc., 1993","volume":"16","author":"Salzberg","year":"1994","journal-title":"Mach. Learn."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"1189","DOI":"10.1214\/aos\/1013203451","article-title":"Greedy function approximation: A gradient boosting machine","volume":"29","author":"Friedman","year":"2001","journal-title":"Ann. Stat."},{"key":"ref_37","first-page":"18","article-title":"Classification and regression by randomForest","volume":"2","author":"Liaw","year":"2002","journal-title":"R News"},{"key":"ref_38","first-page":"2825","article-title":"Scikit-learn: Machine Learning in Python","volume":"12","author":"Pedregosa","year":"2011","journal-title":"J. Mach. Learn. Res."},{"key":"ref_39","first-page":"1","article-title":"Imbalanced-learn: A Python Toolbox to Tackle the Curse of Imbalanced Datasets in Machine Learning","volume":"18","author":"Nogueira","year":"2017","journal-title":"J. Mach. Learn. Res."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/11\/24\/3040\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T13:42:54Z","timestamp":1760190174000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/11\/24\/3040"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,12,17]]},"references-count":39,"journal-issue":{"issue":"24","published-online":{"date-parts":[[2019,12]]}},"alternative-id":["rs11243040"],"URL":"https:\/\/doi.org\/10.3390\/rs11243040","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,12,17]]}}}