{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,12]],"date-time":"2025-10-12T02:16:56Z","timestamp":1760235416327,"version":"build-2065373602"},"reference-count":32,"publisher":"MDPI AG","issue":"16","license":[{"start":{"date-parts":[[2021,8,20]],"date-time":"2021-08-20T00:00:00Z","timestamp":1629417600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Remote sensing imagery, such as that provided by the United States Geological Survey (USGS) Landsat satellites, has been widely used to study environmental protection, hazard analysis, and urban planning for decades. Clouds are a constant challenge for such imagery and, if not handled correctly, can cause a variety of issues for a wide range of remote sensing analyses. Typically, cloud mask algorithms use the entire image; in this study we present an ensemble of different pixel-based approaches to cloud pixel modeling. Based on four training subsets with a selection of different input features, 12 machine learning models were created. We evaluated these models using the cropped LC8-Biome cloud validation dataset. As a comparison, Fmask was also applied to the cropped scene Biome dataset. One goal of this research is to explore a machine learning modeling approach that uses as small a training data sample as possible but still provides an accurate model. Overall, the model trained on the sample subset (1.3% of the total training samples) that includes unsupervised Self-Organizing Map classification results as an input feature has the best performance. The approach achieves 98.57% overall accuracy, 1.18% cloud omission error, and 0.93% cloud commission error on the 88 cropped test images. By comparison to Fmask 4.0, this model improves the accuracy by 10.12% and reduces the cloud omission error by 6.39%. Furthermore, using an additional eight independent validation images that were not sampled in model training, the model trained on the second largest subset with an additional five features has the highest overall accuracy at 86.35%, with 12.48% cloud omission error and 7.96% cloud commission error. This model\u2019s overall correctness increased by 3.26%, and the cloud omission error decreased by 1.28% compared to Fmask 4.0. The machine learning cloud classification models discussed in this paper could achieve very good performance utilizing only a small portion of the total training pixels available. We showed that a pixel-based cloud classification model, and that as each scene obviously has unique spectral characteristics, and having a small portion of example pixels from each of the sub-regions in a scene can improve the model accuracy significantly.<\/jats:p>","DOI":"10.3390\/rs13163289","type":"journal-article","created":{"date-parts":[[2021,8,22]],"date-time":"2021-08-22T22:59:27Z","timestamp":1629673167000},"page":"3289","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["Cloud Detection Using an Ensemble of Pixel-Based Machine Learning Models Incorporating Unsupervised Classification"],"prefix":"10.3390","volume":"13","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-9605-1141","authenticated-orcid":false,"given":"Xiaohe","family":"Yu","sequence":"first","affiliation":[{"name":"Geospatial Information Sciences, The University of Texas at Dallas, Richardson, TX 75080, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4265-9543","authenticated-orcid":false,"given":"David J.","family":"Lary","sequence":"additional","affiliation":[{"name":"Hanson Center for Space Science, The University of Texas at Dallas, Richardson, TX 75080, USA"}]}],"member":"1968","published-online":{"date-parts":[[2021,8,20]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"3826","DOI":"10.1109\/TGRS.2012.2227333","article-title":"Spatial and Temporal Distribution of Clouds Observed by MODIS Onboard the Terra and Aqua Satellites","volume":"51","author":"King","year":"2013","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"1137","DOI":"10.14358\/PERS.72.10.1137","article-title":"Landsat-7 long-term acquisition plan","volume":"72","author":"Arvidson","year":"2006","journal-title":"Photogramm. Eng. Remote Sens."},{"key":"ref_3","first-page":"348","article-title":"Landsat 7 automatic cloud cover assessment. Algorithms for Multispectral, Hyperspectral, and Ultraspectral Imagery VI","volume":"4049","author":"Irish","year":"2000","journal-title":"Int. Soc. Opt. Photonics"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1016\/j.rse.2011.10.028","article-title":"Object-based cloud and cloud shadow detection in Landsat imagery","volume":"118","author":"Zhu","year":"2012","journal-title":"Remote Sens. Environ."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"269","DOI":"10.1016\/j.rse.2014.12.014","article-title":"Improvement and expansion of the Fmask algorithm: Cloud, cloud shadow, and snow detection for Landsats 4\u20137, 8, and Sentinel 2 images","volume":"159","author":"Zhu","year":"2015","journal-title":"Remote Sens. Environ."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"379","DOI":"10.1016\/j.rse.2017.03.026","article-title":"Cloud detection algorithm comparison and validation for operational Landsat data products","volume":"194","author":"Foga","year":"2017","journal-title":"Remote Sens. Environ."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"4907","DOI":"10.3390\/rs6064907","article-title":"Automated detection of cloud and cloud shadow in single-date Landsat imagery using neural networks and spatial post-processing","volume":"6","author":"Hughes","year":"2014","journal-title":"Remote Sens."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"342","DOI":"10.1016\/j.rse.2017.01.026","article-title":"Multi-feature combined cloud and cloud shadow detection in GaoFen-1 wide field of view imagery","volume":"191","author":"Li","year":"2017","journal-title":"Remote Sens. Environ."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"217","DOI":"10.1016\/j.rse.2014.06.012","article-title":"Automated cloud, cloud shadow, and snow detection in multitemporal Landsat data: An algorithm designed specifically for monitoring land cover change","volume":"152","author":"Zhu","year":"2014","journal-title":"Remote Sens. Environ."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Candra, D.S., Phinn, S., and Scarth, P. (2019). Automated Cloud and Cloud-Shadow Masking for Landsat 8 Using Multitemporal Images in a Variety of Environments. Remote Sens., 11.","DOI":"10.3390\/rs11172060"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"8","DOI":"10.1109\/MGRS.2017.2762307","article-title":"Deep learning in remote sensing: A comprehensive review and list of resources","volume":"5","author":"Zhu","year":"2017","journal-title":"IEEE Geosci. Remote Sens. Mag."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"3631","DOI":"10.1109\/JSTARS.2017.2686488","article-title":"Multilevel cloud detection in remote sensing images based on deep learning","volume":"10","author":"Xie","year":"2017","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"4206","DOI":"10.1109\/JSTARS.2015.2438015","article-title":"Scene learning for cloud detection on remote-sensing images","volume":"8","author":"An","year":"2015","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"7264","DOI":"10.1109\/TGRS.2014.2310240","article-title":"Cloud detection of RGB color aerial photographs by progressive refinement scheme","volume":"52","author":"Zhang","year":"2014","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"149","DOI":"10.5194\/isprs-annals-IV-3-149-2018","article-title":"Cloud detection by fusing multi-scale convolutional features","volume":"4","author":"Li","year":"2018","journal-title":"ISPRS Ann. Photogramm. Remote Sens. Spat. Inf. Sci."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Long, J., Shelhamer, E., and Darrell, T. (2015, January 7\u201312). Fully convolutional networks for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"111446","DOI":"10.1016\/j.rse.2019.111446","article-title":"Cloud detection algorithm for multi-modal satellite imagery using convolutional neural-networks (CNN)","volume":"237","author":"Li","year":"2020","journal-title":"Remote Sens. Environ."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Zi, Y., Xie, F., and Jiang, Z. (2018). A cloud detection method for Landsat 8 images based on PCANet. Remote Sens., 10.","DOI":"10.3390\/rs10060877"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"6195","DOI":"10.1109\/TGRS.2019.2904868","article-title":"CDnet: CNN-based cloud detection for remote sensing imagery","volume":"57","author":"Yang","year":"2019","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"700","DOI":"10.1109\/TGRS.2020.2991398","article-title":"CDNetv2: CNN-Based cloud detection for remote sensing imagery with cloud-snow coexistence","volume":"59","author":"Guo","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Shahraki, F.F., and Prasad, S. (2018, January 26\u201329). Graph convolutional neural networks for hyperspectral data classification. Proceedings of the 2018 IEEE Global Conference on Signal and Information Processing (GlobalSIP), Anaheim, CA, USA.","DOI":"10.1109\/GlobalSIP.2018.8645969"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"241","DOI":"10.1109\/LGRS.2018.2869563","article-title":"Spectral\u2013spatial graph convolutional networks for semisupervised hyperspectral image classification","volume":"16","author":"Qin","year":"2018","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Pu, S., Wu, Y., Sun, X., and Sun, X. (2021). Hyperspectral Image Classification with Localized Graph Convolutional Filtering. Remote Sens., 13.","DOI":"10.3390\/rs13030526"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"5966","DOI":"10.1109\/TGRS.2020.3015157","article-title":"Graph convolutional networks for hyperspectral image classification","volume":"59","author":"Hong","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_25","first-page":"453","article-title":"Automated detection and removal of clouds and their shadows from Landsat TM images","volume":"82","author":"Wang","year":"1999","journal-title":"IEICE Trans. Inf. Syst."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"251","DOI":"10.1007\/s40747-019-00128-0","article-title":"Cloud detection methodologies: Variants and development\u2014A review","volume":"6","author":"Mahajan","year":"2020","journal-title":"Complex Intell. Syst."},{"key":"ref_27","unstructured":"U.S. Geological Survey (2016). L8 Biome Cloud Validation Masks."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"4105","DOI":"10.1109\/TGRS.2007.905312","article-title":"Cloud-screening algorithm for ENVISAT\/MERIS multispectral images","volume":"45","author":"Guanter","year":"2007","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"173","DOI":"10.1016\/S0034-4257(02)00034-2","article-title":"An image transform to characterize and compensate for spatial variations in thin cloud contamination of Landsat images","volume":"82","author":"Zhang","year":"2002","journal-title":"Remote Sens. Environ."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1007\/BF00337288","article-title":"Self-organized formation of topologically correct feature maps","volume":"43","author":"Kohonen","year":"1982","journal-title":"Biol. Cybern."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"1140","DOI":"10.1109\/TGRS.2011.2164087","article-title":"Development of the Landsat data continuity mission cloud-cover assessment algorithms","volume":"50","author":"Scaramuzza","year":"2011","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_32","unstructured":"Kiviluoto, K. (1996, January 3\u20136). Topology preservation in self-organizing maps. Proceedings of the International Conference on Neural Networks (ICNN\u201996), Washington, DC, USA."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/16\/3289\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T06:48:03Z","timestamp":1760165283000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/16\/3289"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,8,20]]},"references-count":32,"journal-issue":{"issue":"16","published-online":{"date-parts":[[2021,8]]}},"alternative-id":["rs13163289"],"URL":"https:\/\/doi.org\/10.3390\/rs13163289","relation":{},"ISSN":["2072-4292"],"issn-type":[{"type":"electronic","value":"2072-4292"}],"subject":[],"published":{"date-parts":[[2021,8,20]]}}}