{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,2]],"date-time":"2026-04-02T11:31:17Z","timestamp":1775129477487,"version":"3.50.1"},"reference-count":37,"publisher":"MDPI AG","issue":"19","license":[{"start":{"date-parts":[[2023,10,7]],"date-time":"2023-10-07T00:00:00Z","timestamp":1696636800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Natural Science Foundation of PR China","award":["42075130"],"award-info":[{"award-number":["42075130"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Cloud and cloud shadow segmentation is one of the most critical challenges in remote sensing image processing. Because of susceptibility to factors such as disturbance from terrain features and noise, as well as a poor capacity to generalize, conventional deep learning networks, when directly used to cloud and cloud shade detection and division, have a tendency to lose fine features and spatial data, leading to coarse segmentation of cloud and cloud shadow borders, false detections, and omissions of targets. To address the aforementioned issues, a multi-scale strip feature attention network (MSFANet) is proposed. This approach uses Resnet18 as the backbone for obtaining semantic data at multiple levels. It incorporates a particular attention module that we name the deep-layer multi-scale pooling attention module (DMPA), aimed at extracting multi-scale contextual semantic data, deep channel feature information, and deep spatial feature information. Furthermore, a skip connection module named the boundary detail feature perception module (BDFP) is introduced to promote information interaction and fusion between adjacent layers of the backbone network. This module performs feature exploration on both the height and width dimensions of the characteristic pattern to enhance the recovery of boundary detail intelligence of the detection targets. Finally, during the decoding phase, a self-attention module named the cross-layer self-attention feature fusion module (CSFF) is employed to direct the aggregation of deeplayer semantic feature and shallow detail feature. This approach facilitates the extraction of feature information to the maximum extent while conducting image restoration. The experimental outcomes unequivocally prove the efficacy of our network in effectively addressing complex cloud-covered scenes, showcasing good performance across the cloud and cloud shadow datasets, the HRC_WHU dataset, and the SPARCS dataset. Our model outperforms existing methods in terms of segmentation accuracy, underscoring its paramount importance in the field of cloud recognition research.<\/jats:p>","DOI":"10.3390\/rs15194853","type":"journal-article","created":{"date-parts":[[2023,10,9]],"date-time":"2023-10-09T04:52:36Z","timestamp":1696827156000},"page":"4853","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":20,"title":["MSFANet: Multi-Scale Strip Feature Attention Network for Cloud and Cloud Shadow Segmentation"],"prefix":"10.3390","volume":"15","author":[{"given":"Kai","family":"Chen","sequence":"first","affiliation":[{"name":"Collaborative Innovation Center on Atmospheric Environment and Equipment Technology, Nanjing University of Information Science and Technology, Nanjing 210044, China"}]},{"given":"Xin","family":"Dai","sequence":"additional","affiliation":[{"name":"Collaborative Innovation Center on Atmospheric Environment and Equipment Technology, Nanjing University of Information Science and Technology, Nanjing 210044, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4681-9129","authenticated-orcid":false,"given":"Min","family":"Xia","sequence":"additional","affiliation":[{"name":"Collaborative Innovation Center on Atmospheric Environment and Equipment Technology, Nanjing University of Information Science and Technology, Nanjing 210044, China"}]},{"given":"Liguo","family":"Weng","sequence":"additional","affiliation":[{"name":"Collaborative Innovation Center on Atmospheric Environment and Equipment Technology, Nanjing University of Information Science and Technology, Nanjing 210044, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7181-9935","authenticated-orcid":false,"given":"Kai","family":"Hu","sequence":"additional","affiliation":[{"name":"Collaborative Innovation Center on Atmospheric Environment and Equipment Technology, Nanjing University of Information Science and Technology, Nanjing 210044, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3835-6075","authenticated-orcid":false,"given":"Haifeng","family":"Lin","sequence":"additional","affiliation":[{"name":"College of Information Science and Technology, Nanjing Forestry University, Nanjing 210037, China"}]}],"member":"1968","published-online":{"date-parts":[[2023,10,7]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"D19","DOI":"10.1029\/2003JD004457","article-title":"Calculation of radiative fluxes from the surface to top of atmosphere based on ISCCP and other global data sets: Refinements of the radiative transfer model and the input data","volume":"109","author":"Zhang","year":"2004","journal-title":"J. Geophys. Res. Atmos."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"203","DOI":"10.1016\/0169-8095(88)90027-0","article-title":"Cloud detection and analysis: A review of recent progress","volume":"21","author":"Goodman","year":"1988","journal-title":"Atmos. Res."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"257","DOI":"10.1002\/met.5060020309","article-title":"Pattern recognition techniques for the identification of cloud and cloud systems","volume":"2","author":"Pankiewicz","year":"1995","journal-title":"Meteorol. Appl."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"9113","DOI":"10.1080\/01431161.2018.1506183","article-title":"A new Landsat 8 cloud discrimination algorithm using thresholding tests","volume":"39","author":"Oishi","year":"2018","journal-title":"Int. J. Remote Sens."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"4167","DOI":"10.1016\/j.rse.2008.06.010","article-title":"Developing clear-sky, cloud and cloud shadow mask for producing clear-sky composites at 250-meter spatial resolution for the seven MODIS land bands over Canada and North America","volume":"112","author":"Luo","year":"2008","journal-title":"Remote Sens. Environ."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Main-Knorn, M., Pflug, B., Louis, J., Debaecker, V., M\u00fcller-Wilm, U., and Gascon, F. (2017, January 11\u201313). Sen2Cor for sentinel-2. Proceedings of the Image and Signal Processing for Remote Sensing XXIII. International Society for Optics and Photonics, Warsaw, Poland.","DOI":"10.1117\/12.2278218"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"1388","DOI":"10.1175\/2009JTECHA1198.1","article-title":"A geometry-based approach to identifying cloud shadows in the VIIRS cloud mask algorithm for NPOESS","volume":"26","author":"Hutchison","year":"2009","journal-title":"J. Atmos. Ocean. Technol."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1016\/j.rse.2011.10.028","article-title":"Object-based cloud and cloud shadow detection in Landsat imagery","volume":"118","author":"Zhu","year":"2012","journal-title":"Remote Sens. Environ."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"3322","DOI":"10.1109\/TGRS.2017.2669341","article-title":"Automatic road detection and centerline extraction via cascaded end-to-end convolutional neural network","volume":"55","author":"Cheng","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Ji, H., Xia, M., Zhang, D., and Lin, H. (2023). Multi-Supervised Feature Fusion Attention Network for Clouds and Shadows Detection. ISPRS Int. J. Geo-Inf., 12.","DOI":"10.3390\/ijgi12060247"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Long, J., Shelhamer, E., and Darrell, T. (2015, January 7\u201312). fully convolutional networks for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015, January 5\u20139). U-net: Convolutional networks for biomedical image segmentation. Proceedings of the International Conference on Medical image Computing and Computer-Assisted Intervention, Munich, Germany.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Zhao, H., Shi, J., Qi, X., Wang, X., and Jia, J. (2017, January 21\u201326). Pyramid scene parsing network. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.660"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Chen, L.C., Zhu, Y., Papandreou, G., Schroff, F., and Adam, H. (2018, January 8\u201314). Encoder-decoder with atrous separable convolution for semantic image segmentation. Proceedings of the European conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_49"},{"key":"ref_15","first-page":"1","article-title":"Multilevel deformable attention-aggregated networks for change detection in bitemporal remote sensing imagery","volume":"60","author":"Zhang","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Hou, Q., Zhang, L., Cheng, M.M., and Feng, J. (2020, January 13\u201319). Strip pooling: Rethinking spatial pooling for scene parsing. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00406"},{"key":"ref_18","unstructured":"Li, H., Xiong, P., An, J., and Wang, L. (2018). Pyramid attention network for semantic segmentation. arXiv."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"173","DOI":"10.1016\/S0034-4257(02)00034-2","article-title":"An image transform to characterize and compensate for spatial variations in thin cloud contamination of Landsat images","volume":"82","author":"Zhang","year":"2002","journal-title":"Remote Sens. Environ."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Wang, X., Girshick, R., Gupta, A., and He, K. (2018, January 18\u201322). Non-local neural networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00813"},{"key":"ref_21","unstructured":"Li, Z., Shen, H., Cheng, Q., Liu, Y., You, S., and He, Z. (2018). Deep learning based cloud detection for remote sensing images by the fusion of multi-scale convolutional features. arXiv."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"4907","DOI":"10.3390\/rs6064907","article-title":"Automated detection of cloud and cloud shadow in single-date Landsat imagery using neural networks and spatial post-processing","volume":"6","author":"Hughes","year":"2014","journal-title":"Remote Sens."},{"key":"ref_23","unstructured":"Hughes, M. (2016). L8 SPARCS Cloud Validation Masks."},{"key":"ref_24","unstructured":"Paszke, A., Gross, S., Chintala, S., Chanan, G., Yang, E., DeVito, Z., Lin, Z., Desmaison, A., Antiga, L., and Lerer, A. (2023, August 14). Automatic Differentiation in Pytorch. Available online: https:\/\/openreview.net\/forum?id=BJJsrmfCZ."},{"key":"ref_25","unstructured":"Kingma, D.P., and Ba, J. (2014). Adam: A method for stochastic optimization. arXiv."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"104940","DOI":"10.1016\/j.cageo.2021.104940","article-title":"Strip pooling channel spatial attention network for the segmentation of cloud and cloud shadow","volume":"157","author":"Qu","year":"2021","journal-title":"Comput. Geosci."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"046512","DOI":"10.1117\/1.JRS.15.046512","article-title":"PANDA: Parallel asymmetric network with double attention for cloud and its shadow detection","volume":"15","author":"Xia","year":"2021","journal-title":"J. Appl. Remote Sens."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"4809","DOI":"10.1109\/JSTARS.2022.3181303","article-title":"LCDNet: Light-Weighted Cloud Detection Network for High-Resolution Remote Sensing Images","volume":"15","author":"Hu","year":"2022","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"2022","DOI":"10.1080\/01431161.2020.1849852","article-title":"Cloud\/shadow segmentation based on global attention feature fusion residual network for remote sensing imagery","volume":"42","author":"Xia","year":"2021","journal-title":"Int. J. Remote Sens."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S., and Guo, B. (2021, January 11\u201317). Swin transformer: Hierarchical vision transformer using shifted windows. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, BC, Canada.","DOI":"10.1109\/ICCV48922.2021.00986"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Yang, M., Yu, K., Zhang, C., Li, Z., and Yang, K. (2018, January 18\u201322). Denseaspp for semantic segmentation in street scenes. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00388"},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"5917","DOI":"10.1080\/01431161.2021.2022805","article-title":"SGBNet: An ultra light-weight network for real-time semantic segmentation of land cover","volume":"43","author":"Pang","year":"2022","journal-title":"Int. J. Remote Sens."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"3051","DOI":"10.1007\/s11263-021-01515-2","article-title":"Bisenet v2: Bilateral network with guided aggregation for real-time semantic segmentation","volume":"129","author":"Yu","year":"2021","journal-title":"Int. J. Comput. Vis."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Wang, W., Xie, E., Li, X., Fan, D.P., Song, K., Liang, D., Lu, T., Luo, P., and Shao, L. (2021, January 11\u201317). Pyramid vision transformer: A versatile backbone for dense prediction without convolutions. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, BC, Canada.","DOI":"10.1109\/ICCV48922.2021.00061"},{"key":"ref_35","unstructured":"Paszke, A., Chaurasia, A., Kim, S., and Culurciello, E. (2016). Enet: A deep neural network architecture for real-time semantic segmentation. arXiv."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"270","DOI":"10.1109\/MGRS.2022.3145854","article-title":"Artificial intelligence for remote sensing data analysis: A review of challenges and opportunities","volume":"10","author":"Zhang","year":"2022","journal-title":"IEEE Geosci. Remote Sens. Mag."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"133","DOI":"10.1111\/j.0033-0124.1986.00133.x","article-title":"Applications of artificial intelligence techniques to remote sensing","volume":"38","author":"Estes","year":"1986","journal-title":"Prof. Geogr."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/19\/4853\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T21:02:34Z","timestamp":1760130154000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/19\/4853"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,10,7]]},"references-count":37,"journal-issue":{"issue":"19","published-online":{"date-parts":[[2023,10]]}},"alternative-id":["rs15194853"],"URL":"https:\/\/doi.org\/10.3390\/rs15194853","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,10,7]]}}}