{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,19]],"date-time":"2026-04-19T15:52:47Z","timestamp":1776613967103,"version":"3.51.2"},"reference-count":47,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2022,1,3]],"date-time":"2022-01-03T00:00:00Z","timestamp":1641168000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["42075130"],"award-info":[{"award-number":["42075130"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Water area segmentation is an important branch of remote sensing image segmentation, but in reality, most water area images have complex and diverse backgrounds. Traditional detection methods cannot accurately identify small tributaries due to incomplete mining and insufficient utilization of semantic information, and the edge information of segmentation is rough. To solve the above problems, we propose a multi-scale feature aggregation network. In order to improve the ability of the network to process boundary information, we design a deep feature extraction module using a multi-scale pyramid to extract features, combined with the designed attention mechanism and strip convolution, extraction of multi-scale deep semantic information and enhancement of spatial and location information. Then, the multi-branch aggregation module is used to interact with different scale features to enhance the positioning information of the pixels. Finally, the two high-performance branches designed in the Feature Fusion Upsample module are used to deeply extract the semantic information of the image, and the deep information is fused with the shallow information generated by the multi-branch module to improve the ability of the network. Global and local features are used to determine the location distribution of each image category. The experimental results show that the accuracy of the segmentation method in this paper is better than that in the previous detection methods, and has important practical significance for the actual water area segmentation.<\/jats:p>","DOI":"10.3390\/rs14010206","type":"journal-article","created":{"date-parts":[[2022,1,9]],"date-time":"2022-01-09T23:06:15Z","timestamp":1641769575000},"page":"206","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":57,"title":["Multi-Scale Feature Aggregation Network for Water Area Segmentation"],"prefix":"10.3390","volume":"14","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7181-9935","authenticated-orcid":false,"given":"Kai","family":"Hu","sequence":"first","affiliation":[{"name":"Collaborative Innovation Center on Atmospheric Environment and Equipment Technology, B-DAT, Nanjing University of Information Science and Technology, Nanjing 210044, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3156-9836","authenticated-orcid":false,"given":"Meng","family":"Li","sequence":"additional","affiliation":[{"name":"Collaborative Innovation Center on Atmospheric Environment and Equipment Technology, B-DAT, Nanjing University of Information Science and Technology, Nanjing 210044, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4681-9129","authenticated-orcid":false,"given":"Min","family":"Xia","sequence":"additional","affiliation":[{"name":"Collaborative Innovation Center on Atmospheric Environment and Equipment Technology, B-DAT, Nanjing University of Information Science and Technology, Nanjing 210044, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Haifeng","family":"Lin","sequence":"additional","affiliation":[{"name":"College of Information Science and Technology, Nanjing Forestry University, Nanjing 210037, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2022,1,3]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"104805","DOI":"10.1016\/j.cageo.2021.104805","article-title":"DeepRivWidth: Deep learning based semantic segmentation approach for river identification and width measurement in SAR images of Coastal Karnataka","volume":"154","author":"Verma","year":"2021","journal-title":"Comput. Geosci."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Zhu, L., Zhang, J.Q., and Pa, L. (2006, January 20\u201324). River change detection based on remote sensing image and vector. Proceedings of the First International Multi-Symposiums on Computer and Computational Sciences (IMSCCS\u201906), Hangzhou, China.","DOI":"10.1109\/IMSCCS.2006.121"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"3485","DOI":"10.1080\/01431161003749477","article-title":"River detection algorithm in SAR images based on edge extraction and ridge tracing techniques","volume":"32","author":"Sun","year":"2011","journal-title":"Int. J. Remote Sens."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"1425","DOI":"10.1080\/01431169608948714","article-title":"The use of the Normalized Difference Water Index (NDWI) in the delineation of open water features","volume":"17","author":"McFeeters","year":"1996","journal-title":"Int. J. Remote Sens."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"4155","DOI":"10.1080\/01431161.2010.484821","article-title":"Texture information-based hybrid methodology for the segmentation of SAR images","volume":"32","author":"Singh","year":"2011","journal-title":"Int. J. Remote Sens."},{"key":"ref_6","first-page":"175","article-title":"Coastline extraction using support vector machine from remote sensing image","volume":"8","author":"Zhang","year":"2013","journal-title":"J. Multim."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"504","DOI":"10.1126\/science.1127647","article-title":"Reducing the dimensionality of data with neural networks","volume":"313","author":"Hinton","year":"2006","journal-title":"Science"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Long, J., Shelhamer, E., and Darrell, T. (2015, January 7\u201312). Fully convolutional networks for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"2481","DOI":"10.1109\/TPAMI.2016.2644615","article-title":"Segnet: A deep convolutional encoder-decoder architecture for image segmentation","volume":"39","author":"Badrinarayanan","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015). U-net: Convolutional networks for biomedical image segmentation. International Conference on Medical Image Computing and Computer-Assisted Intervention, Proceedings of the 18th International Conference, Munich, Germany, 5\u20139 October 2015, Springer.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"196","DOI":"10.1016\/j.eswa.2017.04.018","article-title":"River channel segmentation in polarimetric SAR images: Watershed transform combined with average contrast maximisation","volume":"82","author":"Ciecholewski","year":"2017","journal-title":"Expert Syst. Appl."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"1025","DOI":"10.1109\/JSTARS.2016.2609804","article-title":"River extraction from high-resolution SAR images combining a structural feature set and mathematical morphology","volume":"10","author":"Sghaier","year":"2017","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Zhao, H.S., Shi, J.P., Qi, X.J., Wang, X.G., and Jia, J. (2017, January 21\u201326). Pyramid scene parsing network. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.660"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Shamsolmoali, P., Chanussot, J., Zareapoor, M., Zhou, H.Y., and Yang, J. (2021). Multipatch Feature Pyramid Network for Weakly Supervised Object Detection in Optical Remote Sensing Images. IEEE Trans. Geosci. Remote Sens., 1\u201313.","DOI":"10.1109\/TGRS.2021.3106442"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Shamsolmoali, P., Zareapoor, M., Chanussot, J., Zhou, H.Y., and Yang, J. (2021). Rotation Equivariant Feature Image Pyramid Network for Object Detection in Optical Remote Sensing Imagery. arXiv.","DOI":"10.1109\/TGRS.2021.3112481"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"4673","DOI":"10.1109\/TGRS.2020.3016086","article-title":"Road Segmentation for Remote Sensing Images Using Adversarial Spatial Pyramid Networks","volume":"59","author":"Shamsolmoali","year":"2021","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Hoekstra, M., Jiang, M.Z., Clausi, D.A., and Duguay, C. (2020). Lake Ice-Water Classification of RADARSAT-2 Images by Integrating IRGS Segmentation with Pixel-Based Random Forest Labeling. Remote Sens., 12.","DOI":"10.3390\/rs12091425"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Weng, L.G., Xu, Y.M., Xia, M., Zhang, Y.H., Liu, J., and Xu, Y.Q. (2020). Water areas segmentation from remote sensing images using a separable residual segnet network. ISPRS Int. J. Geo-Inf., 9.","DOI":"10.3390\/ijgi9040256"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Doll\u00e1r, P., Girshick, R., He, K.M., Hariharan, B., and Belongie, S. (2017, January 21\u201326). Feature pyramid networks for object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.106"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"423","DOI":"10.1017\/S0140525X00079577","article-title":"Analyzing vision at the complexity level","volume":"13","author":"Tsotsos","year":"1990","journal-title":"Behav. Brain Sci."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Huang, G., Liu, Z., Van Der Maaten, L., and Weinberger, K.Q. (2017, January 21\u201326). Densely connected convolutional networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.243"},{"key":"ref_22","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. arXiv."},{"key":"ref_23","unstructured":"Howard, A.G., Zhu, M.L., Chen, B., Kalenichenko, D., Wang, W.J., Wey, T., Andreetto, M., and Adam, H. (2017). Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Liu, W., Jia, Y.Q., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., and Rabinovich, A. (2015, January 7\u201312). Going deeper with convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Zhang, X.Y., Zhou, X.Y., Lin, M.X., and Sun, J. (2018, January 18\u201323). Shufflenet: An extremely efficient convolutional neural network for mobile devices. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00716"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Wang, Z.W., Xia, M., Lu, M., Pan, L.L., and Liu, J. (2021). Parameter Identification in Power Transmission Systems Based on Graph Convolution Network. IEEE Trans. Power Deliv.","DOI":"10.1109\/TPWRD.2021.3124528"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Tsotsos, J.K. (2011). A Computational Perspective on Visual Attention, MIT Press.","DOI":"10.7551\/mitpress\/9780262015417.001.0001"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Bello, I., Zoph, B., Vaswani, A., Shlens, J., and Le, Q.V. (2019, January 27\u201328). Attention augmented convolutional networks. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Korea.","DOI":"10.1109\/ICCV.2019.00338"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"104735","DOI":"10.1016\/j.cageo.2021.104735","article-title":"Dual-input attention network for automatic identification of detritus from river sands","volume":"151","author":"Ge","year":"2021","journal-title":"Comput. Geosci."},{"key":"ref_30","first-page":"102597","article-title":"SUACDNet: Attentional change detection network based on siamese U-shaped structure","volume":"105","author":"Song","year":"2021","journal-title":"Int. J. Appl. Earth Obs. Geoinf."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"104940","DOI":"10.1016\/j.cageo.2021.104940","article-title":"Strip pooling channel spatial attention network for the segmentation of cloud and cloud shadow","volume":"157","author":"Qu","year":"2021","journal-title":"Comput. Geosci."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Hu, J., Shen, L., and Sun, G. (2018, January 18\u201323). Squeeze-and-excitation networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Woo, S., Park, J., Lee, J.Y., and Kweon, I.S. (2018, January 8\u201314). Cbam: Convolutional block attention module. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_1"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Li, X., Wang, W.H., Hu, X.L., and Yang, J. (2019, January 15\u201320). Selective kernel networks. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00060"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"2417","DOI":"10.1109\/TIFS.2020.2969552","article-title":"Multi-stage feature constraints learning for age estimation","volume":"15","author":"Xia","year":"2020","journal-title":"IEEE Trans. Inf. Forensics Secur."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"113669","DOI":"10.1016\/j.eswa.2020.113669","article-title":"Non-intrusive load disaggregation based on composite deep long short-term memory network","volume":"160","author":"Xia","year":"2020","journal-title":"Expert Syst. Appl."},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Boguszewski, A., Batorski, D., Ziemba-Jankowska, N., Zambrzycka, A., and Dziedzic, T. (2020). Landcover. ai: Dataset for automatic mapping of buildings, woodlands and water from aerial imagery. arXiv.","DOI":"10.1109\/CVPRW53098.2021.00121"},{"key":"ref_39","unstructured":"Kingma, D.P., and Ba, J. (2014). Adam: A method for stochastic optimization. arXiv."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"834","DOI":"10.1109\/TPAMI.2017.2699184","article-title":"Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs","volume":"40","author":"Chen","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Chen, L.C., Zhu, Y.K., Papandreou, G., Schroff, F., and Adam, H. (2018, January 8\u201314). Encoder-decoder with atrous separable convolution for semantic image segmentation. Proceedings of the European conference on computer vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_49"},{"key":"ref_42","unstructured":"Li, H.C., Xiong, P.F., An, J., and Wang, L. (2018). Pyramid attention network for semantic segmentation. arXiv."},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Dang, B., and Li, Y.S. (2021). MSResNet: Multiscale Residual Network via Self-Supervised Learning for Water-Body Detection in Remote Sensing Imagery. Remote Sens., 13.","DOI":"10.3390\/rs13163122"},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Yu, C.Q., Wang, J.B., Peng, C., Gao, C.X., Yu, G., and Sang, N. (2018, January 18\u201323). Learning a discriminative feature network for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00199"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Yu, C.Q., Wang, J.B., Peng, C., Gao, C.X., Yu, G., and Sang, N. (2018, January 8\u201314). Bisenet: Bilateral segmentation network for real-time semantic segmentation. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01261-8_20"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Yang, M., Yu, K., Zhang, C., Li, Z.W., and Yang, K.Y. (2018, January 18\u201323). Denseaspp for semantic segmentation in street scenes. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00388"},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Zhang, Z.L., Lu, M., Ji, S.P., Yu, H.F., and Nie, C.H. (2021). Rich CNN Features for Water-Body Segmentation from Very High Resolution Aerial and Satellite Imager. Remote Sens., 13.","DOI":"10.3390\/rs13101912"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/1\/206\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,13]],"date-time":"2025-10-13T13:35:59Z","timestamp":1760362559000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/1\/206"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,1,3]]},"references-count":47,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2022,1]]}},"alternative-id":["rs14010206"],"URL":"https:\/\/doi.org\/10.3390\/rs14010206","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,1,3]]}}}