{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,2]],"date-time":"2026-01-02T07:11:37Z","timestamp":1767337897294,"version":"build-2065373602"},"reference-count":24,"publisher":"MDPI AG","issue":"6","license":[{"start":{"date-parts":[[2025,6,6]],"date-time":"2025-06-06T00:00:00Z","timestamp":1749168000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["BDCC"],"abstract":"<jats:p>To improve the performance of the image semantic segmentation algorithm and make the algorithm achieve a better balance between accuracy and real-time performance when segmenting images, this paper proposes a real-time image semantic segmentation model based on an improved DeepLabv3+ network. First, the MobileNetV2 model with less computational overhead and number of parameters is selected as the backbone network to improve the segmentation speed; then, the Feature Enhancement Module (FEM) is introduced to several shallow features with different scale sizes in MobileNetV2, and then these shallow features are fused to improve the utilization rate of the model encoder on the edge information, to retain more detailed information and to improve the network\u2019s feature representation ability for complex scenes; finally, to address the problem that the output feature maps of Atrous Spatial Pyramid Pooling (ASPP) module do not pay enough attention to detailed information after merging, the FEM attention mechanism is introduced on the feature maps processed by the ASPP module. The algorithm in this study achieves 76.45% for mean intersection over union (mIoU) accuracy with 29.18 FPS real-time performance in the PASCAL VOC2012 Augmented dataset; and 37.31% mIoU accuracy with 23.31 FPS real-time performance in the ADE20K dataset. The experimental results show that the algorithm in this study achieves a good balance between accuracy and real-time performance, and its image semantic segmentation performance is significantly improved compared to DeepLabv3+ and other existing algorithms.<\/jats:p>","DOI":"10.3390\/bdcc9060152","type":"journal-article","created":{"date-parts":[[2025,6,6]],"date-time":"2025-06-06T12:58:21Z","timestamp":1749214701000},"page":"152","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["Real-Time Image Semantic Segmentation Based on Improved DeepLabv3+ Network"],"prefix":"10.3390","volume":"9","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-2692-2719","authenticated-orcid":false,"given":"Peibo","family":"Li","sequence":"first","affiliation":[{"name":"College of Mechanical Engineering, Donghua University, Shanghai 201620, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0001-6707-0295","authenticated-orcid":false,"given":"Jiangwu","family":"Zhou","sequence":"additional","affiliation":[{"name":"College of Mechanical Engineering, Donghua University, Shanghai 201620, China"}]},{"given":"Xiaohua","family":"Xu","sequence":"additional","affiliation":[{"name":"College of Mechanical Engineering, Donghua University, Shanghai 201620, China"}]}],"member":"1968","published-online":{"date-parts":[[2025,6,6]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Shu, R., and Zhao, S. (2024). Multi-Resolution Learning and Semantic Edge Enhancement for Super-Resolution Semantic Segmentation of Urban Scene Images. Sensors, 24.","DOI":"10.3390\/s24144522"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"1865","DOI":"10.1007\/s12524-022-01496-w","article-title":"Intelligent image semantic segmentation: A review through deep learning techniques for remote sensing image analysis","volume":"51","author":"Jiang","year":"2023","journal-title":"J. Indian Soc. Remote Sens."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"103574","DOI":"10.1016\/j.jvcir.2022.103574","article-title":"ADPNet: Attention based dual path network for lane detection","volume":"87","author":"Ren","year":"2022","journal-title":"J. Vis. Commun. Image Represent."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"104554","DOI":"10.1016\/j.imavis.2022.104554","article-title":"Weather-degraded image semantic segmentation with multi-task knowledge distillation","volume":"127","author":"Li","year":"2022","journal-title":"Image Vis. Comput."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Long, J., Shelhamer, E., and Darrell, T. (2015, January 7\u201312). Fully convolutional networks for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2015, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Chen, L.C. (2017). Rethinking atrous convolution for semantic image segmentation. arXiv.","DOI":"10.1007\/978-3-030-01234-2_49"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Chen, L.C., Zhu, Y., Papandreou, G., Schroff, F., and Adam, H. (2018, January 8\u201314). Encoder-decoder with atrous separable convolution for semantic image segmentation. Proceedings of the European Conference on Computer Vision (ECCV) 2018, Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_49"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"771","DOI":"10.18280\/ts.410218","article-title":"MFP-DeepLabv3+: A Multi-scale Feature Fusion and Parallel Attention Network for Enhanced Bone Metastasis Segmentation","volume":"41","author":"Gu","year":"2024","journal-title":"Trait. Signal"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Hou, X., Chen, P., and Gu, H. (2024). LM-DeeplabV3+: A Lightweight Image Segmentation Algorithm Based on Multi-Scale Feature Interaction. Appl. Sci., 14.","DOI":"10.20944\/preprints202401.1052.v1"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/s11554-023-01380-x","article-title":"Light-Deeplabv3+: A lightweight real-time semantic segmentation method for complex environment perception","volume":"21","author":"Ding","year":"2024","journal-title":"J. Real-Time Image Process."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Baban, A., Erep, T.R., and Chaari, L. (2023). mid-DeepLabv3+: A Novel Approach for Image Semantic Segmentation Applied to African Food Dietary Assessments. Sensors, 24.","DOI":"10.3390\/s24010209"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"2310805","DOI":"10.1080\/23311932.2024.2310805","article-title":"Pear leaf disease segmentation method based on improved DeepLabv3+","volume":"10","author":"Fu","year":"2024","journal-title":"Cogent Food Agric."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Zhao, T., Fu, C., Song, W., and Sham, C.W. (2023). RGGC-UNet: Accurate deep learning framework for signet ring cell semantic segmentation in pathological images. Bioengineering, 11.","DOI":"10.3390\/bioengineering11010016"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"121574","DOI":"10.1016\/j.eswa.2023.121574","article-title":"TranSiam: Aggregating multi-modal visual features with locality for medical image segmentation","volume":"237","author":"Li","year":"2024","journal-title":"Expert Syst. Appl."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"648","DOI":"10.1016\/j.aej.2025.02.035","article-title":"MiM-UNet: An efficient building image segmentation network integrating state space models","volume":"120","author":"Liu","year":"2025","journal-title":"Alex. Eng. J."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2016, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Chollet, F. (2017, January 21\u201326). Xception: Deep learning with depthwise separable convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2017, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.195"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., and Chen, L.C. (2018, January 18\u201323). Mobilenetv2: Inverted residuals and linear bottlenecks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2018, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00474"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"48","DOI":"10.1016\/j.neucom.2021.03.091","article-title":"A review on the attention mechanism of deep learning","volume":"452","author":"Niu","year":"2021","journal-title":"Neurocomputing"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Hou, Q., Zhou, D., and Feng, J. (2021, January 20\u201325). Coordinate attention for efficient mobile network design. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition 2021, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.01350"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Hu, J., Shen, L., and Sun, G. (2018, January 18\u201323). Squeeze-and-excitation networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2018, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Wang, Y., Yang, L., Liu, X., and Yan, P. (2024). An improved semantic segmentation algorithm for high-resolution remote sensing images based on DeepLabv3+. Sci. Rep., 14.","DOI":"10.1038\/s41598-024-60375-1"},{"key":"ref_23","first-page":"5611215","article-title":"FFCA-YOLO for small object detection in remote sensing images","volume":"62","author":"Zhang","year":"2024","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"108261","DOI":"10.1016\/j.engappai.2024.108261","article-title":"Multi-scale spatial pyramid attention mechanism for image recognition: An effective approach","volume":"133","author":"Yu","year":"2024","journal-title":"Eng. Appl. Artif. Intell."}],"container-title":["Big Data and Cognitive Computing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2504-2289\/9\/6\/152\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T17:48:02Z","timestamp":1760032082000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2504-2289\/9\/6\/152"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,6,6]]},"references-count":24,"journal-issue":{"issue":"6","published-online":{"date-parts":[[2025,6]]}},"alternative-id":["bdcc9060152"],"URL":"https:\/\/doi.org\/10.3390\/bdcc9060152","relation":{},"ISSN":["2504-2289"],"issn-type":[{"type":"electronic","value":"2504-2289"}],"subject":[],"published":{"date-parts":[[2025,6,6]]}}}