{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T02:26:17Z","timestamp":1760149577553,"version":"build-2065373602"},"reference-count":69,"publisher":"MDPI AG","issue":"15","license":[{"start":{"date-parts":[[2023,8,6]],"date-time":"2023-08-06T00:00:00Z","timestamp":1691280000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001691","name":"Japan Society for the Promotion of Science (JSPS)","doi-asserted-by":"publisher","award":["JP20J2212"],"award-info":[{"award-number":["JP20J2212"]}],"id":[{"id":"10.13039\/501100001691","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>In this paper, we propose the Semantic-Boundary-Conditioned Backbone (SBCB) framework, an effective approach to enhancing semantic segmentation performance, particularly around mask boundaries, while maintaining compatibility with various segmentation architectures. Our objective is to improve existing models by leveraging semantic boundary information as an auxiliary task. The SBCB framework incorporates a complementary semantic boundary detection (SBD) task with a multi-task learning approach. It enhances the segmentation backbone without introducing additional parameters during inference or relying on independent post-processing modules. The SBD head utilizes multi-scale features from the backbone, learning low-level features in early stages and understanding high-level semantics in later stages. This complements common semantic segmentation architectures, where features from later stages are used for classification. Extensive evaluations using popular segmentation heads and backbones demonstrate the effectiveness of the SBCB. It leads to an average improvement of 1.2% in IoU and a 2.6% gain in the boundary F-score on the Cityscapes dataset. The SBCB framework also improves over- and under-segmentation characteristics. Furthermore, the SBCB adapts well to customized backbones and emerging vision transformer models, consistently achieving superior performance. In summary, the SBCB framework significantly boosts segmentation performance, especially around boundaries, without introducing complexity to the models. Leveraging the SBD task as an auxiliary objective, our approach demonstrates consistent improvements on various benchmarks, confirming its potential for advancing the field of semantic segmentation.<\/jats:p>","DOI":"10.3390\/s23156980","type":"journal-article","created":{"date-parts":[[2023,8,6]],"date-time":"2023-08-06T10:01:53Z","timestamp":1691316113000},"page":"6980","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["Boosting Semantic Segmentation by Conditioning the Backbone with Semantic Boundaries"],"prefix":"10.3390","volume":"23","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1494-3635","authenticated-orcid":false,"given":"Haruya","family":"Ishikawa","sequence":"first","affiliation":[{"name":"Department of Electronics and Electrical Engineering, Facility of Science and Technology, Keio University, 3-14-1, Hiyoshi, Kohoku-ku, Yokohama 223-8522, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7361-0027","authenticated-orcid":false,"given":"Yoshimitsu","family":"Aoki","sequence":"additional","affiliation":[{"name":"Department of Electronics and Electrical Engineering, Facility of Science and Technology, Keio University, 3-14-1, Hiyoshi, Kohoku-ku, Yokohama 223-8522, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2023,8,6]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Cheng, B., Girshick, R.B., Doll\u2019ar, P., Berg, A.C., and Kirillov, A. (2021, January 19\u201325). Boundary IoU: Improving Object-Centric Image Segmentation Evaluation. Proceedings of the 2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Virtual.","DOI":"10.1109\/CVPR46437.2021.01508"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Takikawa, T., Acuna, D., Jampani, V., and Fidler, S. (November, January 27). Gated-SCNN: Gated Shape CNNs for Semantic Segmentation. Proceedings of the 2019 IEEE\/CVF International Conference on Computer Vision (ICCV), Seoul, Republic of Korea.","DOI":"10.1109\/ICCV.2019.00533"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Li, X., Li, X., Zhang, L., Cheng, G., Shi, J., Lin, Z., Tan, S., and Tong, Y. (2020, January 23\u201328). Improving Semantic Segmentation via Decoupled Body and Edge Supervision. Proceedings of the 2020 IEEE\/CVF European Conference on Computer Vision (ECCV), Virtual.","DOI":"10.1007\/978-3-030-58520-4_26"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Zhen, M., Wang, J., Zhou, L., Li, S., Shen, T., Shang, J., Fang, T., and Long, Q. (2020, January 13\u201319). Joint Semantic Segmentation and Boundary Detection Using Iterative Pyramid Contexts. Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Virtual.","DOI":"10.1109\/CVPR42600.2020.01368"},{"key":"ref_5","unstructured":"Yu, Z., Huang, R., Byeon, W., Liu, S., Liu, G., Breuel, T., Anandkumar, A., and Kautz, J. (2021, January 6\u201314). Coupled Segmentation and Edge Learning via Dynamic Graph Propagation. Proceedings of the Advances in Neural Information Processing Systems (NeurIPS), Virtual."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Yuan, Y., Xie, J., Chen, X., and Wang, J. (2020, January 23\u201328). SegFix: Model-Agnostic Boundary Refinement for Segmentation. Proceedings of the 2020 IEEE\/CVF European Conference on Computer Vision (ECCV), Virtual.","DOI":"10.1007\/978-3-030-58610-2_29"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Bertasius, G., Shi, J., and Torresani, L. (2015, January 11\u201318). High-for-Low and Low-for-High: Efficient Boundary Detection from Deep Object Features and Its Applications to High-Level Vision. Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.65"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Ramamonjisoa, M., Du, Y., and Lepetit, V. (2020, January 13\u201319). Predicting Sharp and Accurate Occlusion Boundaries in Monocular Depth Estimation Using Displacement Fields. Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01466"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Ramalingam, S., Bouaziz, S., Sturm, P.F., and Brand, M. (2010, January 18\u201322). SKYLINE2GPS: Localization in urban canyons using omni-skylines. Proceedings of the 010 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), Taipei, Taiwan.","DOI":"10.1109\/IROS.2010.5649105"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Long, J., Shelhamer, E., and Darrell, T. (2015, January 7\u201312). Fully convolutional networks for semantic segmentation. Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"ref_11","unstructured":"Chen, L.C., Papandreou, G., Schroff, F., and Adam, H. (2017). Rethinking Atrous Convolution for Semantic Image Segmentation. arXiv."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Zhao, H., Shi, J., Qi, X., Wang, X., and Jia, J. (2017, January 21\u201326). Pyramid Scene Parsing Network. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.660"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Fu, J., Liu, J., Tian, H., Fang, Z., and Lu, H. (2019, January 15\u201320). Dual Attention Network for Scene Segmentation. Proceedings of the 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00326"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Zhu, Z., Xu, M., Bai, S., Huang, T., and Bai, X. (November, January 27). Asymmetric Non-Local Neural Networks for Semantic Segmentation. Proceedings of the 2019 IEEE\/CVF International Conference on Computer Vision (ICCV), Seoul, Republic of Korea.","DOI":"10.1109\/ICCV.2019.00068"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Pang, Y., Li, Y., Shen, J., and Shao, L. (November, January 27). Towards Bridging Semantic Gap to Improve Semantic Segmentation. Proceedings of the 2019 IEEE\/CVF International Conference on Computer Vision (ICCV), Seoul, Republic of Korea.","DOI":"10.1109\/ICCV.2019.00433"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Huang, Z., Wang, X., Huang, L., Huang, C., Wei, Y., Shi, H., and Liu, W. (November, January 27). CCNet: Criss-Cross Attention for Semantic Segmentation. Proceedings of the 2019 IEEE\/CVF International Conference on Computer Vision (ICCV), Seoul, Republic of Korea.","DOI":"10.1109\/ICCV.2019.00069"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"6881","DOI":"10.1109\/TPAMI.2020.3047209","article-title":"Global Context Networks","volume":"45","author":"Cao","year":"2020","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Yin, M., Yao, Z., Cao, Y., Li, X., Zhang, Z., Lin, S., and Hu, H. (2020, January 23\u201328). Disentangled Non-Local Neural Networks. Proceedings of the 2020 IEEE\/CVF European Conference on Computer Vision (ECCV), Virtual.","DOI":"10.1007\/978-3-030-58555-6_12"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"2547","DOI":"10.1109\/TNNLS.2020.3006524","article-title":"Scene Segmentation with Dual Relation-Aware Attention Network","volume":"32","author":"Fu","year":"2021","journal-title":"IEEE Trans. Neural Netw. Learn. Syst."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Wang, X., Girshick, R.B., Gupta, A.K., and He, K. (2018, January 18\u201323). Non-local Neural Networks. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00813"},{"key":"ref_21","unstructured":"Vaswani, A., Shazeer, N.M., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L., and Polosukhin, I. (2017, January 4\u20139). Attention is All you Need. Proceedings of the 21st International Conference in Neural Information Processing Systems (NeurIPS), Long Beach, CA, USA."},{"key":"ref_22","unstructured":"Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., and Gelly, S. (2020). An Image is Worth 16 \u00d7 16 Words: Transformers for Image Recognition at Scale. arXiv."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S., and Guo, B. (2021, January 11\u201317). Swin Transformer: Hierarchical Vision Transformer using Shifted Windows. Proceedings of the 2021 IEEE\/CVF International Conference on Computer Vision (ICCV), Montreal, BC, Canada.","DOI":"10.1109\/ICCV48922.2021.00986"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Strudel, R., Pinel, R.G., Laptev, I., and Schmid, C. (2021, January 11\u201317). Segmenter: Transformer for Semantic Segmentation. Proceedings of the 2021 IEEE\/CVF International Conference on Computer Vision (ICCV), Montreal, BC, Canada.","DOI":"10.1109\/ICCV48922.2021.00717"},{"key":"ref_25","unstructured":"Xie, E., Wang, W., Yu, Z., Anandkumar, A., \u00c1lvarez, J.M., and Luo, P. (2021, January 6\u201314). SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers. Proceedings of the Advances in Neural Information Processing Systems (NeurIPS), Virtual."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Kirillov, A., He, K., Girshick, R.B., Rother, C., and Doll\u00e1r, P. (2018, January 18\u201323). Panoptic Segmentation. Proceedings of the 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2019.00963"},{"key":"ref_27","unstructured":"Cheng, B., Schwing, A.G., and Kirillov, A. (2021, January 6\u201314). Per-Pixel Classification is Not All You Need for Semantic Segmentation. Proceedings of the Neural Information Processing Systems, Virtual."},{"key":"ref_28","unstructured":"Jain, J., Li, J., Chiu, M.C., Hassani, A., Orlov, N., and Shi, H. (2022). OneFormer: One Transformer to Rule Universal Image Segmentation. arXiv."},{"key":"ref_29","unstructured":"Li, F., Zhang, H., Xu, H.S., Liu, S., Zhang, L., Ni, L.M., and Shum, H.Y. (2022). Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation. arXiv."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Kirillov, A., Mintun, E., Ravi, N., Mao, H., Rolland, C., Gustafson, L., Xiao, T., Whitehead, S., Berg, A.C., and Lo, W.Y. (2023). Segment Anything. arXiv.","DOI":"10.1109\/ICCV51070.2023.00371"},{"key":"ref_31","unstructured":"Zou, X., Yang, J., Zhang, H., Li, F., Li, L., Gao, J., and Lee, Y.J. (2023). Segment Everything Everywhere All at Once. arXiv."},{"key":"ref_32","unstructured":"Li, F., Zhang, H., Sun, P., Zou, X., Liu, S., Yang, J., Li, C., Zhang, L., and Gao, J. (2023). Semantic-SAM: Segment and Recognize Anything at Any Granularity. arXiv."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1007\/s11263-017-1004-z","article-title":"Holistically-Nested Edge Detection","volume":"125","author":"Xie","year":"2015","journal-title":"Int. J. Comput. Vis."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Liu, Y., Cheng, M.M., Hu, X., Wang, K., and Bai, X. (2017, January 21\u201326). Richer Convolutional Features for Edge Detection. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.622"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Pu, M., Huang, Y., Liu, Y., Guan, Q., and Ling, H. (2022, January 18\u201324). EDTER: Edge Detection with Transformer. Proceedings of the 2022 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.00146"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Yu, Z., Feng, C., Liu, M.Y., and Ramalingam, S. (2017, January 21\u201326). CASENet: Deep Category-Aware Semantic Edge Detection. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.191"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Hu, Y., Chen, Y., Li, X., and Feng, J. (2019, January 10\u201316). Dynamic Feature Fusion for Semantic Edge Detection. Proceedings of the IJCAI, Macao, China.","DOI":"10.24963\/ijcai.2019\/110"},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"179","DOI":"10.1007\/s11263-021-01539-8","article-title":"Semantic Edge Detection with Diverse Deep Supervision","volume":"130","author":"Liu","year":"2022","journal-title":"Int. J. Comput. Vis."},{"key":"ref_39","unstructured":"Shen, D., Ji, Y., Li, P., Wang, Y., and Lin, D. (2020, January 6\u201312). RANet: Region Attention Network for Semantic Segmentation. Proceedings of the Neural Information Processing Systems, Vancouver, BC, Canada."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Hu, H., Cui, J., and Zha, H. (2021, January 18\u201321). Boundary-aware Graph Convolution for Semantic Segmentation. Proceedings of the 2020 25th International Conference on Pattern Recognition (ICPR), Taichung, Taiwan.","DOI":"10.1109\/ICPR48806.2021.9412034"},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"25259","DOI":"10.1109\/TITS.2022.3194213","article-title":"BANet: Boundary-Assistant Encoder-Decoder Network for Semantic Segmentation","volume":"23","author":"Zhou","year":"2022","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_42","unstructured":"Tao, A., Sapra, K., and Catanzaro, B. (2020). Hierarchical Multi-Scale Attention for Semantic Segmentation. arXiv."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"107511","DOI":"10.1016\/j.asoc.2021.107511","article-title":"Semantic boundary enhancement and position attention network with long-range dependency for semantic segmentation","volume":"109","author":"Chen","year":"2021","journal-title":"Appl. Soft Comput."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"107557","DOI":"10.1016\/j.patcog.2020.107557","article-title":"SEMEDA: Enhancing Segmentation Precision with Semantic Edge Aware Loss","volume":"108","author":"Chen","year":"2020","journal-title":"Pattern Recognit."},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Borse, S., Wang, Y., Zhang, Y., and Porikli, F.M. (2021, January 20\u201325). InverseForm: A Loss Function for Structured Boundary-Aware Segmentation. Proceedings of the 2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.00584"},{"key":"ref_46","unstructured":"Wang, C., Zhang, Y., Cui, M., Liu, J., Ren, P., Yang, Y., Xie, X., Hua, X., Bao, H., and Xu, W. (March, January 22). Active Boundary Loss for Semantic Segmentation. Proceedings of the AAAI, Virtual."},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Zhou, P., Price, B.L., Cohen, S.D., Wilensky, G., and Davis, L.S. (2020, January 13\u201319). Deepstrip: High-Resolution Boundary Refinement. Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Virtual.","DOI":"10.1109\/CVPR42600.2020.01057"},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Misra, I., Shrivastava, A., Gupta, A.K., and Hebert, M. (July, January 26). Cross-Stitch Networks for Multi-task Learning. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.433"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Kokkinos, I. (2017, January 21\u201326). UberNet: Training a Universal Convolutional Neural Network for Low-, Mid-, and High-Level Vision Using Diverse Datasets and Limited Memory. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.579"},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Xiao, T., Liu, Y., Zhou, B., Jiang, Y., and Sun, J. (2018, January 8\u201314). Unified Perceptual Parsing for Scene Understanding. Proceedings of the 2018 IEEE\/CVF European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01228-1_26"},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Xu, D., Ouyang, W., Wang, X., and Sebe, N. (2018, January 18\u201323). PAD-Net: Multi-tasks Guided Prediction-and-Distillation Network for Simultaneous Depth Estimation and Scene Parsing. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00077"},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"Xu, J., Xiong, Z., and Bhattacharyya, S. (2023, January 24\u201331). PIDNet: A Real-time Semantic Segmentation Network Inspired by PID Controllers. Proceedings of the 2023 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR52729.2023.01871"},{"key":"ref_53","unstructured":"Tan, H.H., Wu, S., and Pi, J. (2023, January 10\u201316). Semantic Diffusion Network for Semantic Segmentation. Proceedings of the Advances in Neural Information Processing Systems (NeurIPS), New Orleans, LA, USA."},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (July, January 26). Deep Residual Learning for Image Recognition. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Odena, A., Dumoulin, V., and Olah, C. (2016). Deconvolution and Checkerboard Artifacts. Distill.","DOI":"10.23915\/distill.00003"},{"key":"ref_56","doi-asserted-by":"crossref","first-page":"415","DOI":"10.4086\/toc.2012.v008a019","article-title":"Distance Transforms of Sampled Functions","volume":"8","author":"Felzenszwalb","year":"2012","journal-title":"Theory Comput."},{"key":"ref_57","doi-asserted-by":"crossref","unstructured":"Cordts, M., Omran, M., Ramos, S., Rehfeld, T., Enzweiler, M., Benenson, R., Franke, U., Roth, S., and Schiele, B. (July, January 26). The Cityscapes Dataset for Semantic Urban Scene Understanding. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.350"},{"key":"ref_58","doi-asserted-by":"crossref","unstructured":"Yu, Z., Liu, W., Zou, Y., Feng, C., Ramalingam, S., Kumar, B.V.K.V., and Kautz, J. (2018, January 8\u201314). Simultaneous Edge Alignment and Learning. Proceedings of the 2018 IEEE\/CVF European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01219-9_24"},{"key":"ref_59","doi-asserted-by":"crossref","unstructured":"Yu, F., Chen, H., Wang, X., Xian, W., Chen, Y., Liu, F., Madhavan, V., and Darrell, T. (2018, January 18\u201323). BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask Learning. Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR42600.2020.00271"},{"key":"ref_60","doi-asserted-by":"crossref","unstructured":"Ros, G., Sellart, L., Materzynska, J., V\u00e1zquez, D., and L\u00f3pez, A.M. (July, January 26). The SYNTHIA Dataset: A Large Collection of Synthetic Images for Semantic Segmentation of Urban Scenes. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.352"},{"key":"ref_61","doi-asserted-by":"crossref","unstructured":"Zhou, B., Zhao, H., Puig, X., Fidler, S., Barriuso, A., and Torralba, A. (2017, January 21\u201326). Scene Parsing through ADE20K Dataset. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.544"},{"key":"ref_62","unstructured":"Zhang, Y., Mehta, S., and Caspi, A. (2021). Rethinking Semantic Segmentation Evaluation for Explainability and Model Selection. arXiv."},{"key":"ref_63","unstructured":"Contributors, M. (2023, July 13). MMSegmentation: OpenMMLab Semantic Segmentation Toolbox and Benchmark. Available online: https:\/\/github.com\/open-mmlab\/mmsegmentation."},{"key":"ref_64","doi-asserted-by":"crossref","unstructured":"Chen, L.C., Zhu, Y., Papandreou, G., Schroff, F., and Adam, H. (2018, January 8\u201314). Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation. Proceedings of the 2018 IEEE\/CVF European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_49"},{"key":"ref_65","doi-asserted-by":"crossref","unstructured":"Zhao, H., Zhang, Y., Liu, S., Shi, J., Loy, C.C., Lin, D., and Jia, J. (2018, January 8\u201314). PSANet: Point-wise Spatial Attention Network for Scene Parsing. Proceedings of the European Conference on Computer Vision, Munich, Germany.","DOI":"10.1007\/978-3-030-01240-3_17"},{"key":"ref_66","doi-asserted-by":"crossref","unstructured":"Yu, C., Wang, J., Peng, C., Gao, C., Yu, G., and Sang, N. (2018, January 8\u201314). BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation. Proceedings of the European Conference on Computer Vision, Munich, Germany.","DOI":"10.1007\/978-3-030-01261-8_20"},{"key":"ref_67","doi-asserted-by":"crossref","first-page":"3051","DOI":"10.1007\/s11263-021-01515-2","article-title":"BiSeNet V2: Bilateral Network with Guided Aggregation for Real-Time Semantic Segmentation","volume":"129","author":"Yu","year":"2020","journal-title":"Int. J. Comput. Vis."},{"key":"ref_68","doi-asserted-by":"crossref","unstructured":"Fan, M., Lai, S., Huang, J., Wei, X., Chai, Z., Luo, J., and Wei, X. (2021, January 20\u201325). Rethinking BiSeNet For Real-time Semantic Segmentation. Proceedings of the 2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.00959"},{"key":"ref_69","doi-asserted-by":"crossref","unstructured":"Liu, Z., Mao, H., Wu, C., Feichtenhofer, C., Darrell, T., and Xie, S. (2022, January 18\u201324). A ConvNet for the 2020s. Proceedings of the 2022 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.01167"}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/15\/6980\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T20:26:43Z","timestamp":1760128003000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/15\/6980"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,8,6]]},"references-count":69,"journal-issue":{"issue":"15","published-online":{"date-parts":[[2023,8]]}},"alternative-id":["s23156980"],"URL":"https:\/\/doi.org\/10.3390\/s23156980","relation":{},"ISSN":["1424-8220"],"issn-type":[{"type":"electronic","value":"1424-8220"}],"subject":[],"published":{"date-parts":[[2023,8,6]]}}}