{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,25]],"date-time":"2026-02-25T17:08:31Z","timestamp":1772039311351,"version":"3.50.1"},"reference-count":64,"publisher":"MDPI AG","issue":"14","license":[{"start":{"date-parts":[[2023,7,22]],"date-time":"2023-07-22T00:00:00Z","timestamp":1689984000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["51805078"],"award-info":[{"award-number":["51805078"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["N2103011"],"award-info":[{"award-number":["N2103011"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["2022JH6\/100100023"],"award-info":[{"award-number":["2022JH6\/100100023"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["B16009"],"award-info":[{"award-number":["B16009"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Fundamental Research Funds for the Central Universities","award":["51805078"],"award-info":[{"award-number":["51805078"]}]},{"name":"Fundamental Research Funds for the Central Universities","award":["N2103011"],"award-info":[{"award-number":["N2103011"]}]},{"name":"Fundamental Research Funds for the Central Universities","award":["2022JH6\/100100023"],"award-info":[{"award-number":["2022JH6\/100100023"]}]},{"name":"Fundamental Research Funds for the Central Universities","award":["B16009"],"award-info":[{"award-number":["B16009"]}]},{"name":"Central Guidance on Local Science and Technology Development Fund","award":["51805078"],"award-info":[{"award-number":["51805078"]}]},{"name":"Central Guidance on Local Science and Technology Development Fund","award":["N2103011"],"award-info":[{"award-number":["N2103011"]}]},{"name":"Central Guidance on Local Science and Technology Development Fund","award":["2022JH6\/100100023"],"award-info":[{"award-number":["2022JH6\/100100023"]}]},{"name":"Central Guidance on Local Science and Technology Development Fund","award":["B16009"],"award-info":[{"award-number":["B16009"]}]},{"name":"111 Project","award":["51805078"],"award-info":[{"award-number":["51805078"]}]},{"name":"111 Project","award":["N2103011"],"award-info":[{"award-number":["N2103011"]}]},{"name":"111 Project","award":["2022JH6\/100100023"],"award-info":[{"award-number":["2022JH6\/100100023"]}]},{"name":"111 Project","award":["B16009"],"award-info":[{"award-number":["B16009"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>As an important computer vision technique, image segmentation has been widely used in various tasks. However, in some extreme cases, the insufficient illumination would result in a great impact on the performance of the model. So more and more fully supervised methods use multi-modal images as their input. The dense annotated large datasets are difficult to obtain, but the few-shot methods still can have satisfactory results with few pixel-annotated samples. Therefore, we propose the Visible-Depth-Thermal (three-modal) images few-shot semantic segmentation method. It utilizes the homogeneous information of three-modal images and the complementary information of different modal images, which can improve the performance of few-shot segmentation tasks. We constructed a novel indoor dataset VDT-2048-5i for the three-modal images few-shot semantic segmentation task. We also proposed a Self-Enhanced Mixed Attention Network (SEMANet), which consists of a Self-Enhanced module (SE) and a Mixed Attention module (MA). The SE module amplifies the difference between the different kinds of features and strengthens the weak connection for the foreground features. The MA module fuses the three-modal feature to obtain a better feature. Compared with the most advanced methods before, our model improves mIoU by 3.8% and 3.3% in 1-shot and 5-shot settings, respectively, which achieves state-of-the-art performance. In the future, we will solve failure cases by obtaining more discriminative and robust feature representations, and explore achieving high performance with fewer parameters and computational costs.<\/jats:p>","DOI":"10.3390\/s23146612","type":"journal-article","created":{"date-parts":[[2023,7,24]],"date-time":"2023-07-24T03:03:25Z","timestamp":1690167805000},"page":"6612","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["Self-Enhanced Mixed Attention Network for Three-Modal Images Few-Shot Semantic Segmentation"],"prefix":"10.3390","volume":"23","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7636-3460","authenticated-orcid":false,"given":"Kechen","family":"Song","sequence":"first","affiliation":[{"name":"School of Mechanical Engineering & Automation, Northeastern University, Shenyang 110819, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yiming","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Mechanical Engineering & Automation, Northeastern University, Shenyang 110819, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yanqi","family":"Bao","sequence":"additional","affiliation":[{"name":"National Key Laboratory for Novel Software Technology, Department of Computer Science and Technology, Nanjing University, Nanjing 210023, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ying","family":"Zhao","sequence":"additional","affiliation":[{"name":"School of Mechanical Engineering & Automation, Northeastern University, Shenyang 110819, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7121-2367","authenticated-orcid":false,"given":"Yunhui","family":"Yan","sequence":"additional","affiliation":[{"name":"School of Mechanical Engineering & Automation, Northeastern University, Shenyang 110819, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2023,7,22]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Kong, Y., Wang, H., Kong, L., Liu, Y., Yao, C., and Yin, B. (2023). Absolute and Relative Depth-Induced Network for RGB-D Salient Object Detection. Sensors, 23.","DOI":"10.3390\/s23073611"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Li, J., Han, D., Wang, X., Yi, P., Yan, L., and Li, X. (2023). Multi-sensor medical-image fusion technique based on embedding bilateral filter in least squares and salient detection. Sensors, 23.","DOI":"10.3390\/s23073490"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Jian, M., Jin, H., Liu, X., and Zhang, L. (2022). Multiscale Cascaded Attention Network for Saliency Detection Based on ResNet. Sensors, 22.","DOI":"10.3390\/s22249950"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Ullah, I., Jian, M., Shaheed, K., Hussain, S., Ma, Y., Xu, L., and Muhammad, K. (2022). AWANet: Attentive-Aware Wide-Kernels Asymmetrical Network with Blended Contour Information for Salient Object Detection. Sensors, 22.","DOI":"10.3390\/s22249667"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Liao, X., Li, J., Li, L., Shangguan, C., and Huang, S. (2022). RGBD Salient Object Detection, Based on Specific Object Imaging. Sensors, 22.","DOI":"10.3390\/s22228973"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Meng, X., Liu, Y., Fan, L., and Fan, J. (2023). YOLOv5s-Fog: An Improved Model Based on YOLOv5s for Object Detection in Foggy Weather Scenarios. Sensors, 23.","DOI":"10.20944\/preprints202305.0729.v1"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Lai, H., Chen, L., Liu, W., Yan, Z., and Ye, S. (2023). STC-YOLO: Small Object Detection Network for Traffic Signs in Complex Environments. Sensors, 23.","DOI":"10.3390\/s23115307"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Zhang, T., Zhang, Y., Xin, M., Liao, J., and Xie, Q. (2023). A Light-Weight Network for Small Insulator and Defect Detection Using UAV Imaging Based on Improved YOLOv5. Sensors, 23.","DOI":"10.20944\/preprints202305.0796.v1"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Yuan, Y., Cui, J., Liu, Y., and Wu, B. (2023). A Multi-Step Fusion Network for Semantic Segmentation of High-Resolution Aerial Images. Sensors, 23.","DOI":"10.3390\/s23115323"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Wu, B., Cui, J., Cui, W., Yuan, Y., and Ren, X. (2023). Fast Semantic Segmentation of Remote Sensing Images Using a Network That Integrates Global and Local Information. Sensors, 23.","DOI":"10.3390\/s23115310"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"103306","DOI":"10.1016\/j.jvcir.2021.103306","article-title":"Visible and thermal images fusion architecture for few-shot semantic segmentation","volume":"80","author":"Bao","year":"2021","journal-title":"J. Vis. Commun. Image Represent."},{"key":"ref_12","unstructured":"Yu, F., and Koltun, V. (2015). Multi-scale context aggregation by dilated convolutions. arXiv."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"532","DOI":"10.1016\/j.jvcir.2018.11.020","article-title":"A novel framework for semantic segmentation with generative adversarial network","volume":"58","author":"Zhu","year":"2019","journal-title":"J. Vis. Commun. Image Represent."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1016\/j.jvcir.2015.01.014","article-title":"Hybrid graphical model for semantic image segmentation","volume":"28","author":"Wang","year":"2015","journal-title":"J. Vis. Commun. Image Represent."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"107483","DOI":"10.1016\/j.sigpro.2020.107483","article-title":"Unsupervised fuzzy model-based image segmentation","volume":"171","author":"Choy","year":"2020","journal-title":"Signal Process."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"105919","DOI":"10.1016\/j.engappai.2023.105919","article-title":"RGB-T image analysis technology and application: A survey","volume":"120","author":"Song","year":"2023","journal-title":"Eng. Appl. Artif. Intell."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"30","DOI":"10.1016\/j.sigpro.2018.08.010","article-title":"Fuzzy bit-plane-dependence image segmentation","volume":"154","author":"Choy","year":"2019","journal-title":"Signal Process."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"2903","DOI":"10.1109\/TNNLS.2020.3046924","article-title":"Generalized zero-shot learning with multiple graph adaptive generative networks","volume":"33","author":"Xie","year":"2021","journal-title":"IEEE Trans. Neural Netw. Learn. Syst."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Xie, G.S., Liu, L., Zhu, F., Zhao, F., Zhang, Z., Yao, Y., Qin, J., and Shao, L. (2020, January 23\u201328). Region graph embedding network for zero-shot learning. Proceedings of the Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK.","DOI":"10.1007\/978-3-030-58548-8_33"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Xie, G.S., Liu, L., Jin, X., Zhu, F., Zhang, Z., Qin, J., Yao, Y., and Shao, L. (2019, January 15\u201320). Attentive region embedding network for zero-shot learning. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00961"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"1801","DOI":"10.1109\/TII.2021.3090036","article-title":"Deep metric learning-based for multi-target few-shot pavement distress Classification","volume":"18","author":"Dong","year":"2021","journal-title":"IEEE Trans. Ind. Inform."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"968","DOI":"10.1109\/TMM.2021.3061816","article-title":"Semantically meaningful class prototype learning for one-shot image segmentation","volume":"24","author":"Chen","year":"2021","journal-title":"IEEE Trans. Multimed."},{"key":"ref_23","first-page":"1","article-title":"Triplet-graph reasoning network for few-shot metal generic surface defect segmentation","volume":"70","author":"Bao","year":"2021","journal-title":"IEEE Trans. Instrum. Meas."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"442","DOI":"10.1016\/j.jvcir.2017.03.014","article-title":"Collaborative sparse representation leaning model for RGBD action recognition","volume":"48","author":"Gao","year":"2017","journal-title":"J. Vis. Commun. Image Represent."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"186","DOI":"10.1016\/j.jvcir.2019.01.016","article-title":"RETRACTED: An iterative propagation based co-saliency framework for RGBD images","volume":"59","author":"Xu","year":"2019","journal-title":"J. Vis. Commun. Image Represent."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"206","DOI":"10.1016\/j.inffus.2018.06.005","article-title":"Pedestrian detection with unsupervised multispectral feature learning using deep neural networks","volume":"46","author":"Cao","year":"2019","journal-title":"Inf. Fusion"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"102881","DOI":"10.1016\/j.jvcir.2020.102881","article-title":"Learning discriminative update adaptive spatial-temporal regularized correlation filter for RGB-T tracking","volume":"72","author":"Feng","year":"2020","journal-title":"J. Vis. Commun. Image Represent."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"102616","DOI":"10.1016\/j.jvcir.2019.102616","article-title":"Scene flow estimation by depth map upsampling and layer assignment for camera-LiDAR system","volume":"64","author":"Zou","year":"2019","journal-title":"J. Vis. Commun. Image Represent."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Zhang, Y., Sidib\u00e9, D., Morel, O., and Meriaudeau, F. (2021, January 10\u201315). Incorporating depth information into few-shot semantic segmentation. Proceedings of the 2020 25th International Conference on Pattern Recognition (ICPR), Milan, Italy.","DOI":"10.1109\/ICPR48806.2021.9412921"},{"key":"ref_30","unstructured":"Zhao, Y., Song, K., Zhang, Y., and Yan, Y. (2023). IEEE Transactions on Circuits and Systems II: Express Briefs, IEEE."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Shaban, A., Bansal, S., Liu, Z., Essa, I., and Boots, B. (2017). One-shot learning for semantic segmentation. arXiv.","DOI":"10.5244\/C.31.167"},{"key":"ref_32","unstructured":"Song, K., Wang, J., Bao, Y., Huang, L., and Yan, Y. (2022). IEEE\/ASME Transactions on Mechatronics, IEEE."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"924","DOI":"10.1016\/j.engappai.2012.08.009","article-title":"Automatic scene calibration for detecting and tracking people using a single camera","volume":"26","author":"Perdomo","year":"2013","journal-title":"Eng. Appl. Artif. Intell."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Shivakumar, S.S., Rodrigues, N., Zhou, A., Miller, I.D., Kumar, V., and Taylor, C.J. (August, January 31). Pst900: Rgb-thermal calibration, dataset and segmentation network. Proceedings of the 2020 IEEE international conference on robotics and automation (ICRA), Paris, France.","DOI":"10.1109\/ICRA40945.2020.9196831"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Wang, C., Yang, G., and Papanastasiou, G. (2022). Unsupervised image registration towards enhancing performance and explainability in cardiac and brain image analysis. Sensors, 22.","DOI":"10.3390\/s22062125"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Xie, J., Jin, X., and Cao, H. (2021, January 5\u20138). SMRD: A Local Feature Descriptor for Multi-modal Image Registration. Proceedings of the 2021 International Conference on Visual Communications and Image Processing (VCIP), Munich, Germany.","DOI":"10.1109\/VCIP53242.2021.9675401"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Min, J., Kang, D., and Cho, M. (2021, January 11\u201317). Hypercorrelation squeeze for few-shot segmentation. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, BC, Canada.","DOI":"10.1109\/ICCV48922.2021.00686"},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"521","DOI":"10.1016\/j.irbm.2022.05.002","article-title":"A Review on Convolutional Neural Networks for Brain Tumor Segmentation: Methods, Datasets, Libraries, and Future Directions","volume":"43","author":"Balwant","year":"2022","journal-title":"IRBM"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Rehman, M.U., Cho, S., Kim, J., and Chong, K.T. (2021). Brainseg-net: Brain tumor mr image segmentation via enhanced encoder\u2013decoder network. Diagnostics, 11.","DOI":"10.3390\/diagnostics11020169"},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"448","DOI":"10.3174\/ajnr.A7419","article-title":"Automated 3D fetal brain segmentation using an optimized deep learning approach","volume":"43","author":"Zhao","year":"2022","journal-title":"Am. J. Neuroradiol."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"106426","DOI":"10.1016\/j.compbiomed.2022.106426","article-title":"RAAGR2-Net: A brain tumor segmentation network using parallel processing of multiple spatial frames","volume":"152","author":"Rehman","year":"2023","journal-title":"Comput. Biol. Med."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"103541","DOI":"10.1016\/j.bspc.2022.103541","article-title":"MR brain segmentation based on DE-ResUnet combining texture features and background knowledge","volume":"75","author":"Wu","year":"2022","journal-title":"Biomed. Signal Process. Control"},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/TIM.2022.3216413","article-title":"Electrical thermal image semantic segmentation: Large-scale dataset and baseline","volume":"71","author":"Wang","year":"2022","journal-title":"IEEE Trans. Instrum. Meas."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Maheswari, B., and Reeja, S.R. (2023). Thermal infrared image semantic segmentation for night-time driving scenes based on deep learning. Multimed. Tools Appl., 1\u201326.","DOI":"10.1007\/s11042-023-15882-0"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Wang, F., Ding, Z., Shi, T., and Tang, J. (2023, January 6\u20138). EdgeFormer: Edge-assisted transformer for thermal images semantic segmentation. Proceedings of the Second International Conference on Electronic Information Engineering, Big Data, and Computer Technology (EIBDCT), Xishuangbanna, China.","DOI":"10.1117\/12.2674788"},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"2205","DOI":"10.1109\/LRA.2023.3247175","article-title":"CEKD: Cross-Modal Edge-Privileged Knowledge Distillation for Semantic Scene Understanding Using Only Thermal Images","volume":"8","author":"Feng","year":"2023","journal-title":"IEEE Robot. Autom. Lett."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"104709","DOI":"10.1016\/j.engappai.2022.104709","article-title":"A novel fuzzy clustering based method for image segmentation in RGB-D images","volume":"111","author":"Yadav","year":"2022","journal-title":"Eng. Appl. Artif. Intell."},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Zhang, Q., Zhao, S., Luo, Y., Zhang, D., Huang, N., and Han, J. (2021, January 20\u201325). ABMDRNet: Adaptive-weighted bi-directional modality difference reduction network for RGB-T semantic segmentation. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.00266"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Ha, Q., Watanabe, K., Karasawa, T., Ushiku, Y., and Harada, T. (2017, January 24\u201328). MFNet: Towards real-time semantic segmentation for autonomous vehicles with multi-spectral scenes. Proceedings of the 2017 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), Vancouver, BC, Canada.","DOI":"10.1109\/IROS.2017.8206396"},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Vertens, J., Z\u00fcrn, J., and Burgard, W. (2020\u201324, January 24). Heatnet: Bridging the day-night domain gap in semantic segmentation with thermal images. Proceedings of the 2020 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), Las Vegas, NV, USA.","DOI":"10.1109\/IROS45743.2020.9341192"},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"5817","DOI":"10.1007\/s10489-021-02687-7","article-title":"MMNet: Multi-modal multi-stage network for RGB-T image semantic segmentation","volume":"52","author":"Lan","year":"2022","journal-title":"Appl. Intell."},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"Wang, J., Wang, Z., Tao, D., See, S., and Wang, G. (2016, January 11\u201314). Learning common and specific features for RGB-D semantic segmentation with deconvolutional networks. Proceedings of the Computer Vision\u2013ECCV 2016: 14th European Conference, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46454-1_40"},{"key":"ref_53","unstructured":"Jiang, J., Zheng, L., Luo, F., and Zhang, Z. (2018). Rednet: Residual encoder-decoder network for indoor rgb-d semantic segmentation. arXiv."},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Wu, Z., Allibert, G., Stolz, C., Ma, C., and Demonceaux, C. (2022). Depth-adapted CNNs for RGB-D semantic segmentation. arXiv.","DOI":"10.1007\/978-3-030-69538-5_24"},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Zhang, C., Lin, G., Liu, F., Yao, R., and Shen, C. (2019, January 15\u201320). Canet: Class-agnostic segmentation networks with iterative refinement and attentive few-shot learning. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00536"},{"key":"ref_56","unstructured":"Zhang, C., Lin, G., Liu, F., Guo, J., Wu, Q., and Yao, R. (November, January 27). Pyramid graph networks with connection attentions for region-based one-shot semantic segmentation. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea."},{"key":"ref_57","doi-asserted-by":"crossref","first-page":"1050","DOI":"10.1109\/TPAMI.2020.3013717","article-title":"Prior guided feature enrichment network for few-shot segmentation","volume":"44","author":"Tian","year":"2020","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_58","doi-asserted-by":"crossref","unstructured":"Li, G., Jampani, V., Sevilla-Lara, L., Sun, D., Kim, J., and Kim, J. (2021, January 20\u201325). Adaptive prototype learning and allocation for few-shot segmentation. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.00823"},{"key":"ref_59","doi-asserted-by":"crossref","unstructured":"Kang, D., and Cho, M. (2022, January 18\u201324). Integrative few-shot learning for classification and segmentation. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.00974"},{"key":"ref_60","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE conference on computer vision and pattern recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_61","doi-asserted-by":"crossref","unstructured":"Wu, Z., Pan, S., Long, G., Jiang, J., and Zhang, C. (2019). Graph wavenet for deep spatial-temporal graph modeling. arXiv.","DOI":"10.24963\/ijcai.2019\/264"},{"key":"ref_62","doi-asserted-by":"crossref","unstructured":"Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., and Fei-Fei, L. (2009, January 20\u201325). Imagenet: A large-scale hierarchical image database. Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA.","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"ref_63","first-page":"8026","article-title":"Pytorch: An imperative style, high-performance deep learning library","volume":"32","author":"Paszke","year":"2019","journal-title":"NeurIPS"},{"key":"ref_64","unstructured":"Kingma, D.P., and Ba, J. (2014). Adam: A method for stochastic optimization. arXiv."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/14\/6612\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T20:17:12Z","timestamp":1760127432000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/14\/6612"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,7,22]]},"references-count":64,"journal-issue":{"issue":"14","published-online":{"date-parts":[[2023,7]]}},"alternative-id":["s23146612"],"URL":"https:\/\/doi.org\/10.3390\/s23146612","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,7,22]]}}}