{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,18]],"date-time":"2026-02-18T23:46:06Z","timestamp":1771458366835,"version":"3.50.1"},"reference-count":62,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2023,9,18]],"date-time":"2023-09-18T00:00:00Z","timestamp":1694995200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100012166","name":"National Key R&D Program of China","doi-asserted-by":"crossref","award":["2021ZD0111902"],"award-info":[{"award-number":["2021ZD0111902"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["62072015, U21B2038, U19B2039, 61772209"],"award-info":[{"award-number":["62072015, U21B2038, U19B2039, 61772209"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100012245","name":"Science and Technology Planning Project of Guangdong Province","doi-asserted-by":"crossref","award":["2019A050510034"],"award-info":[{"award-number":["2019A050510034"]}],"id":[{"id":"10.13039\/501100012245","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2024,1,31]]},"abstract":"<jats:p>Video object segmentation (VOS) exhibits heavy occlusions, large deformation, and severe motion blur. While many remarkable convolutional neural networks are devoted to the VOS task, they often mis-identify background noise as the target or output coarse object boundaries, due to the failure of mining detail information and high-order correlations of pixels within the whole video. In this work, we propose an edge attention gated graph convolutional network (GCN) for VOS. The seed point initialization and graph construction stages construct a spatio-temporal graph of the video by exploring the spatial intra-frame correlation and the temporal inter-frame correlation of superpixels. The node classification stage identifies foreground superpixels by using an edge attention gated GCN which mines higher-order correlations between superpixels and propagates features among different nodes. The segmentation optimization stage optimizes the classification of foreground superpixels and reduces segmentation errors by using a global appearance model which captures the long-term stable feature of objects. In summary, the key contribution of our framework is twofold: (a) the spatio-temporal graph representation can propagate the seed points of the first frame to subsequent frames and facilitate our framework for the semi-supervised VOS task; and (b) the edge attention gated GCN can learn the importance of each node with respect to both the neighboring nodes and the whole task with a small number of layers. Experiments on Davis 2016 and Davis 2017 datasets show that our framework achieves the excellent performance with only small training samples (45 video sequences).<\/jats:p>","DOI":"10.1145\/3611389","type":"journal-article","created":{"date-parts":[[2023,8,1]],"date-time":"2023-08-01T11:54:43Z","timestamp":1690890883000},"page":"1-23","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["Semi-supervised Video Object Segmentation Via an Edge Attention Gated Graph Convolutional Network"],"prefix":"10.1145","volume":"20","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-0015-8874","authenticated-orcid":false,"given":"Yuqing","family":"Zhang","sequence":"first","affiliation":[{"name":"Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing Institute of Artificial Intelligence, Faculty of Information Technology, Beijing University of Technology, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6650-6790","authenticated-orcid":false,"given":"Yong","family":"Zhang","sequence":"additional","affiliation":[{"name":"Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing Institute of Artificial Intelligence, Faculty of Information Technology, Beijing University of Technology, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3045-624X","authenticated-orcid":false,"given":"Shaofan","family":"Wang","sequence":"additional","affiliation":[{"name":"Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing Institute of Artificial Intelligence, Faculty of Information Technology, Beijing University of Technology, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0799-0054","authenticated-orcid":false,"given":"Yun","family":"Liang","sequence":"additional","affiliation":[{"name":"Guangzhou Key Laboratory of Intelligent Agriculture, College of Mathematics and Informatics, South China Agricultural University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8125-4648","authenticated-orcid":false,"given":"Baocai","family":"Yin","sequence":"additional","affiliation":[{"name":"Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing Institute of Artificial Intelligence, Faculty of Information Technology, Beijing University of Technology, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2023,9,18]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2012.120"},{"key":"e_1_3_2_3_2","article-title":"Hybrid sequence to sequence model for video object segmentation","volume":"2010","author":"Azimi Fatemeh","year":"2020","unstructured":"Fatemeh Azimi, Stanislav Frolov, Federico Raue, and Andreas Dengel. 2020. Hybrid sequence to sequence model for video object segmentation. CoRR abs\/2010.05069 (2020).","journal-title":"CoRR"},{"key":"e_1_3_2_4_2","first-page":"5977","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Bao Linchao","year":"2018","unstructured":"Linchao Bao, Baoyuan Wu, and Wei Liu. 2018. CNN in MRF: Video object segmentation via inference in a CNN-based higher-order spatio-temporal MRF. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 5977\u20135986."},{"key":"e_1_3_2_5_2","article-title":"Residual gated graph convNets","volume":"1711","author":"Bresson Xavier","year":"2017","unstructured":"Xavier Bresson and Thomas Laurent. 2017. Residual gated graph convNets. CoRR abs\/1711.07553 (2017). arXiv:1711.07553","journal-title":"CoRR"},{"key":"e_1_3_2_6_2","first-page":"221","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Caelles Sergi","year":"2017","unstructured":"Sergi Caelles, Kevis-Kokitsi Maninis, Jordi Pont-Tuset, Laura Leal-Taixe, Daniel Cremers, and Luc Van Gool. 2017. One-shot video object segmentation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 221\u2013230."},{"key":"e_1_3_2_7_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.1986.4767851"},{"key":"e_1_3_2_8_2","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2018.2890361"},{"issue":"2","key":"e_1_3_2_9_2","article-title":"Appearance-consistent video object segmentation based on a multinomial event model","volume":"15","author":"Chen Yadang","year":"2019","unstructured":"Yadang Chen, Chuanyan Hao, Alex X. Liu, and Enhua Wu. 2019. Appearance-consistent video object segmentation based on a multinomial event model. ACM Transactions on Multimedia Computing Communications and Applications 15, 2 (2019), 40.1\u201340.15.","journal-title":"ACM Transactions on Multimedia Computing Communications and Applications"},{"key":"e_1_3_2_10_2","first-page":"11781","volume-title":"Advances in Neural Information Processing Systems","author":"Cheng Ho Kei","year":"2021","unstructured":"Ho Kei Cheng, Yu-Wing Tai, and Chi-Keung Tang. 2021. Rethinking space-time networks with improved memory coverage for efficient video object segmentation. In Advances in Neural Information Processing Systems, Vol. 34. 11781\u201311794."},{"key":"e_1_3_2_11_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2019.2961267"},{"key":"e_1_3_2_12_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2020.03.020"},{"key":"e_1_3_2_13_2","first-page":"4142","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Hu Li","year":"2021","unstructured":"Li Hu, Peng Zhang, Bang Zhang, Pan Pan, Yinghui Xu, and Rong Jin. 2021. Learning position and target consistency for memory-based video object segmentation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 4142\u20134152."},{"key":"e_1_3_2_14_2","volume-title":"Advances in Neural Information Processing Systems","author":"Hu Yuan-Ting","year":"2017","unstructured":"Yuan-Ting Hu, Jia-Bin Huang, and Alexander Schwing. 2017. MaskRNN: Instance level video object segmentation. In Advances in Neural Information Processing Systems, Vol. 30."},{"key":"e_1_3_2_15_2","first-page":"54","volume-title":"Proceedings of the European Conference on Computer Vision","author":"Hu Yuan-Ting","year":"2018","unstructured":"Yuan-Ting Hu, Jia-Bin Huang, and Alexander Schwing. 2018. VideoMatch: Matching based video object segmentation. In Proceedings of the European Conference on Computer Vision. 54\u201370."},{"key":"e_1_3_2_16_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00890"},{"key":"e_1_3_2_17_2","first-page":"2462","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Ilg Eddy","year":"2017","unstructured":"Eddy Ilg, Nikolaus Mayer, Tonmoy Saikia, Margret Keuper, Alexey Dosovitskiy, and Thomas Brox. 2017. FlowNet 2.0: Evolution of optical flow estimation with deep networks. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 2462\u20132470."},{"key":"e_1_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00916"},{"key":"e_1_3_2_19_2","first-page":"8709","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Li Liulei","year":"2022","unstructured":"Liulei Li, Tianfei Zhou, Wenguan Wang, Lu Yang, Jianwu Li, and Yi Yang. 2022. Locality-aware inter-and intra-video reconstruction for self-supervised correspondence learning. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 8709\u20138720."},{"key":"e_1_3_2_20_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-022-01655-z"},{"key":"e_1_3_2_21_2","first-page":"2100","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","volume":"35","author":"Liu Daizong","year":"2021","unstructured":"Daizong Liu, Shuangjie Xu, Xiao-Yang Liu, Zichuan Xu, Wei Wei, and Pan Zhou. 2021. Spatiotemporal graph neural network based mask reconstruction for video object segmentation. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. 2100\u20132108."},{"issue":"4","key":"e_1_3_2_22_2","doi-asserted-by":"crossref","first-page":"1607","DOI":"10.1109\/TCSVT.2020.3010293","article-title":"Guided co-segmentation network for fast video object segmentation","volume":"31","author":"Liu Weide","year":"2021","unstructured":"Weide Liu, Guosheng Lin, Tianyi Zhang, and Zichuan Liu. 2021. Guided co-segmentation network for fast video object segmentation. IEEE Transactions on Circuits and Systems for Video Technology 31, 4 (2021), 1607\u20131617.","journal-title":"IEEE Transactions on Circuits and Systems for Video Technology"},{"key":"e_1_3_2_23_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIM.2022.3219307"},{"key":"e_1_3_2_24_2","first-page":"661","volume-title":"Proceedings of the European Conference on Computer Vision","author":"Lu Xiankai","year":"2020","unstructured":"Xiankai Lu, Wenguan Wang, Martin Danelljan, Tianfei Zhou, Jianbing Shen, and Luc Van Gool. 2020. Video object segmentation with episodic graph memory networks. In Proceedings of the European Conference on Computer Vision. 661\u2013679."},{"key":"e_1_3_2_25_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00374"},{"key":"e_1_3_2_26_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00898"},{"key":"e_1_3_2_27_2","first-page":"743","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Maerki Nicolas","year":"2016","unstructured":"Nicolas Maerki, Federico Perazzi, Oliver Wang, and Alexander Sorkine-Hornung. 2016. Bilateral space video segmentation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 743\u2013751."},{"key":"e_1_3_2_28_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2018.2838670"},{"key":"e_1_3_2_29_2","article-title":"Semi-supervised classification with graph convolutional networks","volume":"1609","author":"N-Kipf Thomas","year":"2016","unstructured":"Thomas N-Kipf and Max Welling. 2016. Semi-supervised classification with graph convolutional networks. CoRR abs\/1609.02907 (2016).","journal-title":"CoRR"},{"key":"e_1_3_2_30_2","first-page":"7376","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Oh Seoung Wug","year":"2018","unstructured":"Seoung Wug Oh, Joon-Young Lee, Kalyan Sunkavalli, and Seon Joo Kim. 2018. Fast video object segmentation by reference-guided mask propagation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 7376\u20137385."},{"key":"e_1_3_2_31_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00932"},{"key":"e_1_3_2_32_2","first-page":"724","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Perazzi Federico","year":"2016","unstructured":"Federico Perazzi, Jordi Pont-Tuset, Brian McWilliams, Luc Van Gool, Markus Gross, and Alexander Sorkine-Hornung. 2016. A benchmark dataset and evaluation methodology for video object segmentation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 724\u2013732."},{"key":"e_1_3_2_33_2","first-page":"3227","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision","author":"Perazzi Federico","year":"2015","unstructured":"Federico Perazzi, Oliver Wang, Markus Gross, and Alexander Sorkine. 2015. Fully connected object proposals for video segmentation. In Proceedings of the IEEE\/CVF International Conference on Computer Vision. 3227\u20133234."},{"key":"e_1_3_2_34_2","article-title":"The 2017 Davis challenge on video object segmentation","author":"Pont-Tuset Jordi","year":"2017","unstructured":"Jordi Pont-Tuset, Federico Perazzi, Sergi Caelles, Pablo Arbelez, Alex Sorkine-Hornung, and Luc Van Gool. 2017. The 2017 Davis challenge on video object segmentation. arXiv preprint arXiv:1704.00675 (2017).","journal-title":"arXiv preprint arXiv:1704.00675"},{"key":"e_1_3_2_35_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00743"},{"key":"e_1_3_2_36_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00474"},{"key":"e_1_3_2_37_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58542-6_38"},{"key":"e_1_3_2_38_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01080"},{"issue":"1","key":"e_1_3_2_39_2","first-page":"175","article-title":"Real time video object segmentation in compressed domain","volume":"31","author":"Tan Zhentao","year":"2020","unstructured":"Zhentao Tan, Bin Liu, Qi Chu, Hangshi Zhong, Yue Wu, Weihai Li, and Nenghai Yu. 2020. Real time video object segmentation in compressed domain. IEEE Transactions on Circuits and Systems for Video Technology 31, 1 (2020), 175\u2013188.","journal-title":"IEEE Transactions on Circuits and Systems for Video Technology"},{"key":"e_1_3_2_40_2","first-page":"3899","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Tsai Yihsuan","year":"2016","unstructured":"Yihsuan Tsai, Minghsuan Yang, and Michael Black. 2016. Video segmentation via object flow. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 3899\u20133908."},{"key":"e_1_3_2_41_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00971"},{"key":"e_1_3_2_42_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00142"},{"key":"e_1_3_2_43_2","first-page":"10797","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision","author":"Wang Tao","year":"2021","unstructured":"Tao Wang, Ning Xu, Kean Chen, and Weiyao Lin. 2021. End-to-end video instance segmentation via spatial-temporal graph neural networks. In Proceedings of the IEEE\/CVF International Conference on Computer Vision. 10797\u201310806."},{"key":"e_1_3_2_44_2","first-page":"9236","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision","author":"Wang Wenguan","year":"2019","unstructured":"Wenguan Wang, Xiankai Lu, Jianbing Shen, David J. Crandall, and Ling Shao. 2019. Zero-shot video object segmentation via attentive graph neural networks. In Proceedings of the IEEE\/CVF International Conference on Computer Vision. 9236\u20139245."},{"issue":"7","key":"e_1_3_2_45_2","doi-asserted-by":"crossref","first-page":"2413","DOI":"10.1109\/TPAMI.2020.2966453","article-title":"Paying attention to video object pattern understanding","volume":"43","author":"Wang Wenguan","year":"2021","unstructured":"Wenguan Wang, Jianbing Shen, Xiankai Lu, Steven C. H. Hoi, and Haibin Ling. 2021. Paying attention to video object pattern understanding. IEEE Transactions on Pattern Analysis and Machine Intelligence 43, 7 (2021), 2413\u20132428.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"e_1_3_2_46_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2017.2745098"},{"key":"e_1_3_2_47_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2018.2819173"},{"key":"e_1_3_2_48_2","first-page":"1671","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision","author":"Wang Wenguan","year":"2017","unstructured":"Wenguan Wang, Jianbing Shen, Jianwen Xie, and Fatih Porikli. 2017. Super-trajectory for video segmentation. In Proceedings of the IEEE\/CVF International Conference on Computer Vision. 1671\u20131679."},{"key":"e_1_3_2_49_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2017.2662005"},{"key":"e_1_3_2_50_2","article-title":"A survey on deep learning technique for video segmentation","author":"Wang Wenguan","year":"2021","unstructured":"Wenguan Wang, Tianfei Zhou, Fatih Porikli, David Crandall, and Luc Van Gool. 2021. A survey on deep learning technique for video segmentation. arXiv preprint arXiv:2107.01153 (2021).","journal-title":"arXiv preprint arXiv:2107.01153"},{"issue":"5","key":"e_1_3_2_51_2","first-page":"1205","article-title":"Online meta adaptation for fast video object segmentation","volume":"42","author":"Xiao Huaxin","year":"2019","unstructured":"Huaxin Xiao, Bingyi Kang, Yu Liu, Maojun Zhang, and Jiashi Feng. 2019. Online meta adaptation for fast video object segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 42, 5 (2019), 1205\u20131217.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"e_1_3_2_52_2","first-page":"1286","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Xie Haozhe","year":"2021","unstructured":"Haozhe Xie, Hongxun Yao, Shangchen Zhou, Shengping Zhang, and Wenxiu Sun. 2021. Efficient regional memory network for video object segmentation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 1286\u20131295."},{"key":"e_1_3_2_53_2","first-page":"585","volume-title":"Proceedings of the European Conference on Computer Vision","author":"Xu Ning","year":"2018","unstructured":"Ning Xu, Linjie Yang, Yuchen Fan, Jianchao Yang, Dingcheng Yue, Yuchen Liang, Brian Price, Scott Cohen, and Thomas Huang. 2018. YouTube-VOS: Sequence-to-sequence video object segmentation. In Proceedings of the European Conference on Computer Vision. 585\u2013601."},{"key":"e_1_3_2_54_2","first-page":"7157","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision","author":"Yang Charig","year":"2021","unstructured":"Charig Yang, Hala Lamdouar, Erika Lu, Andrew Zisserman, and Weidi Xie. 2021. Self-supervised video object segmentation by motion grouping. In Proceedings of the IEEE\/CVF International Conference on Computer Vision. 7157\u20137168."},{"key":"e_1_3_2_55_2","first-page":"13964","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Yang Fengting","year":"2020","unstructured":"Fengting Yang, Qian Sun, Hailin Jin, and Zihan Zhou. 2020. Superpixel segmentation with fully convolutional networks. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 13964\u201313973."},{"key":"e_1_3_2_56_2","first-page":"1812","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Yeo Donghun","year":"2017","unstructured":"Donghun Yeo, Jeany Son, and Bohyung Han. 2017. Superpixel-based tracking-by-segmentation using Markov chains. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 1812\u20131821."},{"key":"e_1_3_2_57_2","doi-asserted-by":"publisher","DOI":"10.1364\/OL.37.004994"},{"key":"e_1_3_2_58_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2021.3083269"},{"key":"e_1_3_2_59_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.00866"},{"key":"e_1_3_2_60_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2021.108120"},{"key":"e_1_3_2_61_2","first-page":"1225","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Zhu Lei","year":"2021","unstructured":"Lei Zhu, Qi She, Bin Zhang, Yanye Lu, Zhilin Lu, Duo Li, and Jie Hu. 2021. Learning the superpixel in a non-iterative and lifelong manner. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 1225\u20131234."},{"key":"e_1_3_2_62_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2021.3060015"},{"key":"e_1_3_2_63_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2019.2930152"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3611389","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3611389","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:37:09Z","timestamp":1750178229000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3611389"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,9,18]]},"references-count":62,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2024,1,31]]}},"alternative-id":["10.1145\/3611389"],"URL":"https:\/\/doi.org\/10.1145\/3611389","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"value":"1551-6857","type":"print"},{"value":"1551-6865","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,9,18]]},"assertion":[{"value":"2022-08-09","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-07-17","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-09-18","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}