{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:08:27Z","timestamp":1750219707157,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":45,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,12,6]],"date-time":"2023-12-06T00:00:00Z","timestamp":1701820800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Meizhou Tobacco Technology Project of Guangdong Province","award":["202304"],"award-info":[{"award-number":["202304"]}]},{"name":"the key R&D project of Guangzhou","award":["202206010091,2023B03J1363"],"award-info":[{"award-number":["202206010091,2023B03J1363"]}]},{"name":"the Science and Technology Planning Project of Guangdong Province","award":["2019A050510034"],"award-info":[{"award-number":["2019A050510034"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,12,6]]},"DOI":"10.1145\/3595916.3626445","type":"proceedings-article","created":{"date-parts":[[2024,1,1]],"date-time":"2024-01-01T16:34:41Z","timestamp":1704126881000},"page":"1-7","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["SASSM: Semantic Awareness and Self-Support Matching for Semi-Supervised Video Object Segmentation"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-0799-0054","authenticated-orcid":false,"given":"Yun","family":"Liang","sequence":"first","affiliation":[{"name":"South China Agricultural University, CN"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0008-6748-5643","authenticated-orcid":false,"given":"Ming","family":"Junhui","sequence":"additional","affiliation":[{"name":"South China Agricultural University, CN"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0003-3865-099X","authenticated-orcid":false,"given":"Jintu","family":"Zheng","sequence":"additional","affiliation":[{"name":"Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, CN"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2024,1]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Proceedings, Part II 16","author":"Bhat Goutam","year":"2020","unstructured":"Goutam Bhat , Felix\u00a0J\u00e4remo Lawin , Martin Danelljan , Andreas Robinson , Michael Felsberg , Luc Van\u00a0Gool , and Radu Timofte . 2020 . Learning what to learn for video object segmentation. In Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK, August 23\u201328, 2020 , Proceedings, Part II 16 . Springer, 777\u2013794. Goutam Bhat, Felix\u00a0J\u00e4remo Lawin, Martin Danelljan, Andreas Robinson, Michael Felsberg, Luc Van\u00a0Gool, and Radu Timofte. 2020. Learning what to learn for video object segmentation. In Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK, August 23\u201328, 2020, Proceedings, Part II 16. Springer, 777\u2013794."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.565"},{"key":"e_1_3_2_1_3_1","volume-title":"Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs","author":"Chen Liang-Chieh","year":"2017","unstructured":"Liang-Chieh Chen , George Papandreou , Iasonas Kokkinos , Kevin Murphy , and Alan\u00a0 L Yuille . 2017 . Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs . IEEE transactions on pattern analysis and machine intelligence 40, 4 (2017), 834\u2013848. Liang-Chieh Chen, George Papandreou, Iasonas Kokkinos, Kevin Murphy, and Alan\u00a0L Yuille. 2017. Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE transactions on pattern analysis and machine intelligence 40, 4 (2017), 834\u2013848."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01234-2_49"},{"key":"e_1_3_2_1_5_1","volume-title":"European Conference on Computer Vision. Springer, 640\u2013658","author":"Cheng Ho\u00a0Kei","year":"2022","unstructured":"Ho\u00a0Kei Cheng and Alexander\u00a0 G Schwing . 2022 . Xmem: Long-term video object segmentation with an atkinson-shiffrin memory model . In European Conference on Computer Vision. Springer, 640\u2013658 . Ho\u00a0Kei Cheng and Alexander\u00a0G Schwing. 2022. Xmem: Long-term video object segmentation with an atkinson-shiffrin memory model. In European Conference on Computer Vision. Springer, 640\u2013658."},{"key":"e_1_3_2_1_6_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 5559\u20135568","author":"Cheng Ho\u00a0Kei","year":"2021","unstructured":"Ho\u00a0Kei Cheng , Yu-Wing Tai , and Chi-Keung Tang . 2021 . Modular interactive video object segmentation: Interaction-to-mask, propagation and difference-aware fusion . In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 5559\u20135568 . Ho\u00a0Kei Cheng, Yu-Wing Tai, and Chi-Keung Tang. 2021. Modular interactive video object segmentation: Interaction-to-mask, propagation and difference-aware fusion. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 5559\u20135568."},{"key":"e_1_3_2_1_7_1","first-page":"11781","article-title":"Rethinking space-time networks with improved memory coverage for efficient video object segmentation","volume":"34","author":"Cheng Ho\u00a0Kei","year":"2021","unstructured":"Ho\u00a0Kei Cheng , Yu-Wing Tai , and Chi-Keung Tang . 2021 . Rethinking space-time networks with improved memory coverage for efficient video object segmentation . Advances in Neural Information Processing Systems 34 (2021), 11781 \u2013 11794 . Ho\u00a0Kei Cheng, Yu-Wing Tai, and Chi-Keung Tang. 2021. Rethinking space-time networks with improved memory coverage for efficient video object segmentation. Advances in Neural Information Processing Systems 34 (2021), 11781\u201311794.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_1_8_1","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition. 7415\u20137424","author":"Cheng Jingchun","year":"2018","unstructured":"Jingchun Cheng , Yi-Hsuan Tsai , Wei-Chih Hung , Shengjin Wang , and Ming-Hsuan Yang . 2018 . Fast and accurate online video object segmentation via tracking parts . In Proceedings of the IEEE conference on computer vision and pattern recognition. 7415\u20137424 . Jingchun Cheng, Yi-Hsuan Tsai, Wei-Chih Hung, Shengjin Wang, and Ming-Hsuan Yang. 2018. Fast and accurate online video object segmentation via tracking parts. In Proceedings of the IEEE conference on computer vision and pattern recognition. 7415\u20137424."},{"key":"e_1_3_2_1_9_1","unstructured":"Kevin Duarte Yogesh\u00a0S Rawat and Mubarak Shah. [n. d.]. CapsuleVOS: Semi-Supervised Video Object Segmentation Using Capsule Routing. ([n. d.]).  Kevin Duarte Yogesh\u00a0S Rawat and Mubarak Shah. [n. d.]. CapsuleVOS: Semi-Supervised Video Object Segmentation Using Capsule Routing. ([n. d.])."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00326"},{"key":"e_1_3_2_1_11_1","volume-title":"Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition. 16836\u201316845","author":"Ge Wenbin","year":"2021","unstructured":"Wenbin Ge , Xiankai Lu , and Jianbing Shen . 2021 . Video object segmentation using global and instance embedding learning . In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition. 16836\u201316845 . Wenbin Ge, Xiankai Lu, and Jianbing Shen. 2021. Video object segmentation using global and instance embedding learning. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition. 16836\u201316845."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_1_13_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 4144\u20134154","author":"Hu Li","year":"2021","unstructured":"Li Hu , Peng Zhang , Bang Zhang , Pan Pan , Yinghui Xu , and Rong Jin . 2021 . Learning position and target consistency for memory-based video object segmentation . In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 4144\u20134154 . Li Hu, Peng Zhang, Bang Zhang, Pan Pan, Yinghui Xu, and Rong Jin. 2021. Learning position and target consistency for memory-based video object segmentation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 4144\u20134154."},{"key":"e_1_3_2_1_14_1","volume-title":"Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition. 8879\u20138889","author":"Huang Xuhua","year":"2020","unstructured":"Xuhua Huang , Jiarui Xu , Yu-Wing Tai , and Chi-Keung Tang . 2020 . Fast video object segmentation with temporal aggregation network and dynamic template matching . In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition. 8879\u20138889 . Xuhua Huang, Jiarui Xu, Yu-Wing Tai, and Chi-Keung Tang. 2020. Fast video object segmentation with temporal aggregation network and dynamic template matching. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition. 8879\u20138889."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00069"},{"key":"e_1_3_2_1_16_1","volume-title":"Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980","author":"Kingma P","year":"2014","unstructured":"Diederik\u00a0 P Kingma and Jimmy Ba . 2014 . Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014). Diederik\u00a0P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)."},{"key":"e_1_3_2_1_17_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 1332\u20131341","author":"Li Mingxing","year":"2022","unstructured":"Mingxing Li , Li Hu , Zhiwei Xiong , Bang Zhang , Pan Pan , and Dong Liu . 2022 . Recurrent dynamic embedding for video object segmentation . In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 1332\u20131341 . Mingxing Li, Li Hu, Zhiwei Xiong, Bang Zhang, Pan Pan, and Dong Liu. 2022. Recurrent dynamic embedding for video object segmentation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 1332\u20131341."},{"key":"e_1_3_2_1_18_1","volume-title":"Proceedings, Part X 16","author":"Li Yu","year":"2020","unstructured":"Yu Li , Zhuoran Shen , and Ying Shan . 2020 . Fast video object segmentation using the global context module. In Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK, August 23\u201328, 2020 , Proceedings, Part X 16 . Springer, 735\u2013750. Yu Li, Zhuoran Shen, and Ying Shan. 2020. Fast video object segmentation using the global context module. In Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK, August 23\u201328, 2020, Proceedings, Part X 16. Springer, 735\u2013750."},{"key":"e_1_3_2_1_19_1","first-page":"3430","article-title":"Video object segmentation with adaptive feature bank and uncertain-region refinement","volume":"33","author":"Liang Yongqing","year":"2020","unstructured":"Yongqing Liang , Xin Li , Navid Jafari , and Jim Chen . 2020 . Video object segmentation with adaptive feature bank and uncertain-region refinement . Advances in Neural Information Processing Systems 33 (2020), 3430 \u2013 3441 . Yongqing Liang, Xin Li, Navid Jafari, and Jim Chen. 2020. Video object segmentation with adaptive feature bank and uncertain-region refinement. Advances in Neural Information Processing Systems 33 (2020), 3430\u20133441.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_1_20_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 1362\u20131372","author":"Lin Zhihui","year":"2022","unstructured":"Zhihui Lin , Tianyu Yang , Maomao Li , Ziyu Wang , Chun Yuan , Wenhao Jiang , and Wei Liu . 2022 . Swem: Towards real-time video object segmentation with sequential weighted expectation-maximization . In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 1362\u20131372 . Zhihui Lin, Tianyu Yang, Maomao Li, Ziyu Wang, Chun Yuan, Wenhao Jiang, and Wei Liu. 2022. Swem: Towards real-time video object segmentation with sequential weighted expectation-maximization. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 1362\u20131372."},{"key":"e_1_3_2_1_21_1","volume-title":"European Conference on Computer Vision. Springer, 468\u2013486","author":"Liu Yong","year":"2022","unstructured":"Yong Liu , Ran Yu , Fei Yin , Xinyuan Zhao , Wei Zhao , Weihao Xia , and Yujiu Yang . 2022 . Learning quality-aware dynamic memory for video object segmentation . In European Conference on Computer Vision. Springer, 468\u2013486 . Yong Liu, Ran Yu, Fei Yin, Xinyuan Zhao, Wei Zhao, Weihao Xia, and Yujiu Yang. 2022. Learning quality-aware dynamic memory for video object segmentation. In European Conference on Computer Vision. Springer, 468\u2013486."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"e_1_3_2_1_23_1","volume-title":"Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101","author":"Loshchilov Ilya","year":"2017","unstructured":"Ilya Loshchilov and Frank Hutter . 2017. Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101 ( 2017 ). Ilya Loshchilov and Frank Hutter. 2017. Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101 (2017)."},{"key":"e_1_3_2_1_24_1","volume-title":"Proceedings, Part III 16","author":"Lu Xiankai","year":"2020","unstructured":"Xiankai Lu , Wenguan Wang , Martin Danelljan , Tianfei Zhou , Jianbing Shen , and Luc Van\u00a0Gool . 2020 . Video object segmentation with episodic graph memory networks. In Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK, August 23\u201328, 2020 , Proceedings, Part III 16 . Springer, 661\u2013679. Xiankai Lu, Wenguan Wang, Martin Danelljan, Tianfei Zhou, Jianbing Shen, and Luc Van\u00a0Gool. 2020. Video object segmentation with episodic graph memory networks. In Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK, August 23\u201328, 2020, Proceedings, Part III 16. Springer, 661\u2013679."},{"key":"e_1_3_2_1_25_1","volume-title":"Proceedings of the IEEE\/CVF international conference on computer vision. 9670\u20139679","author":"Mao Yunyao","year":"2021","unstructured":"Yunyao Mao , Ning Wang , Wengang Zhou , and Houqiang Li . 2021 . Joint inductive and transductive learning for video object segmentation . In Proceedings of the IEEE\/CVF international conference on computer vision. 9670\u20139679 . Yunyao Mao, Ning Wang, Wengang Zhou, and Houqiang Li. 2021. Joint inductive and transductive learning for video object segmentation. In Proceedings of the IEEE\/CVF international conference on computer vision. 9670\u20139679."},{"key":"e_1_3_2_1_26_1","volume-title":"Make one-shot video object segmentation efficient again. Advances in neural information processing systems 33","author":"Meinhardt Tim","year":"2020","unstructured":"Tim Meinhardt and Laura Leal-Taix\u00e9 . 2020. Make one-shot video object segmentation efficient again. Advances in neural information processing systems 33 ( 2020 ), 10607\u201310619. Tim Meinhardt and Laura Leal-Taix\u00e9. 2020. Make one-shot video object segmentation efficient again. Advances in neural information processing systems 33 (2020), 10607\u201310619."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00932"},{"key":"e_1_3_2_1_28_1","volume-title":"Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition. 8405\u20138414","author":"Park Hyojin","year":"2021","unstructured":"Hyojin Park , Jayeon Yoo , Seohyeong Jeong , Ganesh Venkatesh , and Nojun Kwak . 2021 . Learning dynamic network using a reuse gate function in semi-supervised video object segmentation . In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition. 8405\u20138414 . Hyojin Park, Jayeon Yoo, Seohyeong Jeong, Ganesh Venkatesh, and Nojun Kwak. 2021. Learning dynamic network using a reuse gate function in semi-supervised video object segmentation. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition. 8405\u20138414."},{"key":"e_1_3_2_1_29_1","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition. 2663\u20132672","author":"Perazzi Federico","year":"2017","unstructured":"Federico Perazzi , Anna Khoreva , Rodrigo Benenson , Bernt Schiele , and Alexander Sorkine-Hornung . 2017 . Learning video object segmentation from static images . In Proceedings of the IEEE conference on computer vision and pattern recognition. 2663\u20132672 . Federico Perazzi, Anna Khoreva, Rodrigo Benenson, Bernt Schiele, and Alexander Sorkine-Hornung. 2017. Learning video object segmentation from static images. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2663\u20132672."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.85"},{"key":"e_1_3_2_1_31_1","volume-title":"The 2017 davis challenge on video object segmentation. arXiv preprint arXiv:1704.00675","author":"Pont-Tuset Jordi","year":"2017","unstructured":"Jordi Pont-Tuset , Federico Perazzi , Sergi Caelles , Pablo Arbel\u00e1ez , Alex Sorkine-Hornung , and Luc Van\u00a0Gool . 2017. The 2017 davis challenge on video object segmentation. arXiv preprint arXiv:1704.00675 ( 2017 ). Jordi Pont-Tuset, Federico Perazzi, Sergi Caelles, Pablo Arbel\u00e1ez, Alex Sorkine-Hornung, and Luc Van\u00a0Gool. 2017. The 2017 davis challenge on video object segmentation. arXiv preprint arXiv:1704.00675 (2017)."},{"key":"e_1_3_2_1_32_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 7406\u20137415","author":"Robinson Andreas","year":"2020","unstructured":"Andreas Robinson , Felix\u00a0Jaremo Lawin , Martin Danelljan , Fahad\u00a0Shahbaz Khan , and Michael Felsberg . 2020 . Learning fast and robust target models for video object segmentation . In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 7406\u20137415 . Andreas Robinson, Felix\u00a0Jaremo Lawin, Martin Danelljan, Fahad\u00a0Shahbaz Khan, and Michael Felsberg. 2020. Learning fast and robust target models for video object segmentation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 7406\u20137415."},{"key":"e_1_3_2_1_33_1","volume-title":"U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention\u2013MICCAI 2015: 18th International Conference","author":"Ronneberger Olaf","year":"2015","unstructured":"Olaf Ronneberger , Philipp Fischer , and Thomas Brox . 2015 . U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention\u2013MICCAI 2015: 18th International Conference , Munich, Germany, October 5-9, 2015, Proceedings, Part III 18. Springer , 234\u2013241. Olaf Ronneberger, Philipp Fischer, and Thomas Brox. 2015. U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention\u2013MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18. Springer, 234\u2013241."},{"key":"e_1_3_2_1_34_1","volume-title":"Proceedings, Part XXII 16","author":"Seong Hongje","year":"2020","unstructured":"Hongje Seong , Junhyuk Hyun , and Euntai Kim . 2020 . Kernelized memory network for video object segmentation. In Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK, August 23\u201328, 2020 , Proceedings, Part XXII 16 . Springer, 629\u2013645. Hongje Seong, Junhyuk Hyun, and Euntai Kim. 2020. Kernelized memory network for video object segmentation. In Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK, August 23\u201328, 2020, Proceedings, Part XXII 16. Springer, 629\u2013645."},{"key":"e_1_3_2_1_35_1","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision. 12889\u201312898","author":"Seong Hongje","year":"2021","unstructured":"Hongje Seong , Seoung\u00a0Wug Oh , Joon-Young Lee , Seongwon Lee , Suhyeon Lee , and Euntai Kim . 2021 . Hierarchical memory matching network for video object segmentation . In Proceedings of the IEEE\/CVF International Conference on Computer Vision. 12889\u201312898 . Hongje Seong, Seoung\u00a0Wug Oh, Joon-Young Lee, Seongwon Lee, Suhyeon Lee, and Euntai Kim. 2021. Hierarchical memory matching network for video object segmentation. In Proceedings of the IEEE\/CVF International Conference on Computer Vision. 12889\u201312898."},{"key":"e_1_3_2_1_36_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 1296\u20131305","author":"Wang Haochen","year":"2021","unstructured":"Haochen Wang , Xiaolong Jiang , Haibing Ren , Yao Hu , and Song Bai . 2021 . Swiftnet: Real-time video object segmentation . In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 1296\u20131305 . Haochen Wang, Xiaolong Jiang, Haibing Ren, Yao Hu, and Song Bai. 2021. Swiftnet: Real-time video object segmentation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 1296\u20131305."},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00813"},{"key":"e_1_3_2_1_38_1","volume-title":"Proceedings of the IEEE\/CVF international conference on computer vision. 3978\u20133987","author":"Wang Ziqin","year":"2019","unstructured":"Ziqin Wang , Jun Xu , Li Liu , Fan Zhu , and Ling Shao . 2019 . Ranet: Ranking attention network for fast video object segmentation . In Proceedings of the IEEE\/CVF international conference on computer vision. 3978\u20133987 . Ziqin Wang, Jun Xu, Li Liu, Fan Zhu, and Ling Shao. 2019. Ranet: Ranking attention network for fast video object segmentation. In Proceedings of the IEEE\/CVF international conference on computer vision. 3978\u20133987."},{"key":"e_1_3_2_1_39_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 1286\u20131295","author":"Xie Haozhe","year":"2021","unstructured":"Haozhe Xie , Hongxun Yao , Shangchen Zhou , Shengping Zhang , and Wenxiu Sun . 2021 . Efficient regional memory network for video object segmentation . In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 1286\u20131295 . Haozhe Xie, Hongxun Yao, Shangchen Zhou, Shengping Zhang, and Wenxiu Sun. 2021. Efficient regional memory network for video object segmentation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 1286\u20131295."},{"key":"e_1_3_2_1_40_1","volume-title":"Youtube-vos: A large-scale video object segmentation benchmark. arXiv preprint arXiv:1809.03327","author":"Xu Ning","year":"2018","unstructured":"Ning Xu , Linjie Yang , Yuchen Fan , Dingcheng Yue , Yuchen Liang , Jianchao Yang , and Thomas Huang . 2018 . Youtube-vos: A large-scale video object segmentation benchmark. arXiv preprint arXiv:1809.03327 (2018). Ning Xu, Linjie Yang, Yuchen Fan, Dingcheng Yue, Yuchen Liang, Jianchao Yang, and Thomas Huang. 2018. Youtube-vos: A large-scale video object segmentation benchmark. arXiv preprint arXiv:1809.03327 (2018)."},{"key":"e_1_3_2_1_41_1","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence, Vol.\u00a036","author":"Xu Xiaohao","year":"2022","unstructured":"Xiaohao Xu , Jinglu Wang , Xiao Li , and Yan Lu . 2022 . Reliable propagation-correction modulation for video object segmentation . In Proceedings of the AAAI Conference on Artificial Intelligence, Vol.\u00a036 . 2946\u20132954. Xiaohao Xu, Jinglu Wang, Xiao Li, and Yan Lu. 2022. Reliable propagation-correction modulation for video object segmentation. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol.\u00a036. 2946\u20132954."},{"key":"e_1_3_2_1_42_1","volume-title":"European Conference on Computer Vision. Springer, 332\u2013348","author":"Yang Zongxin","year":"2020","unstructured":"Zongxin Yang , Yunchao Wei , and Yi Yang . 2020 . Collaborative video object segmentation by foreground-background integration . In European Conference on Computer Vision. Springer, 332\u2013348 . Zongxin Yang, Yunchao Wei, and Yi Yang. 2020. Collaborative video object segmentation by foreground-background integration. In European Conference on Computer Vision. Springer, 332\u2013348."},{"key":"e_1_3_2_1_43_1","first-page":"2491","article-title":"Associating objects with transformers for video object segmentation","volume":"34","author":"Yang Zongxin","year":"2021","unstructured":"Zongxin Yang , Yunchao Wei , and Yi Yang . 2021 . Associating objects with transformers for video object segmentation . Advances in Neural Information Processing Systems 34 (2021), 2491 \u2013 2502 . Zongxin Yang, Yunchao Wei, and Yi Yang. 2021. Associating objects with transformers for video object segmentation. Advances in Neural Information Processing Systems 34 (2021), 2491\u20132502.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_1_44_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 6949\u20136958","author":"Zhang Yizhuo","year":"2020","unstructured":"Yizhuo Zhang , Zhirong Wu , Houwen Peng , and Stephen Lin . 2020 . A transductive approach for video object segmentation . In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 6949\u20136958 . Yizhuo Zhang, Zhirong Wu, Houwen Peng, and Stephen Lin. 2020. A transductive approach for video object segmentation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 6949\u20136958."},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00681"}],"event":{"name":"MMAsia '23: ACM Multimedia Asia","sponsor":["SIGMM ACM Special Interest Group on Multimedia"],"location":"Tainan Taiwan","acronym":"MMAsia '23"},"container-title":["ACM Multimedia Asia 2023"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3595916.3626445","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3595916.3626445","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:35:56Z","timestamp":1750178156000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3595916.3626445"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,12,6]]},"references-count":45,"alternative-id":["10.1145\/3595916.3626445","10.1145\/3595916"],"URL":"https:\/\/doi.org\/10.1145\/3595916.3626445","relation":{},"subject":[],"published":{"date-parts":[[2023,12,6]]},"assertion":[{"value":"2024-01-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}