{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,10]],"date-time":"2026-02-10T18:34:51Z","timestamp":1770748491709,"version":"3.50.0"},"publisher-location":"New York, NY, USA","reference-count":70,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,10,10]],"date-time":"2022-10-10T00:00:00Z","timestamp":1665360000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,10,10]]},"DOI":"10.1145\/3503161.3547824","type":"proceedings-article","created":{"date-parts":[[2022,10,10]],"date-time":"2022-10-10T15:42:35Z","timestamp":1665416555000},"page":"2709-2718","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":21,"title":["Towards Robust Video Object Segmentation with Adaptive Object Calibration"],"prefix":"10.1145","author":[{"given":"Xiaohao","family":"Xu","sequence":"first","affiliation":[{"name":"Huazhong University of Science and Technology, Wuhan, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jinglu","family":"Wang","sequence":"additional","affiliation":[{"name":"Microsoft Research Asia, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xiang","family":"Ming","sequence":"additional","affiliation":[{"name":"Microsoft Research Asia, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yan","family":"Lu","sequence":"additional","affiliation":[{"name":"Microsoft Research Asia, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2022,10,10]]},"reference":[{"key":"e_1_3_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-7908-2604-3_16"},{"key":"e_1_3_2_2_2_1","volume-title":"Learning OpenCV: Computer vision with the OpenCV library. \"O'Reilly Media","author":"Bradski Gary","unstructured":"Gary Bradski and Adrian Kaehler . 2008. Learning OpenCV: Computer vision with the OpenCV library. \"O'Reilly Media , Inc .\". Gary Bradski and Adrian Kaehler. 2008. Learning OpenCV: Computer vision with the OpenCV library. \"O'Reilly Media, Inc.\"."},{"key":"e_1_3_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.565"},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01234-2_49"},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00130"},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00551"},{"key":"e_1_3_2_2_7_1","volume-title":"Advances in Neural Information Processing Systems","volume":"34","author":"Cheng Ho Kei","year":"2021","unstructured":"Ho Kei Cheng , Yu-Wing Tai , and Chi-Keung Tang . 2021 b. Rethinking space-time networks with improved memory coverage for efficient video object segmentation . Advances in Neural Information Processing Systems , Vol. 34 (2021). Ho Kei Cheng, Yu-Wing Tai, and Chi-Keung Tang. 2021b. Rethinking space-time networks with improved memory coverage for efficient video object segmentation. Advances in Neural Information Processing Systems, Vol. 34 (2021)."},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.81"},{"key":"e_1_3_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-012-0519-6"},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00585"},{"key":"e_1_3_2_2_11_1","volume-title":"A learned representation for artistic style. arXiv preprint arXiv:1610.07629","author":"Dumoulin Vincent","year":"2016","unstructured":"Vincent Dumoulin , Jonathon Shlens , and Manjunath Kudlur . 2016. A learned representation for artistic style. arXiv preprint arXiv:1610.07629 ( 2016 ). Vincent Dumoulin, Jonathon Shlens, and Manjunath Kudlur. 2016. A learned representation for artistic style. arXiv preprint arXiv:1610.07629 (2016)."},{"key":"e_1_3_2_2_12_1","volume-title":"Jonas Rauber, Heiko H Sch\u00fctt, Matthias Bethge, and Felix A Wichmann.","author":"Geirhos Robert","year":"2018","unstructured":"Robert Geirhos , Carlos RM Temme , Jonas Rauber, Heiko H Sch\u00fctt, Matthias Bethge, and Felix A Wichmann. 2018 . Generalisation in humans and deep neural networks. Advances in neural information processing systems, Vol. 31 (2018). Robert Geirhos, Carlos RM Temme, Jonas Rauber, Heiko H Sch\u00fctt, Matthias Bethge, and Felix A Wichmann. 2018. Generalisation in humans and deep neural networks. Advances in neural information processing systems, Vol. 31 (2018)."},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_2_14_1","volume-title":"Benchmarking neural network robustness to common corruptions and perturbations. arXiv preprint arXiv:1903.12261","author":"Hendrycks Dan","year":"2019","unstructured":"Dan Hendrycks and Thomas Dietterich . 2019. Benchmarking neural network robustness to common corruptions and perturbations. arXiv preprint arXiv:1903.12261 ( 2019 ). Dan Hendrycks and Thomas Dietterich. 2019. Benchmarking neural network robustness to common corruptions and perturbations. arXiv preprint arXiv:1903.12261 (2019)."},{"key":"e_1_3_2_2_15_1","volume-title":"Squeeze-and-Excitation Networks. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 7132--7141","author":"Hu Jie","year":"2018","unstructured":"Jie Hu , Li Shen , and Gang Sun . 2018 b. Squeeze-and-Excitation Networks. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 7132--7141 . Jie Hu, Li Shen, and Gang Sun. 2018b. Squeeze-and-Excitation Networks. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 7132--7141."},{"key":"e_1_3_2_2_16_1","volume-title":"Learning Position and Target Consistency for Memory-based Video Object Segmentation. In IEEE Conference on Computer Vision and Pattern Recognition. 4144--4154","author":"Hu Li","year":"2021","unstructured":"Li Hu , Peng Zhang , Bang Zhang , Pan Pan , Yinghui Xu , and Rong Jin . 2021 . Learning Position and Target Consistency for Memory-based Video Object Segmentation. In IEEE Conference on Computer Vision and Pattern Recognition. 4144--4154 . Li Hu, Peng Zhang, Bang Zhang, Pan Pan, Yinghui Xu, and Rong Jin. 2021. Learning Position and Target Consistency for Memory-based Video Object Segmentation. In IEEE Conference on Computer Vision and Pattern Recognition. 4144--4154."},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01237-3_4"},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.167"},{"key":"e_1_3_2_2_19_1","volume-title":"MBRS: Enhancing Robustness of DNN-Based Watermarking by Mini-Batch of Real and Simulated JPEG Compression","author":"Jia Zhaoyang","year":"2021","unstructured":"Zhaoyang Jia , Han Fang , and Weiming Zhang . 2021 . MBRS: Enhancing Robustness of DNN-Based Watermarking by Mini-Batch of Real and Simulated JPEG Compression . Association for Computing Machinery , New York, NY, USA , 41--49. Zhaoyang Jia, Han Fang, and Weiming Zhang. 2021. MBRS: Enhancing Robustness of DNN-Based Watermarking by Mini-Batch of Real and Simulated JPEG Compression. Association for Computing Machinery, New York, NY, USA, 41--49."},{"key":"e_1_3_2_2_20_1","volume-title":"Joey Tianyi Zhou, and Peter Szolovits","author":"Jin Di","year":"2019","unstructured":"Di Jin , Zhijing Jin , Joey Tianyi Zhou, and Peter Szolovits . 2019 . Is BERT Really Robust? Natural Language Attack on Text Classification and Entailment . arXiv preprint arXiv:1907.11932 (2019). Di Jin, Zhijing Jin, Joey Tianyi Zhou, and Peter Szolovits. 2019. Is BERT Really Robust? Natural Language Attack on Text Classification and Entailment. arXiv preprint arXiv:1907.11932 (2019)."},{"key":"e_1_3_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00885"},{"key":"e_1_3_2_2_22_1","volume-title":"International journal of computer vision","author":"Kamann Christoph","year":"2020","unstructured":"Christoph Kamann and Carsten Rother . 2020;2021;. Benchmarking the Robustness of Semantic Segmentation Models with Respect to Common Corruptions . International journal of computer vision , Vol. 129 , 2 ( 2020 ;2021;), 462--483. Christoph Kamann and Carsten Rother. 2020;2021;. Benchmarking the Robustness of Semantic Segmentation Models with Respect to Common Corruptions. International journal of computer vision, Vol. 129, 2 (2020;2021;), 462--483."},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-019-01164-6"},{"key":"e_1_3_2_2_24_1","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision Workshops. 0--8.","author":"Laugros Alfred","year":"2019","unstructured":"Alfred Laugros , Alice Caplier , and Matthieu Ospici . 2019 . Are adversarial robustness and common perturbation robustness independant attributes? . In Proceedings of the IEEE\/CVF International Conference on Computer Vision Workshops. 0--8. Alfred Laugros, Alice Caplier, and Matthieu Ospici. 2019. Are adversarial robustness and common perturbation robustness independant attributes?. In Proceedings of the IEEE\/CVF International Conference on Computer Vision Workshops. 0--8."},{"key":"e_1_3_2_2_25_1","volume-title":"Video Object Segmentation with Dynamic Memory Networks and Adaptive Object Alignment","author":"Liang Shuxian","unstructured":"Shuxian Liang , Xu Shen , Jianqiang Huang , and Xian-Sheng Hua . 2021a. Video Object Segmentation with Dynamic Memory Networks and Adaptive Object Alignment . IEEE , 8045--8054. Shuxian Liang, Xu Shen, Jianqiang Huang, and Xian-Sheng Hua. 2021a. Video Object Segmentation with Dynamic Memory Networks and Adaptive Object Alignment. IEEE, 8045--8054."},{"key":"e_1_3_2_2_26_1","volume-title":"Video Object Segmentation with Dynamic Memory Networks and Adaptive Object Alignment. In 2021 IEEE\/CVF International Conference on Computer Vision (ICCV). 8045--8054","author":"Liang Shuxian","year":"2021","unstructured":"Shuxian Liang , Xu Shen , Jianqiang Huang , and Xian-Sheng Hua . 2021 b. Video Object Segmentation with Dynamic Memory Networks and Adaptive Object Alignment. In 2021 IEEE\/CVF International Conference on Computer Vision (ICCV). 8045--8054 . https:\/\/doi.org\/10.1109\/ICCV48922.2021.00796 Shuxian Liang, Xu Shen, Jianqiang Huang, and Xian-Sheng Hua. 2021b. Video Object Segmentation with Dynamic Memory Networks and Adaptive Object Alignment. In 2021 IEEE\/CVF International Conference on Computer Vision (ICCV). 8045--8054. https:\/\/doi.org\/10.1109\/ICCV48922.2021.00796"},{"key":"e_1_3_2_2_27_1","volume-title":"WaterNet: An adaptive matching pipeline for segmenting water with volatile appearance. Computational Visual Media","author":"Liang Yongqing","year":"2020","unstructured":"Yongqing Liang , Navid Jafari , Xing Luo , Qin Chen , Yanpeng Cao , and Xin Li. 2020a. WaterNet: An adaptive matching pipeline for segmenting water with volatile appearance. Computational Visual Media ( 2020 ), 1--14. Yongqing Liang, Navid Jafari, Xing Luo, Qin Chen, Yanpeng Cao, and Xin Li. 2020a. WaterNet: An adaptive matching pipeline for segmenting water with volatile appearance. Computational Visual Media (2020), 1--14."},{"key":"e_1_3_2_2_28_1","volume-title":"Advances in Neural Information Processing Systems","volume":"33","author":"Liang Yongqing","year":"2020","unstructured":"Yongqing Liang , Xin Li , Navid Jafari , and Jim Chen . 2020 b. Video Object Segmentation with Adaptive Feature Bank and Uncertain-Region Refinement . Advances in Neural Information Processing Systems , Vol. 33 (2020). Yongqing Liang, Xin Li, Navid Jafari, and Jim Chen. 2020b. Video Object Segmentation with Adaptive Feature Bank and Uncertain-Region Refinement. Advances in Neural Information Processing Systems, Vol. 33 (2020)."},{"key":"e_1_3_2_2_29_1","volume-title":"The global k-means clustering algorithm. Pattern recognition","author":"Likas Aristidis","year":"2003","unstructured":"Aristidis Likas , Nikos Vlassis , and Jakob J Verbeek . 2003. The global k-means clustering algorithm. Pattern recognition , Vol. 36 , 2 ( 2003 ), 451--461. Aristidis Likas, Nikos Vlassis, and Jakob J Verbeek. 2003. The global k-means clustering algorithm. Pattern recognition, Vol. 36, 2 (2003), 451--461."},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00405"},{"key":"e_1_3_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2020.2990623"},{"key":"e_1_3_2_2_32_1","volume-title":"Video object segmentation with episodic graph memory networks. arXiv preprint arXiv:2007.07020","author":"Lu Xinkai","year":"2020","unstructured":"Xinkai Lu , Wenguan Wang , Martin Danelljan , Tianfei Zhou , Jianbing Shen , and Luc Van Gool . 2020. Video object segmentation with episodic graph memory networks. arXiv preprint arXiv:2007.07020 ( 2020 ). Xinkai Lu, Wenguan Wang, Martin Danelljan, Tianfei Zhou, Jianbing Shen, and Luc Van Gool. 2020. Video object segmentation with episodic graph memory networks. arXiv preprint arXiv:2007.07020 (2020)."},{"key":"e_1_3_2_2_33_1","volume-title":"Asian Conference on Computer Vision. Springer, 565--580","author":"Luiten Jonathon","year":"2018","unstructured":"Jonathon Luiten , Paul Voigtlaender , and Bastian Leibe . 2018 . Premvos: Proposal-generation, refinement and merging for video object segmentation . In Asian Conference on Computer Vision. Springer, 565--580 . Jonathon Luiten, Paul Voigtlaender, and Bastian Leibe. 2018. Premvos: Proposal-generation, refinement and merging for video object segmentation. In Asian Conference on Computer Vision. Springer, 565--580."},{"key":"e_1_3_2_2_34_1","volume-title":"Proceedings of the Advances on Neural Information Processing Systems. 4905--4913","author":"Luo Wenjie","year":"2016","unstructured":"Wenjie Luo , Yujia Li , Raquel Urtasun , and Richard Zemel . 2016 . Understanding the Effective Receptive Field in Deep Convolutional Neural Networks . In Proceedings of the Advances on Neural Information Processing Systems. 4905--4913 . Wenjie Luo, Yujia Li, Raquel Urtasun, and Richard Zemel. 2016. Understanding the Effective Receptive Field in Deep Convolutional Neural Networks. In Proceedings of the Advances on Neural Information Processing Systems. 4905--4913."},{"key":"e_1_3_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.00953"},{"key":"e_1_3_2_2_36_1","unstructured":"Jan H. Metzen Tim Genewein Volker Fischer and Bastian Bischoff. 2017. On Detecting Adversarial Perturbations. (2017).  Jan H. Metzen Tim Genewein Volker Fischer and Bastian Bischoff. 2017. On Detecting Adversarial Perturbations. (2017)."},{"key":"e_1_3_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00412"},{"key":"e_1_3_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00770"},{"key":"e_1_3_2_2_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00932"},{"key":"e_1_3_2_2_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00244"},{"key":"e_1_3_2_2_41_1","volume-title":"Pytorch: An imperative style, high-performance deep learning library. arXiv preprint arXiv:1912.01703","author":"Paszke Adam","year":"2019","unstructured":"Adam Paszke , Sam Gross , Francisco Massa , Adam Lerer , James Bradbury , Gregory Chanan , Trevor Killeen , Zeming Lin , Natalia Gimelshein , Luca Antiga , 2019 . Pytorch: An imperative style, high-performance deep learning library. arXiv preprint arXiv:1912.01703 (2019). Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, et al. 2019. Pytorch: An imperative style, high-performance deep learning library. arXiv preprint arXiv:1912.01703 (2019)."},{"key":"e_1_3_2_2_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.372"},{"key":"e_1_3_2_2_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.85"},{"key":"e_1_3_2_2_44_1","volume-title":"The 2017 davis challenge on video object segmentation. arXiv preprint arXiv:1704.00675","author":"Pont-Tuset Jordi","year":"2017","unstructured":"Jordi Pont-Tuset , Federico Perazzi , Sergi Caelles , Pablo Arbel\u00e1ez , Alex Sorkine-Hornung , and Luc Van Gool . 2017a. The 2017 davis challenge on video object segmentation. arXiv preprint arXiv:1704.00675 ( 2017 ). Jordi Pont-Tuset, Federico Perazzi, Sergi Caelles, Pablo Arbel\u00e1ez, Alex Sorkine-Hornung, and Luc Van Gool. 2017a. The 2017 davis challenge on video object segmentation. arXiv preprint arXiv:1704.00675 (2017)."},{"key":"e_1_3_2_2_45_1","volume-title":"The 2017 davis challenge on video object segmentation. arXiv preprint arXiv:1704.00675","author":"Pont-Tuset Jordi","year":"2017","unstructured":"Jordi Pont-Tuset , Federico Perazzi , Sergi Caelles , Pablo Arbel\u00e1ez , Alex Sorkine-Hornung , and Luc Van Gool . 2017b. The 2017 davis challenge on video object segmentation. arXiv preprint arXiv:1704.00675 ( 2017 ). Jordi Pont-Tuset, Federico Perazzi, Sergi Caelles, Pablo Arbel\u00e1ez, Alex Sorkine-Hornung, and Luc Van Gool. 2017b. The 2017 davis challenge on video object segmentation. arXiv preprint arXiv:1704.00675 (2017)."},{"key":"e_1_3_2_2_46_1","volume-title":"Kernelized Memory Network for Video Object Segmentation. In European Conference on Computer Vision. Springer, 629--645","author":"Seong Hongje","year":"2020","unstructured":"Hongje Seong , Junhyuk Hyun , and Euntai Kim . 2020 . Kernelized Memory Network for Video Object Segmentation. In European Conference on Computer Vision. Springer, 629--645 . Hongje Seong, Junhyuk Hyun, and Euntai Kim. 2020. Kernelized Memory Network for Video Object Segmentation. In European Conference on Computer Vision. Springer, 629--645."},{"key":"e_1_3_2_2_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.01265"},{"key":"e_1_3_2_2_48_1","volume-title":"Advances in Neural Information Processing Systems","volume":"32","author":"Tramer Florian","year":"2019","unstructured":"Florian Tramer and Dan Boneh . 2019 . Adversarial training and robustness for multiple perturbations . Advances in Neural Information Processing Systems , Vol. 32 (2019). Florian Tramer and Dan Boneh. 2019. Adversarial training and robustness for multiple perturbations. Advances in Neural Information Processing Systems, Vol. 32 (2019)."},{"key":"e_1_3_2_2_49_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00971"},{"key":"e_1_3_2_2_50_1","doi-asserted-by":"publisher","DOI":"10.5244\/C.31.116"},{"key":"e_1_3_2_2_51_1","volume-title":"Boltvos: Box-level tracking for video object segmentation. arXiv preprint arXiv:1904.04552","author":"Voigtlaender Paul","year":"2019","unstructured":"Paul Voigtlaender , Jonathon Luiten , and Bastian Leibe . 2019 b. Boltvos: Box-level tracking for video object segmentation. arXiv preprint arXiv:1904.04552 (2019). Paul Voigtlaender, Jonathon Luiten, and Bastian Leibe. 2019b. Boltvos: Box-level tracking for video object segmentation. arXiv preprint arXiv:1904.04552 (2019)."},{"key":"e_1_3_2_2_52_1","volume-title":"SwiftNet: Real-time Video Object Segmentation. In IEEE Conference on Computer Vision and Pattern Recognition. 1296--1305","author":"Wang Haochen","year":"2021","unstructured":"Haochen Wang , Xiaolong Jiang , Haibing Ren , Yao Hu , and Song Bai . 2021 . SwiftNet: Real-time Video Object Segmentation. In IEEE Conference on Computer Vision and Pattern Recognition. 1296--1305 . Haochen Wang, Xiaolong Jiang, Haibing Ren, Yao Hu, and Song Bai. 2021. SwiftNet: Real-time Video Object Segmentation. In IEEE Conference on Computer Vision and Pattern Recognition. 1296--1305."},{"key":"e_1_3_2_2_53_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00070"},{"key":"e_1_3_2_2_54_1","volume-title":"International journal of computer vision","author":"Wang Zilei","year":"2014","unstructured":"Zilei Wang , Jiashi Feng , and Shuicheng Yan . 2014. Collaborative Linear Coding for Robust Image Classification . International journal of computer vision , Vol. 114 , 2--3 ( 2014 ), 322--333. Zilei Wang, Jiashi Feng, and Shuicheng Yan. 2014. Collaborative Linear Coding for Robust Image Classification. International journal of computer vision, Vol. 114, 2--3 (2014), 322--333."},{"key":"e_1_3_2_2_55_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00408"},{"key":"e_1_3_2_2_56_1","unstructured":"Zhongdao Wang Liang Zheng Yixuan Liu Yali Li and Shengjin Wang. 2019b. Towards Real-Time Multi-Object Tracking. (2019).  Zhongdao Wang Liang Zheng Yixuan Liu Yali Li and Shengjin Wang. 2019b. Towards Real-Time Multi-Object Tracking. (2019)."},{"key":"e_1_3_2_2_57_1","doi-asserted-by":"publisher","DOI":"10.1145\/3394171.3414035"},{"key":"e_1_3_2_2_58_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 501--509","author":"Xie Cihang","unstructured":"Cihang Xie , Yuxin Wu , Laurens van der Maaten, Alan L Yuille, and Kaiming He. 2019. Feature denoising for improving adversarial robustness . In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 501--509 . Cihang Xie, Yuxin Wu, Laurens van der Maaten, Alan L Yuille, and Kaiming He. 2019. Feature denoising for improving adversarial robustness. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 501--509."},{"key":"e_1_3_2_2_59_1","volume-title":"Efficient Regional Memory Network for Video Object Segmentation. In IEEE Conference on Computer Vision and Pattern Recognition. 1286--1295","author":"Xie Haozhe","year":"2021","unstructured":"Haozhe Xie , Hongxun Yao , Shangchen Zhou , Shengping Zhang , and Wenxiu Sun . 2021 . Efficient Regional Memory Network for Video Object Segmentation. In IEEE Conference on Computer Vision and Pattern Recognition. 1286--1295 . Haozhe Xie, Hongxun Yao, Shangchen Zhou, Shengping Zhang, and Wenxiu Sun. 2021. Efficient Regional Memory Network for Video Object Segmentation. In IEEE Conference on Computer Vision and Pattern Recognition. 1286--1295."},{"key":"e_1_3_2_2_60_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01228-1_36"},{"key":"e_1_3_2_2_61_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00686"},{"key":"e_1_3_2_2_62_1","volume-title":"Katsaggelos","author":"Yang Linjie","year":"2018","unstructured":"Linjie Yang , Yanran Wang , Xuehan Xiong , Jianchao Yang , and Aggelos K . Katsaggelos . 2018 . Efficient Video Object Segmentation via Network Modulation. CVPR ( 2018). Linjie Yang, Yanran Wang, Xuehan Xiong, Jianchao Yang, and Aggelos K. Katsaggelos. 2018. Efficient Video Object Segmentation via Network Modulation. CVPR (2018)."},{"key":"e_1_3_2_2_63_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58558-7_20"},{"key":"e_1_3_2_2_64_1","volume-title":"Advances in Neural Information Processing Systems","volume":"34","author":"Yang Zongxin","year":"2021","unstructured":"Zongxin Yang , Yunchao Wei , and Yi Yang . 2021 a. Associating objects with transformers for video object segmentation . Advances in Neural Information Processing Systems , Vol. 34 (2021). Zongxin Yang, Yunchao Wei, and Yi Yang. 2021a. Associating objects with transformers for video object segmentation. Advances in Neural Information Processing Systems, Vol. 34 (2021)."},{"key":"e_1_3_2_2_65_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2021.3081597"},{"key":"e_1_3_2_2_66_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01181"},{"key":"e_1_3_2_2_67_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00403"},{"key":"e_1_3_2_2_68_1","doi-asserted-by":"publisher","DOI":"10.1145\/2393347.2396316"},{"key":"e_1_3_2_2_69_1","doi-asserted-by":"publisher","DOI":"10.1145\/3394171.3413942"},{"key":"e_1_3_2_2_70_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58565-5_5"}],"event":{"name":"MM '22: The 30th ACM International Conference on Multimedia","location":"Lisboa Portugal","acronym":"MM '22","sponsor":["SIGMM ACM Special Interest Group on Multimedia"]},"container-title":["Proceedings of the 30th ACM International Conference on Multimedia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3503161.3547824","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3503161.3547824","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:02:35Z","timestamp":1750186955000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3503161.3547824"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,10,10]]},"references-count":70,"alternative-id":["10.1145\/3503161.3547824","10.1145\/3503161"],"URL":"https:\/\/doi.org\/10.1145\/3503161.3547824","relation":{},"subject":[],"published":{"date-parts":[[2022,10,10]]},"assertion":[{"value":"2022-10-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}