{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,25]],"date-time":"2026-02-25T17:03:23Z","timestamp":1772039003698,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":43,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,10,17]],"date-time":"2021-10-17T00:00:00Z","timestamp":1634428800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Zhejiang Lab","award":["2019KB0AB05"],"award-info":[{"award-number":["2019KB0AB05"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,10,17]]},"DOI":"10.1145\/3474085.3475710","type":"proceedings-article","created":{"date-parts":[[2021,10,18]],"date-time":"2021-10-18T06:57:34Z","timestamp":1634540254000},"page":"5646-5654","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":51,"title":["Recursive Fusion and Deformable Spatiotemporal Attention for Video Compression Artifact Reduction"],"prefix":"10.1145","author":[{"given":"Minyi","family":"Zhao","sequence":"first","affiliation":[{"name":"Fudan University, Shanghai, China"}]},{"given":"Yi","family":"Xu","sequence":"additional","affiliation":[{"name":"Fudan University, Shanghai, China"}]},{"given":"Shuigeng","family":"Zhou","sequence":"additional","affiliation":[{"name":"Fudan University, Shanghai, China"}]}],"member":"320","published-online":{"date-parts":[[2021,10,17]]},"reference":[{"key":"e_1_3_2_2_1_1","volume-title":"Video Summarization Using Deep Neural Networks: A Survey. arXiv preprint arXiv:2101.06072","author":"Apostolidis Evlampios","year":"2021","unstructured":"Evlampios Apostolidis , Eleni Adamantidou , Alexandros I Metsai , Vasileios Mezaris , and Ioannis Patras . 2021. Video Summarization Using Deep Neural Networks: A Survey. arXiv preprint arXiv:2101.06072 ( 2021 ). Evlampios Apostolidis, Eleni Adamantidou, Alexandros I Metsai, Vasileios Mezaris, and Ioannis Patras. 2021. Video Summarization Using Deep Neural Networks: A Survey. arXiv preprint arXiv:2101.06072 (2021)."},{"key":"e_1_3_2_2_2_1","volume-title":"5th meeting","author":"Bossen Frank","year":"2011","unstructured":"Frank Bossen . 2011 . Common test conditions and software reference configurations. In Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG16 WP3 and ISO\/IEC JTC1\/SC29\/WG11 , 5th meeting , Jan. 2011. Frank Bossen. 2011. Common test conditions and software reference configurations. In Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG16 WP3 and ISO\/IEC JTC1\/SC29\/WG11, 5th meeting, Jan. 2011."},{"key":"e_1_3_2_2_3_1","volume-title":"Understanding deformable alignment in video super-resolution. arXiv preprint arXiv:2009.07265","author":"Chan Kelvin CK","year":"2020","unstructured":"Kelvin CK Chan , Xintao Wang , Ke Yu , Chao Dong , and Chen Change Loy . 2020. Understanding deformable alignment in video super-resolution. arXiv preprint arXiv:2009.07265 , Vol. 4 ( 2020 ). Kelvin CK Chan, Xintao Wang, Ke Yu, Chao Dong, and Chen Change Loy. 2020. Understanding deformable alignment in video super-resolution. arXiv preprint arXiv:2009.07265, Vol. 4 (2020)."},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICIP.1994.413553"},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"crossref","unstructured":"Honggang Chen Xiaohai He Linbo Qing Shuhua Xiong and Truong Q Nguyen. 2018. DPW-SDNet: Dual pixel-wavelet domain deep CNNs for soft decoding of JPEG-compressed images. In CVPRW. 711--720. Honggang Chen Xiaohai He Linbo Qing Shuhua Xiong and Truong Q Nguyen. 2018. DPW-SDNet: Dual pixel-wavelet domain deep CNNs for soft decoding of JPEG-compressed images. In CVPRW. 711--720.","DOI":"10.1109\/CVPRW.2018.00114"},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-51811-4_3"},{"key":"e_1_3_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i07.6697"},{"key":"e_1_3_2_2_8_1","volume-title":"Patch-wise Spatial-Temporal Quality Enhancement for HEVC Compressed Video. TIP","author":"Ding Qing","year":"2021","unstructured":"Qing Ding , Liquan Shen , Liangwei Yu , Hao Yang , and Mai Xu. 2021. Patch-wise Spatial-Temporal Quality Enhancement for HEVC Compressed Video. TIP ( 2021 ). Qing Ding, Liquan Shen, Liangwei Yu, Hao Yang, and Mai Xu. 2021. Patch-wise Spatial-Temporal Quality Enhancement for HEVC Compressed Video. TIP (2021)."},{"key":"e_1_3_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.73"},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"crossref","unstructured":"L. Galteri L. Seidenari M. Bertini and AD Bimbo. 2017. Deep Generative Adversarial Compression Artifact Removal. (2017). L. Galteri L. Seidenari M. Bertini and AD Bimbo. 2017. Deep Generative Adversarial Compression Artifact Removal. (2017).","DOI":"10.1109\/ICCV.2017.517"},{"key":"e_1_3_2_2_11_1","unstructured":"Z. Guan Q. Xing X. Mai Y. Ren and Z. Wang. 2019. MFQE 2.0: A New Approach for Multi-frame Quality Enhancement on Compressed Video. TPAMI Vol. PP 99 (2019) 1--1. Z. Guan Q. Xing X. Mai Y. Ren and Z. Wang. 2019. MFQE 2.0: A New Approach for Multi-frame Quality Enhancement on Compressed Video. TPAMI Vol. PP 99 (2019) 1--1."},{"key":"e_1_3_2_2_12_1","volume-title":"Building dual-domain representations for compression artifacts reduction","author":"Guo Jun","unstructured":"Jun Guo and Hongyang Chao . 2016. Building dual-domain representations for compression artifacts reduction . In ECCV. Springer , 628--644. Jun Guo and Hongyang Chao. 2016. Building dual-domain representations for compression artifacts reduction. In ECCV. Springer, 628--644."},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"crossref","unstructured":"Jun Guo and Hongyang Chao. 2017. One-to-many network for visually pleasing compression artifacts reduction. In CVPR. 4867--4876. Jun Guo and Hongyang Chao. 2017. One-to-many network for visually pleasing compression artifacts reduction. In CVPR. 4867--4876.","DOI":"10.1109\/CVPR.2017.517"},{"key":"e_1_3_2_2_14_1","volume-title":"Quality Enhancement for Intra Frame Coding Via Cnns: An Adversarial Approach","author":"Jin Zhipeng","unstructured":"Zhipeng Jin , Ping An , Chao Yang , and Liquan Shen . 2018. Quality Enhancement for Intra Frame Coding Via Cnns: An Adversarial Approach . In ICASSP. IEEE , 1368--1372. Zhipeng Jin, Ping An, Chao Yang, and Liquan Shen. 2018. Quality Enhancement for Intra Frame Coding Via Cnns: An Adversarial Approach. In ICASSP. IEEE, 1368--1372."},{"key":"e_1_3_2_2_15_1","volume-title":"Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980","author":"Kingma Diederik P","year":"2014","unstructured":"Diederik P Kingma and Jimmy Ba . 2014 . Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014). Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)."},{"key":"e_1_3_2_2_16_1","volume-title":"An efficient deep convolutional neural networks model for compressed image deblocking","author":"Li Ke","unstructured":"Ke Li , Bahetiyaer Bare , and Bo Yan . 2017. An efficient deep convolutional neural networks model for compressed image deblocking . In ICME. IEEE , 1320--1325. Ke Li, Bahetiyaer Bare, and Bo Yan. 2017. An efficient deep convolutional neural networks model for compressed image deblocking. In ICME. IEEE, 1320--1325."},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.5555\/3326943.3327097"},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"crossref","unstructured":"Jie Liu Wenjie Zhang Yuting Tang Jie Tang and Gangshan Wu. 2020. Residual feature aggregation network for image super-resolution. In CVPR. 2359--2368. Jie Liu Wenjie Zhang Yuting Tang Jie Tang and Gangshan Wu. 2020. Residual feature aggregation network for image super-resolution. In CVPR. 2359--2368.","DOI":"10.1109\/CVPR42600.2020.00243"},{"key":"e_1_3_2_2_19_1","doi-asserted-by":"crossref","unstructured":"Guo Lu Wanli Ouyang Dong Xu Xiaoyun Zhang Zhiyong Gao and Ming-Ting Sun. 2018. Deep Kalman Filtering Network for Video Compression Artifact Reduction. In ECCV. 568--584. Guo Lu Wanli Ouyang Dong Xu Xiaoyun Zhang Zhiyong Gao and Ming-Ting Sun. 2018. Deep Kalman Filtering Network for Video Compression Artifact Reduction. In ECCV. 568--584.","DOI":"10.1007\/978-3-030-01264-9_35"},{"key":"e_1_3_2_2_20_1","first-page":"1725","article-title":"Deep Non-Local Kalman Network for Video Compression Artifact Reduction","volume":"29","author":"Lu Guo","year":"2019","unstructured":"Guo Lu , Xiaoyun Zhang , Wanli Ouyang , Dong Xu , Li Chen , and Zhiyong Gao . 2019 . Deep Non-Local Kalman Network for Video Compression Artifact Reduction . TIP , Vol. 29 (2019), 1725 -- 1737 . Guo Lu, Xiaoyun Zhang, Wanli Ouyang, Dong Xu, Li Chen, and Zhiyong Gao. 2019. Deep Non-Local Kalman Network for Video Compression Artifact Reduction. TIP, Vol. 29 (2019), 1725--1737.","journal-title":"TIP"},{"key":"e_1_3_2_2_21_1","unstructured":"Y. Ren X. Mai Z. Wang and T. Li. 2018. Multi-frame Quality Enhancement for Compressed Video. In CVPR. Y. Ren X. Mai Z. Wang and T. Li. 2018. Multi-frame Quality Enhancement for Compressed Video. In CVPR."},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2012.2221191"},{"key":"e_1_3_2_2_24_1","unstructured":"Y Tai J Yang X Liu and C Memnet Xu. [n.d.]. A persistent memory network for image restoration. In ICCV. 4549--4557. Y Tai J Yang X Liu and C Memnet Xu. [n.d.]. A persistent memory network for image restoration. In ICCV. 4549--4557."},{"key":"e_1_3_2_2_25_1","unstructured":"VQEG. [n.d.]. VQEG video datasets and organizations. https:\/\/www.its.bldrdoc.gov\/vqeg\/video-datasets-and-organizations.aspx. VQEG. [n.d.]. VQEG video datasets and organizations. https:\/\/www.its.bldrdoc.gov\/vqeg\/video-datasets-and-organizations.aspx."},{"key":"e_1_3_2_2_26_1","volume-title":"Data Compression Conference","author":"Wang Tingting","year":"2017","unstructured":"Tingting Wang , Mingjin Chen , and Hongyang Chao . 2017 . A novel deep learning-based method of improving coding efficiency from the decoder-end for HEVC . In Data Compression Conference , 2017. IEEE, 410--419. Tingting Wang, Mingjin Chen, and Hongyang Chao. 2017. A novel deep learning-based method of improving coding efficiency from the decoder-end for HEVC. In Data Compression Conference, 2017. IEEE, 410--419."},{"key":"e_1_3_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2003.819861"},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2003.815165"},{"key":"e_1_3_2_2_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2020.2986861"},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.5555\/2969239.2969329"},{"key":"e_1_3_2_2_31_1","unstructured":"Xiph.org. [n.d.]. Xiph.org Video Test Media. https:\/\/media.xiph.org\/video\/derf\/. Xiph.org. [n.d.]. Xiph.org Video Test Media. https:\/\/media.xiph.org\/video\/derf\/."},{"key":"e_1_3_2_2_32_1","doi-asserted-by":"crossref","first-page":"3074","DOI":"10.1609\/aaai.v35i4.16416","article-title":"a. GIF Thumbnails: Attract More Clicks to Your Videos","volume":"35","author":"Xu Yi","year":"2021","unstructured":"Yi Xu , Fan Bai , Yingxuan Shi , Qiuyu Chen , Longwen Gao , Kai Tian , Shuigeng Zhou , and Huyang Sun . 2021 a. GIF Thumbnails: Attract More Clicks to Your Videos . In AAAI , Vol. 35. 3074 -- 3082 . Yi Xu, Fan Bai, Yingxuan Shi, Qiuyu Chen, Longwen Gao, Kai Tian, Shuigeng Zhou, and Huyang Sun. 2021 a. GIF Thumbnails: Attract More Clicks to Your Videos. In AAAI, Vol. 35. 3074--3082.","journal-title":"AAAI"},{"key":"e_1_3_2_2_33_1","doi-asserted-by":"crossref","unstructured":"Yi Xu Longwen Gao Kai Tian Shuigeng Zhou and Huyang Sun. 2019. Non-local ConvLS\u2122 for video compression artifact reduction. In ICCV. 7043--7052. Yi Xu Longwen Gao Kai Tian Shuigeng Zhou and Huyang Sun. 2019. Non-local ConvLS\u2122 for video compression artifact reduction. In ICCV. 7043--7052.","DOI":"10.1109\/ICCV.2019.00714"},{"key":"e_1_3_2_2_34_1","doi-asserted-by":"crossref","unstructured":"Yi Xu Minyi Zhao Jing Liu Xinjian Zhang Longwen Gao Shuigeng Zhou and Huyang Sun. 2021 b. Boosting the performance of video compression artifact reduction with reference frame proposals and frequency domain information. In CVPRW. 213--222. Yi Xu Minyi Zhao Jing Liu Xinjian Zhang Longwen Gao Shuigeng Zhou and Huyang Sun. 2021 b. Boosting the performance of video compression artifact reduction with reference frame proposals and frequency domain information. In CVPRW. 213--222.","DOI":"10.1109\/CVPRW53098.2021.00030"},{"key":"e_1_3_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-018-01144-2"},{"key":"e_1_3_2_2_36_1","volume-title":"NTIRE 2021 challenge on quality enhancement of compressed video: Methods and results. In CVPRW. 647--666","author":"Yang Ren","year":"2021","unstructured":"Ren Yang . 2021 . NTIRE 2021 challenge on quality enhancement of compressed video: Methods and results. In CVPRW. 647--666 . Ren Yang. 2021. NTIRE 2021 challenge on quality enhancement of compressed video: Methods and results. In CVPRW. 647--666."},{"key":"e_1_3_2_2_37_1","volume-title":"Quality-gated convolutional LS\u2122 for enhancing compressed video","author":"Yang Ren","unstructured":"Ren Yang , Xiaoyan Sun , Mai Xu , and Wenjun Zeng . 2019. Quality-gated convolutional LS\u2122 for enhancing compressed video . In ICME. IEEE , 532--537. Ren Yang, Xiaoyan Sun, Mai Xu, and Wenjun Zeng. 2019. Quality-gated convolutional LS\u2122 for enhancing compressed video. In ICME. IEEE, 532--537."},{"key":"e_1_3_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2018.2867568"},{"key":"e_1_3_2_2_39_1","volume-title":"Decoder-side HEVC quality enhancement with scalable convolutional neural network","author":"Yang Ren","unstructured":"Ren Yang , Mai Xu , and Zulin Wang . 2017. Decoder-side HEVC quality enhancement with scalable convolutional neural network . In ICME. IEEE , 817--822. Ren Yang, Mai Xu, and Zulin Wang. 2017. Decoder-side HEVC quality enhancement with scalable convolutional neural network. In ICME. IEEE, 817--822."},{"key":"e_1_3_2_2_40_1","doi-asserted-by":"crossref","unstructured":"Jaeyoung Yoo Sang-ho Lee and Nojun Kwak. 2018. Image Restoration by Estimating Frequency Distribution of Local Patches. In CVPR. 6684--6692. Jaeyoung Yoo Sang-ho Lee and Nojun Kwak. 2018. Image Restoration by Estimating Frequency Distribution of Local Patches. In CVPR. 6684--6692.","DOI":"10.1109\/CVPR.2018.00699"},{"key":"e_1_3_2_2_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2017.2662206"},{"key":"e_1_3_2_2_42_1","volume-title":"Residual non-local attention networks for image restoration. ICLR","author":"Zhang Yulun","year":"2019","unstructured":"Yulun Zhang , Kunpeng Li , Kai Li , Bineng Zhong , and Yun Fu. 2019. Residual non-local attention networks for image restoration. ICLR ( 2019 ). Yulun Zhang, Kunpeng Li, Kai Li, Bineng Zhong, and Yun Fu. 2019. Residual non-local attention networks for image restoration. ICLR (2019)."},{"key":"e_1_3_2_2_43_1","volume-title":"A comprehensive study of deep video action recognition. arXiv preprint arXiv:2012.06567","author":"Zhu Yi","year":"2020","unstructured":"Yi Zhu , Xinyu Li , Chunhui Liu , Mohammadreza Zolfaghari , Yuanjun Xiong , Chongruo Wu , Zhi Zhang , Joseph Tighe , R Manmatha , and Mu Li. 2020. A comprehensive study of deep video action recognition. arXiv preprint arXiv:2012.06567 ( 2020 ). Yi Zhu, Xinyu Li, Chunhui Liu, Mohammadreza Zolfaghari, Yuanjun Xiong, Chongruo Wu, Zhi Zhang, Joseph Tighe, R Manmatha, and Mu Li. 2020. A comprehensive study of deep video action recognition. arXiv preprint arXiv:2012.06567 (2020)."}],"event":{"name":"MM '21: ACM Multimedia Conference","location":"Virtual Event China","acronym":"MM '21","sponsor":["SIGMM ACM Special Interest Group on Multimedia"]},"container-title":["Proceedings of the 29th ACM International Conference on Multimedia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3474085.3475710","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3474085.3475710","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:48:25Z","timestamp":1750193305000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3474085.3475710"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,10,17]]},"references-count":43,"alternative-id":["10.1145\/3474085.3475710","10.1145\/3474085"],"URL":"https:\/\/doi.org\/10.1145\/3474085.3475710","relation":{},"subject":[],"published":{"date-parts":[[2021,10,17]]},"assertion":[{"value":"2021-10-17","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}