{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,30]],"date-time":"2026-01-30T03:01:20Z","timestamp":1769742080027,"version":"3.49.0"},"publisher-location":"New York, NY, USA","reference-count":55,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,10,10]],"date-time":"2022-10-10T00:00:00Z","timestamp":1665360000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62171325"],"award-info":[{"award-number":["62171325"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"National Key R&D Project","award":["2021YFC3320301"],"award-info":[{"award-number":["2021YFC3320301"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,10,10]]},"DOI":"10.1145\/3503161.3547875","type":"proceedings-article","created":{"date-parts":[[2022,10,10]],"date-time":"2022-10-10T15:42:35Z","timestamp":1665416555000},"page":"2145-2153","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":16,"title":["Progressive Spatial-temporal Collaborative Network for Video Frame Interpolation"],"prefix":"10.1145","author":[{"given":"Mengshun","family":"Hu","sequence":"first","affiliation":[{"name":"Wuhan University, Wuhan, China"}]},{"given":"Kui","family":"Jiang","sequence":"additional","affiliation":[{"name":"Wuhan University, Wuhan, China"}]},{"given":"Liang","family":"Liao","sequence":"additional","affiliation":[{"name":"Nanyang Technological University, Singapore, Singapore"}]},{"given":"Zhixiang","family":"Nie","sequence":"additional","affiliation":[{"name":"Wuhan University, Wuhan, China"}]},{"given":"Jing","family":"Xiao","sequence":"additional","affiliation":[{"name":"Wuhan University, Wuhan, China"}]},{"given":"Zheng","family":"Wang","sequence":"additional","affiliation":[{"name":"Wuhan University, Wuhan, China"}]}],"member":"320","published-online":{"date-parts":[[2022,10,10]]},"reference":[{"key":"e_1_3_2_2_1_1","doi-asserted-by":"crossref","unstructured":"Wenbo Bao Wei-Sheng Lai Chao Ma Xiaoyun Zhang Zhiyong Gao and Ming-Hsuan Yang. 2019a. Depth-aware video frame interpolation. In CVPR. 3703--3712.  Wenbo Bao Wei-Sheng Lai Chao Ma Xiaoyun Zhang Zhiyong Gao and Ming-Hsuan Yang. 2019a. Depth-aware video frame interpolation. In CVPR. 3703--3712.","DOI":"10.1109\/CVPR.2019.00382"},{"key":"e_1_3_2_2_2_1","volume-title":"Memc-net: Motion estimation and motion compensation driven neural network for video interpolation and enhancement. PAMI","author":"Bao Wenbo","year":"2019","unstructured":"Wenbo Bao , Wei-Sheng Lai , Xiaoyun Zhang , Zhiyong Gao , and Ming-Hsuan Yang . 2019 b. Memc-net: Motion estimation and motion compensation driven neural network for video interpolation and enhancement. PAMI (2019). Wenbo Bao, Wei-Sheng Lai, Xiaoyun Zhang, Zhiyong Gao, and Ming-Hsuan Yang. 2019b. Memc-net: Motion estimation and motion compensation driven neural network for video interpolation and enhancement. PAMI (2019)."},{"key":"e_1_3_2_2_3_1","volume-title":"Swathikiran Sudhakaran, Brais Martinez, and Georgios Tzimiropoulos.","author":"Bulat Adrian","year":"2021","unstructured":"Adrian Bulat , Juan Manuel Perez Rua , Swathikiran Sudhakaran, Brais Martinez, and Georgios Tzimiropoulos. 2021 . Space-time mixing attention for video transformer. Advances in Neural Information Processing Systems , Vol. 34 (2021). Adrian Bulat, Juan Manuel Perez Rua, Swathikiran Sudhakaran, Brais Martinez, and Georgios Tzimiropoulos. 2021. Space-time mixing attention for video transformer. Advances in Neural Information Processing Systems , Vol. 34 (2021)."},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICIP.1994.413553"},{"key":"e_1_3_2_2_5_1","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision. 357--366","author":"Richard Chen Chun-Fu","year":"2021","unstructured":"Chun-Fu Richard Chen , Quanfu Fan , and Rameswar Panda . 2021 . Crossvit: Cross-attention multi-scale vision transformer for image classification . In Proceedings of the IEEE\/CVF International Conference on Computer Vision. 357--366 . Chun-Fu Richard Chen, Quanfu Fan, and Rameswar Panda. 2021. Crossvit: Cross-attention multi-scale vision transformer for image classification. In Proceedings of the IEEE\/CVF International Conference on Computer Vision. 357--366."},{"key":"e_1_3_2_2_6_1","volume-title":"A Multi-Scale Position Feature Transform Network for Video Frame Interpolation. TCSVT","author":"Cheng Xianhang","year":"2019","unstructured":"Xianhang Cheng and Zhenzhong Chen . 2019. A Multi-Scale Position Feature Transform Network for Video Frame Interpolation. TCSVT ( 2019 ). Xianhang Cheng and Zhenzhong Chen. 2019. A Multi-Scale Position Feature Transform Network for Video Frame Interpolation. TCSVT (2019)."},{"key":"e_1_3_2_2_7_1","volume-title":"Multiple video frame interpolation via enhanced deformable separable convolution. arXiv preprint arXiv:2006.08070","author":"Cheng Xianhang","year":"2020","unstructured":"Xianhang Cheng and Zhenzhong Chen . 2020a. Multiple video frame interpolation via enhanced deformable separable convolution. arXiv preprint arXiv:2006.08070 ( 2020 ). Xianhang Cheng and Zhenzhong Chen. 2020a. Multiple video frame interpolation via enhanced deformable separable convolution. arXiv preprint arXiv:2006.08070 (2020)."},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i07.6634"},{"key":"e_1_3_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2007.893835"},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"crossref","unstructured":"Myungsub Choi Heewon Kim Bohyung Han Ning Xu and Kyoung Mu Lee. 2020. Channel Attention Is All You Need for Video Frame Interpolation.. In AAAI. 10663--10671.  Myungsub Choi Heewon Kim Bohyung Han Ning Xu and Kyoung Mu Lee. 2020. Channel Attention Is All You Need for Video Frame Interpolation.. In AAAI. 10663--10671.","DOI":"10.1609\/aaai.v34i07.6693"},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.01358"},{"key":"e_1_3_2_2_12_1","unstructured":"Shurui Gui Chaoyue Wang Qihua Chen and Dacheng Tao. 2020. FeatureFlow: Robust Video Interpolation via Structure-to-Texture Generation. In CVPR. 14004--14013.  Shurui Gui Chaoyue Wang Qihua Chen and Dacheng Tao. 2020. FeatureFlow: Robust Video Interpolation via Structure-to-Texture Generation. In CVPR. 14004--14013."},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2013.06.045"},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"crossref","unstructured":"Jie Hu Li Shen and Gang Sun. 2018. Squeeze-and-excitation networks. In CVPR. 7132--7141.  Jie Hu Li Shen and Gang Sun. 2018. Squeeze-and-excitation networks. In CVPR. 7132--7141.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.00356"},{"key":"e_1_3_2_2_16_1","volume-title":"Motion Feedback Design for Video Frame Interpolation","author":"Hu Mengshun","unstructured":"Mengshun Hu , Liang Liao , Jing Xiao , Lin Gu , and Shin'ichi Satoh . 2020. Motion Feedback Design for Video Frame Interpolation . In ICASSP. IEEE , 4347--4351. Mengshun Hu, Liang Liao, Jing Xiao, Lin Gu, and Shin'ichi Satoh. 2020. Motion Feedback Design for Video Frame Interpolation. In ICASSP. IEEE, 4347--4351."},{"key":"e_1_3_2_2_17_1","volume-title":"Fast-Moving Objects: Frame Interpolation via Recurrent Motion Enhancement","author":"Hu Mengshun","year":"2021","unstructured":"Mengshun Hu , Jing Xiao , Liang Liao , Zheng Wang , Chia-Wen Lin , Mi Wang , and Shin'ichi Satoh . 2021. Capturing Small , Fast-Moving Objects: Frame Interpolation via Recurrent Motion Enhancement . IEEE Transactions on Circuits and Systems for Video Technology ( 2021 ). Mengshun Hu, Jing Xiao, Liang Liao, Zheng Wang, Chia-Wen Lin, Mi Wang, and Shin'ichi Satoh. 2021. Capturing Small, Fast-Moving Objects: Frame Interpolation via Recurrent Motion Enhancement. IEEE Transactions on Circuits and Systems for Video Technology (2021)."},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.00488"},{"key":"e_1_3_2_2_19_1","doi-asserted-by":"crossref","unstructured":"Huaizu Jiang Deqing Sun Varun Jampani Ming-Hsuan Yang Erik Learned-Miller and Jan Kautz. 2018. Super slomo: High quality estimation of multiple intermediate frames for video interpolation. In CVPR. 9000--9008.  Huaizu Jiang Deqing Sun Varun Jampani Ming-Hsuan Yang Erik Learned-Miller and Jan Kautz. 2018. Super slomo: High quality estimation of multiple intermediate frames for video interpolation. In CVPR. 9000--9008.","DOI":"10.1109\/CVPR.2018.00938"},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2020.107475"},{"key":"e_1_3_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/TNNLS.2020.3027849"},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.00963"},{"key":"e_1_3_2_2_23_1","volume-title":"Flavr: Flow-agnostic video representations for fast frame interpolation. arXiv preprint arXiv:2012.08512","author":"Kalluri Tarun","year":"2020","unstructured":"Tarun Kalluri , Deepak Pathak , Manmohan Chandraker , and Du Tran . 2020 . Flavr: Flow-agnostic video representations for fast frame interpolation. arXiv preprint arXiv:2012.08512 (2020). Tarun Kalluri, Deepak Pathak, Manmohan Chandraker, and Du Tran. 2020. Flavr: Flow-agnostic video representations for fast frame interpolation. arXiv preprint arXiv:2012.08512 (2020)."},{"key":"e_1_3_2_2_24_1","volume-title":"Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980","author":"Kingma Diederik P","year":"2014","unstructured":"Diederik P Kingma and Jimmy Ba . 2014 . Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014). Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)."},{"key":"e_1_3_2_2_25_1","unstructured":"Hyeongmin Lee Taeoh Kim Tae-young Chung Daehyun Pak Yuseok Ban and Sangyoun Lee. 2020. AdaCoF: Adaptive Collaboration of Flows for Video Frame Interpolation. In CVPR. 5316--5325.  Hyeongmin Lee Taeoh Kim Tae-young Chung Daehyun Pak Yuseok Ban and Sangyoun Lee. 2020. AdaCoF: Adaptive Collaboration of Flows for Video Frame Interpolation. In CVPR. 5316--5325."},{"key":"e_1_3_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCVW54120.2021.00210"},{"key":"e_1_3_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i07.6825"},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.33018794"},{"key":"e_1_3_2_2_29_1","doi-asserted-by":"crossref","unstructured":"Ziwei Liu Raymond A Yeh Xiaoou Tang Yiming Liu and Aseem Agarwala. 2017. Video frame synthesis using deep voxel flow. In ICCV. 4463--4471.  Ziwei Liu Raymond A Yeh Xiaoou Tang Yiming Liu and Aseem Agarwala. 2017. Video frame synthesis using deep voxel flow. In ICCV. 4463--4471.","DOI":"10.1109\/ICCV.2017.478"},{"key":"e_1_3_2_2_30_1","volume-title":"Learning image matching by simply watching video","author":"Long Gucan","unstructured":"Gucan Long , Laurent Kneip , Jose M Alvarez , Hongdong Li , Xiaohu Zhang , and Qifeng Yu. 2016. Learning image matching by simply watching video . In ECCV. Springer , 434--450. Gucan Long, Laurent Kneip, Jose M Alvarez, Hongdong Li, Xiaohu Zhang, and Qifeng Yu. 2016. Learning image matching by simply watching video. In ECCV. Springer, 434--450."},{"key":"e_1_3_2_2_31_1","volume-title":"Deep multi-scale video prediction beyond mean square error. ICLR","author":"Mathieu Michael","year":"2016","unstructured":"Michael Mathieu , Camille Couprie , and Yann LeCun . 2016. Deep multi-scale video prediction beyond mean square error. ICLR ( 2016 ). Michael Mathieu, Camille Couprie, and Yann LeCun. 2016. Deep multi-scale video prediction beyond mean square error. ICLR (2016)."},{"key":"e_1_3_2_2_32_1","doi-asserted-by":"crossref","unstructured":"Simon Niklaus and Feng Liu. 2018. Context-aware synthesis for video frame interpolation. In CVPR. 1701--1710.  Simon Niklaus and Feng Liu. 2018. Context-aware synthesis for video frame interpolation. In CVPR. 1701--1710.","DOI":"10.1109\/CVPR.2018.00183"},{"key":"e_1_3_2_2_33_1","doi-asserted-by":"crossref","unstructured":"Simon Niklaus and Feng Liu. 2020. Softmax Splatting for Video Frame Interpolation. In CVPR. 5437--5446.  Simon Niklaus and Feng Liu. 2020. Softmax Splatting for Video Frame Interpolation. In CVPR. 5437--5446.","DOI":"10.1109\/CVPR42600.2020.00548"},{"key":"e_1_3_2_2_34_1","doi-asserted-by":"crossref","unstructured":"Simon Niklaus Long Mai and Feng Liu. 2017a. Video frame interpolation via adaptive convolution. In CVPR. 670--679.  Simon Niklaus Long Mai and Feng Liu. 2017a. Video frame interpolation via adaptive convolution. In CVPR. 670--679.","DOI":"10.1109\/CVPR.2017.244"},{"key":"e_1_3_2_2_35_1","doi-asserted-by":"crossref","unstructured":"Simon Niklaus Long Mai and Feng Liu. 2017b. Video frame interpolation via adaptive separable convolution. In ICCV. 261--270.  Simon Niklaus Long Mai and Feng Liu. 2017b. Video frame interpolation via adaptive separable convolution. In ICCV. 261--270.","DOI":"10.1109\/ICCV.2017.37"},{"key":"e_1_3_2_2_36_1","volume-title":"Bmbc: Bilateral motion estimation with bilateral cost","author":"Park Junheum","year":"2020","unstructured":"Junheum Park , Keunsoo Ko , Chul Lee , and Chang-Su Kim . 2020 . Bmbc: Bilateral motion estimation with bilateral cost volume for video interpolation. arXiv preprint arXiv: 2007 .12622 (2020). Junheum Park, Keunsoo Ko, Chul Lee, and Chang-Su Kim. 2020. Bmbc: Bilateral motion estimation with bilateral cost volume for video interpolation. arXiv preprint arXiv:2007.12622 (2020)."},{"key":"e_1_3_2_2_37_1","volume-title":"Asymmetric Bilateral Motion Estimation for Video Frame Interpolation. arXiv preprint arXiv:2108.06815","author":"Park Junheum","year":"2021","unstructured":"Junheum Park , Chul Lee , and Chang-Su Kim . 2021. Asymmetric Bilateral Motion Estimation for Video Frame Interpolation. arXiv preprint arXiv:2108.06815 ( 2021 ). Junheum Park, Chul Lee, and Chang-Su Kim. 2021. Asymmetric Bilateral Motion Estimation for Video Frame Interpolation. arXiv preprint arXiv:2108.06815 (2021)."},{"key":"e_1_3_2_2_38_1","volume-title":"Advances in Neural Information Processing Systems","volume":"34","author":"Patrick Mandela","year":"2021","unstructured":"Mandela Patrick , Dylan Campbell , Yuki Asano , Ishan Misra , Florian Metze , Christoph Feichtenhofer , Andrea Vedaldi , and Jo ao F Henriques . 2021 . Keeping your eye on the ball: Trajectory attention in video transformers . Advances in Neural Information Processing Systems , Vol. 34 (2021). Mandela Patrick, Dylan Campbell, Yuki Asano, Ishan Misra, Florian Metze, Christoph Feichtenhofer, Andrea Vedaldi, and Jo ao F Henriques. 2021. Keeping your eye on the ball: Trajectory attention in video transformers. Advances in Neural Information Processing Systems , Vol. 34 (2021)."},{"key":"e_1_3_2_2_39_1","doi-asserted-by":"crossref","unstructured":"Anurag Ranjan and Michael J Black. 2017. Optical flow estimation using a spatial pyramid network. In CVPR. 4161--4170.  Anurag Ranjan and Michael J Black. 2017. Optical flow estimation using a spatial pyramid network. In CVPR. 4161--4170.","DOI":"10.1109\/CVPR.2017.291"},{"key":"e_1_3_2_2_40_1","doi-asserted-by":"crossref","unstructured":"Wang Shen Wenbo Bao Guangtao Zhai Li Chen Xiongkuo Min and Zhiyong Gao. 2020. Blurry video frame interpolation. In CVPR. 5114--5123.  Wang Shen Wenbo Bao Guangtao Zhai Li Chen Xiongkuo Min and Zhiyong Gao. 2020. Blurry video frame interpolation. In CVPR. 5114--5123.","DOI":"10.1109\/CVPR42600.2020.00516"},{"key":"e_1_3_2_2_41_1","unstructured":"Wenzhe Shi Jose Caballero Ferenc Husz\u00e1r Johannes Totz Andrew P Aitken Rob Bishop Daniel Rueckert and Zehan Wang. 2016. Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network. In CVPR. 1874--1883.  Wenzhe Shi Jose Caballero Ferenc Husz\u00e1r Johannes Totz Andrew P Aitken Rob Bishop Daniel Rueckert and Zehan Wang. 2016. Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network. In CVPR. 1874--1883."},{"key":"e_1_3_2_2_42_1","volume-title":"Video Frame Interpolation via Generalized Deformable Convolution","author":"Shi Zhihao","year":"2021","unstructured":"Zhihao Shi , Xiaohong Liu , Kangdi Shi , Linhui Dai , and Jun Chen . 2021. Video Frame Interpolation via Generalized Deformable Convolution . IEEE Transactions on Multimedia ( 2021 ). Zhihao Shi, Xiaohong Liu, Kangdi Shi, Linhui Dai, and Jun Chen. 2021. Video Frame Interpolation via Generalized Deformable Convolution. IEEE Transactions on Multimedia (2021)."},{"key":"e_1_3_2_2_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.01422"},{"key":"e_1_3_2_2_44_1","volume-title":"Amir Roshan Zamir, and Mubarak Shah","author":"Soomro Khurram","year":"2012","unstructured":"Khurram Soomro , Amir Roshan Zamir, and Mubarak Shah . 2012 . UCF101: A dataset of 101 human actions classes from videos in the wild. arXiv preprint arXiv:1212.0402 (2012). Khurram Soomro, Amir Roshan Zamir, and Mubarak Shah. 2012. UCF101: A dataset of 101 human actions classes from videos in the wild. arXiv preprint arXiv:1212.0402 (2012)."},{"key":"e_1_3_2_2_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.01348"},{"key":"e_1_3_2_2_46_1","doi-asserted-by":"crossref","unstructured":"Xiaolong Wang Ross Girshick Abhinav Gupta and Kaiming He. 2018. Non-local neural networks. In CVPR. 7794--7803.  Xiaolong Wang Ross Girshick Abhinav Gupta and Kaiming He. 2018. Non-local neural networks. In CVPR. 7794--7803.","DOI":"10.1109\/CVPR.2018.00813"},{"key":"e_1_3_2_2_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2003.819861"},{"key":"e_1_3_2_2_48_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01234-2_1"},{"key":"e_1_3_2_2_49_1","volume-title":"GMFlow: Learning Optical Flow via Global Matching. arXiv preprint arXiv:2111.13680","author":"Xu Haofei","year":"2021","unstructured":"Haofei Xu , Jing Zhang , Jianfei Cai , Hamid Rezatofighi , and Dacheng Tao . 2021. GMFlow: Learning Optical Flow via Global Matching. arXiv preprint arXiv:2111.13680 ( 2021 ). Haofei Xu, Jing Zhang, Jianfei Cai, Hamid Rezatofighi, and Dacheng Tao. 2021. GMFlow: Learning Optical Flow via Global Matching. arXiv preprint arXiv:2111.13680 (2021)."},{"key":"e_1_3_2_2_50_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-018-01144-2"},{"key":"e_1_3_2_2_51_1","volume-title":"A progressive fusion generative adversarial network for realistic and consistent video super-resolution","author":"Yi Peng","year":"2020","unstructured":"Peng Yi , Zhongyuan Wang , Kui Jiang , Junjun Jiang , Tao Lu , and Jiayi Ma. 2020. A progressive fusion generative adversarial network for realistic and consistent video super-resolution . IEEE Transactions on Pattern Analysis and Machine Intelligence ( 2020 ). Peng Yi, Zhongyuan Wang, Kui Jiang, Junjun Jiang, Tao Lu, and Jiayi Ma. 2020. A progressive fusion generative adversarial network for realistic and consistent video super-resolution. IEEE Transactions on Pattern Analysis and Machine Intelligence (2020)."},{"key":"e_1_3_2_2_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.00439"},{"key":"e_1_3_2_2_53_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01458"},{"key":"e_1_3_2_2_54_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00262"},{"key":"e_1_3_2_2_55_1","unstructured":"Xizhou Zhu Han Hu Stephen Lin and Jifeng Dai. 2019. Deformable convnets v2: More deformable better results. In CVPR. 9308--9316.io  Xizhou Zhu Han Hu Stephen Lin and Jifeng Dai. 2019. Deformable convnets v2: More deformable better results. In CVPR. 9308--9316.io"}],"event":{"name":"MM '22: The 30th ACM International Conference on Multimedia","location":"Lisboa Portugal","acronym":"MM '22","sponsor":["SIGMM ACM Special Interest Group on Multimedia"]},"container-title":["Proceedings of the 30th ACM International Conference on Multimedia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3503161.3547875","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3503161.3547875","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:02:35Z","timestamp":1750186955000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3503161.3547875"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,10,10]]},"references-count":55,"alternative-id":["10.1145\/3503161.3547875","10.1145\/3503161"],"URL":"https:\/\/doi.org\/10.1145\/3503161.3547875","relation":{},"subject":[],"published":{"date-parts":[[2022,10,10]]},"assertion":[{"value":"2022-10-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}