{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:17:11Z","timestamp":1750220231236,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":32,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,12,1]],"date-time":"2021-12-01T00:00:00Z","timestamp":1638316800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61871226"],"award-info":[{"award-number":["61871226"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,12]]},"DOI":"10.1145\/3469877.3490608","type":"proceedings-article","created":{"date-parts":[[2022,1,10]],"date-time":"2022-01-10T18:24:29Z","timestamp":1641839069000},"page":"1-7","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Intra- and Inter-frame Iterative Temporal Convolutional Networks for Video Stabilization"],"prefix":"10.1145","author":[{"given":"Haopeng","family":"Xie","sequence":"first","affiliation":[{"name":"Nanjing University of Science and Technology, CN"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Liang","family":"Xiao","sequence":"additional","affiliation":[{"name":"Nanjing University of Science and Technology, CN"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Huicong","family":"Wu","sequence":"additional","affiliation":[{"name":"Nanjing University of Science and Technology, CN"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2022,1,10]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"[n.d.]. A demo of our dataset. ([n. d.]). [Online] Available: hhttps:\/\/www.youtube.com\/watch?v=c9Lv73H_OCE.  [n.d.]. A demo of our dataset. ([n. d.]). [Online] Available: hhttps:\/\/www.youtube.com\/watch?v=c9Lv73H_OCE."},{"key":"e_1_3_2_1_2_1","unstructured":"[n.d.]. An example video of comparison result. ([n. d.]). [Online] Available: https:\/\/www.youtube.com\/watch?v=a5vZuPchmqw.  [n.d.]. An example video of comparison result. ([n. d.]). [Online] Available: https:\/\/www.youtube.com\/watch?v=a5vZuPchmqw."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2017.2778011"},{"volume-title":"Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001, Vol.\u00a02. II\u2013II.","author":"Buehler Chris","key":"e_1_3_2_1_4_1","unstructured":"Chris Buehler , Michael Bosse , and Leonard McMillan . [n.d.]. Non-metric image-based rendering for video stabilization . In Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001, Vol.\u00a02. II\u2013II. Chris Buehler, Michael Bosse, and Leonard McMillan. [n.d.]. Non-metric image-based rendering for video stabilization. In Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001, Vol.\u00a02. II\u2013II."},{"key":"e_1_3_2_1_5_1","volume-title":"DIFRINT: Deep Iterative Frame Interpolation for Full-Frame Video Stabilization. In 2019 IEEE\/CVF International Conference on Computer Vision Workshop (ICCVW).","author":"Choi J.","year":"2020","unstructured":"J. Choi and I.\u00a0 S. Kweon . 2020 . DIFRINT: Deep Iterative Frame Interpolation for Full-Frame Video Stabilization. In 2019 IEEE\/CVF International Conference on Computer Vision Workshop (ICCVW). J. Choi and I.\u00a0S. Kweon. 2020. DIFRINT: Deep Iterative Frame Interpolation for Full-Frame Video Stabilization. In 2019 IEEE\/CVF International Conference on Computer Vision Workshop (ICCVW)."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1207\/s15516709cog1402_1"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00138"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/2231816.2231824"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2011.5995525"},{"volume-title":"Non-Uniform Video Time-Lapse Method Based on Motion Scenario and Stabilization Constraint. In ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).","author":"Guo K.","key":"e_1_3_2_1_10_1","unstructured":"K. Guo , N. Kim , D. Seo , I. Kim , and S. Lim . 2020 . Non-Uniform Video Time-Lapse Method Based on Motion Scenario and Stabilization Constraint. In ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). K. Guo, N. Kim, D. Seo, I. Kim, and S. Lim. 2020. Non-Uniform Video Time-Lapse Method Based on Motion Scenario and Stabilization Constraint. In ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1162\/neco.1997.9.8.1735"},{"key":"e_1_3_2_1_12_1","volume-title":"Proceedings of the IEEE International Conference on Computer Vision. 4038\u20134047","author":"Hyun\u00a0Kim Tae","year":"2017","unstructured":"Tae Hyun\u00a0Kim , Kyoung Mu\u00a0Lee , Bernhard Scholkopf , and Michael Hirsch . 2017 . Online video deblurring via dynamic temporal blending network . In Proceedings of the IEEE International Conference on Computer Vision. 4038\u20134047 . Tae Hyun\u00a0Kim, Kyoung Mu\u00a0Lee, Bernhard Scholkopf, and Michael Hirsch. 2017. Online video deblurring via dynamic temporal blending network. In Proceedings of the IEEE International Conference on Computer Vision. 4038\u20134047."},{"volume-title":"A dataset and evaluation framework for deep learning based video stabilization systems","author":"Ito Maria\u00a0Silvia","key":"e_1_3_2_1_13_1","unstructured":"Maria\u00a0Silvia Ito and Ebroul Izquierdo . 2019. A dataset and evaluation framework for deep learning based video stabilization systems . In IEEE Visual Communications and Image Processing (VCIP) . 1\u20134. Maria\u00a0Silvia Ito and Ebroul Izquierdo. 2019. A dataset and evaluation framework for deep learning based video stabilization systems. In IEEE Visual Communications and Image Processing (VCIP). 1\u20134."},{"key":"e_1_3_2_1_14_1","volume-title":"Adam: A Method for Stochastic Optimization. Computer Science","author":"Kingma D.","year":"2014","unstructured":"D. Kingma and J. Ba . 2014 . Adam: A Method for Stochastic Optimization. Computer Science (2014). D. Kingma and J. Ba. 2014. Adam: A Method for Stochastic Optimization. Computer Science (2014)."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/1559755.1559758"},{"key":"e_1_3_2_1_16_1","first-page":"1","article-title":"Subspace video stabilization","volume":"30","author":"Liu Feng","year":"2011","unstructured":"Feng Liu , Michael Gleicher , Jue Wang , Hailin Jin , and Aseem Agarwala . 2011 . Subspace video stabilization . ACM Transactions on Graphics (TOG) 30 , 1 (2011), 1 \u2013 10 . Feng Liu, Michael Gleicher, Jue Wang, Hailin Jin, and Aseem Agarwala. 2011. Subspace video stabilization. ACM Transactions on Graphics (TOG) 30, 1 (2011), 1\u201310.","journal-title":"ACM Transactions on Graphics (TOG)"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2017.2697759"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46466-4_48"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2016.2556587"},{"key":"e_1_3_2_1_20_1","first-page":"1","article-title":"Bundled camera paths for video stabilization","volume":"32","author":"Liu Shuaicheng","year":"2013","unstructured":"Shuaicheng Liu , Lu Yuan , Ping Tan , and Jian Sun . 2013 . Bundled camera paths for video stabilization . ACM Transactions on Graphics (TOG) 32 , 4 (2013), 1 \u2013 10 . Shuaicheng Liu, Lu Yuan, Ping Tan, and Jian Sun. 2013. Bundled camera paths for video stabilization. ACM Transactions on Graphics (TOG) 32, 4 (2013), 1\u201310.","journal-title":"ACM Transactions on Graphics (TOG)"},{"key":"e_1_3_2_1_21_1","volume-title":"Predicting Video-frames Using Encoder-convlstm Combination. In ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).","author":"Mukherjee S.","year":"2019","unstructured":"S. Mukherjee , S. Ghosh , S. Ghosh , P. Kumar , and P.\u00a0 P. Roy . 2019 . Predicting Video-frames Using Encoder-convlstm Combination. In ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). S. Mukherjee, S. Ghosh, S. Ghosh, P. Kumar, and P.\u00a0P. Roy. 2019. Predicting Video-frames Using Encoder-convlstm Combination. In ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00829"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"e_1_3_2_1_24_1","unstructured":"Carlo Tomasi and Takeo Kanade. 1991. Detection and tracking of point features. (1991).  Carlo Tomasi and Takeo Kanade. 1991. Detection and tracking of point features. (1991)."},{"key":"e_1_3_2_1_25_1","volume-title":"International Conference on Machine Learning. PMLR, 1747\u20131756","author":"Van\u00a0Oord Aaron","year":"2016","unstructured":"Aaron Van\u00a0Oord , Nal Kalchbrenner , and Koray Kavukcuoglu . 2016 . Pixel recurrent neural networks . In International Conference on Machine Learning. PMLR, 1747\u20131756 . Aaron Van\u00a0Oord, Nal Kalchbrenner, and Koray Kavukcuoglu. 2016. Pixel recurrent neural networks. In International Conference on Machine Learning. PMLR, 1747\u20131756."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2018.2884280"},{"volume-title":"The 28th ACM International Conference on Multimedia.","author":"Wang Y.","key":"e_1_3_2_1_27_1","unstructured":"Y. Wang , W.\u00a0 K. Zhang , Q. Liu , Z. Zhang , and X. Sun . 2020. Improving Intra- and Inter-Modality Visual Relation for Image Captioning. In MM \u201920 : The 28th ACM International Conference on Multimedia. Y. Wang, W.\u00a0K. Zhang, Q. Liu, Z. Zhang, and X. Sun. 2020. Improving Intra- and Inter-Modality Visual Relation for Image Captioning. In MM \u201920: The 28th ACM International Conference on Multimedia."},{"key":"e_1_3_2_1_28_1","unstructured":"SHI Xingjian Zhourong Chen Hao Wang Dit-Yan Yeung Wai-Kin Wong and Wang-chun Woo. 2015. Convolutional LSTM network: A machine learning approach for precipitation nowcasting. In Advances in neural information processing systems. 802\u2013810.  SHI Xingjian Zhourong Chen Hao Wang Dit-Yan Yeung Wai-Kin Wong and Wang-chun Woo. 2015. Convolutional LSTM network: A machine learning approach for precipitation nowcasting. In Advances in neural information processing systems. 802\u2013810."},{"volume-title":"Computer Graphics Forum, Vol.\u00a037","author":"Xu Sen-Zhe","key":"e_1_3_2_1_29_1","unstructured":"Sen-Zhe Xu , Jun Hu , Miao Wang , Tai-Jiang Mu , and Shi-Min Hu. 2018. Deep video stabilization using adversarial networks . In Computer Graphics Forum, Vol.\u00a037 . Wiley Online Library , 267\u2013276. Sen-Zhe Xu, Jun Hu, Miao Wang, Tai-Jiang Mu, and Shi-Min Hu. 2018. Deep video stabilization using adversarial networks. In Computer Graphics Forum, Vol.\u00a037. Wiley Online Library, 267\u2013276."},{"volume-title":"The 28th ACM International Conference on Multimedia.","author":"Zhang D","key":"e_1_3_2_1_30_1","unstructured":"D Zhang , W. Zhang , S. Li , Q. Zhu , and G. Zhou . 2020. Modeling both Intra- and Inter-modal Influence for Real-Time Emotion Detection in Conversations. In MM \u201920 : The 28th ACM International Conference on Multimedia. D Zhang, W. Zhang, S. Li, Q. Zhu, and G. Zhou. 2020. Modeling both Intra- and Inter-modal Influence for Real-Time Emotion Detection in Conversations. In MM \u201920: The 28th ACM International Conference on Multimedia."},{"volume-title":"HSA-RNN: Hierarchical Structure-Adaptive RNN for Video Summarization. In 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition.","author":"Zhao B.","key":"e_1_3_2_1_31_1","unstructured":"B. Zhao , X. Li , and X. Lu . 2018 . HSA-RNN: Hierarchical Structure-Adaptive RNN for Video Summarization. In 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition. B. Zhao, X. Li, and X. Lu. 2018. HSA-RNN: Hierarchical Structure-Adaptive RNN for Video Summarization. In 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition."},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2019.2963380"}],"event":{"name":"MMAsia '21: ACM Multimedia Asia","sponsor":["SIGMM ACM Special Interest Group on Multimedia"],"location":"Gold Coast Australia","acronym":"MMAsia '21"},"container-title":["ACM Multimedia Asia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3469877.3490608","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3469877.3490608","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:30:16Z","timestamp":1750188616000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3469877.3490608"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,12]]},"references-count":32,"alternative-id":["10.1145\/3469877.3490608","10.1145\/3469877"],"URL":"https:\/\/doi.org\/10.1145\/3469877.3490608","relation":{},"subject":[],"published":{"date-parts":[[2021,12]]},"assertion":[{"value":"2022-01-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}