{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:14:33Z","timestamp":1750220073324,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":23,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,1,6]],"date-time":"2023-01-06T00:00:00Z","timestamp":1672963200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,1,6]]},"DOI":"10.1145\/3582649.3582668","type":"proceedings-article","created":{"date-parts":[[2023,4,7]],"date-time":"2023-04-07T16:23:28Z","timestamp":1680884608000},"page":"8-15","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["ARNets: Action Recurrent Networks for Human Action Recognition"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-0368-4984","authenticated-orcid":false,"given":"Guangjun","family":"Zhang","sequence":"first","affiliation":[{"name":"School of Computer Science and Technology, Beijing Institute of Technology, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2938-0177","authenticated-orcid":false,"given":"Xiaobo","family":"Cai","sequence":"additional","affiliation":[{"name":"Intelligent Hardware Department, Shenzhen Youzhichuangxin Technologies Co., Ltd., China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0083-3016","authenticated-orcid":false,"given":"Guangyu","family":"Gao","sequence":"additional","affiliation":[{"name":"School of Computer Science and Technology, Beijing Institute of Technology, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5388-1436","authenticated-orcid":false,"given":"Zihua","family":"Yan","sequence":"additional","affiliation":[{"name":"Intelligent Hardware Department, Shenzhen Youzhichuangxin Technologies Co., Ltd., China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6015-0383","authenticated-orcid":false,"given":"Liang","family":"Shu","sequence":"additional","affiliation":[{"name":"Intelligent Hardware Department, Shenzhen Youzhichuangxin Technologies Co., Ltd., China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4447-5510","authenticated-orcid":false,"given":"Zhihui","family":"Hu","sequence":"additional","affiliation":[{"name":"Intelligent Hardware Department, Shenzhen Youzhichuangxin Technologies Co., Ltd., China"}]}],"member":"320","published-online":{"date-parts":[[2023,4,7]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Action Recognition? A New Model and the Kinetics Dataset. CVPR","author":"Carreira Jo\u00e3o","year":"2017","unstructured":"Jo\u00e3o Carreira and Andrew Zisserman . 2017. Quo Vadis , Action Recognition? A New Model and the Kinetics Dataset. CVPR ( 2017 ). Jo\u00e3o Carreira and Andrew Zisserman. 2017. Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset. CVPR (2017)."},{"key":"e_1_3_2_1_2_1","volume-title":"Wildes","author":"Feichtenhofer Christoph","year":"2017","unstructured":"Christoph Feichtenhofer , Axel Pinz , and Richard P . Wildes . 2017 . Spatiotemporal Multiplier Networks for Video Action Recognition. In CVPR. Christoph Feichtenhofer, Axel Pinz, and Richard P. Wildes. 2017. Spatiotemporal Multiplier Networks for Video Action Recognition. In CVPR."},{"key":"e_1_3_2_1_3_1","volume-title":"Long Short-Term Memory. Neural Computation 9, 8 (Nov","author":"Hochreiter Sepp","year":"1997","unstructured":"Sepp Hochreiter and J\u00fcrgen Schmidhuber . 1997. Long Short-Term Memory. Neural Computation 9, 8 (Nov 1997 ), 1735\u20131780. Sepp Hochreiter and J\u00fcrgen Schmidhuber. 1997. Long Short-Term Memory. Neural Computation 9, 8 (Nov 1997), 1735\u20131780."},{"key":"e_1_3_2_1_4_1","volume-title":"Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift.. In ICML","author":"Ioffe Sergey","year":"2015","unstructured":"Sergey Ioffe and Christian Szegedy . 2015 . Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift.. In ICML Sergey Ioffe and Christian Szegedy. 2015. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift.. In ICML"},{"key":"e_1_3_2_1_5_1","unstructured":"A. Zisserman K. Simonyan. 2014. Two-Stream Convolutional Networks for Action Recognition in Videos. In NIPS.  A. Zisserman K. Simonyan. 2014. Two-Stream Convolutional Networks for Action Recognition in Videos. In NIPS."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cviu.2017.10.011"},{"volume-title":"Moments in Time Dataset: one million videos for event understanding","author":"Monfort Mathew","key":"e_1_3_2_1_7_1","unstructured":"Mathew Monfort , Alex Andonian , Bolei Zhou , 2019. Moments in Time Dataset: one million videos for event understanding . IEEE transactions on pattern analysis and machine intelligence 42 (2), 502\u2013508. Mathew Monfort, Alex Andonian, Bolei Zhou, 2019. Moments in Time Dataset: one million videos for event understanding. IEEE transactions on pattern analysis and machine intelligence 42 (2), 502\u2013508."},{"key":"e_1_3_2_1_8_1","volume-title":"Berg","author":"Park Eunbyung","year":"2016","unstructured":"Eunbyung Park , Xufeng Han , Tamara L. Berg , and Alexander C . Berg . 2016 . Combining multiple sources of knowledge in deep CNNs for action recognition.. In WACV Eunbyung Park, Xufeng Han, Tamara L. Berg, and Alexander C. Berg. 2016. Combining multiple sources of knowledge in deep CNNs for action recognition.. In WACV"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"crossref","unstructured":"Sarabu A Santra A K. 2021. Human Action Recognition in Videos using Convolution Long Short-term Memory Network with Spatio-temporal Networks. Emerging Science Journal 5(1) 25-33.  Sarabu A Santra A K. 2021. Human Action Recognition in Videos using Convolution Long Short-term Memory Network with Spatio-temporal Networks. Emerging Science Journal 5(1) 25-33.","DOI":"10.28991\/esj-2021-01254"},{"volume-title":"ICCV","author":"Sun Lin","key":"e_1_3_2_1_10_1","unstructured":"Lin Sun , Kui Jia , Kevin Chen , Dit-Yan Yeung , Bertram E. Shi , and Silvio Savarese . 2017. Lattice Long Short-Term Memory for Human Action Recognition . In ICCV Lin Sun, Kui Jia, Kevin Chen, Dit-Yan Yeung, Bertram E. Shi, and Silvio Savarese. 2017. Lattice Long Short-Term Memory for Human Action Recognition. In ICCV"},{"key":"e_1_3_2_1_11_1","volume-title":"Shi","author":"Sun Lin","year":"2015","unstructured":"Lin Sun , Kui Jia , Dit-Yan Yeung , and Bertram E . Shi . 2015 . Human Action Recognition Using Factorized Spatio-Temporal Convolutional Networks. In ICCV. Lin Sun, Kui Jia, Dit-Yan Yeung, and Bertram E. Shi. 2015. Human Action Recognition Using Factorized Spatio-Temporal Convolutional Networks. In ICCV."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"crossref","unstructured":"Du Tran Lubomir Bourdev Rob Fergus Lorenzo Torresani and Manohar Paluri. 2015. Learning Spatiotemporal Features with 3D Convolutional Networks. In ICCV.  Du Tran Lubomir Bourdev Rob Fergus Lorenzo Torresani and Manohar Paluri. 2015. Learning Spatiotemporal Features with 3D Convolutional Networks. In ICCV.","DOI":"10.1109\/ICCV.2015.510"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"crossref","unstructured":"Du Tran Heng Wang Lorenzo Torresani Jamie Ray Yann LeCun and Manohar Paluri. 2018. A Closer Look at Spatiotemporal Convolutions for Action Recognition. In CVPR.  Du Tran Heng Wang Lorenzo Torresani Jamie Ray Yann LeCun and Manohar Paluri. 2018. A Closer Look at Spatiotemporal Convolutions for Action Recognition. In CVPR.","DOI":"10.1109\/CVPR.2018.00675"},{"key":"e_1_3_2_1_14_1","volume-title":"Visualizing Data using t-SNE. Journal of Machine Learning Research 9","author":"van der Maaten Laurens","year":"2008","unstructured":"Laurens van der Maaten and Geoffrey Hinton . 2008. Visualizing Data using t-SNE. Journal of Machine Learning Research 9 ( 2008 ), 2579\u20132605. http: \/\/www.jmlr.org\/papers\/v9\/vandermaaten08a.html Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing Data using t-SNE. Journal of Machine Learning Research 9 (2008), 2579\u20132605. http: \/\/www.jmlr.org\/papers\/v9\/vandermaaten08a.html"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"crossref","unstructured":"Limin Wang Yuanjun Xiong Zhe Wang Yu Qiao Dahua Lin Xiaoou Tang and Luc Van Gool. 2016. Temporal Segment Networks: Towards Good Practices for Deep Action Recognition. In ECCV.  Limin Wang Yuanjun Xiong Zhe Wang Yu Qiao Dahua Lin Xiaoou Tang and Luc Van Gool. 2016. Temporal Segment Networks: Towards Good Practices for Deep Action Recognition. In ECCV.","DOI":"10.1007\/978-3-319-46484-8_2"},{"key":"e_1_3_2_1_16_1","unstructured":"SHI Xingjian Zhourong Chen Hao Wang Dit-Yan Yeung Wai-kin Wong and Wang-chun Woo. 2015. Convolutional LSTM network: A machine learning approach for precipitation nowcasting. In NIPS.  SHI Xingjian Zhourong Chen Hao Wang Dit-Yan Yeung Wai-kin Wong and Wang-chun Woo. 2015. Convolutional LSTM network: A machine learning approach for precipitation nowcasting. In NIPS."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"crossref","unstructured":"Bowen Zhang Limin Wang Zhe Wang Yu Qiao and Hanli Wang. 2016. Realtime Action Recognition with Enhanced Motion Vector CNNs. In CVPR.  Bowen Zhang Limin Wang Zhe Wang Yu Qiao and Hanli Wang. 2016. Realtime Action Recognition with Enhanced Motion Vector CNNs. In CVPR.","DOI":"10.1109\/CVPR.2016.297"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"crossref","unstructured":"Bolei Zhou Alex Andonian Aude Oliva and Antonio Torralba. 2018. Temporal Relational Reasoning in Videos. In ECCV.  Bolei Zhou Alex Andonian Aude Oliva and Antonio Torralba. 2018. Temporal Relational Reasoning in Videos. In ECCV.","DOI":"10.1007\/978-3-030-01246-5_49"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"crossref","unstructured":"Bolei Zhou Aditya Khosla Agata Lapedriza Aude Oliva and Antonio Torralba. 2016. Learning Deep Features for Discriminative Localization. In CVPR.  Bolei Zhou Aditya Khosla Agata Lapedriza Aude Oliva and Antonio Torralba. 2016. Learning Deep Features for Discriminative Localization. In CVPR.","DOI":"10.1109\/CVPR.2016.319"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.18178\/joig.6.1.21-26"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.18178\/joig.6.2.174-180"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.12720\/joig.2.1.28-32"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.18178\/joig.3.2.96-101"}],"event":{"name":"ICIGP 2023: 2023 The 6th International Conference on Image and Graphics Processing","acronym":"ICIGP 2023","location":"Chongqing China"},"container-title":["Proceedings of the 2023 6th International Conference on Image and Graphics Processing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3582649.3582668","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3582649.3582668","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T18:09:14Z","timestamp":1750183754000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3582649.3582668"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,1,6]]},"references-count":23,"alternative-id":["10.1145\/3582649.3582668","10.1145\/3582649"],"URL":"https:\/\/doi.org\/10.1145\/3582649.3582668","relation":{},"subject":[],"published":{"date-parts":[[2023,1,6]]},"assertion":[{"value":"2023-04-07","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}