{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,30]],"date-time":"2025-10-30T07:12:07Z","timestamp":1761808327374,"version":"3.41.0"},"publisher-location":"New York, New York, USA","reference-count":16,"publisher":"ACM Press","license":[{"start":{"date-parts":[[2019,1,1]],"date-time":"2019-01-01T00:00:00Z","timestamp":1546300800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2019]]},"DOI":"10.1145\/3368926.3369726","type":"proceedings-article","created":{"date-parts":[[2019,12,20]],"date-time":"2019-12-20T13:30:11Z","timestamp":1576848611000},"page":"298-305","source":"Crossref","is-referenced-by-count":2,"title":["Spatio-temporal Multi-level Fusion for Human Action Recognition"],"prefix":"10.1145","author":[{"given":"Manh-Hung","family":"Lu","sequence":"first","affiliation":[{"name":"DS Lab, SoICT, HUST, Viettel High Technology Industries Corporation (VHT), Viettel Group Hanoi, Vietnam"}]},{"given":"Thi-Oanh","family":"Nguyen","sequence":"additional","affiliation":[{"name":"DS Lab, SoICT, HUST, Hanoi, Vietnam"}]}],"member":"320","reference":[{"key":"key-10.1145\/3368926.3369726-1","unstructured":"M Hasina Banu. 2014. Patterns of Motion Using MEI and MHI. International Conference on Information and Image Processing (ICIIP-2014) (2014), 337--340."},{"key":"key-10.1145\/3368926.3369726-2","doi-asserted-by":"crossref","unstructured":"Jo&#227;o Carreira and Andrew Zisserman. 2017. Quo Vadis, action recognition? A new model and the kinetics dataset. Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017 2017-Janua (2017), 4724--4733. https:\/\/doi.org\/10.1109\/CVPR.2017.502 arXiv:1705.07750","DOI":"10.1109\/CVPR.2017.502"},{"key":"key-10.1145\/3368926.3369726-3","doi-asserted-by":"crossref","unstructured":"Rizwan Chaudhry, Avinash Ravichandran, Gregory Hager, and Ren&#233; Vidal. 2009. Histograms of oriented optical flow and Binet-Cauchy kernels on nonlinear dynamical systems for the recognition of human actions. 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, CVPR Workshops 2009 2009 IEEE (2009), 1932--1939. https:\/\/doi.org\/10.1109\/CVPRW.2009.5206821","DOI":"10.1109\/CVPRW.2009.5206821"},{"key":"key-10.1145\/3368926.3369726-4","doi-asserted-by":"crossref","unstructured":"Vasileios Choutas, Philippe Weinzaepfel, Jerome Revaud, and Cordelia Schmid. 2018. PoTion: Pose MoTion Representation for Action Recognition. 7024--7033. https:\/\/doi.org\/10.1109\/CVPR.2018.00734","DOI":"10.1109\/CVPR.2018.00734"},{"key":"key-10.1145\/3368926.3369726-5","doi-asserted-by":"crossref","unstructured":"Christoph Feichtenhofer, Axel Pinz, and Richard P. Wildes. 2016. Spatio-temporal Residual Networks for Video Action Recognition. Nips '16 (2016), 9. https:\/\/doi.org\/10.1109\/CVPR.2017.787 arXiv:1611.02155","DOI":"10.1109\/CVPR.2017.787"},{"key":"key-10.1145\/3368926.3369726-6","unstructured":"Samitha Herath, Mehrtash Tafazzoli Harandi, and Fatih Porikli. 2016. Going Deeper into Action Recognition: A Survey. Image Vision Comput. 2016 abs\/1605.04988 (2016). arXiv:arXiv:1605.04988v2"},{"key":"key-10.1145\/3368926.3369726-7","unstructured":"Karen Simonyan and Andrew Zisserman. 2014. Two-Stream Convolutional Networks for Action Recognition in Videos. Advances in Neural Information Processing Systems 27 (NIPS 2014) abs\/1406.2199 (2014), 1--9. arXiv:1406.2199 http:\/\/arxiv.org\/abs\/1406.2199"},{"key":"key-10.1145\/3368926.3369726-8","doi-asserted-by":"crossref","unstructured":"Ivan Sipiran and Benjamin Bustos. 2011. Harris 3D: A robust extension of the Harris operator for interest point detection on 3D meshes. Visual Computer 27, 11 (2011), 963--976. https:\/\/doi.org\/10.1007\/s00371-011-0610-y","DOI":"10.1007\/s00371-011-0610-y"},{"key":"key-10.1145\/3368926.3369726-9","unstructured":"Khurram Soomro, Amir Roshan Zamir, Mubarak Shah, and Action Recognition. 2012. UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild. CoRR November (2012). arXiv:arXiv:1212.0402v1"},{"key":"key-10.1145\/3368926.3369726-10","doi-asserted-by":"crossref","unstructured":"Shuyang Sun, Zhanghui Kuang, Wanli Ouyang, Lu Sheng, and Wei Zhang. 2017. Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition. The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018 (2017), 1390--1399. https:\/\/doi.org\/10.1109\/CVPR.2018.00151 arXiv:1711.11152","DOI":"10.1109\/CVPR.2018.00151"},{"key":"key-10.1145\/3368926.3369726-11","doi-asserted-by":"crossref","unstructured":"Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. 2015. Going deeper with convolutions. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition 07-12-June (2015), 1--9. https:\/\/doi.org\/10.1109\/CVPR.2015.7298594 arXiv:arXiv:1409.4842v1","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"key-10.1145\/3368926.3369726-12","doi-asserted-by":"crossref","unstructured":"Christian Szegedy, Vincent Vanhoucke, Sergey Ioffe, Jon Shlens, and Zbigniew Wojna. 2016. Rethinking the Inception Architecture for Computer Vision. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition 2016-Decem (2016), 2818--2826. https:\/\/doi.org\/10.1109\/CVPR.2016.308 arXiv:arXiv:1512.00567v3","DOI":"10.1109\/CVPR.2016.308"},{"key":"key-10.1145\/3368926.3369726-13","doi-asserted-by":"crossref","unstructured":"Du Tran, Lubomir Bourdev, Rob Fergus, Lorenzo Torresani, and Manohar Paluri. 2015. Learning spatiotemporal features with 3D convolutional networks. Proceedings of the IEEE International Conference on Computer Vision 2015 Inter (2015), 4489--4497. https:\/\/doi.org\/10.1109\/ICCV.2015.510 arXiv:1412.0767","DOI":"10.1109\/ICCV.2015.510"},{"key":"key-10.1145\/3368926.3369726-14","unstructured":"HengWang, Cordelia Schmid, HengWang, Cordelia Schmid, Action Recognition, Trajectories Iccv, Heng Wang, and Cordelia Schmid. 2013. Action Recognition with Improved Trajectories To cite this version: HAL Id: hal-00873267 Action Recognition with Improved Trajectories. ICCV-IEEE International Conference on Computer Vision December (2013), 3551--3558."},{"key":"key-10.1145\/3368926.3369726-15","unstructured":"Kl&#228;ser A. Schmid C. et al. Wang, H. 2013. Dense Trajectories and Motion Boundary Descriptors for Action Recognition. International Journal of Computer Vision 103, 1 (2013), 60--79."},{"key":"key-10.1145\/3368926.3369726-16","doi-asserted-by":"crossref","unstructured":"Limin Wang, Yuanjun Xiong, Zhe Wang, Yu Qiao, Dahua Lin, Xiaoou Tang, and Luc Van Gool. 2018. Temporal Segment Networks for Action Recognition in Videos. IEEE Transactions on Pattern Analysis and Machine Intelligence (2018), 1--14. https:\/\/doi.org\/10.1109\/TPAMI.2018.2868668 arXiv:1705.02953","DOI":"10.1109\/TPAMI.2018.2868668"}],"event":{"number":"10","sponsor":["SOICT, School of Information and Communication Technology - HUST","NAFOSTED, The National Foundation for Science and Technology Development"],"acronym":"SoICT 2019","name":"the Tenth International Symposium","start":{"date-parts":[[2019,12,4]]},"location":"Hanoi, Ha Long Bay, Viet Nam","end":{"date-parts":[[2019,12,6]]}},"container-title":["Proceedings of the Tenth International Symposium on Information and Communication Technology  - SoICT 2019"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3368926.3369726","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/dl.acm.org\/ft_gateway.cfm?id=3369726&ftid=2101269&dwn=1","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T23:53:05Z","timestamp":1750204385000},"score":1,"resource":{"primary":{"URL":"http:\/\/dl.acm.org\/citation.cfm?doid=3368926.3369726"}},"subtitle":[],"proceedings-subject":"Information and Communication Technology","short-title":[],"issued":{"date-parts":[[2019]]},"references-count":16,"URL":"https:\/\/doi.org\/10.1145\/3368926.3369726","relation":{},"subject":[],"published":{"date-parts":[[2019]]}}}