{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,22]],"date-time":"2026-04-22T19:34:55Z","timestamp":1776886495883,"version":"3.51.2"},"publisher-location":"New York, NY, USA","reference-count":23,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,12,6]],"date-time":"2023-12-06T00:00:00Z","timestamp":1701820800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,12,6]]},"DOI":"10.1145\/3595916.3626362","type":"proceedings-article","created":{"date-parts":[[2024,1,1]],"date-time":"2024-01-01T16:34:41Z","timestamp":1704126881000},"page":"1-5","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Multi-region CNN-Transformer for Micro-gesture Recognition in Face and Upper Body"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0006-7963-3952","authenticated-orcid":false,"given":"Keita","family":"Suzuki","sequence":"first","affiliation":[{"name":"NTT Corporation, JP"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1423-3767","authenticated-orcid":false,"given":"Satoshi","family":"Suzuki","sequence":"additional","affiliation":[{"name":"NTT Corporation, JP"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2415-4149","authenticated-orcid":false,"given":"Ryo","family":"Masumura","sequence":"additional","affiliation":[{"name":"NTT Corporation, JP"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3971-0654","authenticated-orcid":false,"given":"Atsushi","family":"Ando","sequence":"additional","affiliation":[{"name":"NTT Corporation, JP"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7065-315X","authenticated-orcid":false,"given":"Naoki","family":"Makishima","sequence":"additional","affiliation":[{"name":"NTT Corporation, JP"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2024,1]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Proceedings of the 38th International Conference on Machine Learning. 813\u2013824","author":"Bertasius Gedas","year":"2021","unstructured":"Gedas Bertasius , Heng Wang , and Lorenzo Torresani . 2021 . Is space-time attention all you need for video understanding? . In Proceedings of the 38th International Conference on Machine Learning. 813\u2013824 . Gedas Bertasius, Heng Wang, and Lorenzo Torresani. 2021. Is space-time attention all you need for video understanding?. In Proceedings of the 38th International Conference on Machine Learning. 813\u2013824."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"e_1_3_2_1_3_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 6824\u20136835","author":"Fan Haoqi","year":"2021","unstructured":"Haoqi Fan , Bo Xiong , Karttikeya Mangalam , Yanghao Li , Zhicheng Yan , Jitendra Malik , and Christoph Feichtenhofer . 2021 . Multiscale vision transformers . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 6824\u20136835 . Haoqi Fan, Bo Xiong, Karttikeya Mangalam, Yanghao Li, Zhicheng Yan, Jitendra Malik, and Christoph Feichtenhofer. 2021. Multiscale vision transformers. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 6824\u20136835."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/2993148.2997632"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"crossref","first-page":"87","DOI":"10.1109\/TPAMI.2022.3152247","article-title":"A survey on vision transformer","volume":"45","author":"Han Kai","year":"2022","unstructured":"Kai Han , Yunhe Wang , Hanting Chen , Xinghao Chen , Jianyuan Guo , Zhenhua Liu , Yehui Tang , An Xiao , Chunjing Xu , Yixing Xu , Zhaohui Yang , Yiman Zhang , and Dacheng Tao . 2022 . A survey on vision transformer . IEEE Transactions on Pattern Analysis and Machine Intelligence 45 , 1 (2022), 87 \u2013 110 . Kai Han, Yunhe Wang, Hanting Chen, Xinghao Chen, Jianyuan Guo, Zhenhua Liu, Yehui Tang, An Xiao, Chunjing Xu, Yixing Xu, Zhaohui Yang, Yiman Zhang, and Dacheng Tao. 2022. A survey on vision transformer. IEEE Transactions on Pattern Analysis and Machine Intelligence 45, 1 (2022), 87\u2013110.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"e_1_3_2_1_6_1","volume-title":"Long short-term memory. Neural computation 9, 8","author":"Hochreiter Sepp","year":"1997","unstructured":"Sepp Hochreiter and J\u00fcrgen Schmidhuber . 1997. Long short-term memory. Neural computation 9, 8 ( 1997 ), 1735\u20131780. Sepp Hochreiter and J\u00fcrgen Schmidhuber. 1997. Long short-term memory. Neural computation 9, 8 (1997), 1735\u20131780."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.5555\/1577069.1755843"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/5.726791"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/TAFFC.2020.2981446"},{"key":"e_1_3_2_1_10_1","volume-title":"Proceedings of the 8th International Conference on Learning Representations.","author":"Liu Liyuan","year":"2020","unstructured":"Liyuan Liu , Haoming Jiang , Pengcheng He , Weizhu Chen , Xiaodong Liu , Jianfeng Gao , and Jiawei Han . 2020 . On the Variance of the Adaptive Learning Rate and Beyond . In Proceedings of the 8th International Conference on Learning Representations. Liyuan Liu, Haoming Jiang, Pengcheng He, Weizhu Chen, Xiaodong Liu, Jianfeng Gao, and Jiawei Han. 2020. On the Variance of the Adaptive Learning Rate and Beyond. In Proceedings of the 8th International Conference on Learning Representations."},{"key":"e_1_3_2_1_11_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 10631\u201310642","author":"Liu Xin","year":"2021","unstructured":"Xin Liu , Henglin Shi , Haoyu Chen , Zitong Yu , Xiaobai Li , and Guoying Zhao . 2021 . iMiGUE: An Identity-free Video Dataset for Micro-Gesture Understanding and Emotion Analysis . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 10631\u201310642 . Xin Liu, Henglin Shi, Haoyu Chen, Zitong Yu, Xiaobai Li, and Guoying Zhao. 2021. iMiGUE: An Identity-free Video Dataset for Micro-Gesture Understanding and Emotion Analysis. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 10631\u201310642."},{"key":"e_1_3_2_1_12_1","volume-title":"Proceedings of the 1st International Conference on Machine Learning and Machine Intelligence. 27\u201331","author":"Ly Son\u00a0Thai","year":"2018","unstructured":"Son\u00a0Thai Ly , Guee-Sang Lee , Soo-Hyung Kim , and Hyung-Jeong Yang . 2018 . Emotion Recognition via Body Gesture: Deep Learning Model Coupled with Keyframe Selection . In Proceedings of the 1st International Conference on Machine Learning and Machine Intelligence. 27\u201331 . Son\u00a0Thai Ly, Guee-Sang Lee, Soo-Hyung Kim, and Hyung-Jeong Yang. 2018. Emotion Recognition via Body Gesture: Deep Learning Model Coupled with Keyframe Selection. In Proceedings of the 1st International Conference on Machine Learning and Machine Intelligence. 27\u201331."},{"key":"e_1_3_2_1_13_1","volume-title":"Silent messages. Vol.\u00a08","author":"Mehrabian Albert","unstructured":"Albert Mehrabian . 1971. Silent messages. Vol.\u00a08 . Wadsworth Belmont, CA . Albert Mehrabian. 1971. Silent messages. Vol.\u00a08. Wadsworth Belmont, CA."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/TAFFC.2018.2874986"},{"key":"e_1_3_2_1_15_1","volume-title":"Attentive Pooling Networks. arXiv preprint arXiv:1602.03609","author":"Santos Cicero\u00a0dos","year":"2016","unstructured":"Cicero\u00a0dos Santos , Ming Tan , Bing Xiang , and Bowen Zhou . 2016. Attentive Pooling Networks. arXiv preprint arXiv:1602.03609 ( 2016 ). Cicero\u00a0dos Santos, Ming Tan, Bing Xiang, and Bowen Zhou. 2016. Attentive Pooling Networks. arXiv preprint arXiv:1602.03609 (2016)."},{"key":"e_1_3_2_1_16_1","first-page":"3200","article-title":"Human Action Recognition From Various Data Modalities","volume":"45","author":"Sun Zehua","year":"2023","unstructured":"Zehua Sun , Qiuhong Ke , Hossein Rahmani , Mohammed Bennamoun , Gang Wang , and Jun Liu . 2023 . Human Action Recognition From Various Data Modalities : A Review. IEEE Transactions on Pattern Analysis and Machine Intelligence 45 , 3 (2023), 3200 \u2013 3225 . Zehua Sun, Qiuhong Ke, Hossein Rahmani, Mohammed Bennamoun, Gang Wang, and Jun Liu. 2023. Human Action Recognition From Various Data Modalities: A Review. IEEE Transactions on Pattern Analysis and Machine Intelligence 45, 3 (2023), 3200\u20133225.","journal-title":"A Review. IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"e_1_3_2_1_17_1","volume-title":"Proceedings of the 36th International Conference on Machine Learning. 6105\u20136114","author":"Tan Mingxing","year":"2019","unstructured":"Mingxing Tan and Quoc Le . 2019 . EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks . In Proceedings of the 36th International Conference on Machine Learning. 6105\u20136114 . Mingxing Tan and Quoc Le. 2019. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. In Proceedings of the 36th International Conference on Machine Learning. 6105\u20136114."},{"key":"e_1_3_2_1_18_1","volume-title":"Proceedings of the 31st International Conference on Neural Information Processing Systems. 6000\u20136010","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani , Noam Shazeer , Niki Parmar , Jakob Uszkoreit , Llion Jones , Aidan\u00a0 N Gomez , \u0141ukasz Kaiser , and Illia Polosukhin . 2017 . Attention is All you Need . In Proceedings of the 31st International Conference on Neural Information Processing Systems. 6000\u20136010 . Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan\u00a0N Gomez, \u0141ukasz Kaiser, and Illia Polosukhin. 2017. Attention is All you Need. In Proceedings of the 31st International Conference on Neural Information Processing Systems. 6000\u20136010."},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/3136755.3143011"},{"key":"e_1_3_2_1_20_1","volume-title":"A Systematic Review on Affective Computing: Emotion Models, Databases, and Recent Advances. Information Fusion 83\u201384","author":"Wang Yan","year":"2022","unstructured":"Yan Wang , Wei Song , Wei Tao , Antonio Liotta , Dawei Yang , Xinlei Li , Shuyong Gao , Yixuan Sun , Weifeng Ge , Wei Zhang , and Wenqiang Zhang . 2022. A Systematic Review on Affective Computing: Emotion Models, Databases, and Recent Advances. Information Fusion 83\u201384 ( 2022 ), 19\u201352. Yan Wang, Wei Song, Wei Tao, Antonio Liotta, Dawei Yang, Xinlei Li, Shuyong Gao, Yixuan Sun, Weifeng Ge, Wei Zhang, and Wenqiang Zhang. 2022. A Systematic Review on Affective Computing: Emotion Models, Databases, and Recent Advances. Information Fusion 83\u201384 (2022), 19\u201352."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.75"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"crossref","first-page":"183","DOI":"10.1016\/j.vrih.2021.05.001","article-title":"Review of dynamic gesture recognition","volume":"3","author":"Yuanyuan SHI","year":"2021","unstructured":"SHI Yuanyuan , LI Yunan , FU Xiaolong , MIAO Kaibin , and MIAO Qiguang . 2021 . Review of dynamic gesture recognition . Virtual Reality & Intelligent Hardware 3 , 3 (2021), 183 \u2013 206 . SHI Yuanyuan, LI Yunan, FU Xiaolong, MIAO Kaibin, and MIAO Qiguang. 2021. Review of dynamic gesture recognition. Virtual Reality & Intelligent Hardware 3, 3 (2021), 183\u2013206.","journal-title":"Virtual Reality & Intelligent Hardware"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0065-2601(08)60369-X"}],"event":{"name":"MMAsia '23: ACM Multimedia Asia","location":"Tainan Taiwan","acronym":"MMAsia '23","sponsor":["SIGMM ACM Special Interest Group on Multimedia"]},"container-title":["ACM Multimedia Asia 2023"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3595916.3626362","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3595916.3626362","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T22:48:39Z","timestamp":1750286919000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3595916.3626362"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,12,6]]},"references-count":23,"alternative-id":["10.1145\/3595916.3626362","10.1145\/3595916"],"URL":"https:\/\/doi.org\/10.1145\/3595916.3626362","relation":{},"subject":[],"published":{"date-parts":[[2023,12,6]]},"assertion":[{"value":"2024-01-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}