{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,3]],"date-time":"2026-04-03T15:13:44Z","timestamp":1775229224542,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":26,"publisher":"ACM","license":[{"start":{"date-parts":[[2018,10,2]],"date-time":"2018-10-02T00:00:00Z","timestamp":1538438400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2018,10,2]]},"DOI":"10.1145\/3242969.3264991","type":"proceedings-article","created":{"date-parts":[[2018,10,2]],"date-time":"2018-10-02T08:09:29Z","timestamp":1538467769000},"page":"640-645","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":35,"title":["Cascade Attention Networks For Group Emotion Recognition with Face, Body and Image Cues"],"prefix":"10.1145","author":[{"given":"Kai","family":"Wang","sequence":"first","affiliation":[{"name":"Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xiaoxing","family":"Zeng","sequence":"additional","affiliation":[{"name":"Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jianfei","family":"Yang","sequence":"additional","affiliation":[{"name":"Nanyang Technological University, Singapore, Singapore"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Debin","family":"Meng","sequence":"additional","affiliation":[{"name":"Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kaipeng","family":"Zhang","sequence":"additional","affiliation":[{"name":"National Taiwan University, Taiwan, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xiaojiang","family":"Peng","sequence":"additional","affiliation":[{"name":"Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yu","family":"Qiao","sequence":"additional","affiliation":[{"name":"Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2018,10,2]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"Shreya Ghosh Jyoti Joshi Jesse Hoey Abhinav Dhall Roland Goecke and Tom Gedeon. 2018. From Individual to Group-level Emotion Recognition: EmotiW 6.0 ICMI. ACM. Shreya Ghosh Jyoti Joshi Jesse Hoey Abhinav Dhall Roland Goecke and Tom Gedeon. 2018. From Individual to Group-level Emotion Recognition: EmotiW 6.0 ICMI. ACM."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"crossref","unstructured":"Unaiza Ahsan Munmun De Choudhury and Irfan Essa. 2017. Towards using visual attributes to infer image sentiment of social events International Joint Conference on Neural Networks (IJCNN). 1372--1379. Unaiza Ahsan Munmun De Choudhury and Irfan Essa. 2017. Towards using visual attributes to infer image sentiment of social events International Joint Conference on Neural Networks (IJCNN). 1372--1379.","DOI":"10.1109\/IJCNN.2017.7966013"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/2993148.2993165"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"crossref","unstructured":"Zhe Cao Tomas Simon Shih-En Wei and Yaser Sheikh. 2017. Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields CVPR. Zhe Cao Tomas Simon Shih-En Wei and Yaser Sheikh. 2017. Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields CVPR.","DOI":"10.1109\/CVPR.2017.143"},{"key":"e_1_3_2_1_5_1","volume-title":"Imagenet: A large-scale hierarchical image database","author":"Deng Jia","year":"2009"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/2993148.2997638"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"crossref","unstructured":"Abhinav Dhall Jyoti Joshi Karan Sikka Roland Goecke and Nicu Sebe. 2015. The more the merrier: Analysing the affect of a group of people in images Workshops on Automatic Face and Gesture Recognition (FG) Vol. Vol. 1. 1--8. Abhinav Dhall Jyoti Joshi Karan Sikka Roland Goecke and Nicu Sebe. 2015. The more the merrier: Analysing the affect of a group of people in images Workshops on Automatic Face and Gesture Recognition (FG) Vol. Vol. 1. 1--8.","DOI":"10.1109\/FG.2015.7163151"},{"key":"e_1_3_2_1_8_1","unstructured":"Kaiming He Xiangyu Zhang Shaoqing Ren and Jian Sun. 2015. Deep Residual Learning for Image Recognition. CoRR Vol. abs\/1512.03385 (2015). http:\/\/arxiv.org\/abs\/1512.03385 Kaiming He Xiangyu Zhang Shaoqing Ren and Jian Sun. 2015. Deep Residual Learning for Image Recognition. CoRR Vol. abs\/1512.03385 (2015). http:\/\/arxiv.org\/abs\/1512.03385"},{"key":"e_1_3_2_1_9_1","unstructured":"Jie Hu Li Shen and Gang Sun. 2017. Squeeze-and-Excitation Networks. CoRR Vol. abs\/1709.01507 (2017). {arxiv}1709.01507http:\/\/arxiv.org\/abs\/1709.01507 Jie Hu Li Shen and Gang Sun. 2017. Squeeze-and-Excitation Networks. CoRR Vol. abs\/1709.01507 (2017). {arxiv}1709.01507http:\/\/arxiv.org\/abs\/1709.01507"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/2993148.2997636"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"crossref","unstructured":"Zheng Li Jianfei Yang Juan Zha Chang-Dong Wang and Weishi Zheng. 2016. Online visual tracking via correlation filter with convolutional networks Visual Communications and Image Processing (VCIP) 2016. IEEE 1--4. Zheng Li Jianfei Yang Juan Zha Chang-Dong Wang and Weishi Zheng. 2016. Online visual tracking via correlation filter with convolutional networks Visual Communications and Image Processing (VCIP) 2016. IEEE 1--4.","DOI":"10.1109\/VCIP.2016.7805476"},{"key":"e_1_3_2_1_12_1","unstructured":"Weiyang Liu Yandong Wen Zhiding Yu Ming Li Bhiksha Raj and Le Song. 2017. SphereFace: Deep Hypersphere Embedding for Face Recognition. CoRR Vol. abs\/1704.08063 (2017). http:\/\/arxiv.org\/abs\/1704.08063 Weiyang Liu Yandong Wen Zhiding Yu Ming Li Bhiksha Raj and Le Song. 2017. SphereFace: Deep Hypersphere Embedding for Face Recognition. CoRR Vol. abs\/1704.08063 (2017). http:\/\/arxiv.org\/abs\/1704.08063"},{"key":"e_1_3_2_1_13_1","unstructured":"Weiyang Liu Yandong Wen Zhiding Yu and Meng Yang. 2016. Large-Margin Softmax Loss for Convolutional Neural Networks. ICML. 507--516. Weiyang Liu Yandong Wen Zhiding Yu and Meng Yang. 2016. Large-Margin Softmax Loss for Convolutional Neural Networks. ICML. 507--516."},{"key":"e_1_3_2_1_14_1","volume-title":"Workshops on Automatic Face and Gesture Recognition (FG)","volume":"5","author":"Mou Wenxuan","year":"2015"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"crossref","unstructured":"Tomas Simon Hanbyul Joo Iain Matthews and Yaser Sheikh. 2017. Hand Keypoint Detection in Single Images using Multiview Bootstrapping CVPR. Tomas Simon Hanbyul Joo Iain Matthews and Yaser Sheikh. 2017. Hand Keypoint Detection in Single Images using Multiview Bootstrapping CVPR.","DOI":"10.1109\/CVPR.2017.494"},{"key":"e_1_3_2_1_16_1","unstructured":"Karen Simonyan and Andrew Zisserman. 2014. Very Deep Convolutional Networks for Large-Scale Image Recognition. CoRR Vol. abs\/1409.1556 (2014). http:\/\/arxiv.org\/abs\/1409.1556 Karen Simonyan and Andrew Zisserman. 2014. Very Deep Convolutional Networks for Large-Scale Image Recognition. CoRR Vol. abs\/1409.1556 (2014). http:\/\/arxiv.org\/abs\/1409.1556"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/2993148.2997640"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/3136755.3143008"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/2993148.2997633"},{"key":"e_1_3_2_1_20_1","unstructured":"Shih-En Wei Varun Ramakrishna Takeo Kanade and Yaser Sheikh. 2016. Convolutional pose machines. In CVPR. Shih-En Wei Varun Ramakrishna Takeo Kanade and Yaser Sheikh. 2016. Convolutional pose machines. In CVPR."},{"key":"e_1_3_2_1_21_1","unstructured":"Guoying Zhao Roland Goecke Xiaohua Huang Abhinav Dhall and Matti Pietik\u00e4inen. 2015. Riesz-based Volume Local Binary Pattern and A Novel Group Expression Model for Group Happiness Intensity Analysis. In BMVC. 1--8. Guoying Zhao Roland Goecke Xiaohua Huang Abhinav Dhall and Matti Pietik\u00e4inen. 2015. Riesz-based Volume Local Binary Pattern and A Novel Group Expression Model for Group Happiness Intensity Analysis. In BMVC. 1--8."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/3242969.3264981"},{"key":"e_1_3_2_1_23_1","unstructured":"Dong Yi Zhen Lei Shengcai Liao and Stan Z. Li. 2014. Learning face representation from scratch. arXiv preprint arXiv:1411.7923 (2014). Dong Yi Zhen Lei Shengcai Liao and Stan Z. Li. 2014. Learning face representation from scratch. arXiv preprint arXiv:1411.7923 (2014)."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/LSP.2016.2603342"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-017-1055-1"},{"key":"e_1_3_2_1_26_1","unstructured":"Bolei Zhou Agata Lapedriza Jianxiong Xiao Antonio Torralba and Aude Oliva. 2014. Learning deep features for scene recognition using places database NIPS. 487--495. Bolei Zhou Agata Lapedriza Jianxiong Xiao Antonio Torralba and Aude Oliva. 2014. Learning deep features for scene recognition using places database NIPS. 487--495."}],"event":{"name":"ICMI '18: INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION","location":"Boulder CO USA","acronym":"ICMI '18","sponsor":["SIGCHI Specialist Interest Group in Computer-Human Interaction of the ACM"]},"container-title":["Proceedings of the 20th ACM International Conference on Multimodal Interaction"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3242969.3264991","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3242969.3264991","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,22]],"date-time":"2025-10-22T00:57:12Z","timestamp":1761094632000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3242969.3264991"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,10,2]]},"references-count":26,"alternative-id":["10.1145\/3242969.3264991","10.1145\/3242969"],"URL":"https:\/\/doi.org\/10.1145\/3242969.3264991","relation":{},"subject":[],"published":{"date-parts":[[2018,10,2]]},"assertion":[{"value":"2018-10-02","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}