{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,3]],"date-time":"2026-04-03T15:13:43Z","timestamp":1775229223531,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":25,"publisher":"ACM","license":[{"start":{"date-parts":[[2018,10,2]],"date-time":"2018-10-02T00:00:00Z","timestamp":1538438400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100000038","name":"Natural Sciences and Engineering Research Council of Canada","doi-asserted-by":"publisher","award":["CRSNG RGPIN 2018-04825"],"award-info":[{"award-number":["CRSNG RGPIN 2018-04825"]}],"id":[{"id":"10.13039\/501100000038","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2018,10,2]]},"DOI":"10.1145\/3242969.3264985","type":"proceedings-article","created":{"date-parts":[[2018,10,2]],"date-time":"2018-10-02T12:09:29Z","timestamp":1538482169000},"page":"611-615","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":43,"title":["An Attention Model for Group-Level Emotion Recognition"],"prefix":"10.1145","author":[{"given":"Aarush","family":"Gupta","sequence":"first","affiliation":[{"name":"Indian Institute of Technology Roorkee, Roorkee, India"}]},{"given":"Dakshit","family":"Agrawal","sequence":"additional","affiliation":[{"name":"Indian Institute of Technology Roorkee, Roorkee, India"}]},{"given":"Hardik","family":"Chauhan","sequence":"additional","affiliation":[{"name":"Indian Institute of Technology Roorkee, Roorkee, India"}]},{"given":"Jose","family":"Dolz","sequence":"additional","affiliation":[{"name":"\u00c9cole de Technologie Sup\u00e9rieure Montreal, Montreal, Canada"}]},{"given":"Marco","family":"Pedersoli","sequence":"additional","affiliation":[{"name":"\u00c9cole de Technologie Sup\u00e9rieure Montreal, Montreal, Canada"}]}],"member":"320","published-online":{"date-parts":[[2018,10,2]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/3242969.3264993"},{"key":"e_1_3_2_1_2_1","volume-title":"Neural Machine Translation by Jointly Learning to Align and Translate. CoRR","author":"Bahdanau Dzmitry","year":"2014","unstructured":"Dzmitry Bahdanau , Kyunghyun Cho , and Yoshua Bengio . 2014. Neural Machine Translation by Jointly Learning to Align and Translate. CoRR Vol. abs\/ 1409 .0473 ( 2014 ). Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio . 2014. Neural Machine Translation by Jointly Learning to Align and Translate. CoRR Vol. abs\/1409.0473 (2014)."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.psychres.2017.02.025"},{"key":"e_1_3_2_1_4_1","volume-title":"ImageNet: A Large-Scale Hierarchical Image Database CVPR09","author":"Deng J.","unstructured":"J. Deng , W. Dong , R. Socher , L.-J. Li , K. Li , and L. Fei-Fei . 2009. ImageNet: A Large-Scale Hierarchical Image Database CVPR09 . J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei . 2009. ImageNet: A Large-Scale Hierarchical Image Database CVPR09."},{"key":"e_1_3_2_1_5_1","volume-title":"The more the merrier: Analysing the affect of a group of people in images 2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG)","author":"Dhall A.","unstructured":"A. Dhall , J. Joshi , K. Sikka , R. Goecke , and N. Sebe . 2015. The more the merrier: Analysing the affect of a group of people in images 2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG) , Vol. Vol. 1 . 1--8. A. Dhall, J. Joshi, K. Sikka, R. Goecke, and N. Sebe . 2015. The more the merrier: Analysing the affect of a group of people in images 2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG), Vol. Vol. 1. 1--8."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/2818346.2830596"},{"key":"e_1_3_2_1_7_1","unstructured":"Rohit Girdhar and Deva Ramanan . 2017. Attentional pooling for action recognition. In Advances in Neural Information Processing Systems. 34--45.  Rohit Girdhar and Deva Ramanan . 2017. Attentional pooling for action recognition. In Advances in Neural Information Processing Systems. 34--45."},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/3136755.3143017"},{"key":"e_1_3_2_1_9_1","volume-title":"Weinberger","author":"Huang Gao","year":"2016","unstructured":"Gao Huang , Zhuang Liu , and Kilian Q . Weinberger . 2016 . Densely Connected Convolutional Networks. CoRR Vol . abs\/1608.06993 (2016). showeprint{arxiv}1608.06993deftempurl%http:\/\/arxiv.org\/abs\/1608.06993 tempurl Gao Huang, Zhuang Liu, and Kilian Q. Weinberger . 2016. Densely Connected Convolutional Networks. CoRR Vol. abs\/1608.06993 (2016). showeprint{arxiv}1608.06993deftempurl%http:\/\/arxiv.org\/abs\/1608.06993 tempurl"},{"key":"e_1_3_2_1_10_1","volume-title":"Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. CoRR","author":"Ioffe Sergey","year":"2015","unstructured":"Sergey Ioffe and Christian Szegedy . 2015 . Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. CoRR Vol. abs\/ 1502 .03167 (2015). showeprint{arxiv}1502.03167deftempurl%http:\/\/arxiv.org\/abs\/1502.03167 tempurl Sergey Ioffe and Christian Szegedy . 2015. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. CoRR Vol. abs\/1502.03167 (2015). showeprint{arxiv}1502.03167deftempurl%http:\/\/arxiv.org\/abs\/1502.03167 tempurl"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACII.2015.7344575"},{"key":"e_1_3_2_1_12_1","volume-title":"Facial Expression Recognition in the Wild using Rich Deep Features. CoRR","author":"Karali Abubakrelsedik","year":"2016","unstructured":"Abubakrelsedik Karali , Ahmad Bassiouny , and Motaz El-Saban . 2016. Facial Expression Recognition in the Wild using Rich Deep Features. CoRR Vol. abs\/ 1601 .02487 ( 2016 ). showeprint{arxiv}1601.02487deftempurl%http:\/\/arxiv.org\/abs\/1601.02487 tempurl Abubakrelsedik Karali, Ahmad Bassiouny, and Motaz El-Saban . 2016. Facial Expression Recognition in the Wild using Rich Deep Features. CoRR Vol. abs\/1601.02487 (2016). showeprint{arxiv}1601.02487deftempurl%http:\/\/arxiv.org\/abs\/1601.02487 tempurl"},{"key":"e_1_3_2_1_13_1","volume-title":"Kingma and Jimmy Ba","author":"Diederik","year":"2014","unstructured":"Diederik P. Kingma and Jimmy Ba . 2014 . Adam : A Method for Stochastic Optimization. CoRR Vol . abs\/1412.6980 (2014). showeprint{arxiv}1412.6980deftempurl%http:\/\/arxiv.org\/abs\/1412.6980 tempurl Diederik P. Kingma and Jimmy Ba . 2014. Adam: A Method for Stochastic Optimization. CoRR Vol. abs\/1412.6980 (2014). showeprint{arxiv}1412.6980deftempurl%http:\/\/arxiv.org\/abs\/1412.6980 tempurl"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.212"},{"key":"e_1_3_2_1_15_1","volume-title":"SphereFace: Deep Hypersphere Embedding for Face Recognition. CoRR","author":"Liu Weiyang","year":"2017","unstructured":"Weiyang Liu , Yandong Wen , Zhiding Yu , Ming Li , Bhiksha Raj , and Le Song . 2017. SphereFace: Deep Hypersphere Embedding for Face Recognition. CoRR Vol. abs\/ 1704 .08063 ( 2017 ). showeprint{arxiv}1704.08063deftempurl%http:\/\/arxiv.org\/abs\/1704.08063 tempurl Weiyang Liu, Yandong Wen, Zhiding Yu, Ming Li, Bhiksha Raj, and Le Song . 2017. SphereFace: Deep Hypersphere Embedding for Face Recognition. CoRR Vol. abs\/1704.08063 (2017). showeprint{arxiv}1704.08063deftempurl%http:\/\/arxiv.org\/abs\/1704.08063 tempurl"},{"key":"e_1_3_2_1_16_1","volume-title":"Large-margin Softmax Loss for Convolutional Neural Networks Proceedings of the 33rd International Conference on International Conference on Machine Learning -","volume":"48","author":"Liu Weiyang","year":"2016","unstructured":"Weiyang Liu , Yandong Wen , Zhiding Yu , and Meng Yang . 2016 . Large-margin Softmax Loss for Convolutional Neural Networks Proceedings of the 33rd International Conference on International Conference on Machine Learning - Volume 48 (ICML'16). JMLR.org, 507--516. deftempurl%http:\/\/dl.acm.org\/citation.cfm?id=3045390.3045445 tempurl Weiyang Liu, Yandong Wen, Zhiding Yu, and Meng Yang . 2016. Large-margin Softmax Loss for Convolutional Neural Networks Proceedings of the 33rd International Conference on International Conference on Machine Learning - Volume 48 (ICML'16). JMLR.org, 507--516. deftempurl%http:\/\/dl.acm.org\/citation.cfm?id=3045390.3045445 tempurl"},{"key":"e_1_3_2_1_17_1","volume-title":"et almbox","author":"Mnih Volodymyr","year":"2014","unstructured":"Volodymyr Mnih , Nicolas Heess , Alex Graves , et almbox . . 2014 . Recurrent models of visual attention. In Advances in neural information processing systems. 2204--2212. Volodymyr Mnih, Nicolas Heess, Alex Graves, et almbox. . 2014. Recurrent models of visual attention. In Advances in neural information processing systems. 2204--2212."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.5555\/2627435.2670313"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/3136755.3143008"},{"key":"e_1_3_2_1_20_1","volume-title":"CoRR","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani , Noam Shazeer , Niki Parmar , Jakob Uszkoreit , Llion Jones , Aidan N. Gomez , Lukasz Kaiser , and Illia Polosukhin . 2017. Attention Is All You Need. CoRR Vol. abs\/ 1706 .03762 ( 2017 ). showeprint{arxiv}1706.03762deftempurl%http:\/\/arxiv.org\/abs\/1706.03762 tempurl Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin . 2017. Attention Is All You Need. CoRR Vol. abs\/1706.03762 (2017). showeprint{arxiv}1706.03762deftempurl%http:\/\/arxiv.org\/abs\/1706.03762 tempurl"},{"key":"e_1_3_2_1_21_1","volume-title":"Residual Attention Network for Image Classification Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3156--3164","author":"Wang Fei","year":"2017","unstructured":"Fei Wang , Mengqing Jiang , Chen Qian , Shuo Yang , Cheng Li , Honggang Zhang , Xiaogang Wang , and Xiaoou Tang . 2017 . Residual Attention Network for Image Classification Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3156--3164 . Fei Wang, Mengqing Jiang, Chen Qian, Shuo Yang, Cheng Li, Honggang Zhang, Xiaogang Wang, and Xiaoou Tang . 2017. Residual Attention Network for Image Classification Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3156--3164."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/3136755.3143014"},{"key":"e_1_3_2_1_23_1","unstructured":"Kelvin Xu Jimmy Ba Ryan Kiros Kyunghyun Cho Aaron Courville Ruslan Salakhudinov Rich Zemel and Yoshua Bengio . 2015. Show attend and tell: Neural image caption generation with visual attention International conference on machine learning. 2048--2057.   Kelvin Xu Jimmy Ba Ryan Kiros Kyunghyun Cho Aaron Courville Ruslan Salakhudinov Rich Zemel and Yoshua Bengio . 2015. Show attend and tell: Neural image caption generation with visual attention International conference on machine learning. 2048--2057."},{"key":"e_1_3_2_1_24_1","volume-title":"Li","author":"Yi Dong","year":"2014","unstructured":"Dong Yi , Zhen Lei , Shengcai Liao , and Stan Z . Li . 2014 . Learning Face Representation from Scratch. CoRR Vol . abs\/1411.7923 (2014). showeprint{arxiv}1411.7923deftempurl%http:\/\/arxiv.org\/abs\/1411.7923 tempurl Dong Yi, Zhen Lei, Shengcai Liao, and Stan Z. Li . 2014. Learning Face Representation from Scratch. CoRR Vol. abs\/1411.7923 (2014). showeprint{arxiv}1411.7923deftempurl%http:\/\/arxiv.org\/abs\/1411.7923 tempurl"},{"key":"e_1_3_2_1_25_1","volume-title":"Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks. CoRR","author":"Zhang Kaipeng","year":"2016","unstructured":"Kaipeng Zhang , Zhanpeng Zhang , Zhifeng Li , and Yu Qiao . 2016. Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks. CoRR Vol. abs\/ 1604 .02878 ( 2016 ). showeprint{arxiv}1604.02878deftempurl%http:\/\/arxiv.org\/abs\/1604.02878 tempurl Kaipeng Zhang, Zhanpeng Zhang, Zhifeng Li, and Yu Qiao . 2016. Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks. CoRR Vol. abs\/1604.02878 (2016). showeprint{arxiv}1604.02878deftempurl%http:\/\/arxiv.org\/abs\/1604.02878 tempurl"}],"event":{"name":"ICMI '18: INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION","location":"Boulder CO USA","acronym":"ICMI '18","sponsor":["SIGCHI Specialist Interest Group in Computer-Human Interaction of the ACM"]},"container-title":["Proceedings of the 20th ACM International Conference on Multimodal Interaction"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3242969.3264985","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3242969.3264985","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T02:06:58Z","timestamp":1750212418000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3242969.3264985"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,10,2]]},"references-count":25,"alternative-id":["10.1145\/3242969.3264985","10.1145\/3242969"],"URL":"https:\/\/doi.org\/10.1145\/3242969.3264985","relation":{},"subject":[],"published":{"date-parts":[[2018,10,2]]},"assertion":[{"value":"2018-10-02","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}