{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,5]],"date-time":"2026-06-05T04:31:36Z","timestamp":1780633896570,"version":"3.54.1"},"reference-count":68,"publisher":"Association for Computing Machinery (ACM)","issue":"ETRA","license":[{"start":{"date-parts":[[2022,5,13]],"date-time":"2022-05-13T00:00:00Z","timestamp":1652400000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Deutsche Forschungsgemeinschaf","award":["EXC 2075 - 390740016"],"award-info":[{"award-number":["EXC 2075 - 390740016"]}]},{"name":"European Research Council","award":["801708"],"award-info":[{"award-number":["801708"]}]},{"name":"y the German Ministry for Education and Research","award":["01IS20075"],"award-info":[{"award-number":["01IS20075"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. ACM Hum.-Comput. Interact."],"published-print":{"date-parts":[[2022,5,13]]},"abstract":"<jats:p>Emotional expressions are inherently multimodal -- integrating facial behavior, speech, and gaze -- but their automatic recognition is often limited to a single modality, e.g. speech during a phone call. While previous work proposed crossmodal emotion embeddings to improve monomodal recognition performance, despite its importance, an explicit representation of gaze was not included. We propose a new approach to emotion recognition that incorporates an explicit representation of gaze in a crossmodal emotion embedding framework. We show that our method outperforms the previous state of the art for both audio-only and video-only emotion classification on the popular One-Minute Gradual Emotion Recognition dataset. Furthermore, we report extensive ablation experiments and provide detailed insights into the performance of different state-of-the-art gaze representations and integration strategies. Our results not only underline the importance of gaze for emotion recognition but also demonstrate a practical and highly effective approach to leveraging gaze information for this task.<\/jats:p>","DOI":"10.1145\/3530879","type":"journal-article","created":{"date-parts":[[2022,5,13]],"date-time":"2022-05-13T22:17:43Z","timestamp":1652480263000},"page":"1-18","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":12,"title":["Gaze-enhanced Crossmodal Embeddings for Emotion Recognition"],"prefix":"10.1145","volume":"6","author":[{"given":"Ahmed","family":"Abdou","sequence":"first","affiliation":[{"name":"Technical University of Munich, Munich, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Ekta","family":"Sood","sequence":"additional","affiliation":[{"name":"University of Stuttgart, Stuttgart, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Philipp","family":"M\u00fcller","sequence":"additional","affiliation":[{"name":"German Research Center for Artificial Intellegence, Kaiserslautern, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Andreas","family":"Bulling","sequence":"additional","affiliation":[{"name":"University of Stuttgart, Stuttgart, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2022,5,13]]},"reference":[{"key":"e_1_2_2_1_1","volume-title":"Perceived gaze direction and the processing of facial displays of emotion. Psychological science 14, 6","author":"Adams Reginald B","year":"2003","unstructured":"Reginald B Adams Jr and Robert E Kleck . 2003. Perceived gaze direction and the processing of facial displays of emotion. Psychological science 14, 6 ( 2003 ), 644--647. Reginald B Adams Jr and Robert E Kleck. 2003. Perceived gaze direction and the processing of facial displays of emotion. Psychological science 14, 6 (2003), 644--647."},{"key":"e_1_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1037\/1528-3542.5.1.3"},{"key":"e_1_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/3136755.3137016"},{"key":"e_1_2_2_4_1","volume-title":"Real Time Facial Expression Recognition and Eye Gaze Estimation System. Ph. D. Dissertation","author":"Anwar Suzan A","unstructured":"Suzan A Anwar . 2019. Real Time Facial Expression Recognition and Eye Gaze Estimation System. Ph. D. Dissertation . University of Arkansas at Little Rock. Suzan A Anwar. 2019. Real Time Facial Expression Recognition and Eye Gaze Estimation System. Ph. D. Dissertation. University of Arkansas at Little Rock."},{"key":"e_1_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/SMC.2015.460"},{"key":"e_1_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1080\/10494820.2014.908927"},{"key":"e_1_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCVW.2013.54"},{"key":"e_1_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/FG.2018.00019"},{"key":"e_1_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/IJCNN.2018.8489099"},{"key":"e_1_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1080\/13506280701269318"},{"key":"e_1_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/3474085.3479213"},{"key":"e_1_2_2_12_1","volume-title":"Proc. ACM Hum.-Comput. Interact.","volume":"6","author":"Busso Carlos","year":"2008","unstructured":"Carlos Busso , Murtaza Bulut , Chi-Chun Lee , Abe Kazemzadeh , Emily Mower , Samuel Kim , Jeannette N Chang , Sungbok Lee , and Shrikanth S Narayanan . 2008 . IEMOCAP: Interactive emotional dyadic motion capture database. Language resources and evaluation 42, 4 (2008), 335--359 . Proc. ACM Hum.-Comput. Interact. , Vol. 6 , No. ETRA, Article 138. Publication date : May 2022. 138:16 Ahmed Abdou et al. Carlos Busso, Murtaza Bulut, Chi-Chun Lee, Abe Kazemzadeh, Emily Mower, Samuel Kim, Jeannette N Chang, Sungbok Lee, and Shrikanth S Narayanan. 2008. IEMOCAP: Interactive emotional dyadic motion capture database. Language resources and evaluation 42, 4 (2008), 335--359. Proc. ACM Hum.-Comput. Interact., Vol. 6, No. ETRA, Article 138. Publication date: May 2022. 138:16 Ahmed Abdou et al."},{"key":"e_1_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.isci.2019.05.035"},{"key":"e_1_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-0-387-74161-1_41"},{"key":"e_1_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.lisr.2010.09.010"},{"key":"e_1_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/2663204.2666277"},{"key":"e_1_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/3133944.3133949"},{"key":"e_1_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1142\/S0219720005001004"},{"key":"e_1_2_2_19_1","volume-title":"A review and meta-analysis of multimodal affect detection systems. ACM computing surveys (CSUR) 47, 3","author":"D'mello Sidney K","year":"2015","unstructured":"Sidney K D'mello and Jacqueline Kory . 2015. A review and meta-analysis of multimodal affect detection systems. ACM computing surveys (CSUR) 47, 3 ( 2015 ), 1--36. Sidney K D'mello and Jacqueline Kory. 2015. A review and meta-analysis of multimodal affect detection systems. ACM computing surveys (CSUR) 47, 3 (2015), 1--36."},{"key":"e_1_2_2_20_1","volume-title":"Quantitative comparison of a mobile and a stationary video-based eye-tracker. Behavior research methods 52, 2","author":"Dowiasch Stefan","year":"2020","unstructured":"Stefan Dowiasch , Peter Wolf , and Frank Bremmer . 2020. Quantitative comparison of a mobile and a stationary video-based eye-tracker. Behavior research methods 52, 2 ( 2020 ), 667--680. Stefan Dowiasch, Peter Wolf, and Frank Bremmer. 2020. Quantitative comparison of a mobile and a stationary video-based eye-tracker. Behavior research methods 52, 2 (2020), 667--680."},{"key":"e_1_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1037\/h0030377"},{"key":"e_1_2_2_22_1","volume-title":"The eyes have it: the neuroethology, function and evolution of social gaze. Neuroscience & biobehavioral reviews 24, 6","author":"Emery Nathan J","year":"2000","unstructured":"Nathan J Emery . 2000. The eyes have it: the neuroethology, function and evolution of social gaze. Neuroscience & biobehavioral reviews 24, 6 ( 2000 ), 581--604. Nathan J Emery. 2000. The eyes have it: the neuroethology, function and evolution of social gaze. Neuroscience & biobehavioral reviews 24, 6 (2000), 581--604."},{"key":"e_1_2_2_23_1","doi-asserted-by":"crossref","unstructured":"Florian Eyben Klaus R Scherer Bj\u00f6rn W Schuller Johan Sundberg Elisabeth Andr\u00e9 Carlos Busso Laurence Y Devillers Julien Epps Petri Laukka Shrikanth S Narayanan etal 2015. The Geneva minimalistic acoustic parameter set (GeMAPS) for voice research and affective computing. IEEE transactions on affective computing 7 2 (2015) 190--202.  Florian Eyben Klaus R Scherer Bj\u00f6rn W Schuller Johan Sundberg Elisabeth Andr\u00e9 Carlos Busso Laurence Y Devillers Julien Epps Petri Laukka Shrikanth S Narayanan et al. 2015. The Geneva minimalistic acoustic parameter set (GeMAPS) for voice research and affective computing. IEEE transactions on affective computing 7 2 (2015) 190--202.","DOI":"10.1109\/TAFFC.2015.2457417"},{"key":"e_1_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/1873951.1874246"},{"key":"e_1_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/3474085.3479230"},{"key":"e_1_2_2_26_1","volume-title":"Emobed: Strengthening monomodal emotion recognition via training with crossmodal emotion embeddings","author":"Han Jing","year":"2019","unstructured":"Jing Han , Zixing Zhang , Zhao Ren , and Bjoern W Schuller . 2019 . Emobed: Strengthening monomodal emotion recognition via training with crossmodal emotion embeddings . IEEE Transactions on Affective Computing ( 2019). Jing Han, Zixing Zhang, Zhao Ren, and Bjoern W Schuller. 2019. Emobed: Strengthening monomodal emotion recognition via training with crossmodal emotion embeddings. IEEE Transactions on Affective Computing (2019)."},{"key":"e_1_2_2_27_1","volume-title":"Eye movements during everyday behavior predict personality traits. Frontiers in human neuroscience 12","author":"Hoppe Sabrina","year":"2018","unstructured":"Sabrina Hoppe , Tobias Loetscher , Stephanie A Morey , and Andreas Bulling . 2018. Eye movements during everyday behavior predict personality traits. Frontiers in human neuroscience 12 ( 2018 ), 105. Sabrina Hoppe, Tobias Loetscher, Stephanie A Morey, and Andreas Bulling. 2018. Eye movements during everyday behavior predict personality traits. Frontiers in human neuroscience 12 (2018), 105."},{"key":"e_1_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/3136755.3143009"},{"key":"e_1_2_2_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/2808196.2811640"},{"key":"e_1_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neubiorev.2009.02.004"},{"key":"e_1_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-07221-0_4"},{"key":"e_1_2_2_32_1","volume-title":"Speech Based Affective Analysis of Patients Embedded in Telemedicine Platforms. In 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC). IEEE","author":"Kallipolitis Athanasios","year":"2021","unstructured":"Athanasios Kallipolitis , Michael Galliakis , Andreas Menychtas , and Ilias Maglogiannis . 2021 . Speech Based Affective Analysis of Patients Embedded in Telemedicine Platforms. In 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC). IEEE , 1857--1860. Athanasios Kallipolitis, Michael Galliakis, Andreas Menychtas, and Ilias Maglogiannis. 2021. Speech Based Affective Analysis of Patients Embedded in Telemedicine Platforms. In 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC). IEEE, 1857--1860."},{"key":"e_1_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/2638728.2641695"},{"key":"e_1_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1037\/0022-3514.68.3.441"},{"key":"e_1_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCE.2019.2897758"},{"key":"e_1_2_2_36_1","doi-asserted-by":"publisher","DOI":"10.1016\/0092-6566(80)90040-9"},{"key":"e_1_2_2_37_1","volume-title":"Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980","author":"Kingma Diederik P","year":"2014","unstructured":"Diederik P Kingma and Jimmy Ba . 2014 . Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014). Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)."},{"key":"e_1_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.2196\/24191"},{"key":"e_1_2_2_39_1","volume-title":"Synchrony in psychotherapy: A review and an integrative framework for the therapeutic alliance. Frontiers in psychology 7","author":"Koole Sander L","year":"2016","unstructured":"Sander L Koole and Wolfgang Tschacher . 2016. Synchrony in psychotherapy: A review and an integrative framework for the therapeutic alliance. Frontiers in psychology 7 ( 2016 ), 862. Sander L Koole and Wolfgang Tschacher. 2016. Synchrony in psychotherapy: A review and an integrative framework for the therapeutic alliance. Frontiers in psychology 7 (2016), 862."},{"key":"e_1_2_2_40_1","volume-title":"Emotional gaze: The effects of gaze direction on the perception of facial emotions. Frontiers in Psychology 12","author":"Liang Jing","year":"2021","unstructured":"Jing Liang , Yu-Qing Zou , Si-Yi Liang , Yu-Wei Wu , and Wen-Jing Yan . 2021. Emotional gaze: The effects of gaze direction on the perception of facial emotions. Frontiers in Psychology 12 ( 2021 ). Jing Liang, Yu-Qing Zou, Si-Yi Liang, Yu-Wei Wu, and Wen-Jing Yan. 2021. Emotional gaze: The effects of gaze direction on the perception of facial emotions. Frontiers in Psychology 12 (2021)."},{"key":"e_1_2_2_41_1","volume-title":"Are you looking at me? Eye gaze and person perception. Psychological science 13, 5","author":"Macrae C Neil","year":"2002","unstructured":"C Neil Macrae , Bruce M Hood , Alan B Milne , Angela C Rowe , and Malia F Mason . 2002. Are you looking at me? Eye gaze and person perception. Psychological science 13, 5 ( 2002 ), 460--464. C Neil Macrae, Bruce M Hood, Alan B Milne, Angela C Rowe, and Malia F Mason. 2002. Are you looking at me? Eye gaze and person perception. Psychological science 13, 5 (2002), 460--464."},{"key":"e_1_2_2_42_1","doi-asserted-by":"publisher","DOI":"10.1037\/a0022901"},{"key":"e_1_2_2_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2017.7952552"},{"key":"e_1_2_2_44_1","volume-title":"Emergent Leadership Detection Across Datasets. In 2019 International Conference on Multimodal Interaction. 274--278","author":"M\u00fcller Philipp","year":"2019","unstructured":"Philipp M\u00fcller and Andreas Bulling . 2019 . Emergent Leadership Detection Across Datasets. In 2019 International Conference on Multimodal Interaction. 274--278 . Philipp M\u00fcller and Andreas Bulling. 2019. Emergent Leadership Detection Across Datasets. In 2019 International Conference on Multimodal Interaction. 274--278."},{"key":"e_1_2_2_45_1","doi-asserted-by":"publisher","DOI":"10.1145\/3474085.3479219"},{"key":"e_1_2_2_46_1","volume-title":"Proc. ACM International Conference on Intelligent User Interfaces (IUI). 153--164","author":"M\u00fcller Philipp","year":"2018","unstructured":"Philipp M\u00fcller , Michael Xuelin Huang , and Andreas Bulling . 2018 . Detecting Low Rapport During Natural Interactions in Small Groups from Non-Verbal Behavior . In Proc. ACM International Conference on Intelligent User Interfaces (IUI). 153--164 . https:\/\/doi.org\/10.1145\/3172944.3172969 10.1145\/3172944.3172969 Philipp M\u00fcller, Michael Xuelin Huang, and Andreas Bulling. 2018. Detecting Low Rapport During Natural Interactions in Small Groups from Non-Verbal Behavior. In Proc. ACM International Conference on Intelligent User Interfaces (IUI). 153--164. https:\/\/doi.org\/10.1145\/3172944.3172969"},{"key":"e_1_2_2_47_1","doi-asserted-by":"publisher","DOI":"10.1145\/3379155.3391332"},{"key":"e_1_2_2_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACII.2015.7344640"},{"key":"e_1_2_2_49_1","volume-title":"Affective computing using speech and eye gaze: a review and bimodal system proposal for continuous affect prediction. arXiv preprint arXiv:1805.06652","author":"O'Dwyer Jonny","year":"2018","unstructured":"Jonny O'Dwyer , Niall Murray , and Ronan Flynn . 2018. Affective computing using speech and eye gaze: a review and bimodal system proposal for continuous affect prediction. arXiv preprint arXiv:1805.06652 ( 2018 ). Jonny O'Dwyer, Niall Murray, and Ronan Flynn. 2018. Affective computing using speech and eye gaze: a review and bimodal system proposal for continuous affect prediction. arXiv preprint arXiv:1805.06652 (2018)."},{"key":"e_1_2_2_50_1","volume-title":"Eye-based Continuous Affect Prediction. In 2019 8th International Conference on Affective Computing and Intelligent Interaction (ACII). IEEE, 137--143","author":"O'Dwyer Jonny","year":"2019","unstructured":"Jonny O'Dwyer , Niall Murray , and Ronan Flynn . 2019 . Eye-based Continuous Affect Prediction. In 2019 8th International Conference on Affective Computing and Intelligent Interaction (ACII). IEEE, 137--143 . Jonny O'Dwyer, Niall Murray, and Ronan Flynn. 2019. Eye-based Continuous Affect Prediction. In 2019 8th International Conference on Affective Computing and Intelligent Interaction (ACII). IEEE, 137--143."},{"key":"e_1_2_2_51_1","doi-asserted-by":"publisher","DOI":"10.1145\/1101149.1101299"},{"key":"e_1_2_2_52_1","doi-asserted-by":"publisher","DOI":"10.5244\/C.29.41"},{"key":"e_1_2_2_53_1","volume-title":"Robust Latent Representations Via Cross-Modal Translation and Alignment. In ICASSP 2021--2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 4315--4319","author":"Rajan Vandana","year":"2021","unstructured":"Vandana Rajan , Alessio Brutti , and Andrea Cavallaro . 2021 . Robust Latent Representations Via Cross-Modal Translation and Alignment. In ICASSP 2021--2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 4315--4319 . Vandana Rajan, Alessio Brutti, and Andrea Cavallaro. 2021. Robust Latent Representations Via Cross-Modal Translation and Alignment. In ICASSP 2021--2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 4315--4319."},{"key":"e_1_2_2_54_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2014.11.007"},{"key":"e_1_2_2_55_1","doi-asserted-by":"publisher","DOI":"10.1109\/FG.2013.6553805"},{"key":"e_1_2_2_56_1","volume-title":"Deep learning for human affect recognition: Insights and new developments","author":"Rouast Philipp V","year":"2019","unstructured":"Philipp V Rouast , Marc Adam , and Raymond Chiong . 2019. Deep learning for human affect recognition: Insights and new developments . IEEE Transactions on Affective Computing ( 2019 ). Philipp V Rouast, Marc Adam, and Raymond Chiong. 2019. Deep learning for human affect recognition: Insights and new developments. IEEE Transactions on Affective Computing (2019)."},{"key":"e_1_2_2_57_1","volume-title":"Thirteenth Annual Conference of the International Speech Communication Association.","author":"Viktor","year":"2012","unstructured":"Viktor Rozgi?, Sankaranarayanan Ananthakrishnan , Shirin Saleem , Rohit Kumar , Aravind Namandi Vembu , and Rohit Prasad . 2012 . Emotion recognition using acoustic and lexical features . In Thirteenth Annual Conference of the International Speech Communication Association. Viktor Rozgi?, Sankaranarayanan Ananthakrishnan, Shirin Saleem, Rohit Kumar, Aravind Namandi Vembu, and Rohit Prasad. 2012. Emotion recognition using acoustic and lexical features. In Thirteenth Annual Conference of the International Speech Communication Association."},{"key":"e_1_2_2_58_1","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2018.8593580"},{"key":"e_1_2_2_59_1","volume-title":"Proc. ACM Hum.-Comput. Interact.","volume":"6","author":"Schoneveld Liam","year":"2021","unstructured":"Liam Schoneveld , Alice Othmani , and Hazem Abdelkawy . 2021 . Leveraging recent advances in deep learning for audio-Visual emotion recognition. Pattern Recognition Letters (2021) . Proc. ACM Hum.-Comput. Interact. , Vol. 6 , No. ETRA, Article 138. Publication date : May 2022. 138:18 Ahmed Abdou et al. Liam Schoneveld, Alice Othmani, and Hazem Abdelkawy. 2021. Leveraging recent advances in deep learning for audio-Visual emotion recognition. Pattern Recognition Letters (2021). Proc. ACM Hum.-Comput. Interact., Vol. 6, No. ETRA, Article 138. Publication date: May 2022. 138:18 Ahmed Abdou et al."},{"key":"e_1_2_2_60_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW53098.2021.00352"},{"key":"e_1_2_2_61_1","doi-asserted-by":"publisher","DOI":"10.1145\/3240925.3240943"},{"key":"e_1_2_2_62_1","volume-title":"Proceedings of the 3rd International Conference on Machine Learning and Soft Computing. 166--169","author":"Huynh Thong Van","year":"2019","unstructured":"Thong Van Huynh , Hyung-Jeong Yang , Guee-Sang Lee , Soo-Hyung Kim , and In-Seop Na . 2019 . Emotion recognition by integrating eye movement analysis and facial expression model . In Proceedings of the 3rd International Conference on Machine Learning and Soft Computing. 166--169 . Thong Van Huynh, Hyung-Jeong Yang, Guee-Sang Lee, Soo-Hyung Kim, and In-Seop Na. 2019. Emotion recognition by integrating eye movement analysis and facial expression model. In Proceedings of the 3rd International Conference on Machine Learning and Soft Computing. 166--169."},{"key":"e_1_2_2_63_1","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0168307"},{"key":"e_1_2_2_64_1","volume-title":"Faces in context: a review and systematization of contextual influences on affective face processing. Frontiers in psychology 3","author":"Wieser Matthias J","year":"2012","unstructured":"Matthias J Wieser and Tobias Brosch . 2012. Faces in context: a review and systematization of contextual influences on affective face processing. Frontiers in psychology 3 ( 2012 ), 471. Matthias J Wieser and Tobias Brosch. 2012. Faces in context: a review and systematization of contextual influences on affective face processing. Frontiers in psychology 3 (2012), 471."},{"key":"e_1_2_2_65_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.428"},{"key":"e_1_2_2_66_1","volume-title":"Survey on audiovisual emotion recognition: databases, features, and data fusion strategies. APSIPA transactions on signal and information processing 3","author":"Wu Chung-Hsien","year":"2014","unstructured":"Chung-Hsien Wu , Jen-Chun Lin , and Wen-Li Wei . 2014. Survey on audiovisual emotion recognition: databases, features, and data fusion strategies. APSIPA transactions on signal and information processing 3 ( 2014 ). Chung-Hsien Wu, Jen-Chun Lin, and Wen-Li Wei. 2014. Survey on audiovisual emotion recognition: databases, features, and data fusion strategies. APSIPA transactions on signal and information processing 3 (2014)."},{"key":"e_1_2_2_67_1","volume-title":"A survey of affect recognition methods: Audio, visual, and spontaneous expressions","author":"Zeng Zhihong","year":"2008","unstructured":"Zhihong Zeng , Maja Pantic , Glenn I Roisman , and Thomas S Huang . 2008. A survey of affect recognition methods: Audio, visual, and spontaneous expressions . IEEE transactions on pattern analysis and machine intelligence 31, 1 ( 2008 ), 39--58. Zhihong Zeng, Maja Pantic, Glenn I Roisman, and Thomas S Huang. 2008. A survey of affect recognition methods: Audio, visual, and spontaneous expressions. IEEE transactions on pattern analysis and machine intelligence 31, 1 (2008), 39--58."},{"key":"e_1_2_2_68_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7299081"}],"container-title":["Proceedings of the ACM on Human-Computer Interaction"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3530879","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3530879","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T18:09:26Z","timestamp":1750183766000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3530879"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,5,13]]},"references-count":68,"journal-issue":{"issue":"ETRA","published-print":{"date-parts":[[2022,5,13]]}},"alternative-id":["10.1145\/3530879"],"URL":"https:\/\/doi.org\/10.1145\/3530879","relation":{},"ISSN":["2573-0142"],"issn-type":[{"value":"2573-0142","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,5,13]]},"assertion":[{"value":"2022-05-13","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}