{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,3]],"date-time":"2026-03-03T16:05:38Z","timestamp":1772553938665,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":33,"publisher":"ACM","license":[{"start":{"date-parts":[[2018,10,2]],"date-time":"2018-10-02T00:00:00Z","timestamp":1538438400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2018,10,2]]},"DOI":"10.1145\/3242969.3264980","type":"proceedings-article","created":{"date-parts":[[2018,10,2]],"date-time":"2018-10-02T12:09:29Z","timestamp":1538482169000},"page":"589-593","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":29,"title":["An Occam's Razor View on Learning Audiovisual Emotion Recognition with Small Training Sets"],"prefix":"10.1145","author":[{"given":"Valentin","family":"Vielzeuf","sequence":"first","affiliation":[{"name":"Orange Labs &amp; Normandie Univ., UNICAEN, ENSICAEN, CNRS, Cesson-S\u00e9vign\u00e9 &amp; Caen, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Corentin","family":"Kervadec","sequence":"additional","affiliation":[{"name":"Orange Labs, Cesson-S\u00e9vign\u00e9, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"St\u00e9phane","family":"Pateux","sequence":"additional","affiliation":[{"name":"Orange Labs, Cesson-S\u00e9vign\u00e9, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alexis","family":"Lechervy","sequence":"additional","affiliation":[{"name":"Normandie University, UNICAEN, ENSICAEN, CNRS, Caen, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Fr\u00e9d\u00e9ric","family":"Jurie","sequence":"additional","affiliation":[{"name":"Normandie University, UNICAEN, ENSICAEN, CNRS, Caen, France"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2018,10,2]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/3242969.3264993"},{"key":"e_1_3_2_1_2_1","volume-title":"Covariance Pooling for Facial Expression Recognition. arXiv preprint arXiv:1805.04855","author":"Acharya Dinesh","year":"2018","unstructured":"Dinesh Acharya , Zhiwu Huang , Danda Paudel , and Luc Van Gool . 2018. Covariance Pooling for Facial Expression Recognition. arXiv preprint arXiv:1805.04855 ( 2018 ). Dinesh Acharya, Zhiwu Huang, Danda Paudel, and Luc Van Gool. 2018. Covariance Pooling for Facial Expression Recognition. arXiv preprint arXiv:1805.04855 (2018)."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW.2016.105"},{"key":"e_1_3_2_1_4_1","volume-title":"Soundnet: Learning sound representations from unlabeled video Advances in Neural Information Processing Systems. 892--900.","author":"Aytar Yusuf","year":"2016","unstructured":"Yusuf Aytar , Carl Vondrick , and Antonio Torralba . 2016 . Soundnet: Learning sound representations from unlabeled video Advances in Neural Information Processing Systems. 892--900. Yusuf Aytar, Carl Vondrick, and Antonio Torralba. 2016. Soundnet: Learning sound representations from unlabeled video Advances in Neural Information Processing Systems. 892--900."},{"key":"e_1_3_2_1_5_1","volume-title":"Russell","author":"Barrett Lisa Feldman","year":"1999","unstructured":"Lisa Feldman Barrett and James A . Russell . 1999 . The structure of current affect: Controversies and emerging consensus. Current directions in psychological science, Vol. 8 , 1 (1999), 10--14. Lisa Feldman Barrett and James A. Russell. 1999. The structure of current affect: Controversies and emerging consensus. Current directions in psychological science, Vol. 8, 1 (1999), 10--14."},{"key":"e_1_3_2_1_6_1","volume-title":"Martinez","author":"Benitez-Quiroz C. Fabian","year":"2017","unstructured":"C. Fabian Benitez-Quiroz , Ramprakash Srinivasan , Qianli Feng , Yan Wang , and Aleix M . Martinez . 2017 . EmotioNet Challenge : Recognition of facial expressions of emotion in the wild. arXiv preprint arXiv:1703.01210 (2017). C. Fabian Benitez-Quiroz, Ramprakash Srinivasan, Qianli Feng, Yan Wang, and Aleix M. Martinez. 2017. EmotioNet Challenge: Recognition of facial expressions of emotion in the wild. arXiv preprint arXiv:1703.01210 (2017)."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1010933404324"},{"key":"e_1_3_2_1_8_1","volume-title":"Narayanan","author":"Busso Carlos","year":"2008","unstructured":"Carlos Busso , Murtaza Bulut , Chi-Chun Lee , Abe Kazemzadeh , Emily Mower , Samuel Kim , Jeannette N. Chang , Sungbok Lee , and Shrikanth S . Narayanan . 2008 . IEMOCAP : Interactive emotional dyadic motion capture database. Language resources and evaluation Vol. 42 , 4 (2008), 335. Carlos Busso, Murtaza Bulut, Chi-Chun Lee, Abe Kazemzadeh, Emily Mower, Samuel Kim, Jeannette N. Chang, Sungbok Lee, and Shrikanth S. Narayanan. 2008. IEMOCAP: Interactive emotional dyadic motion capture database. Language resources and evaluation Vol. 42, 4 (2008), 335."},{"key":"e_1_3_2_1_9_1","volume-title":"Taylor","author":"DeVries Terrance","year":"2017","unstructured":"Terrance DeVries and Graham W . Taylor . 2017 . Improved regularization of convolutional neural networks with cutout. arXiv preprint arXiv:1708.04552 (2017). Terrance DeVries and Graham W. Taylor. 2017. Improved regularization of convolutional neural networks with cutout. arXiv preprint arXiv:1708.04552 (2017)."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCVW.2011.6130508"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/MMUL.2012.26"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/2818346.2829994"},{"key":"e_1_3_2_1_13_1","volume-title":"Friesen","author":"Ekman Paul","year":"1977","unstructured":"Paul Ekman and Wallace V . Friesen . 1977 . Facial action coding system. (1977). Paul Ekman and Wallace V. Friesen. 1977. Facial action coding system. (1977)."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/1873951.1874246"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/2993148.2997632"},{"key":"e_1_3_2_1_17_1","volume-title":"Aaron Courville, Mehdi Mirza, Ben Hamner, Will Cukierski, Yichuan Tang, David Thaler, Dong-Hyun Lee, et al.","author":"Goodfellow Ian J.","year":"2013","unstructured":"Ian J. Goodfellow , Dumitru Erhan , Pierre Luc Carrier , Aaron Courville, Mehdi Mirza, Ben Hamner, Will Cukierski, Yichuan Tang, David Thaler, Dong-Hyun Lee, et al. 2013 . Challenges in representation learning: A report on three machine learning contests International Conference on Neural Information Processing. Springer , 117--124. Ian J. Goodfellow, Dumitru Erhan, Pierre Luc Carrier, Aaron Courville, Mehdi Mirza, Ben Hamner, Will Cukierski, Yichuan Tang, David Thaler, Dong-Hyun Lee, et al. 2013. Challenges in representation learning: A report on three machine learning contests International Conference on Neural Information Processing. Springer, 117--124."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/3136755.3143009"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1007\/s12193-015-0202-7"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1007\/s12193-015-0209-0"},{"key":"e_1_3_2_1_22_1","volume-title":"2018 13th IEEE International Conference on. IEEE, 692--696","author":"Knyazev Boris","year":"2018","unstructured":"Boris Knyazev , Roman Shvetsov , Natalia Efremova , and Artem Kuharenko . 2018 . Leveraging large face recognition data for emotion classification Automatic Face & Gesture Recognition (FG 2018) , 2018 13th IEEE International Conference on. IEEE, 692--696 . Boris Knyazev, Roman Shvetsov, Natalia Efremova, and Artem Kuharenko. 2018. Leveraging large face recognition data for emotion classification Automatic Face & Gesture Recognition (FG 2018), 2018 13th IEEE International Conference on. IEEE, 692--696."},{"key":"e_1_3_2_1_23_1","volume-title":"Reliable Crowdsourcing and Deep Locality-Preserving Learning for Expression Recognition in the Wild 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2584--2593","author":"Li Shan","year":"2017","unstructured":"Shan Li , Weihong Deng , and JunPing Du . 2017 . Reliable Crowdsourcing and Deep Locality-Preserving Learning for Expression Recognition in the Wild 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2584--2593 . Shan Li, Weihong Deng, and JunPing Du. 2017. Reliable Crowdsourcing and Deep Locality-Preserving Learning for Expression Recognition in the Wild 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2584--2593."},{"key":"e_1_3_2_1_24_1","volume-title":"Learnable pooling with Context Gating for video classification. arXiv preprint arXiv:1706.06905","author":"Miech Antoine","year":"2017","unstructured":"Antoine Miech , Ivan Laptev , and Josef Sivic . 2017. Learnable pooling with Context Gating for video classification. arXiv preprint arXiv:1706.06905 ( 2017 ). Antoine Miech, Ivan Laptev, and Josef Sivic. 2017. Learnable pooling with Context Gating for video classification. arXiv preprint arXiv:1706.06905 (2017)."},{"key":"e_1_3_2_1_25_1","volume-title":"Mahoor","author":"Mollahosseini Ali","year":"2017","unstructured":"Ali Mollahosseini , Behzad Hasani , and Mohammad H . Mahoor . 2017 . Affectnet : A database for facial expression, valence, and arousal computing in the wild. arXiv preprint arXiv:1708.03985 (2017). Ali Mollahosseini, Behzad Hasani, and Mohammad H. Mahoor. 2017. Affectnet: A database for facial expression, valence, and arousal computing in the wild. arXiv preprint arXiv:1708.03985 (2017)."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/3136755.3143006"},{"key":"e_1_3_2_1_27_1","volume-title":"Theories of emotion","author":"Plutchik Robert","unstructured":"Robert Plutchik and Henry Kellerman . 2013. Theories of emotion . Vol. Vol. 1 . Academic Press . Robert Plutchik and Henry Kellerman. 2013. Theories of emotion. Vol. Vol. 1. Academic Press."},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/3133944.3133953"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.5555\/2627435.2670313"},{"key":"e_1_3_2_1_30_1","volume-title":"2017 12th IEEE International Conference on. IEEE, 839--847","author":"Valstar Michel F.","year":"2017","unstructured":"Michel F. Valstar , Enrique S\u00e1nchez-Lozano , Jeffrey F. Cohn , L\u00e1szl\u00f3 A. Jeni , Jeffrey M. Girard , Zheng Zhang , Lijun Yin , and Maja Pantic . 2017 . Fera 2017-addressing head pose in the third facial expression recognition and analysis challenge Automatic Face & Gesture Recognition (FG 2017) , 2017 12th IEEE International Conference on. IEEE, 839--847 . Michel F. Valstar, Enrique S\u00e1nchez-Lozano, Jeffrey F. Cohn, L\u00e1szl\u00f3 A. Jeni, Jeffrey M. Girard, Zheng Zhang, Lijun Yin, and Maja Pantic. 2017. Fera 2017-addressing head pose in the third facial expression recognition and analysis challenge Automatic Face & Gesture Recognition (FG 2017), 2017 12th IEEE International Conference on. IEEE, 839--847."},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/3136755.3143011"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/2818346.2830585"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/2818346.2830595"},{"key":"e_1_3_2_1_34_1","unstructured":"Xingyu Zeng Wanli Ouyang Junjie Yan Hongsheng Li Tong Xiao Kun Wang Yu Liu Yucong Zhou Bin Yang Zhe Wang etal 2017. Crafting gbd-net for object detection. IEEE transactions on pattern analysis and machine intelligence (2017).  Xingyu Zeng Wanli Ouyang Junjie Yan Hongsheng Li Tong Xiao Kun Wang Yu Liu Yucong Zhou Bin Yang Zhe Wang et al. 2017. Crafting gbd-net for object detection. IEEE transactions on pattern analysis and machine intelligence (2017)."}],"event":{"name":"ICMI '18: INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION","location":"Boulder CO USA","acronym":"ICMI '18","sponsor":["SIGCHI Specialist Interest Group in Computer-Human Interaction of the ACM"]},"container-title":["Proceedings of the 20th ACM International Conference on Multimodal Interaction"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3242969.3264980","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3242969.3264980","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T02:06:58Z","timestamp":1750212418000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3242969.3264980"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,10,2]]},"references-count":33,"alternative-id":["10.1145\/3242969.3264980","10.1145\/3242969"],"URL":"https:\/\/doi.org\/10.1145\/3242969.3264980","relation":{},"subject":[],"published":{"date-parts":[[2018,10,2]]},"assertion":[{"value":"2018-10-02","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}