{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,18]],"date-time":"2026-06-18T16:10:24Z","timestamp":1781799024843,"version":"3.54.5"},"publisher-location":"New York, NY, USA","reference-count":53,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,10,21]],"date-time":"2020-10-21T00:00:00Z","timestamp":1603238400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,10,21]]},"DOI":"10.1145\/3382507.3417967","type":"proceedings-article","created":{"date-parts":[[2020,10,22]],"date-time":"2020-10-22T10:04:35Z","timestamp":1603361075000},"page":"858-867","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":18,"title":["X-AWARE: ConteXt-AWARE Human-Environment Attention Fusion for Driver Gaze Prediction in the Wild"],"prefix":"10.1145","author":[{"given":"Lukas","family":"Stappen","sequence":"first","affiliation":[{"name":"University of Augsburg, Augsburg, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Georgios","family":"Rizos","sequence":"additional","affiliation":[{"name":"Imperial College London, London, United Kingdom"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Bj\u00f6rn","family":"Schuller","sequence":"additional","affiliation":[{"name":"Imperial College London, London, United Kingdom"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2020,10,22]]},"reference":[{"key":"e_1_3_2_2_1_1","volume-title":"ACM International Conference on Multimodal Interaction. ACM.","author":"Abhinav Dhall Roland Goecke","year":"2020","unstructured":"Roland Goecke Abhinav Dhall , Garima Sharma and Tom Gedeon . 2020 . EmotiW 2020: Driver Gaze, Group Emotion, Student Engagement and Physiological Signal based Challenges . In ACM International Conference on Multimodal Interaction. ACM. Roland Goecke Abhinav Dhall, Garima Sharma and Tom Gedeon. 2020. EmotiW 2020: Driver Gaze, Group Emotion, Student Engagement and Physiological Signal based Challenges. In ACM International Conference on Multimodal Interaction. ACM."},{"key":"e_1_3_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/TITS.2013.2247759"},{"key":"e_1_3_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/SPMB.2013.6736770"},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/JTEHM.2013.2289879"},{"key":"e_1_3_2_2_5_1","volume-title":"et almbox","author":"Borghi Guido","year":"2018","unstructured":"Guido Borghi , Matteo Fabbri , Roberto Vezzani , Rita Cucchiara , et almbox . 2018 . Face-from-depth for head pose estimation on depth images. IEEE Transactions on Pattern Analysis and Machine Intelligence ( 2018). Guido Borghi, Matteo Fabbri, Roberto Vezzani, Rita Cucchiara, et almbox. 2018. Face-from-depth for head pose estimation on depth images. IEEE Transactions on Pattern Analysis and Machine Intelligence (2018)."},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.583"},{"key":"e_1_3_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.195"},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01249-6_21"},{"key":"e_1_3_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/MIS.2016.47"},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00138-016-0776-4"},{"key":"e_1_3_2_2_11_1","volume-title":"Speak2Label: Using Domain Knowledge for Creating a Large Scale Driver Gaze Zone Estimation Dataset. arXiv preprint arXiv:2004.05973","author":"Ghosh Shreya","year":"2020","unstructured":"Shreya Ghosh , Abhinav Dhall , Garima Sharma , Sarthak Gupta , and Nicu Sebe . 2020. Speak2Label: Using Domain Knowledge for Creating a Large Scale Driver Gaze Zone Estimation Dataset. arXiv preprint arXiv:2004.05973 ( 2020 ). Shreya Ghosh, Abhinav Dhall, Garima Sharma, Sarthak Gupta, and Nicu Sebe. 2020. Speak2Label: Using Domain Knowledge for Creating a Large Scale Driver Gaze Zone Estimation Dataset. arXiv preprint arXiv:2004.05973 (2020)."},{"key":"e_1_3_2_2_12_1","first-page":"258","article-title":"The effects of age on crash risk associated with driver distraction","volume":"46","author":"Guo Feng","year":"2017","unstructured":"Feng Guo , Sheila G Klauer , Youjia Fang , Jonathan M Hankey , Jonathan F Antin , Miguel A Perez , Suzanne E Lee , and Thomas A Dingus . 2017 . The effects of age on crash risk associated with driver distraction . International Journal of Epidemiology , Vol. 46 , 1 (2017), 258 -- 265 . Feng Guo, Sheila G Klauer, Youjia Fang, Jonathan M Hankey, Jonathan F Antin, Miguel A Perez, Suzanne E Lee, and Thomas A Dingus. 2017. The effects of age on crash risk associated with driver distraction. International Journal of Epidemiology, Vol. 46, 1 (2017), 258--265.","journal-title":"International Journal of Epidemiology"},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.5555\/1061935.1649098"},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.123"},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/TII.2018.2884211"},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00745"},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/3204493.3204529"},{"key":"e_1_3_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.3390\/s19010216"},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1038\/s41598-020-59251-5"},{"key":"e_1_3_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICIP.2007.4379556"},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/TITS.2015.2506602"},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2015.2482819"},{"key":"e_1_3_2_2_24_1","volume-title":"Head pose estimation in computer vision: A survey","author":"Murphy-Chutorian Erik","year":"2008","unstructured":"Erik Murphy-Chutorian and Mohan Manubhai Trivedi . 2008. Head pose estimation in computer vision: A survey . IEEE transactions on pattern analysis and machine intelligence, Vol. 31 , 4 ( 2008 ), 607--626. Erik Murphy-Chutorian and Mohan Manubhai Trivedi. 2008. Head pose estimation in computer vision: A survey. IEEE transactions on pattern analysis and machine intelligence, Vol. 31, 4 (2008), 607--626."},{"key":"e_1_3_2_2_25_1","volume-title":"Global status report on road safety","author":"World Health Organization","year":"2018","unstructured":"World Health Organization . 2018. Global status report on road safety 2018 . Geneva : World Health Organization . World Health Organization. 2018. Global status report on road safety 2018. Geneva: World Health Organization."},{"key":"e_1_3_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-30499-9_103"},{"key":"e_1_3_2_2_27_1","unstructured":"Niki Parmar Prajit Ramachandran Ashish Vaswani Irwan Bello Anselm Levskaya and Jon Shlens. 2019. Stand-alone self-attention in vision models. In Advances in Neural Information Processing Systems. 68--80.  Niki Parmar Prajit Ramachandran Ashish Vaswani Irwan Bello Anselm Levskaya and Jon Shlens. 2019. Stand-alone self-attention in vision models. In Advances in Neural Information Processing Systems. 68--80."},{"key":"e_1_3_2_2_28_1","volume-title":"PyTorch: An Imperative Style","author":"Paszke Adam","unstructured":"Adam Paszke , Sam Gross , Francisco Massa , Adam Lerer , James Bradbury , Gregory Chanan , Trevor Killeen , Zeming Lin , Natalia Gimelshein , Luca Antiga , Alban Desmaison , Andreas Kopf , Edward Yang , Zachary DeVito , Martin Raison , Alykhan Tejani , Sasank Chilamkurthy , Benoit Steiner , Lu Fang , Junjie Bai , and Soumith Chintala . 2019. PyTorch: An Imperative Style , High-Performance Deep Learning Library . In Advances in Neural Information Processing Systems 32. Curran Associates, 8024--8035. Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, Alban Desmaison, Andreas Kopf, Edward Yang, Zachary DeVito, Martin Raison, Alykhan Tejani, Sasank Chilamkurthy, Benoit Steiner, Lu Fang, Junjie Bai, and Soumith Chintala. 2019. PyTorch: An Imperative Style, High-Performance Deep Learning Library. In Advances in Neural Information Processing Systems 32. Curran Associates, 8024--8035."},{"key":"e_1_3_2_2_29_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2017.06.009"},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1243\/09596518JSCE218"},{"key":"e_1_3_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2015.01.095"},{"key":"e_1_3_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i07.6865"},{"key":"e_1_3_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.160"},{"key":"e_1_3_2_2_34_1","volume-title":"Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767","author":"Redmon Joseph","year":"2018","unstructured":"Joseph Redmon and Ali Farhadi . 2018. Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767 ( 2018 ). Joseph Redmon and Ali Farhadi. 2018. Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767 (2018)."},{"key":"e_1_3_2_2_35_1","volume-title":"Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556","author":"Simonyan Karen","year":"2014","unstructured":"Karen Simonyan and Andrew Zisserman . 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 ( 2014 ). Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)."},{"key":"e_1_3_2_2_36_1","volume-title":"et almbox. 2020 a. MuSe 2020--The First International Multimodal Sentiment Analysis in Real-life Media Challenge and Workshop. arXiv preprint arXiv:2004.14858","author":"Stappen Lukas","year":"2020","unstructured":"Lukas Stappen , Alice Baird , Georgios Rizos , Panagiotis Tzirakis , Xinchen Du , Felix Hafner , Lea Schumann , Adria Mallol-Ragolta , Bj\u00f6rn W Schuller , Iulia Lefter , et almbox. 2020 a. MuSe 2020--The First International Multimodal Sentiment Analysis in Real-life Media Challenge and Workshop. arXiv preprint arXiv:2004.14858 ( 2020 ). Lukas Stappen, Alice Baird, Georgios Rizos, Panagiotis Tzirakis, Xinchen Du, Felix Hafner, Lea Schumann, Adria Mallol-Ragolta, Bj\u00f6rn W Schuller, Iulia Lefter, et almbox. 2020 a. MuSe 2020--The First International Multimodal Sentiment Analysis in Real-life Media Challenge and Workshop. arXiv preprint arXiv:2004.14858 (2020)."},{"key":"e_1_3_2_2_37_1","volume-title":"Optical Car Part Recognition and Detection: Collection, Insights, and Applications. arXiv preprint arXiv:2006.08521","author":"Stappen Lukas","year":"2020","unstructured":"Lukas Stappen , Xinchen Du , Vincent Karas , Stefan M\u00fcller , and Bj\u00f6rn W Schuller . 2020 b. Go-CaRD--Generic , Optical Car Part Recognition and Detection: Collection, Insights, and Applications. arXiv preprint arXiv:2006.08521 ( 2020 ). Lukas Stappen, Xinchen Du, Vincent Karas, Stefan M\u00fcller, and Bj\u00f6rn W Schuller. 2020 b. Go-CaRD--Generic, Optical Car Part Recognition and Detection: Collection, Insights, and Applications. arXiv preprint arXiv:2006.08521 (2020)."},{"key":"e_1_3_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/MMSP.2019.8901779"},{"key":"e_1_3_2_2_39_1","volume-title":"AAAI Conference on Artifical Intelligence. AAAI.","author":"Szegedy Christian","year":"2017","unstructured":"Christian Szegedy , Sergey Ioffe , Vincent Vanhoucke , and Alexander A Alemi . 2017 . Inception-v4, inception-resnet and the impact of residual connections on learning . In AAAI Conference on Artifical Intelligence. AAAI. Christian Szegedy, Sergey Ioffe, Vincent Vanhoucke, and Alexander A Alemi. 2017. Inception-v4, inception-resnet and the impact of residual connections on learning. In AAAI Conference on Artifical Intelligence. AAAI."},{"key":"e_1_3_2_2_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.308"},{"key":"e_1_3_2_2_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/TITS.2013.2247760"},{"key":"e_1_3_2_2_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/TITS.2015.2396031"},{"key":"e_1_3_2_2_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/3194085.3194094"},{"key":"e_1_3_2_2_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIV.2018.2843120"},{"key":"e_1_3_2_2_45_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2018.01.031"},{"key":"e_1_3_2_2_46_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2018.05.083"},{"key":"e_1_3_2_2_47_1","doi-asserted-by":"publisher","DOI":"10.3390\/s19061287"},{"key":"e_1_3_2_2_48_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01234-2_1"},{"key":"e_1_3_2_2_49_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-012-1220-z"},{"key":"e_1_3_2_2_50_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2019.2943753"},{"key":"e_1_3_2_2_51_1","volume-title":"Tensor fusion network for multimodal sentiment analysis. arXiv preprint arXiv:1707.07250","author":"Zadeh Amir","year":"2017","unstructured":"Amir Zadeh , Minghai Chen , Soujanya Poria , Erik Cambria , and Louis-Philippe Morency . 2017. Tensor fusion network for multimodal sentiment analysis. arXiv preprint arXiv:1707.07250 ( 2017 ). Amir Zadeh, Minghai Chen, Soujanya Poria, Erik Cambria, and Louis-Philippe Morency. 2017. Tensor fusion network for multimodal sentiment analysis. arXiv preprint arXiv:1707.07250 (2017)."},{"key":"e_1_3_2_2_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/MIS.2016.94"},{"key":"e_1_3_2_2_53_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7299081"}],"event":{"name":"ICMI '20: INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION","location":"Virtual Event Netherlands","acronym":"ICMI '20","sponsor":["SIGCHI ACM Special Interest Group on Computer-Human Interaction"]},"container-title":["Proceedings of the 2020 International Conference on Multimodal Interaction"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3382507.3417967","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3382507.3417967","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:38:26Z","timestamp":1750199906000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3382507.3417967"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,10,21]]},"references-count":53,"alternative-id":["10.1145\/3382507.3417967","10.1145\/3382507"],"URL":"https:\/\/doi.org\/10.1145\/3382507.3417967","relation":{},"subject":[],"published":{"date-parts":[[2020,10,21]]},"assertion":[{"value":"2020-10-22","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}