{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,3]],"date-time":"2026-04-03T15:37:03Z","timestamp":1775230623385,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":48,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,10,18]],"date-time":"2021-10-18T00:00:00Z","timestamp":1634515200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,10,18]]},"DOI":"10.1145\/3462244.3479902","type":"proceedings-article","created":{"date-parts":[[2021,10,15]],"date-time":"2021-10-15T15:01:58Z","timestamp":1634310118000},"page":"512-520","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":14,"title":["What\u2019s This? A Voice and Touch Multimodal Approach for Ambiguity Resolution in Voice Assistants"],"prefix":"10.1145","author":[{"given":"Jaewook","family":"Lee","sequence":"first","affiliation":[{"name":"University of Illinois at Urbana-Champaign, USA"}]},{"given":"Sebastian S.","family":"Rodriguez","sequence":"additional","affiliation":[{"name":"University of Illinois at Urbana-Champaign, USA"}]},{"given":"Raahul","family":"Natarrajan","sequence":"additional","affiliation":[{"name":"Vanderbilt University, USA"}]},{"given":"Jacqueline","family":"Chen","sequence":"additional","affiliation":[{"name":"University of Illinois at Urbana-Champaign, USA"}]},{"given":"Harsh","family":"Deep","sequence":"additional","affiliation":[{"name":"University of Illinois at Urbana-Champaign, USA"}]},{"given":"Alex","family":"Kirlik","sequence":"additional","affiliation":[{"name":"University of Illinois at Urbana-Champaign, USA"}]}],"member":"320","published-online":{"date-parts":[[2021,10,18]]},"reference":[{"key":"e_1_3_2_2_1_1","volume-title":"Search, and IoT: How people (really) use voice assistants. ACM Transactions on Computer-Human Interaction 26","author":"Ammari Tawfiq","year":"2019","unstructured":"Tawfiq Ammari , Jofish Kaye , Janice\u00a0 Y. Tsai , and Frank Bentley . 2019. Music , Search, and IoT: How people (really) use voice assistants. ACM Transactions on Computer-Human Interaction 26 ( 2019 ). Issue 3. https:\/\/doi.org\/10.1145\/3311956 Tawfiq Ammari, Jofish Kaye, Janice\u00a0Y. Tsai, and Frank Bentley. 2019. Music, Search, and IoT: How people (really) use voice assistants. ACM Transactions on Computer-Human Interaction 26 (2019). Issue 3. https:\/\/doi.org\/10.1145\/3311956"},{"key":"e_1_3_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1177\/1071181319631031"},{"key":"e_1_3_2_2_3_1","first-page":"3","article-title":"Determining What Individual SUS Scores Mean: Adding an Adjective Rating Scale","volume":"4","author":"Bangor Aaron","year":"2009","unstructured":"Aaron Bangor , Philip Kortum , and James Miller . 2009 . Determining What Individual SUS Scores Mean: Adding an Adjective Rating Scale . J. Usability Studies 4 , 3 (May 2009), 114\u2013123. Aaron Bangor, Philip Kortum, and James Miller. 2009. Determining What Individual SUS Scores Mean: Adding an Adjective Rating Scale. J. Usability Studies 4, 3 (May 2009), 114\u2013123.","journal-title":"J. Usability Studies"},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.7551\/mitpress\/9780262014113.001.0001"},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/3264901"},{"key":"e_1_3_2_2_6_1","volume-title":"SUS: A \u2019Quick and Dirty","author":"Brooke J.","year":"1996","unstructured":"J. Brooke . 1996 . SUS: A \u2019Quick and Dirty \u2019 Usability Scale . J. Brooke. 1996. SUS: A \u2019Quick and Dirty\u2019 Usability Scale."},{"key":"e_1_3_2_2_7_1","volume-title":"SUS: a retrospective. Journal of usability studies 8","author":"Brooke John","year":"2013","unstructured":"John Brooke . 2013. SUS: a retrospective. Journal of usability studies 8 ( 2013 ). Issue 2. John Brooke. 2013. SUS: a retrospective. Journal of usability studies 8 (2013). Issue 2."},{"key":"e_1_3_2_2_8_1","volume-title":"Proceedings of Second Colloquium on Discurse Anaphora and Anaphor Resolution (DAARC2). http:\/\/hdl.handle.net\/1802\/1456","author":"Byron D\u00a0K","year":"1998","unstructured":"D\u00a0K Byron and J\u00a0F Allen . 1998 . Resolving Demostrative Anaphora in the TRAINS93 Corpus . In Proceedings of Second Colloquium on Discurse Anaphora and Anaphor Resolution (DAARC2). http:\/\/hdl.handle.net\/1802\/1456 D\u00a0K Byron and J\u00a0F Allen. 1998. Resolving Demostrative Anaphora in the TRAINS93 Corpus. In Proceedings of Second Colloquium on Discurse Anaphora and Anaphor Resolution (DAARC2). http:\/\/hdl.handle.net\/1802\/1456"},{"key":"e_1_3_2_2_9_1","doi-asserted-by":"crossref","unstructured":"Emna Ch\u00e9rif and Jean-Fran\u00e7ois Lemoine. 2019. Anthropomorphic virtual assistants and the reactions of Internet users: An experiment on the assistant\u2019s voice. Recherche et Applications en Marketing (English Edition) 34 1(2019) 28\u201347. https:\/\/doi.org\/10.1177\/2051570719829432  Emna Ch\u00e9rif and Jean-Fran\u00e7ois Lemoine. 2019. Anthropomorphic virtual assistants and the reactions of Internet users: An experiment on the assistant\u2019s voice. Recherche et Applications en Marketing (English Edition) 34 1(2019) 28\u201347. https:\/\/doi.org\/10.1177\/2051570719829432","DOI":"10.1177\/2051570719829432"},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2020.113193"},{"key":"e_1_3_2_2_11_1","volume-title":"Understanding and using context. Personal and Ubiquitous Computing 5","author":"Dey K.","year":"2001","unstructured":"Anind\u00a0 K. Dey . 2001. Understanding and using context. Personal and Ubiquitous Computing 5 ( 2001 ). Issue 1. https:\/\/doi.org\/10.1007\/s007790170019 Anind\u00a0K. Dey. 2001. Understanding and using context. Personal and Ubiquitous Computing 5 (2001). Issue 1. https:\/\/doi.org\/10.1007\/s007790170019"},{"key":"e_1_3_2_2_12_1","unstructured":"IBM\u00a0Cloud Education. 2020. Conversational AI. https:\/\/www.ibm.com\/cloud\/learn\/conversational-ai.  IBM\u00a0Cloud Education. 2020. Conversational AI. https:\/\/www.ibm.com\/cloud\/learn\/conversational-ai."},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/2401836.2401848"},{"key":"e_1_3_2_2_14_1","volume-title":"On seeing human: A three-factor theory of anthropomorphism.Psychological Review 114, 4 (Oct","author":"Epley Nicholas","year":"2007","unstructured":"Nicholas Epley , Adam Waytz , and John\u00a0 T. Cacioppo . 2007. On seeing human: A three-factor theory of anthropomorphism.Psychological Review 114, 4 (Oct 2007 ), 864\u2013886. https:\/\/doi.org\/10.1037\/0033-295x.114.4.864 Nicholas Epley, Adam Waytz, and John\u00a0T. Cacioppo. 2007. On seeing human: A three-factor theory of anthropomorphism.Psychological Review 114, 4 (Oct 2007), 864\u2013886. https:\/\/doi.org\/10.1037\/0033-295x.114.4.864"},{"key":"e_1_3_2_2_15_1","unstructured":"Timnit Gebru and Emily Denton. 2020. Fairness Accountability Transparency and Ethics in Computer Vision. https:\/\/sites.google.com\/view\/fatecv-tutorial\/home?authuser=0.  Timnit Gebru and Emily Denton. 2020. Fairness Accountability Transparency and Ethics in Computer Vision. https:\/\/sites.google.com\/view\/fatecv-tutorial\/home?authuser=0."},{"key":"e_1_3_2_2_16_1","unstructured":"Google. 2017. Google Assistant SDK. https:\/\/developers.google.com\/assistant\/sdk\/.  Google. 2017. Google Assistant SDK. https:\/\/developers.google.com\/assistant\/sdk\/."},{"key":"e_1_3_2_2_17_1","volume-title":"Proceedings of the 5th Discourse Anaphora and Anaphor Resolution Colloquium (DAARC","author":"Gundel K","year":"2004","unstructured":"Jeanette\u00a0 K Gundel , Nancy Hedberg , and Ron Zacharski . 2004 . Demonstrative pronouns in natural discourse . In Proceedings of the 5th Discourse Anaphora and Anaphor Resolution Colloquium (DAARC 2004). Jeanette\u00a0K Gundel, Nancy Hedberg, and Ron Zacharski. 2004. Demonstrative pronouns in natural discourse. In Proceedings of the 5th Discourse Anaphora and Anaphor Resolution Colloquium (DAARC 2004)."},{"key":"e_1_3_2_2_18_1","volume-title":"Halliday\u2019s Introduction to Functional Grammar","author":"Halliday A\u00a0K","unstructured":"M\u00a0 A\u00a0K Halliday and C.M.I.M. Matthiessen . 2013. Halliday\u2019s Introduction to Functional Grammar . Taylor & Francis . https:\/\/books.google.com\/books?id=odUqAAAAQBAJ M\u00a0A\u00a0K Halliday and C.M.I.M. Matthiessen. 2013. Halliday\u2019s Introduction to Functional Grammar. Taylor & Francis. https:\/\/books.google.com\/books?id=odUqAAAAQBAJ"},{"key":"e_1_3_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.1037\/e577632012-009"},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"crossref","unstructured":"Sandra\u00a0G. Hart and Lowell\u00a0E. Staveland. 1988. Development of NASA-TLX (Task Load Index): Results of Empirical and Theoretical Research. In Human Mental Workload Peter\u00a0A. Hancock and Najmedin Meshkati (Eds.). Advances in Psychology Vol.\u00a052. North-Holland 139\u2013183. https:\/\/doi.org\/10.1016\/S0166-4115(08)62386-9  Sandra\u00a0G. Hart and Lowell\u00a0E. Staveland. 1988. Development of NASA-TLX (Task Load Index): Results of Empirical and Theoretical Research. In Human Mental Workload Peter\u00a0A. Hancock and Najmedin Meshkati (Eds.). Advances in Psychology Vol.\u00a052. North-Holland 139\u2013183. https:\/\/doi.org\/10.1016\/S0166-4115(08)62386-9","DOI":"10.1016\/S0166-4115(08)62386-9"},{"key":"e_1_3_2_2_21_1","unstructured":"Drew Harwell. 2018. The Accent Gap. https:\/\/www.washingtonpost.com\/graphics\/2018\/business\/alexa-does-not-understand-your-accent\/.  Drew Harwell. 2018. The Accent Gap. https:\/\/www.washingtonpost.com\/graphics\/2018\/business\/alexa-does-not-understand-your-accent\/."},{"key":"e_1_3_2_2_22_1","first-page":"1","article-title":"Reference values and subscale patterns for the task load index (TLX): a meta-analytic review","volume":"0","author":"Hertzum Morten","year":"2021","unstructured":"Morten Hertzum . 2021 . Reference values and subscale patterns for the task load index (TLX): a meta-analytic review . Ergonomics 0 , 0 (2021), 1 \u2013 10 . https:\/\/doi.org\/10.1080\/00140139.2021.1876927 arXiv:https:\/\/doi.org\/10.1080\/00140139.2021.1876927PMID: 33463402. Morten Hertzum. 2021. Reference values and subscale patterns for the task load index (TLX): a meta-analytic review. Ergonomics 0, 0 (2021), 1\u201310. https:\/\/doi.org\/10.1080\/00140139.2021.1876927 arXiv:https:\/\/doi.org\/10.1080\/00140139.2021.1876927PMID: 33463402.","journal-title":"Ergonomics"},{"key":"e_1_3_2_2_23_1","unstructured":"PTC Inc. 2015. Vuforia. https:\/\/developer.vuforia.com\/.  PTC Inc. 2015. Vuforia. https:\/\/developer.vuforia.com\/."},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/123078.128728"},{"key":"e_1_3_2_2_25_1","unstructured":"Bret Kinsella. 2018. What People Ask Their Smart Speakers. https:\/\/voicebot.ai\/2018\/08\/01\/what-people-ask-their-smart-speakers\/.  Bret Kinsella. 2018. What People Ask Their Smart Speakers. https:\/\/voicebot.ai\/2018\/08\/01\/what-people-ask-their-smart-speakers\/."},{"key":"e_1_3_2_2_26_1","unstructured":"Fedor Kitashov Elizaveta Svitanko and Debojyoti Dutta. 2018. Foreign English Accent Adjustment by Learning Phonetic Patterns. arxiv:1807.03625\u00a0[cs.SD]  Fedor Kitashov Elizaveta Svitanko and Debojyoti Dutta. 2018. Foreign English Accent Adjustment by Learning Phonetic Patterns. arxiv:1807.03625\u00a0[cs.SD]"},{"key":"e_1_3_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.1915768117"},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"crossref","unstructured":"Ludovic Le Bigot Lo\u00efc Caroux Christine Ros Agn\u00e8s Lacroix and Val\u00e9rie Botherel. 2013. Investigating memory constraints on recall of options in interactive voice response system messages. 106\u2013116\u00a0pages. https:\/\/doi.org\/10.1080\/0144929X.2011.563800  Ludovic Le Bigot Lo\u00efc Caroux Christine Ros Agn\u00e8s Lacroix and Val\u00e9rie Botherel. 2013. Investigating memory constraints on recall of options in interactive voice response system messages. 106\u2013116\u00a0pages. https:\/\/doi.org\/10.1080\/0144929X.2011.563800","DOI":"10.1080\/0144929X.2011.563800"},{"key":"e_1_3_2_2_29_1","volume-title":"UIST 2020 - Adjunct Publication of the 33rd Annual ACM Symposium on User Interface Software and Technology. Association for Computing Machinery, Inc, 162\u2013168","author":"Jia\u00a0Jun Li Toby","year":"2020","unstructured":"Toby Jia\u00a0Jun Li . 2020 . Multi-Modal Interactive Task Learning from Demonstrations and Natural Language Instructions . In UIST 2020 - Adjunct Publication of the 33rd Annual ACM Symposium on User Interface Software and Technology. Association for Computing Machinery, Inc, 162\u2013168 . https:\/\/doi.org\/10.1145\/3379350.3415803 Toby Jia\u00a0Jun Li. 2020. Multi-Modal Interactive Task Learning from Demonstrations and Natural Language Instructions. In UIST 2020 - Adjunct Publication of the 33rd Annual ACM Symposium on User Interface Software and Technology. Association for Computing Machinery, Inc, 162\u2013168. https:\/\/doi.org\/10.1145\/3379350.3415803"},{"key":"e_1_3_2_2_30_1","volume-title":"Universal methods of design: 100 ways to research complex problems, develop innovative ideas, and design effective solutions (digital ed ed.)","author":"Martin Bella","unstructured":"Bella Martin and Bruce\u00a0 M. Hanington . 2012. Universal methods of design: 100 ways to research complex problems, develop innovative ideas, and design effective solutions (digital ed ed.) . Rockport Publishers . Bella Martin and Bruce\u00a0M. Hanington. 2012. Universal methods of design: 100 ways to research complex problems, develop innovative ideas, and design effective solutions (digital ed ed.). Rockport Publishers."},{"key":"e_1_3_2_2_31_1","volume-title":"Enhancing Mobile Voice Assistants with WorldGaze. In Conference on Human Factors in Computing Systems - Proceedings. Association for Computing Machinery. https:\/\/doi.org\/10","author":"Mayer Sven","year":"2020","unstructured":"Sven Mayer , Gierad Laput , and Chris Harrison . 2020 . Enhancing Mobile Voice Assistants with WorldGaze. In Conference on Human Factors in Computing Systems - Proceedings. Association for Computing Machinery. https:\/\/doi.org\/10 .1145\/3313831.3376479 Sven Mayer, Gierad Laput, and Chris Harrison. 2020. Enhancing Mobile Voice Assistants with WorldGaze. In Conference on Human Factors in Computing Systems - Proceedings. Association for Computing Machinery. https:\/\/doi.org\/10.1145\/3313831.3376479"},{"key":"e_1_3_2_2_32_1","unstructured":"Rich McCormick. 2017. Please don\u2019t make me talk to voice assistants anymore. https:\/\/www.theverge.com\/2017\/6\/6\/15744106\/voice-assistants-siri-dont-make-me-talk.  Rich McCormick. 2017. Please don\u2019t make me talk to voice assistants anymore. https:\/\/www.theverge.com\/2017\/6\/6\/15744106\/voice-assistants-siri-dont-make-me-talk."},{"key":"e_1_3_2_2_33_1","unstructured":"Microsoft. 2019. Voice Report. https:\/\/advertiseonbing-blob.azureedge.net\/blob\/bingads\/media\/insight\/whitepapers\/2019\/04%20apr\/voice-report\/bingads_2019_voicereport.pdf.  Microsoft. 2019. Voice Report. https:\/\/advertiseonbing-blob.azureedge.net\/blob\/bingads\/media\/insight\/whitepapers\/2019\/04%20apr\/voice-report\/bingads_2019_voicereport.pdf."},{"key":"e_1_3_2_2_34_1","volume-title":"32nd AAAI Conference on Artificial Intelligence, AAAI","author":"Naik Vishal\u00a0Ishwar","year":"2018","unstructured":"Vishal\u00a0Ishwar Naik , Angeliki Metallinou , and Rahul Goel . 2018 . Context aware conversational understanding for intelligent agents with a screen . In 32nd AAAI Conference on Artificial Intelligence, AAAI 2018. 5325\u20135332. www.aaai.org Vishal\u00a0Ishwar Naik, Angeliki Metallinou, and Rahul Goel. 2018. Context aware conversational understanding for intelligent agents with a screen. In 32nd AAAI Conference on Artificial Intelligence, AAAI 2018. 5325\u20135332. www.aaai.org"},{"key":"e_1_3_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/1229863.1229885"},{"key":"e_1_3_2_2_36_1","volume-title":"The design of everyday things","author":"Norman A.","unstructured":"Donald\u00a0 A. Norman . 2013. The design of everyday things . Basic Books . Donald\u00a0A. Norman. 2013. The design of everyday things. Basic Books."},{"key":"e_1_3_2_2_37_1","unstructured":"S. O\u2019Dea. 2020. Smartphone users worldwide 2016-2023. https:\/\/www.statista.com\/statistics\/330695\/number-of-smartphone-users-worldwide\/.  S. O\u2019Dea. 2020. Smartphone users worldwide 2016-2023. https:\/\/www.statista.com\/statistics\/330695\/number-of-smartphone-users-worldwide\/."},{"key":"e_1_3_2_2_38_1","first-page":"1","article-title":"Hey, voice assistant!","volume":"39","author":"Patrizi Michela","year":"2021","unstructured":"Michela Patrizi , Maria Vernuccio , and Alberto Pastore . 2021 . \u201c Hey, voice assistant! \u201d How do users perceive you? An exploratory study. Sinergie Italian Journal of Management 39 , 1 (Feb 2021), 173\u2013192. https:\/\/doi.org\/10.7433\/s114.2021.10 Michela Patrizi, Maria Vernuccio, and Alberto Pastore. 2021. \u201cHey, voice assistant!\u201d How do users perceive you? An exploratory study. Sinergie Italian Journal of Management 39, 1 (Feb 2021), 173\u2013192. https:\/\/doi.org\/10.7433\/s114.2021.10","journal-title":"An exploratory study. Sinergie Italian Journal of Management"},{"key":"e_1_3_2_2_39_1","volume-title":"International Conference on Intelligent User Interfaces, Proceedings IUI. 20\u201329","author":"Prasov Zahar","year":"2008","unstructured":"Zahar Prasov and Joyce\u00a0 Y. Chai . 2008 . What\u2019s in a Gaze? The role of eye-gaze in reference resolution in multimodal conversational interfaces . In International Conference on Intelligent User Interfaces, Proceedings IUI. 20\u201329 . https:\/\/doi.org\/10.1145\/1378773.1378777 Zahar Prasov and Joyce\u00a0Y. Chai. 2008. What\u2019s in a Gaze? The role of eye-gaze in reference resolution in multimodal conversational interfaces. In International Conference on Intelligent User Interfaces, Proceedings IUI. 20\u201329. https:\/\/doi.org\/10.1145\/1378773.1378777"},{"key":"e_1_3_2_2_40_1","unstructured":"PwC. 2018. Consumer Intelligence Series: Prepare for the voice revolution. https:\/\/www.pwc.com\/us\/en\/advisory-services\/publications\/consumer-intelligence-series\/voice-assistants.pdf.  PwC. 2018. Consumer Intelligence Series: Prepare for the voice revolution. https:\/\/www.pwc.com\/us\/en\/advisory-services\/publications\/consumer-intelligence-series\/voice-assistants.pdf."},{"key":"e_1_3_2_2_41_1","doi-asserted-by":"crossref","unstructured":"Tony Russell-Rose and Tyler Tate. 2013. Chapter 6 - Displaying and Manipulating Results. In Designing the Search Experience Tony Russell-Rose and Tyler Tate (Eds.). Morgan Kaufmann 129\u2013166. https:\/\/doi.org\/10.1016\/B978-0-12-396981-1.00006-9  Tony Russell-Rose and Tyler Tate. 2013. Chapter 6 - Displaying and Manipulating Results. In Designing the Search Experience Tony Russell-Rose and Tyler Tate (Eds.). Morgan Kaufmann 129\u2013166. https:\/\/doi.org\/10.1016\/B978-0-12-396981-1.00006-9","DOI":"10.1016\/B978-0-12-396981-1.00006-9"},{"key":"e_1_3_2_2_42_1","doi-asserted-by":"publisher","DOI":"10.1007\/11555261_24"},{"key":"e_1_3_2_2_43_1","doi-asserted-by":"crossref","unstructured":"Rachael Tatman. 2017. Gender and Dialect Bias in YouTube\u2019s Automatic Captions. (2017) 53\u201359. https:\/\/doi.org\/10.18653\/v1\/w17-1606  Rachael Tatman. 2017. Gender and Dialect Bias in YouTube\u2019s Automatic Captions. (2017) 53\u201359. https:\/\/doi.org\/10.18653\/v1\/w17-1606","DOI":"10.18653\/v1\/W17-1606"},{"key":"e_1_3_2_2_44_1","doi-asserted-by":"crossref","unstructured":"Rachael Tatman and C. Kasten. 2017. Effects of Talker Dialect Gender & Race on Accuracy of Bing Speech and YouTube Automatic Captions. In INTERSPEECH.  Rachael Tatman and C. Kasten. 2017. Effects of Talker Dialect Gender & Race on Accuracy of Bing Speech and YouTube Automatic Captions. In INTERSPEECH.","DOI":"10.21437\/Interspeech.2017-1746"},{"key":"e_1_3_2_2_45_1","unstructured":"Unity Technologies. 2005. Unity. https:\/\/unity.com\/.  Unity Technologies. 2005. Unity. https:\/\/unity.com\/."},{"key":"e_1_3_2_2_46_1","volume-title":"Smart speaker installed base worldwide 2020 and","author":"Vailshery S.","year":"2024","unstructured":"Lionel\u00a0 S. Vailshery . 2020. Smart speaker installed base worldwide 2020 and 2024 . https:\/\/www.statista.com\/statistics\/878650\/worldwide-smart-speaker-installed-base-by-country\/. Lionel\u00a0S. Vailshery. 2020. Smart speaker installed base worldwide 2020 and 2024. https:\/\/www.statista.com\/statistics\/878650\/worldwide-smart-speaker-installed-base-by-country\/."},{"key":"e_1_3_2_2_47_1","volume-title":"Comparing Intelligent Personal Assistant Use for Native and Non-Native Language Speakers. 22nd International Conference on Human-Computer Interaction with Mobile Devices and Services(2020)","author":"Wu Yunhan","year":"2020","unstructured":"Yunhan Wu , Daniel Rough , Anna Bleakley , Justin Edwards , Orla Cooney , Philip\u00a0 R. Doyle , Leigh Clark , and Benjamin\u00a0 R. Cowan . 2020 . See What I\u2019m Saying? Comparing Intelligent Personal Assistant Use for Native and Non-Native Language Speakers. 22nd International Conference on Human-Computer Interaction with Mobile Devices and Services(2020) . https:\/\/doi.org\/10.1145\/3379503.3403563 Yunhan Wu, Daniel Rough, Anna Bleakley, Justin Edwards, Orla Cooney, Philip\u00a0R. Doyle, Leigh Clark, and Benjamin\u00a0R. Cowan. 2020. See What I\u2019m Saying? Comparing Intelligent Personal Assistant Use for Native and Non-Native Language Speakers. 22nd International Conference on Human-Computer Interaction with Mobile Devices and Services(2020). https:\/\/doi.org\/10.1145\/3379503.3403563"},{"key":"e_1_3_2_2_48_1","volume-title":"International Conference on Information and Knowledge Management, Proceedings, Vol.\u00a010","author":"Zhang Yongfeng","unstructured":"Yongfeng Zhang , Xu Chen , Qingyao Ai , Liu Yang , and W. Bruce Croft . 2018. Towards conversational search and recommendation: System Ask, user respond . In International Conference on Information and Knowledge Management, Proceedings, Vol.\u00a010 . ACM, 177\u2013186. https:\/\/doi.org\/10.1145\/3269206.3271776 Yongfeng Zhang, Xu Chen, Qingyao Ai, Liu Yang, and W. Bruce Croft. 2018. Towards conversational search and recommendation: System Ask, user respond. In International Conference on Information and Knowledge Management, Proceedings, Vol.\u00a010. ACM, 177\u2013186. https:\/\/doi.org\/10.1145\/3269206.3271776"}],"event":{"name":"ICMI '21: INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION","location":"Montr\u00e9al QC Canada","acronym":"ICMI '21","sponsor":["SIGCHI ACM Special Interest Group on Computer-Human Interaction"]},"container-title":["Proceedings of the 2021 International Conference on Multimodal Interaction"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3462244.3479902","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3462244.3479902","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:48:54Z","timestamp":1750193334000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3462244.3479902"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,10,18]]},"references-count":48,"alternative-id":["10.1145\/3462244.3479902","10.1145\/3462244"],"URL":"https:\/\/doi.org\/10.1145\/3462244.3479902","relation":{},"subject":[],"published":{"date-parts":[[2021,10,18]]},"assertion":[{"value":"2021-10-18","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}