{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,22]],"date-time":"2026-04-22T19:04:17Z","timestamp":1776884657655,"version":"3.51.2"},"publisher-location":"New York, NY, USA","reference-count":23,"publisher":"ACM","license":[{"start":{"date-parts":[[2018,10,2]],"date-time":"2018-10-02T00:00:00Z","timestamp":1538438400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Japan Science and Technology Agency","award":["JPMJER1401"],"award-info":[{"award-number":["JPMJER1401"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2018,10,2]]},"DOI":"10.1145\/3242969.3242994","type":"proceedings-article","created":{"date-parts":[[2018,10,2]],"date-time":"2018-10-02T12:09:29Z","timestamp":1538482169000},"page":"78-86","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":20,"title":["Evaluation of Real-time Deep Learning Turn-taking Models for Multiple Dialogue Scenarios"],"prefix":"10.1145","author":[{"given":"Divesh","family":"Lala","sequence":"first","affiliation":[{"name":"Kyoto University, Kyoto, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Koji","family":"Inoue","sequence":"additional","affiliation":[{"name":"Kyoto University, Kyoto, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tatsuya","family":"Kawahara","sequence":"additional","affiliation":[{"name":"Kyoto University, Kyoto, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2018,10,2]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"crossref","unstructured":"Zakaria Aldeneh Dimitrios Dimitriadis and Emily Mower Provost . 2018. Improving End-of-turn Detection In Spoken Dialogues By Detecting Speaker Intentions As A Secondary Task. In ICAASP.  Zakaria Aldeneh Dimitrios Dimitriadis and Emily Mower Provost . 2018. Improving End-of-turn Detection In Spoken Dialogues By Detecting Speaker Intentions As A Secondary Task. In ICAASP.","DOI":"10.1109\/ICASSP.2018.8461997"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/3136755.3136760"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2014.6854199"},{"key":"e_1_3_2_1_4_1","unstructured":"Chuan Guo Geoff Pleiss Yu Sun and Kilian Q Weinberger . 2017. On calibration of modern neural networks. arXiv preprint arXiv:1706.04599 (2017).  Chuan Guo Geoff Pleiss Yu Sun and Kilian Q Weinberger . 2017. On calibration of modern neural networks. arXiv preprint arXiv:1706.04599 (2017)."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"crossref","unstructured":"Kohei Hara Koji Inoue Katsuya Takanashi and Tatsuya Kawahara . 2018. Prediction of Turn-taking Using Multitask Learning with Prediction of Backchannels and Fillers. In INTERSPEECH. To appear.  Kohei Hara Koji Inoue Katsuya Takanashi and Tatsuya Kawahara . 2018. Prediction of Turn-taking Using Multitask Learning with Prediction of Backchannels and Fillers. In INTERSPEECH. To appear.","DOI":"10.21437\/Interspeech.2018-1442"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.123"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/2663204.2663271"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2017-837"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"crossref","unstructured":"Kristiina Jokinen Kazuaki Harada Masafumi Nishida and Seiichi Yamamoto . 2010. Turn-alignment using eye-gaze and speech in conversational interaction Eleventh Annual Conference of the International Speech Communication Association.  Kristiina Jokinen Kazuaki Harada Masafumi Nishida and Seiichi Yamamoto . 2010. Turn-alignment using eye-gaze and speech in conversational interaction Eleventh Annual Conference of the International Speech Communication Association.","DOI":"10.21437\/Interspeech.2010-571"},{"key":"e_1_3_2_1_10_1","volume-title":"Thirteenth Annual Conference of the International Speech Communication Association.","author":"Kawahara Tatsuya","year":"2012"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"crossref","unstructured":"Yoon Kim . 2014. Convolutional neural networks for sentence classification. arXiv preprint arXiv:1408.5882 (2014).  Yoon Kim . 2014. Convolutional neural networks for sentence classification. arXiv preprint arXiv:1408.5882 (2014).","DOI":"10.3115\/v1\/D14-1181"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2017-965"},{"key":"e_1_3_2_1_13_1","volume-title":"Towards Deep End-of-Turn Prediction for Situated Spoken Dialogue Systems Proceedings of INTERSPEECH","author":"Maier Angelika","year":"2017"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2017-651"},{"key":"e_1_3_2_1_15_1","unstructured":"Tomas Mikolov Kai Chen Greg Corrado and Jeffrey Dean . 2013. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013).  Tomas Mikolov Kai Chen Greg Corrado and Jeffrey Dean . 2013. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"crossref","unstructured":"Antoine Raux and Maxine Eskenazi . 2009. A finite-state turn-taking model for spoken dialog systems Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics. Association for Computational Linguistics 629--637.   Antoine Raux and Maxine Eskenazi . 2009. A finite-state turn-taking model for spoken dialog systems Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics. Association for Computational Linguistics 629--637.","DOI":"10.3115\/1620754.1620846"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/2168748.2168749"},{"key":"e_1_3_2_1_18_1","volume-title":"Interaction: The infrastructure for social institutions, the natural ecological niche for language, and the arena in which culture is enacted. In Roots of Human Sociality, bibfieldeditor","author":"Schegloff Emanuel A.","year":"2006"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W17-5527"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.0903616106"},{"key":"e_1_3_2_1_21_1","unstructured":"L. ten Bosch N. Oostdijk and Jan de Ruiter . 2004. Turn-taking in social talk dialogues: temporal formal and functional aspects SPECOM 2004.  L. ten Bosch N. Oostdijk and Jan de Ruiter . 2004. Turn-taking in social talk dialogues: temporal formal and functional aspects SPECOM 2004."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2018.8462576"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1609\/aimag.v37i4.2687"}],"event":{"name":"ICMI '18: INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION","location":"Boulder CO USA","acronym":"ICMI '18","sponsor":["SIGCHI Specialist Interest Group in Computer-Human Interaction of the ACM"]},"container-title":["Proceedings of the 20th ACM International Conference on Multimodal Interaction"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3242969.3242994","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3242969.3242994","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3242969.3242994","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T01:39:25Z","timestamp":1750210765000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3242969.3242994"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,10,2]]},"references-count":23,"alternative-id":["10.1145\/3242969.3242994","10.1145\/3242969"],"URL":"https:\/\/doi.org\/10.1145\/3242969.3242994","relation":{},"subject":[],"published":{"date-parts":[[2018,10,2]]},"assertion":[{"value":"2018-10-02","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}