{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,1]],"date-time":"2026-04-01T05:02:45Z","timestamp":1775019765389,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":49,"publisher":"ACM","funder":[{"name":"ANR TAPAS","award":["ANR19-JSTS-0001-01"],"award-info":[{"award-number":["ANR19-JSTS-0001-01"]}]},{"name":"ANR Enhancer","award":["ANR-22-EXEN-0004"],"award-info":[{"award-number":["ANR-22-EXEN-0004"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,9,16]]},"DOI":"10.1145\/3717511.3747082","type":"proceedings-article","created":{"date-parts":[[2025,10,8]],"date-time":"2025-10-08T14:30:52Z","timestamp":1759933852000},"page":"1-10","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Greta 2.0: Social Interactive Agent system, optimized for neural network integration"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0003-7824-5016","authenticated-orcid":false,"given":"Takeshi","family":"Saga","sequence":"first","affiliation":[{"name":"ISIR - Sorbonne University, Paris, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4682-6011","authenticated-orcid":false,"given":"Lucie","family":"Galland","sequence":"additional","affiliation":[{"name":"ISIR - Sorbonne University, Paris, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0004-2587-043X","authenticated-orcid":false,"given":"Nezih","family":"Younsi","sequence":"additional","affiliation":[{"name":"ISIR - Sorbonne University, Paris, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1008-0799","authenticated-orcid":false,"given":"Catherine","family":"Pelachaud","sequence":"additional","affiliation":[{"name":"CNRS - ISIR - Sorbonne University, Paris, France"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2025,10,8]]},"reference":[{"key":"e_1_3_3_2_2_2","doi-asserted-by":"crossref","unstructured":"Jens Allwood Joakim Nivre and Elisabeth Ahls\u00e9n. 1992. On the semantics and pragmatics of linguistic feedback. Journal of semantics 9 1 (1992) 1\u201326.","DOI":"10.1093\/jos\/9.1.1"},{"key":"e_1_3_3_2_3_2","first-page":"476","volume-title":"Lecture Notes in Computer Science","author":"Anderson Keith","year":"2013","unstructured":"Keith Anderson, Elisabeth Andr\u00e9, T Baur, Sara Bernardini, M Chollet, E Chryssafidou, I Damian, C Ennis, A Egges, P Gebhard, H Jones, M Ochs, C Pelachaud, Ka\u015bka Porayska-Pomsta, P Rizzo, and Nicolas Sabouret. 2013. The TARDIS framework: Intelligent virtual agents for social coaching in job interviews. In Lecture Notes in Computer Science. Springer International Publishing, Cham, 476\u2013491."},{"key":"e_1_3_3_2_4_2","doi-asserted-by":"crossref","unstructured":"Elisabetta Bevacqua Etienne de Sevin Sylwia\u00a0Julia Hyniewska and Catherine Pelachaud. 2012. A listener model: introducing personality traits. J. Multimodal User Interfaces 6 1-2 (July 2012) 27\u201338.","DOI":"10.1007\/s12193-012-0094-8"},{"key":"e_1_3_3_2_5_2","doi-asserted-by":"publisher","DOI":"10.1145\/3136755.3136780"},{"key":"e_1_3_3_2_6_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-662-08373-4_4"},{"key":"e_1_3_3_2_7_2","first-page":"1","volume-title":"Proc. IWSDS","author":"Chiba Yuya","year":"2024","unstructured":"Yuya Chiba, Koh Mitsuda, Akinobu Lee, and Ryuichiro Higashinaka. 2024. The Remdis toolkit: Building advanced real-time multimodal dialogue systems with incremental processing and large language models. In Proc. IWSDS. Springer Nature, New York, NY, USA, 1\u20136."},{"key":"e_1_3_3_2_8_2","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2022-10955"},{"key":"e_1_3_3_2_9_2","unstructured":"Maximilian\u00a0C Fink Seth\u00a0A Robinson and Bernhard Ertl. 2024. AI-based avatars are changing the way we learn and teach: benefits and challenges. Online."},{"key":"e_1_3_3_2_10_2","doi-asserted-by":"publisher","DOI":"10.1145\/3717511.3747062"},{"key":"e_1_3_3_2_11_2","unstructured":"Lucie Galland Catherine Pelachaud and Florian Pecune. 2024. EMMI\u2013Empathic Multimodal Motivational Interviews Dataset: Analyses and Annotations."},{"key":"e_1_3_3_2_12_2","doi-asserted-by":"publisher","DOI":"10.1145\/3652988.3673923"},{"key":"e_1_3_3_2_13_2","unstructured":"Lucie Galland Catherine Pelachaud and Florian Pecune. 2025. Tailored Conversations beyond LLMs: A RL-Based Dialogue Manager. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2506.19652 (2025)."},{"key":"e_1_3_3_2_14_2","doi-asserted-by":"publisher","DOI":"10.1145\/1082473.1082478"},{"key":"e_1_3_3_2_15_2","doi-asserted-by":"publisher","DOI":"10.1145\/3514197.3549671"},{"key":"e_1_3_3_2_16_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-40415-3_33"},{"key":"e_1_3_3_2_17_2","doi-asserted-by":"publisher","DOI":"10.1145\/1082473.1082640"},{"key":"e_1_3_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-85483-8_28"},{"key":"e_1_3_3_2_19_2","doi-asserted-by":"crossref","unstructured":"Kobin\u00a0H Kendrick Judith Holler and Stephen\u00a0C Levinson. 2023. Turn-taking in human face-to-face interaction is multimodal: gaze direction and manual gestures aid the coordination of turn transitions. Philosophical transactions of the royal society B 378 1875 (2023) 20210473.","DOI":"10.1098\/rstb.2021.0473"},{"key":"e_1_3_3_2_20_2","first-page":"10","volume-title":"Proceedings of the AAMAS Workshop on Intelligent Conversation Agents in Home and Geriatric Care Applications co-located with the Federated AI Meeting","volume":"2338","author":"Kopp S","year":"2018","unstructured":"S Kopp, Mara Brandt, Hendrik Buschmeier, Katharina Cyra, F Freigang, N Kr\u00e4mer, F Kummert, Christiane Opfermann, K Pitsch, Lars Schillingmann, Carolin Stra\u00dfmann, Eduard Wall, and Ramin Yaghoubzadeh. 2018. Conversational assistants for elderly users - the importance of socially cooperative dialogue. In Proceedings of the AAMAS Workshop on Intelligent Conversation Agents in Home and Geriatric Care Applications co-located with the Federated AI Meeting , Vol.\u00a02338. ACM, New York, NY, USA, 10\u201317."},{"key":"e_1_3_3_2_21_2","first-page":"205","volume-title":"Lecture Notes in Computer Science","author":"Kopp Stefan","year":"2006","unstructured":"Stefan Kopp, Brigitte Krenn, Stacy Marsella, Andrew\u00a0N Marshall, Catherine Pelachaud, Hannes Pirker, Kristinn\u00a0R Th\u00f3risson, and Hannes Vilhj\u00e1lmsson. 2006. Towards a common framework for multimodal generation: The behavior markup language. In Lecture Notes in Computer Science. Springer Berlin Heidelberg, Berlin, Heidelberg, 205\u2013217."},{"key":"e_1_3_3_2_22_2","doi-asserted-by":"crossref","unstructured":"Stefan Kopp Herwin van Welbergen Ramin Yaghoubzadeh and Hendrik Buschmeier. 2014. An architecture for fluid real-time conversational agents: integrating incremental output generation and input processing. Journal on Multimodal User Interfaces 8 (2014) 97\u2013108.","DOI":"10.1007\/s12193-013-0130-3"},{"key":"e_1_3_3_2_23_2","doi-asserted-by":"publisher","DOI":"10.1145\/3382507.3418815"},{"key":"e_1_3_3_2_24_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W17-5516"},{"key":"e_1_3_3_2_25_2","doi-asserted-by":"publisher","DOI":"10.5281\/zenodo.10427369"},{"key":"e_1_3_3_2_26_2","volume-title":"The Handbook on Socially Interactive Agents: 20 years of Research on Embodied Conversational Agents, Intelligent Virtual Agents, and Social Robotics Volume 2: Interactivity, Platforms, Application (1 ed.)","author":"Lugrin Birgit","year":"2022","unstructured":"Birgit Lugrin, Catherine Pelachaud, and David Traum (Eds.). 2022. The Handbook on Socially Interactive Agents: 20 years of Research on Embodied Conversational Agents, Intelligent Virtual Agents, and Social Robotics Volume 2: Interactivity, Platforms, Application (1 ed.). Vol.\u00a048. Association for Computing Machinery (ACM), New York, NY, USA."},{"key":"e_1_3_3_2_27_2","volume-title":"Proc. of the Workshop on FML at AAMAS","author":"Mancini Maurizio","year":"2008","unstructured":"Maurizio Mancini and Catherine Pelachaud. 2008. The fml-apml language. In Proc. of the Workshop on FML at AAMAS , Vol.\u00a08. Association for Computing Machinery (ACM), Estril."},{"key":"e_1_3_3_2_28_2","unstructured":"William\u00a0R Miller Theresa\u00a0B Moyers Denise Ernst and Paul Amrhein. 2003. Manual for the motivational interviewing skill code (MISC). (2003). Albuquerque: Center on Alcoholism Substance Abuse and Addictions University of New Mexico."},{"key":"e_1_3_3_2_29_2","series-title":"(AAMAS \u201909)","first-page":"1399","volume-title":"Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2","author":"Niewiadomski Radoslaw","year":"2009","unstructured":"Radoslaw Niewiadomski, Elisabetta Bevacqua, Maurizio Mancini, and Catherine Pelachaud. 2009. Greta: an interactive expressive ECA system. In Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2 (Budapest, Hungary) (AAMAS \u201909). International Foundation for Autonomous Agents and Multiagent Systems, Richland, SC, 1399\u20131400."},{"key":"e_1_3_3_2_30_2","unstructured":"Nvidia. 2025. NVIDIA AI Blueprint: Digital Human for Customer Service. online. https:\/\/github.com\/NVIDIA-AI-Blueprints\/digital-human Last accessed: 16 April 2025."},{"key":"e_1_3_3_2_31_2","doi-asserted-by":"crossref","unstructured":"Daniel\u00a0C O\u2019Connell Sabine Kowal and Erika Kaltenbacher. 1990. Turn-taking: A critical analysis of the research tradition. J. Psycholinguist. Res. 19 6 (Nov. 1990) 345\u2013373.","DOI":"10.1007\/BF01068884"},{"key":"e_1_3_3_2_32_2","doi-asserted-by":"publisher","DOI":"10.1145\/3623809.3623837"},{"key":"e_1_3_3_2_33_2","doi-asserted-by":"publisher","DOI":"10.1002\/0470854626"},{"key":"e_1_3_3_2_34_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-23974-8_25"},{"key":"e_1_3_3_2_35_2","doi-asserted-by":"crossref","unstructured":"Simon Provoost Ho\u00a0Ming Lau Jeroen Ruwaard and Heleen Riper. 2017. Embodied conversational agents in clinical psychology: A scoping review. J. Med. Internet Res. 19 5 (May 2017) e151.","DOI":"10.2196\/jmir.6553"},{"key":"e_1_3_3_2_36_2","volume-title":"AAAI Spring Symposia","author":"Ribeiro Tiago","year":"2016","unstructured":"Tiago Ribeiro, Andr\u00e9 Pereira, Eugenio Di\u00a0Tullio, and Ana Paiva. 2016. The SERA Ecosystem: Socially Expressive Robotics Architecture for Autonomous Human-Robot Interaction. In AAAI Spring Symposia. AAAI, Palo Alto."},{"key":"e_1_3_3_2_37_2","first-page":"67","volume-title":"AAAI spring symposium on artificial intelligence and interactive entertainment","author":"Rickel Jeff","year":"2001","unstructured":"Jeff Rickel, Jonathan Gratch, Randall Hill, Stacy Marsella, and William Swartout. 2001. Steve goes to Bosnia: Towards a new generation of virtual humans for interactive experiences. In AAAI spring symposium on artificial intelligence and interactive entertainment. AAAI Press, Menlo Park, CA, 67\u201371."},{"key":"e_1_3_3_2_38_2","doi-asserted-by":"crossref","unstructured":"Harvey Sacks Emanuel\u00a0A Schegloff and Gail Jefferson. 1974. A simplest systematics for the organization of turn-taking for conversation. Language (Baltim.) 50 4 (1974) 696\u2013735.","DOI":"10.1353\/lan.1974.0010"},{"key":"e_1_3_3_2_39_2","unstructured":"Takeshi Saga and Catherine Pelachaud. 2025. Voice Activity Projection Model with Multimodal Encoders. arxiv:https:\/\/arXiv.org\/abs\/2506.03980\u00a0[cs.CL] https:\/\/arxiv.org\/abs\/2506.03980"},{"key":"e_1_3_3_2_40_2","doi-asserted-by":"publisher","DOI":"10.1145\/3610661.3620662"},{"key":"e_1_3_3_2_41_2","doi-asserted-by":"crossref","unstructured":"Gabriel Skantze. 2021. Turn-taking in conversational systems and human-robot interaction: a review. Computer Speech & Language 67 (2021) 101178.","DOI":"10.1016\/j.csl.2020.101178"},{"key":"e_1_3_3_2_42_2","doi-asserted-by":"crossref","unstructured":"Tanya Stivers N Enfield P Brown C Englert M Hayashi Trine Heinemann G Hoymann F Rossano Jan\u00a0Peter De\u00a0Ruiter Kyung-Eun Yoon S Levinson P Kay and K. Y. 2009. Universals and cultural variation in turn-taking in conversation. Proc. Natl. Acad. Sci. U. S. A. 106 (June 2009) 10587\u201310592.","DOI":"10.1073\/pnas.0903616106"},{"key":"e_1_3_3_2_43_2","unstructured":"William\u00a0R Swartout Jonathan Gratch Randall\u00a0W Hill\u00a0Jr Eduard Hovy Stacy Marsella Jeff Rickel and David Traum. 2006. Toward virtual humans. AI Magazine 27 2 (2006) 96\u201396."},{"key":"e_1_3_3_2_44_2","series-title":"(AAMAS \u201908)","first-page":"151","volume-title":"Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 1","author":"Thiebaux Marcus","year":"2008","unstructured":"Marcus Thiebaux, Stacy Marsella, Andrew\u00a0N. Marshall, and Marcelo Kallmann. 2008. SmartBody: behavior realization for embodied conversational agents. In Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 1 (Estoril, Portugal) (AAMAS \u201908). International Foundation for Autonomous Agents and Multiagent Systems, Richland, SC, 151\u2013158."},{"key":"e_1_3_3_2_45_2","first-page":"52","volume-title":"Intelligent Virtual Agents","author":"Van\u00a0Welbergen Herwin","year":"2013","unstructured":"Herwin Van\u00a0Welbergen, Timo Baumann, Stefan Kopp, and David Schlangen. 2013. Incremental, Adaptive and Interruptive Speech Realization for Fluent Conversation with ECAs. In Intelligent Virtual Agents , Vol.\u00a08108. Springer, New York, NY, USA, 52\u201353."},{"key":"e_1_3_3_2_46_2","doi-asserted-by":"crossref","unstructured":"Herwin van Welbergen Dennis Reidsma Zs\u00f3fia\u00a0M Ruttkay and Job Zwiers. 2009. Elckerlyc: A BML realizer for continuous multimodal interaction with a virtual human. Journal on Multimodal User Interfaces 3 4 (2009) 271\u2013284.","DOI":"10.1007\/s12193-010-0051-3"},{"key":"e_1_3_3_2_47_2","doi-asserted-by":"publisher","DOI":"10.1145\/3570945.3607326"},{"key":"e_1_3_3_2_48_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP43922.2022.9746035"},{"key":"e_1_3_3_2_49_2","unstructured":"Nezih Younsi Catherine Pelachaud and Laurence Chaby. 2025. Dyadic Adaptation of Facial Expressions Using Diffusion Models. (Under revision) IEEE Transactions on Affective Computing (2025)."},{"key":"e_1_3_3_2_50_2","doi-asserted-by":"crossref","unstructured":"Nezih Younsi Catherine Pelachaud and Laurence Chaby. 2025. MODIFF-8 to better motivate: Live Adaptive Human-Socially Interactive Agent Interaction. (Under revision) International Journal of Human-Computer Studies (2025).","DOI":"10.2139\/ssrn.5125133"}],"event":{"name":"IVA '25: ACM International Conference on Intelligent Virtual Agents","location":"Berlin Germany","acronym":"IVA '25","sponsor":["SIGAI ACM Special Interest Group on Artificial Intelligence"]},"container-title":["Proceedings of the 25th ACM International Conference on Intelligent Virtual Agents"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3717511.3747082","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,1,9]],"date-time":"2026-01-09T17:43:16Z","timestamp":1767980596000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3717511.3747082"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,9,16]]},"references-count":49,"alternative-id":["10.1145\/3717511.3747082","10.1145\/3717511"],"URL":"https:\/\/doi.org\/10.1145\/3717511.3747082","relation":{},"subject":[],"published":{"date-parts":[[2025,9,16]]},"assertion":[{"value":"2025-10-08","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}