{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,14]],"date-time":"2026-04-14T16:43:04Z","timestamp":1776184984899,"version":"3.50.1"},"reference-count":64,"publisher":"Springer Science and Business Media LLC","issue":"12","license":[{"start":{"date-parts":[[2025,7,21]],"date-time":"2025-07-21T00:00:00Z","timestamp":1753056000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,7,21]],"date-time":"2025-07-21T00:00:00Z","timestamp":1753056000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Int J of Soc Robotics"],"published-print":{"date-parts":[[2025,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Robot assistants for older adults and people with disabilities need to perform collaborative tasks with users effectively. The core component of these systems is an interaction manager whose job is to observe and assess the task and infer the state of the human and their intent for the robot to choose the best course of action. Due to the sparseness of the data in this domain, the policy for such multimodal systems is often crafted by hand; as the complexity of interactions grows, this process is not scalable. This paper proposes a reinforcement learning (RL) approach to automatically generate the multimodal policy of the robot. Our system focuses on a realistic scenario where a robot assists a user in locating objects within a home environment, managing multimodal signals, including language and physical actions, to select the best action. In contrast to traditional dialog systems, our agent is trained with a simulator that uses human data and can deal with multiple modalities. We use a simple high-level reward function that needs no fine-tuning and enforce some preconditions to speed up the training process. A human study evaluating the system in a real-world setting demonstrates promising results, indicating high usability and effective task completion. This RL-based approach offers a scalable and interpretable alternative for designing interaction managers in multimodal human-robot collaborations.<\/jats:p>","DOI":"10.1007\/s12369-025-01287-6","type":"journal-article","created":{"date-parts":[[2025,7,21]],"date-time":"2025-07-21T09:54:03Z","timestamp":1753091643000},"page":"3003-3025","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Multimodal Reinforcement Learning for Robots Collaborating with Humans"],"prefix":"10.1007","volume":"17","author":[{"ORCID":"https:\/\/orcid.org\/0009-0007-9554-0681","authenticated-orcid":false,"given":"Afagh","family":"Mehri Shervedani","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Siyu","family":"Li","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Natawut","family":"Monaikul","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Bahareh","family":"Abbasi","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Milo\u0161","family":"\u017defran","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Barbara","family":"Di Eugenio","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2025,7,21]]},"reference":[{"key":"1287_CR1","doi-asserted-by":"publisher","unstructured":"Abbasi B, Monaikul N, Rysbek Z, Di Eugenio B, \u017defran M (2019) A multimodal human-robot interaction manager for assistive robots. In 2019 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), 6756\u20136762). https:\/\/doi.org\/10.1109\/IROS40897.2019.8968505","DOI":"10.1109\/IROS40897.2019.8968505"},{"key":"1287_CR2","doi-asserted-by":"publisher","unstructured":"Monaikul N, Abbasi B, Rysbek Z, Di Eugenio B, \u017defran M (2020) Role switching in task-oriented multimodal human-robot collaboration. In 2020 29th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN), 1150\u20131156). https:\/\/doi.org\/10.1109\/RO-MAN47096.2020.9223461","DOI":"10.1109\/RO-MAN47096.2020.9223461"},{"key":"1287_CR3","doi-asserted-by":"publisher","unstructured":"Chen L, Javaid M, Di Eugenio B, \u017defran M (2015) The roles and recognition of haptic-ostensive actions in collaborative multimodal human\u2013human dialogues. Comput Speech Language 34(1):201\u2013231. https:\/\/doi.org\/10.1016\/j.csl.2015.03.010","DOI":"10.1016\/j.csl.2015.03.010"},{"key":"1287_CR4","unstructured":"Ouyang L, Wu J, Jiang X, Almeida D, Wainwright C, Mishkin P, Zhang C, Agarwal S, Slama K, Ray A et al. (2022) Training language models to follow instructions with human feedback. In Advances in Neural Information Processing Systems, vol 35. 27730\u201327744). https:\/\/proceedings.neurips.cc\/paperfiles\/paper\/2022\/file\/b1efde53be364a73914f58805a001731-Paper-Conference.pdf"},{"key":"1287_CR5","first-page":"1877","volume":"33","author":"T Brown","year":"2020","unstructured":"Brown T, Mann B, Ryder N, Subbiah M, Kaplan JD, Dhariwal P, Neelakantan A, Shyam P, Sastry G, Askell A et al. (2020) Language models are few-shot learners. Adv Neural Inf Process Syst 33:1877\u20131901. https:\/\/proceedings.neurips.cc\/paperfiles\/paper\/2020\/file\/1457c0d6bfcb4967418bfb8ac142f64a-Paper.pdf","journal-title":"Adv Neural Inf Process Syst"},{"key":"1287_CR6","unstructured":"OpenAI: OpenAI. (2023)"},{"key":"1287_CR7","unstructured":"Brohan A, Chebotar Y, Finn C, Hausman K, Herzog A, Ho D, Ibarz J, Irpan A, Jang E, Julian R et al. (2023) Do as i can, not as i say: grounding language in robotic affordances. In Conference on Robot Learning, PMLR, pp. 287\u2013318). https:\/\/proceedings.mlr.press\/v205\/ichter23a.html"},{"key":"1287_CR8","unstructured":"Driess D, Xia F, Sajjadi MS, Lynch C, Chowdhery A, Ichter B, Wahid A, Tompson J, Vuong Q, Yu T et al. (2023) Palm-e: an embodied multimodal language model. In International Conference on Machine Learning, PMLR, pp. 8469\u20138488). https:\/\/proceedings.mlr.press\/v202\/driess23a.html"},{"key":"1287_CR9","doi-asserted-by":"publisher","unstructured":"Wu J, Gan W, Chen Z, Wan S, Yu PS (2023) Multimodal large language models: a survey. In 2023 IEEE International Conference on Big Data (BigData), 2247\u20132256). https:\/\/doi.org\/10.1109\/BigData59044.2023.10386743","DOI":"10.1109\/BigData59044.2023.10386743"},{"key":"1287_CR10","doi-asserted-by":"publisher","unstructured":"Chen Y, Arkin J, Zhang Y, Roy N, Fan C (2024) Scalable multi-robot collaboration with large language models: centralized or decentralized systems? In 2024 IEEE International Conference on Robotics and Automation (ICRA), 4311\u20134317). https:\/\/doi.org\/10.1109\/ICRA57147.2024.10610676","DOI":"10.1109\/ICRA57147.2024.10610676"},{"key":"1287_CR11","doi-asserted-by":"publisher","unstructured":"Li X, Zhang M, Geng Y, Geng H, Long Y, Shen Y, Zhang R, Liu J, Dong H (2024) Manipllm: Embodied multimodal large language model for objectcentric robotic manipulation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 18061\u201318070). https:\/\/doi.org\/10.1109\/CVPR52733.2024.01710","DOI":"10.1109\/CVPR52733.2024.01710"},{"key":"1287_CR12","unstructured":"Wu S, Fei H, Qu L, Ji W, Chua T-S (2024) NExt-GPT: any-to-any multimodal LLM. In Forty-first International Conference on Machine Learning. https:\/\/openreview.net\/forum?id=NZQkumsNlf"},{"key":"1287_CR13","doi-asserted-by":"publisher","unstructured":"Caffagni D, Cocchi F, Barsellotti L, Moratelli N, Sarto S, Baraldi L, Cornia M, Cucchiara R (2024) The revolution of multimodal large language models: a survey. Findings of the association for computational linguistics ACL 13590\u201313618 (2024) https:\/\/doi.org\/10.18653\/v1\/2024findings-acl.807","DOI":"10.18653\/v1\/2024findings-acl.807"},{"issue":"2","key":"1287_CR14","doi-asserted-by":"publisher","first-page":"229","DOI":"10.1017\/S0263574799271172","volume":"17","author":"RS Sutton","year":"1999","unstructured":"Sutton RS, Barto AG (1999) Reinforcement learning: an introduction. Robotica 17(2):229\u2013235. https:\/\/doi.org\/10.1017\/S0263574799271172","journal-title":"Robotica"},{"key":"1287_CR15","first-page":"1063","volume-title":"The journal of machine learning research","author":"B Sallans","year":"2004","unstructured":"Sallans B, Hinton GE (2004) Reinforcement learning with factored states and actions. In: The journal of machine learning research, vol 5. JMLR. org, 1063\u20131088. https:\/\/dl.acm.org\/doi\/10.5555\/1005332.1016794"},{"key":"1287_CR16","doi-asserted-by":"publisher","unstructured":"Mehri Shervedani A, Li S, Monaikul N, Abbasi B, Di Eugenio B, \u017defran M (2023) An end-to-end human simulator for task-oriented multimodal human-robot collaboration. In 2023 32nd IEEE International Conference on Robot and Human Interactive Communication (RO-MAN), 614\u2013620). https:\/\/doi.org\/10.1109\/RO-MAN57019.2023.10309444","DOI":"10.1109\/RO-MAN57019.2023.10309444"},{"key":"1287_CR17","doi-asserted-by":"publisher","unstructured":"Bratko I, Urban\u010di\u010d T, Sammut C (1995) Behavioural cloning: phenomena, results and problems. In IFAC Proceedings, vol 28(21), 143\u2013149). https:\/\/doi.org\/10.1016\/S1474-6670(17)46716-4","DOI":"10.1016\/S1474-6670(17)46716-4"},{"key":"1287_CR18","doi-asserted-by":"publisher","unstructured":"Torabi F, Warnell G, Stone P (2018) Behavioral cloning from observation. In Proceedings of the 27th International Joint Conference on Artificial Intelligence, 4950\u20134957). https:\/\/doi.org\/10.24963\/ijcai.2018\/687","DOI":"10.24963\/ijcai.2018\/687"},{"issue":"3","key":"1287_CR19","doi-asserted-by":"publisher","first-page":"167","DOI":"10.1016\/S0921-8890(02)00373-1","volume":"42","author":"C Breazeal","year":"2003","unstructured":"Breazeal C (2003) Toward sociable robots. Robot Auton Syst 42(3):167\u2013175. https:\/\/doi.org\/10.1016\/S0921-8890(02)00373-1","journal-title":"Robot Auton Syst"},{"key":"1287_CR20","doi-asserted-by":"publisher","unstructured":"Ishiguro H, Ono T, Imai M, Maeda T, Kanda T, Nakatsu R (2001) Robovie: an interactive humanoid robot. Ind Robot: An Int J 28. https:\/\/doi.org\/10.1108\/01439910110410051","DOI":"10.1108\/01439910110410051"},{"key":"1287_CR21","doi-asserted-by":"publisher","unstructured":"Ishiguro H, Ono T, Imai M, Kanda T (2003) Development of an interactive humanoid robot \u201crobovie\u201d \u2014 an interdisciplinary approach. In: Robotics research. Springer, Berlin, Heidelberg, 179\u2013191. https:\/\/doi.org\/10.1007\/3-540-36460-9_12","DOI":"10.1007\/3-540-36460-9_12"},{"key":"1287_CR22","doi-asserted-by":"crossref","unstructured":"Takayuki Kanda DE, T, Ishiguro H (2004) Interactive robots as social partners and peer tutors for children: a field trial. In: Human\u2013computer interaction, vol 19. Taylor & Francis, 61\u201384. https:\/\/www.tandfonline.com\/doi\/abs\/10.1080\/07370024.2004.9667340","DOI":"10.1207\/s15327051hci1901&2_4"},{"key":"1287_CR23","unstructured":"Erol K, Nau D, Hendler J (1993) Toward a general framework for hierarchical tasknetwork planning. methods 1(t1 2"},{"key":"1287_CR24","first-page":"1123","volume":"94","author":"K Erol","year":"1994","unstructured":"Erol K, Hendler J, Nau DS (1994) HTN planning: complexity and expressivity. AAAI 94:1123\u20131128. https:\/\/cdn.aaai.org\/AAAI\/1994\/AAAI94-173.pdf","journal-title":"AAAI"},{"key":"1287_CR25","unstructured":"Nau D, Cao Y, Lotem A, Munoz-Avila H (1999) Shop: simple hierarchical ordered planner. In Proceedings of the 16th International Joint Conference on Artificial Intelligence-Volume 2, 968\u2013973). https:\/\/www.ijcai.org\/Proceedings\/99-2\/Papers\/043.pdf"},{"key":"1287_CR26","doi-asserted-by":"publisher","unstructured":"Nau DS, Au T-C, Ilghami O, Kuter U, Murdock JW, Wu D, Yaman F (2003) Shop2: an HTN planning system. J Artif Intell Res 20:379\u2013404. https:\/\/doi.org\/10.1613\/jair.1141","DOI":"10.1613\/jair.1141"},{"key":"1287_CR27","first-page":"381","volume":"14","author":"B Nooraei","year":"2014","unstructured":"Nooraei B, Rich C, Sidner CL (2014) A real-time architecture for embodied conversational agents: beyond turn-taking. ACHI 14:381\u2013388","journal-title":"ACHI"},{"key":"1287_CR28","doi-asserted-by":"crossref","unstructured":"Mohseni-Kabir A, Rich C, Chernova S, Sidner CL, Miller D (2015) Interactive hierarchical task learning from a single demonstration. In Proceedings of the Tenth Annual ACM\/IEEE International Conference on Human-Robot Interaction, 205\u2013212). https:\/\/ieeexplore.ieee.org\/abstract\/document\/8520663","DOI":"10.1145\/2696454.2696474"},{"key":"1287_CR29","volume-title":"Refereed workshop planning for human-robot interaction: shared autonomy and collaborative robotics at robotics: science and systems","author":"AS Clair","year":"2016","unstructured":"Clair AS, Saldanha C, Boteanu A, Chernova S (2016) Interactive hierarchical task learning via crowdsourcing for robot adaptability. In: Refereed workshop planning for human-robot interaction: shared autonomy and collaborative robotics at robotics: science and systems. Ann Arbor. RSS, Michigan"},{"key":"1287_CR30","doi-asserted-by":"crossref","unstructured":"Xiao Z, Herzig A, Perrussel L, Wan H, Su X (2017) Hierarchical task network planning with task insertion and state constraints. In 26th International Joint Conference on Artificial Intelligence (IJCAI), 4463\u20134469). https:\/\/www.ijcai.org\/proceedings\/2017\/0623.pdf","DOI":"10.24963\/ijcai.2017\/623"},{"issue":"3","key":"1287_CR31","doi-asserted-by":"publisher","first-page":"441","DOI":"10.1287\/moor.12.3.441","volume":"12","author":"CH Papadimitriou","year":"1987","unstructured":"Papadimitriou CH, Tsitsiklis JN (1987) The complexity of markov decision processes. Math Oper Res 12(3):441\u2013450","journal-title":"Math Oper Res"},{"key":"1287_CR32","unstructured":"Gopalan N, Tellex S (2015) Modeling and solving human-robot collaborative tasks using pomdps. In RSS Workshop on Model Learning for Human-Robot Communication, vol 32. 590\u2013628"},{"key":"1287_CR33","doi-asserted-by":"publisher","unstructured":"Kurniawati H (2022) Partially observable markov decision processes and robotics. Annual review of control, robotics, and autonomous systems 5(1):253\u2013277. https:\/\/doi.org\/10.1146\/annurev-control-042920-092451. Annual Reviews","DOI":"10.1146\/annurev-control-042920-092451"},{"key":"1287_CR34","first-page":"279","volume":"8","author":"CJ Watkins","year":"1992","unstructured":"Watkins CJ, Dayan P (1992) Q-learning. Mach Learn 8:279\u2013292","journal-title":"Mach Learn"},{"key":"1287_CR35","doi-asserted-by":"publisher","unstructured":"Hester T, Vecerik M, Pietquin O, Lanctot M, Schaul T, Piot B, Horgan D, Quan J, Sendonaris A, Osband I et al. (2018) Deep q-learning from demonstrations. In Proceedings of the AAAI Conference on Artificial Intelligence, vol 32. https:\/\/doi.org\/10.1609\/aaai.v32i1.11757","DOI":"10.1609\/aaai.v32i1.11757"},{"key":"1287_CR36","unstructured":"Mnih V, Badia AP, Mirza M, Graves A, Lillicrap T, Harley T, Silver D, Kavukcuoglu K (2016) Asynchronous methods for deep reinforcement learning. In International Conference on Machine Learning, PMLR, pp. 1928\u20131937). https:\/\/proceedings.mlr.press\/v48\/mniha16.html"},{"key":"1287_CR37","unstructured":"Schulman J, Wolski F, Dhariwal P, Radford A, Klimov O (2017) Proximal policy optimization algorithms"},{"key":"1287_CR38","unstructured":"Woodworth B, Ferrari F, Zosa TE, Riek LD (2018). PMLR. Preference learning in assistive robotics: observational repeated inverse reinforcement learning. In Machine Learning for Healthcare Conference, 420\u2013439). http:\/\/proceedings.mlr.press\/v85\/woodworth18a\/woodworth18a.pdf"},{"key":"1287_CR39","doi-asserted-by":"publisher","unstructured":"Liu B, T\u00fcr G, Hakkani-T\u00fcr D, Shah P, Heck L: (2018) Dialogue learning with human teaching and feedback in end-to-end trainable task-oriented dialogue systems. In 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2018, 2060\u20132069). https:\/\/doi.org\/10.18653\/v1\/N18-1187. Association for Computational Linguistics (ACL)","DOI":"10.18653\/v1\/N18-1187"},{"issue":"2","key":"1287_CR40","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1145\/3166054.3166058","volume":"19","author":"H Chen","year":"2017","unstructured":"Chen H, Liu X, Yin D, Tang J (2017) A survey on dialogue systems: recent advances and new frontiers. Acm Sigkdd Explorations Newsl 19(2):25\u201335. https:\/\/doi.org\/10.1145\/3166054.3166058","journal-title":"Acm Sigkdd Explorations Newsl"},{"key":"1287_CR41","doi-asserted-by":"crossref","unstructured":"Zhao T, Eskenazi M (2016) Towards end-to-end learning for dialog state tracking and management using deep reinforcement learning. In Proceedings of the 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue, 1\u201310). https:\/\/aclanthology.org\/W16-3601.pdf","DOI":"10.18653\/v1\/W16-3601"},{"key":"1287_CR42","unstructured":"Li X, Chen Y-N, Li L, Gao J, Celikyilmaz A (2017) End-to-end task-completion neural dialogue systems. In Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 1: Long Papers), 733\u2013743). https:\/\/aclanthology.org\/I17-1074"},{"key":"1287_CR43","doi-asserted-by":"publisher","unstructured":"Park HW, Grover I, Spaulding S, Gomez L, Breazeal C (2019) A model-free affective reinforcement learning approach to personalization of an autonomous social robot companion for early literacy education. In Proceedings of the AAAI Conference on Artificial Intelligence, vol 33. 687\u2013694). https:\/\/doi.org\/10.1609\/aaai.v33i01.3301687","DOI":"10.1609\/aaai.v33i01.3301687"},{"key":"1287_CR44","unstructured":"Jang Y, Lee J, Kim K-E (2022) GPT-critic: offline reinforcement learning for endto- end task-oriented dialogue systems. In International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=qaxhBG1UUaS"},{"key":"1287_CR45","doi-asserted-by":"publisher","unstructured":"Shea R, Yu Z (2023) Building persona consistent dialogue agents with offline reinforcement learning. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Singapore, 1778\u20131795). https:\/\/doi.org\/10.18653\/v1\/2023.emnlp-main.110. https:\/\/aclanthology.org\/2023.emnlp-main.110","DOI":"10.18653\/v1\/2023.emnlp-main.110"},{"key":"1287_CR46","doi-asserted-by":"crossref","unstructured":"Ye G, Zhao H, Zhang Z, Zha X, Jiang Z (2024) Lstdial: enhancing dialogue generation via long-and short-term measurement feedback. In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 5857\u20135871). https:\/\/aclanthology.org\/2024.naacl-long.326.pdf","DOI":"10.18653\/v1\/2024.naacl-long.326"},{"key":"1287_CR47","unstructured":"Gupta D, Chow Y, Tulepbergenov A, Ghavamzadeh M, Boutilier C (2023) Offline reinforcement learning for mixture-of-expert dialogue management. In: Oh A, Naumann T, Globerson A, Saenko K, Hardt M, Levine S (eds) Advances in Neural Information Processing Systems, vol 36. Curran Associates, Inc, pp. 5912\u20135935). https:\/\/proceedings.neurips.cc\/paperfiles\/paper\/2023\/file\/12bcf58a1c09a0fcb5310f3589291ab4-Paper-Conference.pdf"},{"issue":"4","key":"1287_CR48","doi-asserted-by":"publisher","first-page":"3055","DOI":"10.1007\/s10462-022-10248-8","volume":"56","author":"J Ni","year":"2023","unstructured":"Ni J, Young T, Pandelea V, Xue F, Cambria E (2023) Recent advances in deep learning based dialogue systems: a systematic survey. Artif Intell Rev 56(4):3055\u20133155. https:\/\/doi.org\/10.1007\/s10462-022-10248-8","journal-title":"Artif Intell Rev"},{"key":"1287_CR49","doi-asserted-by":"publisher","unstructured":"Thomaz AL, Hoffman G, Breazeal C (2006) Reinforcement learning with human teachers: understanding how people want to teach robots. In ROMAN 2006 - The 15th IEEE International Symposium on Robot and Human Interactive Communication, 352\u2013357). https:\/\/doi.org\/10.1109\/ROMAN.2006.314459","DOI":"10.1109\/ROMAN.2006.314459"},{"key":"1287_CR50","first-page":"98","volume":"4","author":"A-N Sharkawy","year":"2021","unstructured":"Sharkawy A-N (2021) Human-robot interaction: applications. Automation, Robot & Commun For Ind 498","journal-title":"Automation, Robot & Commun For Ind"},{"key":"1287_CR51","unstructured":"Paszke A, Gross S, Massa F, Lerer A, Bradbury J, Chanan G, Killeen T, Lin Z, Gimelshein N, Antiga L, Desmaison A, Kopf A, Yang E, DeVito Z, Raison M, Tejani A, Chilamkurthy S, Steiner B, Fang L, Bai J, Chintala S (2019) Pytorch: an imperative style, high-performance deep learning library. Adv Neural Inf Process Syst 32. https:\/\/proceedings.neurips.cc\/paperfiles\/paper\/2019\/file\/bdbca288fee7f92f2bfa9f7012727740-Paper.pdf"},{"issue":"7540","key":"1287_CR52","doi-asserted-by":"publisher","first-page":"529","DOI":"10.1038\/nature14236","volume":"518","author":"V Mnih","year":"2015","unstructured":"Mnih V, Kavukcuoglu K, Silver D, Rusu AA, Veness J, Bellemare MG, Graves A, Riedmiller M, Fidjeland AK, Ostrovski G, Petersen S, Beattie C, Sadik A, Antonoglou I, King H, Kumaran D, Wierstra D, Legg S, Hassabis D (2015) Human-level control through deep reinforcement learning. Nature 518(7540):529\u2013533","journal-title":"Nature"},{"key":"1287_CR53","unstructured":"Ross S, Gordon G, Bagnell D (2011) A reduction of imitation learning and structured prediction to no-regret online learning. In Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, JMLR Workshop and Conference Proceedings, pp. 627\u2013635). http:\/\/proceedings.mlr.press\/v15\/ross11a\/ross11a.pdf"},{"key":"1287_CR54","unstructured":"https:\/\/cloud.google.com\/speech-to-text"},{"key":"1287_CR55","unstructured":"Lan Z, Chen M, Goodman S, Gimpel K, Sharma P, Soricut R (2019) Albert: a lite bert for self-supervised learning of language representations. In International Conference on Learning Representations"},{"key":"1287_CR56","volume-title":"Natural language processing with Python: analyzing text with the natural language toolkit","author":"S Bird","year":"2009","unstructured":"Bird S, Klein E, Loper E (2009) Natural language processing with Python: analyzing text with the natural language toolkit. O\u2019Reilly Media, Inc"},{"key":"1287_CR57","unstructured":"https:\/\/pyttsx3.readthedocs.io\/en\/latest\/"},{"issue":"6","key":"1287_CR58","doi-asserted-by":"publisher","first-page":"80","DOI":"10.2307\/3001968","volume":"1","author":"F Wilcoxon","year":"1945","unstructured":"Wilcoxon F (1945) Individual comparisons by ranking methods. Biom Bull 1(6):80\u201383. https:\/\/doi.org\/10.2307\/3001968","journal-title":"Biom Bull"},{"issue":"1","key":"1287_CR59","doi-asserted-by":"publisher","first-page":"50","DOI":"10.1214\/aoms\/1177730491","volume":"18","author":"HB Mann","year":"1947","unstructured":"Mann HB, Whitney DR (1947) On a test of whether one of two random variables is stochastically larger than the other. Ann Math Stat 18(1):50\u201360. https:\/\/doi.org\/10.1214\/aoms\/1177730491","journal-title":"Ann Math Stat"},{"key":"1287_CR60","doi-asserted-by":"publisher","unstructured":"Shapiro SS, Wilk MB (1965) An analysis of variance test for normality (complete samples). Biometrika 52(3\u20134):591\u2013611. https:\/\/doi.org\/10.1093\/biomet\/52.3-4.591","DOI":"10.1093\/biomet\/52.3-4.591"},{"issue":"1","key":"1287_CR61","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3412374","volume":"10","author":"G Hoffman","year":"2020","unstructured":"Hoffman G, Zhao X (2020) A primer for conducting experiments in human\u2013robot interaction. ACM Trans on Hum Rob Interact (THRI) 10(1):1\u201331. https:\/\/doi.org\/10.1145\/3412374","journal-title":"ACM Trans on Hum Rob Interact (THRI)"},{"issue":"3","key":"1287_CR62","doi-asserted-by":"publisher","first-page":"33","DOI":"10.1145\/3572784","volume":"12","author":"M Schrum","year":"2023","unstructured":"Schrum M, Ghuy M, Hedlund-Botti E, Natarajan M, Johnson M, Gombolay M (2023) Concerning trends in likert scale usage in human-robot interaction: towards improving best practices. ACM Trans On Hum Rob Interact 12(3):33\u201313332. https:\/\/doi.org\/10.1145\/3572784","journal-title":"ACM Trans On Hum Rob Interact"},{"issue":"3","key":"1287_CR63","doi-asserted-by":"publisher","first-page":"297","DOI":"10.1007\/BF02310555","volume":"16","author":"LJ Cronbach","year":"1951","unstructured":"Cronbach LJ (1951) Coefficient alpha and the internal structure of tests. Psychometrika 16(3):297\u2013334. https:\/\/doi.org\/10.1007\/BF02310555","journal-title":"Psychometrika"},{"key":"1287_CR64","doi-asserted-by":"publisher","unstructured":"Rysbek Z, Li S, Mehri Shervedani A, \u017defran M (2024) Proactive robot control for collaborative manipulation using human intent. In 2024 IEEE International Conference on Robotics and Automation (ICRA), 3176\u20133182). https:\/\/doi.org\/10.1109\/ICRA57147.2024.10610909","DOI":"10.1109\/ICRA57147.2024.10610909"}],"container-title":["International Journal of Social Robotics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s12369-025-01287-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s12369-025-01287-6","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s12369-025-01287-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,12,24]],"date-time":"2025-12-24T08:40:45Z","timestamp":1766565645000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s12369-025-01287-6"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,7,21]]},"references-count":64,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2025,12]]}},"alternative-id":["1287"],"URL":"https:\/\/doi.org\/10.1007\/s12369-025-01287-6","relation":{},"ISSN":["1875-4791","1875-4805"],"issn-type":[{"value":"1875-4791","type":"print"},{"value":"1875-4805","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,7,21]]},"assertion":[{"value":"2 July 2025","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"21 July 2025","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"All experimental protocols were approved by the University of Illinois Chicago Institutional Review Board (IRB).","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics Approval and Consent to Participate"}},{"value":"All authors have approved the manuscript prior to submission, including the names and order of the authors.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for Publication"}},{"value":"Informed consent was obtained from all participants involved in experiments using the University of Illinois Chicago Institutional Review Board (IRB) Social, Behavioral, and Educational Research Informed Consent Template.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Informed Consent Statement"}},{"value":"The methods described in this study were carried out in accordance with the guidelines and regulations provided by the University of Illinois Chicago Institutional Review Board (IRB).","order":5,"name":"Ethics","group":{"name":"EthicsHeading","label":"Accordance Statement"}},{"value":"The authors have no relevant financial or non-financial interests to disclose.","order":6,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of Interest\/Competing Interests"}}]}}