{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,27]],"date-time":"2026-01-27T09:30:32Z","timestamp":1769506232737,"version":"3.49.0"},"publisher-location":"New York, NY, USA","reference-count":50,"publisher":"ACM","funder":[{"name":"ARCIDUCA,EPSRC","award":["EP\/W001632\/1"],"award-info":[{"award-number":["EP\/W001632\/1"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,10,13]]},"DOI":"10.1145\/3716553.3756015","type":"proceedings-article","created":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T13:13:16Z","timestamp":1760188396000},"page":"682-692","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Talking-to-Build: How LLM-Assisted Interface Shapes Player Performance and Experience in Minecraft"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-8188-7576","authenticated-orcid":false,"given":"Xin","family":"Sun","sequence":"first","affiliation":[{"name":"Social and Behavioural Science, University of Amsterdam, Amsterdam, Netherlands and National Institute of Informatics (NII), Tokyo, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0004-3464-5533","authenticated-orcid":false,"given":"Lei","family":"Wang","sequence":"additional","affiliation":[{"name":"Vrije Universiteit Amsterdam, Amsterdam, Netherlands and Universiteit Utrecht, Utrecht, Netherlands"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5624-7235","authenticated-orcid":false,"given":"Yue","family":"Li","sequence":"additional","affiliation":[{"name":"Social AI, Vrije Universiteit Amsterdam, Amsterdam, Netherlands"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6791-104X","authenticated-orcid":false,"given":"Jie","family":"Li","sequence":"additional","affiliation":[{"name":"Human-AI Symbiosis, Alliance, Zurich, Switzerland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8469-2072","authenticated-orcid":false,"given":"Massimo","family":"Poesio","sequence":"additional","affiliation":[{"name":"Universiteit Utrecht, Utrecht, Netherlands and Queen Mary University of London, London, United Kingdom"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8783-7783","authenticated-orcid":false,"given":"Julian","family":"Frommel","sequence":"additional","affiliation":[{"name":"Universiteit Utrecht, Utrecht, Netherlands"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5707-5236","authenticated-orcid":false,"given":"Koen","family":"Hindriks","sequence":"additional","affiliation":[{"name":"Vrije Universiteit Amsterdam, Amsterdam, Netherlands"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6951-8340","authenticated-orcid":false,"given":"Jiahuan","family":"Pei","sequence":"additional","affiliation":[{"name":"Vrije Universiteit Amsterdam, Amsterdam, Netherlands"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2025,10,12]]},"reference":[{"key":"e_1_3_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-04898-2132"},{"key":"e_1_3_3_2_3_2","unstructured":"Anthony Brohan Noah Brown Justice Carbajal Yevgen Chebotar Xi Chen Krzysztof Choromanski Tianli Ding Danny Driess Avinava Dubey Chelsea Finn et\u00a0al. 2023. Rt-2: Vision-language-action models transfer web knowledge to robotic control. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2307.15818 (2023)."},{"key":"e_1_3_3_2_4_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.emnlp-main.637"},{"key":"e_1_3_3_2_5_2","unstructured":"Anthony Costarelli Mat Allen Roman Hauksson Grace Sodunke Suhas Hariharan Carlson Cheng Wenjie Li Joshua Clymer and Arjun Yadav. 2024. Gamebench: Evaluating strategic reasoning abilities of llm agents. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2406.06613 (2024)."},{"key":"e_1_3_3_2_6_2","doi-asserted-by":"crossref","unstructured":"Wentao Deng Jiahuan Pei Zhaochun Ren Zhumin Chen and Pengjie Ren. 2023. Intent-calibrated self-training for answer selection in open-domain dialogues. Transactions of the Association for Computational Linguistics 11 (2023) 1232\u20131249.","DOI":"10.1162\/tacl_a_00599"},{"key":"e_1_3_3_2_7_2","doi-asserted-by":"crossref","unstructured":"Alena Denisova Paul Cairns Christian Guckelsberger and David Zendle. 2020. Measuring perceived challenge in digital games: Development & validation of the challenge originating from recent gameplay interaction scale (CORGIS). International Journal of Human-Computer Studies 137 (2020) 102383.","DOI":"10.1016\/j.ijhcs.2019.102383"},{"key":"e_1_3_3_2_8_2","doi-asserted-by":"crossref","unstructured":"Satu Elo and Helvi Kyng\u00e4s. 2008. The qualitative content analysis process. Journal of advanced nursing 62 1 (2008) 107\u2013115.","DOI":"10.1111\/j.1365-2648.2007.04569.x"},{"key":"e_1_3_3_2_9_2","unstructured":"Linxi Fan Guanzhi Wang Yunfan Jiang Ajay Mandlekar Yuncong Yang Haoyi Zhu Andrew Tang De-An Huang Yuke Zhu and Anima Anandkumar. 2022. Minedojo: Building open-ended embodied agents with internet-scale knowledge. Advances in Neural Information Processing Systems 35 (2022) 18343\u201318362."},{"key":"e_1_3_3_2_10_2","doi-asserted-by":"crossref","unstructured":"Franz Faul Edgar Erdfelder Albert-Georg Lang and Axel Buchner. 2007. G*Power 3: a flexible statistical power analysis program for the social behavioral and biomedical sciences. Behav. Res. Methods 39 2 (May 2007) 175\u2013191.","DOI":"10.3758\/BF03193146"},{"key":"e_1_3_3_2_11_2","unstructured":"LLM Multi-Agent Framework. 2024. VillagerBench: Benchmarking Multi-Agent Collaboration in Minecraft. (2024)."},{"key":"e_1_3_3_2_12_2","unstructured":"Koya Kudo\u00a0Ian Frank. 2024. An LLM Chatbot in Minecraft with Educational Applications. (2024)."},{"key":"e_1_3_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.4135\/9781412983419"},{"key":"e_1_3_3_2_14_2","unstructured":"ATLAS.ti Scientific Software\u00a0Development GmbH.2023. ATLAS.Ti. https:\/\/atlasti.com."},{"key":"e_1_3_3_2_15_2","unstructured":"Jonathan Gray Kavya Srinet Yacine Jernite Haonan Yu Zhuoyuan Chen Demi Guo Siddharth Goyal C\u00a0Lawrence Zitnick and Arthur Szlam. 2019. Craftassist: A framework for dialogue-enabled interactive agents. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/1907.08584 (2019)."},{"key":"e_1_3_3_2_16_2","doi-asserted-by":"publisher","DOI":"10.1109\/CoG60054.2024.10645539"},{"key":"e_1_3_3_2_17_2","unstructured":"Shuo Huang Muhammad\u00a0Umair Nasir Steven James and Julian Togelius. 2025. Word2Minecraft: Generating 3D Game Levels through Large Language Models. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2503.16536 (2025)."},{"key":"e_1_3_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.ijcnlp-srw.10"},{"key":"e_1_3_3_2_19_2","volume-title":"The Game Experience Questionnaire","author":"IJsselsteijn W.A.","year":"2013","unstructured":"W.A. IJsselsteijn, Y.A.W. de Kort, and K. Poels. 2013. The Game Experience Questionnaire. Technische Universiteit Eindhoven."},{"key":"e_1_3_3_2_20_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.232"},{"key":"e_1_3_3_2_21_2","first-page":"4246","volume-title":"Ijcai","author":"Johnson Matthew","year":"2016","unstructured":"Matthew Johnson, Katja Hofmann, Tim Hutton, and David Bignell. 2016. The Malmo Platform for Artificial Intelligence Experimentation.. In Ijcai , Vol.\u00a016. 4246\u20134247."},{"key":"e_1_3_3_2_22_2","first-page":"146","volume-title":"NeurIPS 2021 Competitions and Demonstrations Track","author":"Kiseleva Julia","year":"2022","unstructured":"Julia Kiseleva, Ziming Li, Mohammad Aliannejadi, Shrestha Mohanty, Maartje ter Hoeve, Mikhail Burtsev, Alexey Skrynnik, Artem Zholus, Aleksandr Panov, Kavya Srinet, et\u00a0al. 2022. Interactive grounded language understanding in a collaborative environment: Iglu 2021. In NeurIPS 2021 Competitions and Demonstrations Track. PMLR, 146\u2013161."},{"key":"e_1_3_3_2_23_2","doi-asserted-by":"crossref","unstructured":"Chalamalasetti Kranti Sherzod Hakimov and David Schlangen. 2024. Retrieval-augmented code generation for situated action generation: A case study on minecraft. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2406.17553 (2024).","DOI":"10.18653\/v1\/2024.findings-emnlp.652"},{"key":"e_1_3_3_2_24_2","doi-asserted-by":"publisher","DOI":"10.1145\/2470654.2481287"},{"key":"e_1_3_3_2_25_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52733.2024.01554"},{"key":"e_1_3_3_2_26_2","unstructured":"Shunyu Liu Yaoru Li Kongcheng Zhang Zhenyu Cui Wenkai Fang Yuxuan Zheng Tongya Zheng and Mingli Song. 2024. Odyssey: Empowering Minecraft Agents with Open-World Skills. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2407.15325 (2024)."},{"key":"e_1_3_3_2_27_2","doi-asserted-by":"crossref","unstructured":"Yuanxing Liu Jiahuan Pei Wei-Nan Zhang Ming Li Wanxiang Che and Maarten De\u00a0Rijke. 2025. Augmentation with Neighboring Information for Conversational Recommendation. ACM Transactions on Information Systems 43 3 (2025) 1\u201349.","DOI":"10.1145\/3712588"},{"key":"e_1_3_3_2_28_2","doi-asserted-by":"crossref","unstructured":"Zhihan Lv Fabio Poiesi Qi Dong Jaime Lloret and Houbing Song. 2022. Deep learning for intelligent human\u2013computer interaction. Applied Sciences 12 22 (2022) 11457.","DOI":"10.3390\/app122211457"},{"key":"e_1_3_3_2_29_2","volume-title":"Proceedings of the 28th Workshop on the Semantics and Pragmatics of Dialogue","author":"Madge Chris","year":"2024","unstructured":"Chris Madge and Massimo Poesio. 2024. A LLM Benchmark based on the Minecraft Builder Dialog Agent Task. In Proceedings of the 28th Workshop on the Semantics and Pragmatics of Dialogue."},{"key":"e_1_3_3_2_30_2","first-page":"233","volume-title":"Content and Complexity","author":"Mirel Barbara","year":"2014","unstructured":"Barbara Mirel. 2014. Dynamic usability: Designing usefulness into systems for complex tasks. In Content and Complexity. Routledge, 233\u2013261."},{"key":"e_1_3_3_2_31_2","unstructured":"Shrestha Mohanty Negar Arabzadeh Milagro Teruel Yuxuan Sun Artem Zholus Alexey Skrynnik Mikhail Burtsev Kavya Srinet Aleksandr Panov Arthur Szlam et\u00a0al. 2022. Collecting interactive multi-modal datasets for grounded language understanding. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2211.06552 (2022)."},{"key":"e_1_3_3_2_32_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1537"},{"key":"e_1_3_3_2_33_2","first-page":"7084","volume-title":"Proceedings of the Twelfth Language Resources and Evaluation Conference","author":"Ogawa Haruna","year":"2020","unstructured":"Haruna Ogawa, Hitoshi Nishikawa, Takenobu Tokunaga, and Hikaru Yokono. 2020. Gamification platform for collecting task-oriented dialogue data. In Proceedings of the Twelfth Language Resources and Evaluation Conference. 7084\u20137093."},{"key":"e_1_3_3_2_34_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2024.findings-acl.240"},{"key":"e_1_3_3_2_35_2","doi-asserted-by":"crossref","unstructured":"Jiahuan Pei Guojun Yan Maarten De\u00a0Rijke and Pengjie Ren. 2024. Mixture-of-Languages Routing for Multilingual Dialogues. ACM Transactions on Information Systems 42 6 (2024) 1\u201333.","DOI":"10.1145\/3676956"},{"key":"e_1_3_3_2_36_2","unstructured":"Sudha Rao Weijia Xu Michael Xu Jorge Leandro Ken Lobb Gabriel DesGarennes Chris Brockett and Bill Dolan. 2024. Collaborative Quest Completion with LLM-driven Non-Player Characters in Minecraft. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2407.03460 (2024)."},{"key":"e_1_3_3_2_37_2","unstructured":"Ferran Sanchez\u00a0Llado. 2024. Controlling Agents Behaviours through LLMs."},{"key":"e_1_3_3_2_38_2","doi-asserted-by":"publisher","unstructured":"S.\u00a0S. SHAPIRO and M.\u00a0B. WILK. 1965. An analysis of variance test for normality (complete samples). Biometrika 52 3-4 (dec 1965) 591\u2013611. 10.1093\/biomet\/52.3-4.591","DOI":"10.1093\/biomet\/52.3-4.591"},{"key":"e_1_3_3_2_39_2","volume-title":"Practical business statistics","author":"Siegel Andrew\u00a0F","year":"2016","unstructured":"Andrew\u00a0F Siegel. 2016. Practical business statistics. Academic Press."},{"key":"e_1_3_3_2_40_2","unstructured":"Aarohi Srivastava Abhinav Rastogi Abhishek Rao Abu Awal\u00a0Md Shoeb Abubakar Abid Adam Fisch Adam\u00a0R Brown Adam Santoro Aditya Gupta Adri\u00e0 Garriga-Alonso et\u00a0al. 2022. Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models. Transactions on Machine learning Research (2022)."},{"key":"e_1_3_3_2_41_2","unstructured":"Arthur Szlam Jonathan Gray Kavya Srinet Yacine Jernite Armand Joulin Gabriel Synnaeve Douwe Kiela Haonan Yu Zhuoyuan Chen Siddharth Goyal et\u00a0al. 2019. Why build an assistant in minecraft? arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/1907.09273 (2019)."},{"key":"e_1_3_3_2_42_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-1062"},{"key":"e_1_3_3_2_43_2","unstructured":"Guanzhi Wang Yuqi Xie Yunfan Jiang Ajay Mandlekar Chaowei Xiao Yuke Zhu Linxi Fan and Anima Anandkumar. 2023. Voyager: An open-ended embodied agent with large language models. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2305.16291 (2023)."},{"key":"e_1_3_3_2_44_2","unstructured":"Zihao Wang Shaofei Cai Guanzhou Chen Anji Liu Xiaojian\u00a0Shawn Ma and Yitao Liang. 2023. Describe explain plan and select: interactive planning with llms enables open-world multi-task agents. Advances in Neural Information Processing Systems 36 (2023) 34153\u201334189."},{"key":"e_1_3_3_2_45_2","unstructured":"Jason Wei Xuezhi Wang Dale Schuurmans Maarten Bosma Fei Xia Ed Chi Quoc\u00a0V Le Denny Zhou et\u00a0al. 2022. Chain-of-thought prompting elicits reasoning in large language models. Advances in neural information processing systems 35 (2022) 24824\u201324837."},{"key":"e_1_3_3_2_46_2","unstructured":"Jules White Quchen Fu Sam Hays Michael Sandborn Carlos Olea Henry Gilbert Ashraf Elnashar Jesse Spencer-Smith and Douglas\u00a0C Schmidt. 2023. A prompt pattern catalog to enhance prompt engineering with chatgpt. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2302.11382 (2023)."},{"key":"e_1_3_3_2_47_2","volume-title":"The Twelfth International Conference on Learning Representations","author":"Wu Yue","year":"2024","unstructured":"Yue Wu, Xuan Tang, Tom Mitchell, and Yuanzhi Li. 2024. SmartPlay: A Benchmark for LLMs as Intelligent Agents. In The Twelfth International Conference on Learning Representations."},{"key":"e_1_3_3_2_48_2","volume-title":"International Conference on Learning Representations (ICLR)","author":"Yao Shunyu","year":"2023","unstructured":"Shunyu Yao, Jeffrey Zhao, Dian Yu, Nan Du, Izhak Shafran, Karthik Narasimhan, and Yuan Cao. 2023. React: Synergizing reasoning and acting in language models. In International Conference on Learning Representations (ICLR)."},{"key":"e_1_3_3_2_49_2","unstructured":"Eray Yapa\u011fc\u0131 Yavuz Alp\u00a0Sencer \u00d6zt\u00fcrk and Eray T\u00fcz\u00fcn. 2025. BugCraft: End-to-End Crash Bug Reproduction Using LLM Agents in Minecraft. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2503.20036 (2025)."},{"key":"e_1_3_3_2_50_2","unstructured":"Xizhou Zhu Yuntao Chen Hao Tian Chenxin Tao Weijie Su Chenyu Yang Gao Huang Bin Li Lewei Lu Xiaogang Wang et\u00a0al. [n. d.]. Ghost in the Minecraft: Hierarchical Agents for Minecraft via Large Language Models with Text-based Knowledge and Memory. ([n. d.])."},{"key":"e_1_3_3_2_51_2","unstructured":"Xizhou Zhu Yuntao Chen Hao Tian Chenxin Tao Weijie Su Chenyu Yang Gao Huang Bin Li Lewei Lu Xiaogang Wang et\u00a0al. 2023. Ghost in the minecraft: Generally capable agents for open-world environments via large language models with text-based knowledge and memory. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2305.17144 (2023)."}],"event":{"name":"ICMI '25: International Conference on Multimodal Interaction","location":"Canberra Australia","acronym":"ICMI '25","sponsor":["SIGCHI ACM Special Interest Group on Computer-Human Interaction"]},"container-title":["Proceedings of the 27th International Conference on Multimodal Interaction"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3716553.3756015","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,1,26]],"date-time":"2026-01-26T22:26:06Z","timestamp":1769466366000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3716553.3756015"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,10,12]]},"references-count":50,"alternative-id":["10.1145\/3716553.3756015","10.1145\/3716553"],"URL":"https:\/\/doi.org\/10.1145\/3716553.3756015","relation":{},"subject":[],"published":{"date-parts":[[2025,10,12]]},"assertion":[{"value":"2025-10-12","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}