{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,24]],"date-time":"2026-03-24T22:59:01Z","timestamp":1774393141199,"version":"3.50.1"},"reference-count":129,"publisher":"Association for Computing Machinery (ACM)","issue":"6","license":[{"start":{"date-parts":[[2024,10,22]],"date-time":"2024-10-22T00:00:00Z","timestamp":1729555200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Inf. Syst."],"published-print":{"date-parts":[[2024,11,30]]},"abstract":"<jats:p>We consider multilingual dialogue systems and ask how the performance of a dialogue system can be improved by using information that is available in other languages than the language in which a conversation is being conducted. We adopt a collaborative chair-experts framework, where each expert agent can be either monolingual or cross-lingual, and a chair agent follows a mixture-of-experts procedure for globally optimizing multilingual task-oriented dialogue systems. We propose a mixture-of-languages routing framework that includes four functional components, i.e., input embeddings of multilingual dialogues, language model, pairwise alignment between the representation of every two languages, and mixture-of-languages. We quantify language characteristics of unity and diversity using a number of similarity metrics, i.e., genetic similarity and word and sentence similarity based on embeddings. Our main finding is that the performance of multilingual task-oriented dialogue systems can be greatly impacted by three key aspects, i.e., data sufficiency, language characteristics, and model design in a mixture-of-languages routing framework.<\/jats:p>","DOI":"10.1145\/3676956","type":"journal-article","created":{"date-parts":[[2024,8,5]],"date-time":"2024-08-05T15:53:42Z","timestamp":1722873222000},"page":"1-33","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":5,"title":["Mixture-of-Languages Routing for Multilingual Dialogues"],"prefix":"10.1145","volume":"42","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6951-8340","authenticated-orcid":false,"given":"Jiahuan","family":"Pei","sequence":"first","affiliation":[{"name":"Vrije Universiteit Amsterdam, Amsterdam, The Netherlands"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3682-1041","authenticated-orcid":false,"given":"Guojun","family":"Yan","sequence":"additional","affiliation":[{"name":"China Academy of Engineering Physics, Mianyang, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1086-0202","authenticated-orcid":false,"given":"Maarten","family":"De Rijke","sequence":"additional","affiliation":[{"name":"University of Amsterdam, Amsterdam, The Netherlands"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2964-6422","authenticated-orcid":false,"given":"Pengjie","family":"Ren","sequence":"additional","affiliation":[{"name":"Shandong University, Qingdao, China"}]}],"member":"320","published-online":{"date-parts":[[2024,10,22]]},"reference":[{"key":"e_1_3_4_2_2","doi-asserted-by":"crossref","unstructured":"Yejin Bang Samuel Cahyawijaya Nayeon Lee Wenliang Dai Dan Su Bryan Wilie Holy Lovenia Ziwei Ji Tiezheng Yu Willy Chung Quyet V. Do Yan Xu and Pascale Fung. 2023. A multitask multilingual multimodal evaluation of chatgpt on reasoning hallucination and interactivity. arXiv:2302.04023. Retrieved from https:\/\/arxiv.org\/abs\/2302.04023","DOI":"10.18653\/v1\/2023.ijcnlp-main.45"},{"issue":"3","key":"e_1_3_4_3_2","doi-asserted-by":"crossref","first-page":"571","DOI":"10.1162\/coli_a_00382","article-title":"Semantic drift in multilingual representations","volume":"46","author":"Beinborn Lisa","year":"2020","unstructured":"Lisa Beinborn and Rochelle Choenni. 2020. Semantic drift in multilingual representations. Computational Linguistics 46, 3 (2020), 571\u2013603.","journal-title":"Computational Linguistics"},{"key":"e_1_3_4_4_2","first-page":"883","volume-title":"Proceedings of the 6th International Joint Conference on Natural Language Processing","author":"Beinborn Lisa","year":"2013","unstructured":"Lisa Beinborn, Torsten Zesch, and Iryna Gurevych. 2013. Cognate production using character-based machine translation. In Proceedings of the 6th International Joint Conference on Natural Language Processing, 883\u2013891."},{"issue":"2","key":"e_1_3_4_5_2","doi-asserted-by":"crossref","first-page":"381","DOI":"10.1162\/coli_a_00351","article-title":"What do language representations really represent?","volume":"45","author":"Bjerva Johannes","year":"2019","unstructured":"Johannes Bjerva, Robert \u00d6stling, Maria Han Veiga, J\u00f6rg Tiedemann, and Isabelle Augenstein. 2019. What do language representations really represent? Computational Linguistics 45, 2 (2019), 381\u2013389.","journal-title":"Computational Linguistics"},{"key":"e_1_3_4_6_2","doi-asserted-by":"crossref","first-page":"537","DOI":"10.1007\/978-3-662-04230-4_39","volume-title":"Verbmobil: Foundations of Speech-to-speech Translation","author":"Burger Susanne","year":"2000","unstructured":"Susanne Burger, Karl Weilhammer, Florian Schiel, and Hans G. Tillmann. 2000. Verbmobil data collection and annotation. In Verbmobil: Foundations of Speech-to-speech Translation. Springer, 537\u2013549."},{"key":"e_1_3_4_7_2","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1016\/j.neunet.2015.02.012","article-title":"Multilingual part-of-speech tagging with weightless neural networks","volume":"66","author":"Carneiro Hugo C. C.","year":"2015","unstructured":"Hugo C. C. Carneiro, Felipe M. G. Fran\u00e7a, and Priscila M. V. Lima. 2015. Multilingual part-of-speech tagging with weightless neural networks. Neural Networks 66 (2015), 11\u201321.","journal-title":"Neural Networks"},{"issue":"2","key":"e_1_3_4_8_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3440755","article-title":"Evolution of semantic similarity\u2014a survey","volume":"54","author":"Chandrasekaran Dhivya","year":"2021","unstructured":"Dhivya Chandrasekaran and Vijay Mago. 2021. Evolution of semantic similarity\u2014a survey. ACM Computing Surveys 54, 2 (2021), 1\u201337.","journal-title":"ACM Computing Surveys"},{"key":"e_1_3_4_9_2","volume-title":"Proceedings of Interspeech","author":"Chao Guan-Lin","year":"2019","unstructured":"Guan-Lin Chao and Ian Lane. 2019. BERT-DST: Scalable end-to-end dialogue state tracking with bidirectional encoder representations from transformer. Proceedings of Interspeech."},{"issue":"2","key":"e_1_3_4_10_2","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1145\/3166054.3166058","article-title":"A survey on dialogue systems: Recent advances and new frontiers","volume":"19","author":"Chen Hongshen","year":"2017","unstructured":"Hongshen Chen, Xiaorui Liu, Dawei Yin, and Jiliang Tang. 2017. A survey on dialogue systems: Recent advances and new frontiers. ACM SIGKDD Explorations Newsletter 19, 2 (2017), 25\u201335.","journal-title":"ACM SIGKDD Explorations Newsletter"},{"key":"e_1_3_4_11_2","first-page":"414","volume-title":"Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP)","author":"Chen Wenhu","year":"2018","unstructured":"Wenhu Chen, Jianshu Chen, Yu Su, Xin Wang, Dong Yu, Xifeng Yan, and William Yang Wang. 2018. XL-NBT: A cross-lingual neural belief tracking framework. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 414\u2013424."},{"key":"e_1_3_4_12_2","doi-asserted-by":"crossref","first-page":"8440","DOI":"10.18653\/v1\/2020.acl-main.747","volume-title":"Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL)","author":"Conneau Alexis","year":"2020","unstructured":"Alexis Conneau, Kartikay Khandelwal, Naman Goyal, Vishrav Chaudhary, Guillaume Wenzek, Francisco Guzm\u00e1n, \u00c9douard Grave, Myle Ott, Luke Zettlemoyer, and Veselin Stoyanov. 2020. Unsupervised cross-lingual representation learning at scale. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), 8440\u20138451."},{"key":"e_1_3_4_13_2","first-page":"7059","volume-title":"Proceedings of the 33rd International Conference on Neural Information Processing Systems","author":"Conneau Alexis","year":"2019","unstructured":"Alexis Conneau and Guillaume Lample. 2019. Cross-lingual language model pretraining. In Proceedings of the 33rd International Conference on Neural Information Processing Systems, 7059\u20137069."},{"key":"e_1_3_4_14_2","unstructured":"Alexis Conneau Guillaume Lample Marc\u2019Aurelio Ranzato Ludovic Denoyer and Herv\u00e9 J\u00e9gou. 2017. Word translation without parallel data. arXiv:1710.04087. Retrieved from https:\/\/arxiv.org\/abs\/1710.04087"},{"issue":"1","key":"e_1_3_4_15_2","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1017\/S0266078408000023","article-title":"Two thousand million?","volume":"24","author":"Crystal David","year":"2008","unstructured":"David Crystal. 2008. Two thousand million? English Today 24, 1 (2008), 3\u20136.","journal-title":"English Today"},{"key":"e_1_3_4_16_2","unstructured":"Richard Csaky and Gabor Recski. 2020. The gutenberg dialogue dataset. arXiv:2004.12752. Retrieved from https:\/\/arxiv.org\/abs\/2004.12752"},{"key":"e_1_3_4_17_2","volume-title":"Approaches to Measuring Linguistic Differences","author":"Cysouw Michael","year":"2013","unstructured":"Michael Cysouw. 2013. Predicting language-learning difficulty. In Approaches to Measuring Linguistic Differences. De Gruyter."},{"key":"e_1_3_4_18_2","unstructured":"Raj Dabre Aizhan Imankulova Masahiro Kaneko and Abhisek Chakrabarty. 2021. Simultaneous multi-pivot neural machine translation. arXiv:2104.07410. Retrieved from https:\/\/arxiv.org\/abs\/2104.07410"},{"key":"e_1_3_4_19_2","first-page":"852","article-title":"What exactly is universal grammar, and has anyone seen it?","volume":"6","author":"Dabrowska Ewa","year":"2015","unstructured":"Ewa Dabrowska. 2015. What exactly is universal grammar, and has anyone seen it? Frontiers in Psychology 6 (2015), 852.","journal-title":"Frontiers in Psychology"},{"key":"e_1_3_4_20_2","volume-title":"The Oxford Handbook of Linguistic Typology","author":"Daniel Michael","year":"2011","unstructured":"Michael Daniel. 2011. Linguistic typology and the study of language. In The Oxford Handbook of Linguistic Typology. Oxford University Press."},{"key":"e_1_3_4_21_2","first-page":"4171","volume-title":"Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies","volume":"1","author":"Devlin Jacob","year":"2019","unstructured":"Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Vol. 1, Long and Short Papers, 4171\u20134186."},{"key":"e_1_3_4_22_2","first-page":"1639","volume-title":"Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics","volume":"1","author":"Ding Bosheng","year":"2022","unstructured":"Bosheng Ding, Junjie Hu, Lidong Bing, Mahani Aljunied, Shafiq Joty, Luo Si, and Chunyan Miao. 2022. GlobalWoZ: Globalizing multiwoz to develop multilingual task-oriented dialogue systems. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, Vol. 1, Long Papers, 1639\u20131657."},{"key":"e_1_3_4_23_2","volume-title":"I Am a Linguist: With a Foreword by Peter Matthews","author":"Dixon Robert M.W.","year":"2010","unstructured":"Robert M.W. Dixon. 2010. I Am a Linguist: With a Foreword by Peter Matthews. Brill."},{"key":"e_1_3_4_24_2","first-page":"1126","volume-title":"Proceedings of the 34th International Conference on Machine Learning (ICML)","author":"Finn Chelsea","year":"2017","unstructured":"Chelsea Finn, Pieter Abbeel, and Sergey Levine. 2017. Model-agnostic meta-learning for fast adaptation of deep networks. In Proceedings of the 34th International Conference on Machine Learning (ICML). PMLR, 1126\u20131135."},{"issue":"1563","key":"e_1_3_4_25_2","doi-asserted-by":"crossref","first-page":"376","DOI":"10.1098\/rstb.2010.0223","article-title":"Unity and diversity in human language","volume":"366","author":"Fitch W. Tecumseh","year":"2011","unstructured":"W. Tecumseh Fitch. 2011. Unity and diversity in human language. Philosophical Transactions of the Royal Society B: Biological Sciences 366, 1563 (2011), 376\u2013388.","journal-title":"Philosophical Transactions of the Royal Society B: Biological Sciences"},{"key":"e_1_3_4_26_2","first-page":"161","volume-title":"The Routledge Handbook of Historical Linguistics","author":"Fran\u00e7ois Alexandre","year":"2015","unstructured":"Alexandre Fran\u00e7ois. 2015. Trees, waves and linkages: Models of language diversification. In The Routledge Handbook of Historical Linguistics. Routledge, 161\u2013189."},{"issue":"3","key":"e_1_3_4_27_2","doi-asserted-by":"crossref","first-page":"89","DOI":"10.1109\/MSP.2008.918417","article-title":"Multilingual spoken language processing","volume":"25","author":"Fung Pascale","year":"2008","unstructured":"Pascale Fung and Tanja Schultz. 2008. Multilingual spoken language processing. IEEE Signal Processing Magazine 25, 3 (2008), 89\u201397.","journal-title":"IEEE Signal Processing Magazine"},{"key":"e_1_3_4_28_2","first-page":"371","volume-title":"Proceedings of the International Conference on Findings of the Association for Computational Linguistics (ACL-IJCNLP \u201921)","author":"Gritta Milan","year":"2021","unstructured":"Milan Gritta and Ignacio Iacobacci. 2021. XeroAlign: Zero-shot cross-lingual transformer alignment. In Proceedings of the International Conference on Findings of the Association for Computational Linguistics (ACL-IJCNLP \u201921), 371\u2013381."},{"key":"e_1_3_4_29_2","author":"Hadi Muhammad Usman","year":"2023","unstructured":"Muhammad Usman Hadi, Rizwan Qureshi, Abbas Shah, Muhammad Irfan, Anas Zafar, Muhammad Bilal Shaikh, Naveed Akhtar, Jia Wu, Seyedali Mirjalili, and Mubarak Shah. 2023. A Survey on Large Language Models: Applications, Challenges, Limitations, and Practical Usage. TechRxiv.","journal-title":"A Survey on Large Language Models: Applications, Challenges, Limitations, and Practical Usage"},{"issue":"1","key":"e_1_3_4_30_2","first-page":"209","article-title":"How hopeless is genealogical linguistics, and how advanced is areal linguistics?","volume":"28","author":"Haspelmath Martin","year":"2004","unstructured":"Martin Haspelmath. 2004. How hopeless is genealogical linguistics, and how advanced is areal linguistics? Studies in Language 28, 1 (2004), 209\u2013223.","journal-title":"Studies in Language"},{"issue":"4","key":"e_1_3_4_31_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3465272","article-title":"Conversational search and recommendation: Introduction to the special issue","volume":"39","author":"Hauff Claudia","year":"2021","unstructured":"Claudia Hauff, Julia Kiseleva, Mark Sanderson, Hamed Zamani, and Yongfeng Zhang. 2021. Conversational search and recommendation: Introduction to the special issue. ACM Transactions on Information Systems 39, 4 (2021), 1\u20136.","journal-title":"ACM Transactions on Information Systems"},{"key":"e_1_3_4_32_2","unstructured":"Hiyouga. 2023. LLaMA Factory. Retrieved from https:\/\/github.com\/hiyouga\/LLaMA-Factory."},{"key":"e_1_3_4_33_2","first-page":"20179","article-title":"A simple language model for task-oriented dialogue","volume":"33","author":"Hosseini-Asl Ehsan","year":"2020","unstructured":"Ehsan Hosseini-Asl, Bryan McCann, Chien-Sheng Wu, Semih Yavuz, and Richard Socher. 2020. A simple language model for task-oriented dialogue. Advances in Neural Information Processing Systems 33 (2020), 20179\u201320191.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_4_34_2","volume-title":"Multilingual Information Management: Current Levels and Future Abilities","author":"Hovy Eduard","year":"2001","unstructured":"Eduard Hovy, Nancy Ide, Robert Frederking, Joseph Mariani, and Antonio Zampolli. 2001. Multilingual Information Management: Current Levels and Future Abilities. Istituti Editoriali e Poligrafici Internazionali, Pisa."},{"key":"e_1_3_4_35_2","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Hu Edward J.","year":"2022","unstructured":"Edward J. Hu, Yelong Shen, Phillip Wallis, Zeyuan Allen-Zhu, Yuanzhi Li, Shean Wang, Lu Wang, and Weizhu Chen. 2022. LoRA: Low-Rank adaptation of large language models. In Proceedings of the International Conference on Learning Representations. OpenReview.net. Retrieved from https:\/\/openreview.net\/forum?id=nZeVKeeFYf9"},{"issue":"3","key":"e_1_3_4_36_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3383123","article-title":"Challenges in building intelligent open-domain dialog systems","volume":"38","author":"Huang Minlie","year":"2020","unstructured":"Minlie Huang, Xiaoyan Zhu, and Jianfeng Gao. 2020. Challenges in building intelligent open-domain dialog systems. ACM Transactions on Information Systems 38, 3 (2020), 1\u201332.","journal-title":"ACM Transactions on Information Systems"},{"key":"e_1_3_4_37_2","unstructured":"Chia-Chien Hung Anne Lauscher Ivan Vuli\u0107 Simone Paolo Ponzetto and Goran Glava\u0161. 2022. Multi2WOZ: A robust multilingual dataset and conversational pretraining for task-oriented dialog. arXiv:2205.10400. Retrieved from https:\/\/arxiv.org\/abs\/2205.10400"},{"key":"e_1_3_4_38_2","first-page":"583","volume-title":"Proceedings of the International Conference on Electrical, Electronics, Communication, Computer, and Optimization Techniques (ICEECCOT)","author":"Jayarao Pratik","year":"2018","unstructured":"Pratik Jayarao and Aman Srivastava. 2018. Intent detection for code-mix utterances in task oriented dialogue systems. In Proceedings of the International Conference on Electrical, Electronics, Communication, Computer, and Optimization Techniques (ICEECCOT). IEEE, 583\u2013587."},{"key":"e_1_3_4_39_2","unstructured":"Albert Q Jiang Alexandre Sablayrolles Antoine Roux Arthur Mensch Blanche Savary Chris Bamford Devendra Singh Chaplot Diego de las Casas Emma Bou Hanna Florian Bressand Gianna Lengyel Guillaume Bour Guillaume Lample L\u00e9lio Renard Lavaud Lucile Saulnier Marie-Anne Lachaux Pierre Stock Sandeep Subramanian Sophia Yang Szymon Antoniak Teven Le Scao Th\u00e9ophile Gervet Thibaut Lavril Thomas Wang Timoth\u00e9e Lacroix and William El Sayed. 2024. Mixtral of experts. arXiv:2401.04088."},{"key":"e_1_3_4_40_2","doi-asserted-by":"crossref","first-page":"2979","DOI":"10.18653\/v1\/D18-1330","volume-title":"Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP)","author":"Joulin Armand","year":"2018","unstructured":"Armand Joulin, Piotr Bojanowski, Tom\u00e1\u0161 Mikolov, Herv\u00e9 J\u00e9gou, and \u00c9douard Grave. 2018. Loss in Translation: Learning Bilingual Word Mapping with a Retrieval Criterion. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2979\u20132984."},{"key":"e_1_3_4_41_2","unstructured":"Prabhu Kaliamoorthi Aditya Siddhant Edward Li and Melvin Johnson. 2021. Distilling large language models into tiny and effective students using pQRNN. arXiv:2101.08890. Retrieved from https:\/\/arxiv.org\/abs\/2101.08890"},{"key":"e_1_3_4_42_2","first-page":"511","volume-title":"Proceedings of the EEE Spoken Language Technology Workshop (SLT Workshop)","author":"Kim Seokhwan","year":"2016","unstructured":"Seokhwan Kim, Luis Fernando D\u2019Haro, Rafael E. Banchs, Jason D. Williams, Matthew Henderson, and Koichiro Yoshino. 2016. The fifth dialog state tracking challenge. In Proceedings of the EEE Spoken Language Technology Workshop (SLT Workshop). IEEE, 511\u2013517."},{"key":"e_1_3_4_43_2","doi-asserted-by":"crossref","first-page":"12","DOI":"10.18653\/v1\/W19-4203","volume-title":"Proceedings of the 16th Workshop on Computational Research in Phonetics, Phonology, and Morphology","author":"Kondratyuk Dan","year":"2019","unstructured":"Dan Kondratyuk. 2019. Cross-lingual lemmatization and morphology tagging with two-stage multilingual BERT fine-tuning. In Proceedings of the 16th Workshop on Computational Research in Phonetics, Phonology, and Morphology, 12\u201318."},{"key":"e_1_3_4_44_2","doi-asserted-by":"crossref","first-page":"211","DOI":"10.18653\/v1\/2021.mrl-1.18","volume-title":"Proceedings of the 1st Workshop on Multilingual Representation Learning (MRL Workshop)","author":"Krishnan Jitin","year":"2021","unstructured":"Jitin Krishnan, Antonios Anastasopoulos, Hemant Purohit, and Huzefa Rangwala. 2021. Multilingual code-switching for zero-shot cross-lingual intent prediction and slot filling. In Proceedings of the 1st Workshop on Multilingual Representation Learning (MRL Workshop). 211\u2013223."},{"key":"e_1_3_4_45_2","first-page":"8107","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","volume":"34","author":"Kumar Adarsh","year":"2020","unstructured":"Adarsh Kumar, Peter Ku, Anuj Goyal, Angeliki Metallinou, and Dilek Hakkani-Tur. 2020. MA-DST: Multi-attention-based scalable dialog state tracking. Proceedings of the AAAI Conference on Artificial Intelligence 34 (2020), 8107\u20138114."},{"key":"e_1_3_4_46_2","first-page":"8034","volume-title":"Proceedings of the EEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","author":"Lai Tuan Manh","year":"2020","unstructured":"Tuan Manh Lai, Quan Hung Tran, Trung Bui, and Daisuke Kihara. 2020. A simple but effective bert model for dialog state tracking on resource-limited systems. In Proceedings of the EEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 8034\u20138038."},{"key":"e_1_3_4_47_2","unstructured":"Teven Le Scao Angela Fan Christopher Akiki Ellie Pavlick Suzana Ilic Daniel Hesslow Roman Castagn\u00e9 Alexandra Sasha Luccioni Fran\u00e7ois Yvon Matthias Gall\u00e9 Jonathan Tow Alexander M. Rush Stella Biderman Albert Webson Pawan Sasanka Ammanamanchi Thomas Wang Beno\u00eet Sagot Niklas Muennighoff Albert Villanova del Moral Olatunji Ruwase Rachel Bawden Stas Bekman Angelina McMillan-Major Iz Beltagy Huu Nguyen Lucile Saulnier Samson Tan Pedro Ortiz Suarez Victor Sanh Hugo Lauren\u00e7on Yacine Jernite Julien Launay Margaret Mitchell Colin Raffel Aaron Gokaslan Adi Simhi Aitor Soroa Alham Fikri Aji Amit Alfassy Anna Rogers Ariel Kreisberg Nitzav Canwen Xu Chenghao Mou Chris Emezue Christopher Klamm Colin Leong Daniel van Strien David Ifeoluwa Adelani Dragomir Radev Eduardo Gonz\u00e1lez Ponferrada Efrat Levkovizh Ethan Kim Eyal Bar Natan Francesco De Toni G\u00e9rard Dupont Germ\u00e1n Kruszewski Giada Pistilli Hady Elsahar Hamza Benyamina Hieu Tran Ian Yu Idris Abdulmumin Isaac Johnson Itziar Gonzalez-Dios Javier de la Rosa Jenny Chim Jesse Dodge Jian Zhu Jonathan Chang J\u00f6rg Frohberg Joseph Tobing Joydeep Bhattacharjee Khalid Almubarak Kimbo Chen Kyle Lo Leandro Von Werra Leon Weber Long Phan Loubna Ben allal Ludovic Tanguy Manan Dey Manuel Romero Mu\u223cnoz Maraim Masoud Mar\u00eda Grandury Mario \u0160a\u0161ko Max Huang Maximin Coavoux Mayank Singh Mike Tian-Jian Jiang Minh Chien Vu Mohammad A. Jauhar Mustafa Ghaleb Nishant Subramani Nora Kassner Nurulaqilla Khamis Olivier Nguyen Omar Espejel Ona de Gibert and Paulo Villegas. 2022. BLOOM: A 176b-parameter open-access multilingual language model. arXiv.2211.05100. Retrieved from https:\/\/arxiv.org\/abs\/2211.05100"},{"key":"e_1_3_4_48_2","first-page":"5478","volume-title":"Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL)","author":"Lee Hwaran","year":"2019","unstructured":"Hwaran Lee, Jinsik Lee, and Tae-Yoon Kim. 2019. SUMBT: Slot-utterance matching for universal and scalable belief tracking. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), 5478\u20135483."},{"key":"e_1_3_4_49_2","first-page":"2950","volume-title":"Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics (EACL)","author":"Li Haoran","year":"2021","unstructured":"Haoran Li, Abhinav Arora, Shuohui Chen, Anchit Gupta, Sonal Gupta, and Yashar Mehdad. 2021a. MTOP: A comprehensive multilingual task-oriented semantic parsing benchmark. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2950\u20132962."},{"issue":"4","key":"e_1_3_4_50_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3453183","article-title":"Dialogue history matters! Personalized response selection in multi-turn retrieval-based chatbots","volume":"39","author":"Li Juntao","year":"2021","unstructured":"Juntao Li, Chang Liu, Chongyang Tao, Zhangming Chan, Dongyan Zhao, Min Zhang, and Rui Yan. 2021b. Dialogue history matters! Personalized response selection in multi-turn retrieval-based chatbots. ACM Transactions on Information Systems 39, 4 (2021), 1\u201325.","journal-title":"ACM Transactions on Information Systems"},{"key":"e_1_3_4_51_2","unstructured":"Tomasz Limisiewicz and David Mare\u010dek. 2020. Syntax representation in word embeddings and neural networks \u2013 A survey. arXiv:2010.01063."},{"key":"e_1_3_4_52_2","first-page":"7890","volume-title":"Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP)","author":"Lin Zhaojiang","year":"2021","unstructured":"Zhaojiang Lin, Bing Liu, Andrea Madotto, Seungwhan Moon, Zhenpeng Zhou, Paul A Crook, Zhiguang Wang, Zhou Yu, Eunjoon Cho, Rajen Subba, and Pascale Fung. 2021a. Zero-shot dialogue state tracking via cross-task transfer. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP), 7890\u20137900."},{"key":"e_1_3_4_53_2","first-page":"102","volume-title":"Proceedings of the 3rd Workshop on Natural Language Processing for Conversational AI","author":"Lin Zhaojiang","year":"2021","unstructured":"Zhaojiang Lin, Zihan Liu, Genta Indra Winata, Samuel Cahyawijaya, Andrea Madotto, Yejin Bang, Etsuko Ishii, and Pascale Fung. 2021b. XPersona: Evaluating multilingual personalized chatbot. In Proceedings of the 3rd Workshop on Natural Language Processing for Conversational AI, 102\u2013112."},{"key":"e_1_3_4_54_2","unstructured":"Zhaojiang Lin Andrea Madotto Genta Indra Winata Peng Xu Feijun Jiang Yuxiang Hu Chen Shi and Pascale Fung. 2021c. Bitod: A bilingual multi-domain dataset for task-oriented dialogue modeling. arXiv:2106.02787. Retrieved from https:\/\/arxiv.org\/abs\/2106.02787"},{"issue":"1","key":"e_1_3_4_55_2","first-page":"2","article-title":"Generating relevant and informative questions for open-domain conversations","volume":"41","author":"Ling Yanxiang","year":"2023","unstructured":"Yanxiang Ling, Fei Cai, Jun Liu, Honghui Chen, and Maarten de Rijke. 2023. Generating relevant and informative questions for open-domain conversations. ACM Transactions on Information Systems 41, 1 (2023), Article 2.","journal-title":"ACM Transactions on Information Systems"},{"key":"e_1_3_4_56_2","doi-asserted-by":"crossref","first-page":"726","DOI":"10.1162\/tacl_a_00343","article-title":"Multilingual denoising pre-training for neural machine translation","volume":"8","author":"Liu Yinhan","year":"2020","unstructured":"Yinhan Liu, Jiatao Gu, Naman Goyal, Xian Li, Sergey Edunov, Marjan Ghazvininejad, Mike Lewis, and Luke Zettlemoyer. 2020a. Multilingual denoising pre-training for neural machine translation. Transactions of the Association for Computational Linguistics 8 (2020), 726\u2013742.","journal-title":"Transactions of the Association for Computational Linguistics"},{"key":"e_1_3_4_57_2","first-page":"1297","volume-title":"Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)","author":"Liu Zihan","year":"2019","unstructured":"Zihan Liu, Jamin Shin, Yan Xu, Genta Indra Winata, Peng Xu, Andrea Madotto, and Pascale Fung. 2019. Zero-shot Cross-lingual dialogue systems with transferable latent variables. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 1297\u20131303."},{"key":"e_1_3_4_58_2","first-page":"13461","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","volume":"35","author":"Liu Zihan","year":"2021","unstructured":"Zihan Liu, Genta I Winata, Samuel Cahyawijaya, Andrea Madotto, Zhaojiang Lin, and Pascale Fung. 2021. On the importance of word order information in cross-lingual sequence labeling. Proceedings of the AAAI Conference on Artificial Intelligence 35 (2021), 13461\u201313469."},{"key":"e_1_3_4_59_2","first-page":"8433","volume-title":"Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI-20)","volume":"34","author":"Liu Zihan","year":"2020","unstructured":"Zihan Liu, Genta Indra Winata, Zhaojiang Lin, Peng Xu, and Pascale Fung. 2020b. Attention-informed mixed-language training for zero-shot cross-lingual task-oriented dialogue systems. In Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI-20), 34, 8433\u20138440."},{"key":"e_1_3_4_60_2","volume-title":"Proceedings of the International Conference on Learning Representations (ICLR)","author":"Loshchilov Ilya","year":"2018","unstructured":"Ilya Loshchilov and Frank Hutter. 2018. Decoupled weight decay regularization. In Proceedings of the International Conference on Learning Representations (ICLR)."},{"key":"e_1_3_4_61_2","first-page":"167","volume-title":"Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation (PACLIC)","author":"Louvan Samuel","year":"2020","unstructured":"Samuel Louvan and Bernardo Magnini. 2020. Simple is better! Lightweight data augmentation for low resource slot filling and intent classification. In Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation (PACLIC), 167\u2013177."},{"issue":"6","key":"e_1_3_4_62_2","first-page":"59","article-title":"Identification of English functional noun phrases using CRFs combining the semantic information","volume":"30","author":"Ma Jianjun","year":"2016","unstructured":"Jianjun Ma, Jiahuan Pei, and Degen Huang. 2016. Identification of English functional noun phrases using CRFs combining the semantic information. Journal of Chinese Information Processing 30, 6 (2016), 59\u201366.","journal-title":"Journal of Chinese Information Processing"},{"issue":"1","key":"e_1_3_4_63_2","first-page":"126","article-title":"Syntactic parsing of clause constituents for statistical machine translation","volume":"17","author":"Ma Jianjun","year":"2018","unstructured":"Jianjun Ma, Jiahuan Pei, Degen Huang, and Dingxin Song. 2018. Syntactic parsing of clause constituents for statistical machine translation. International Journal of Computational Science and Engineering 17, 1 (2018), 126\u2013132.","journal-title":"International Journal of Computational Science and Engineering"},{"issue":"1","key":"e_1_3_4_64_2","first-page":"1","article-title":"Unstructured text enhanced open-domain dialogue system: A systematic survey","volume":"40","author":"Ma Longxuan","year":"2021","unstructured":"Longxuan Ma, Mingda Li, Wei-Nan Zhang, Jiapeng Li, and Ting Liu. 2021. Unstructured text enhanced open-domain dialogue system: A systematic survey. ACM Transactions on Information Systems 40, 1 (2021), 1\u201344.","journal-title":"ACM Transactions on Information Systems"},{"key":"e_1_3_4_65_2","first-page":"50","volume-title":"Handbook of Bilingualism: Psycholinguistic Approaches","author":"MacWhinney Brian","year":"2005","unstructured":"Brian MacWhinney. 2005. A unified model of language acquisition. In Handbook of Bilingualism: Psycholinguistic Approaches. Judith F. Kroll and Annette M.B. de Groot (Eds.), Vol. 4967, Oxford University Press, 50\u201370."},{"key":"e_1_3_4_66_2","unstructured":"Andrea Madotto Zhaojiang Lin Genta Indra Winata and Pascale Fung. 2021. Few-shot bot: Prompt-based learning for dialogue systems. arXiv:2110.08118. Retrieved from https:\/\/arxiv.org\/abs\/2110.08118"},{"key":"e_1_3_4_67_2","first-page":"6297","volume-title":"Proceedings of the 31st International Conference on Neural Information Processing Systems","author":"McCann Bryan","year":"2017","unstructured":"Bryan McCann, James Bradbury, Caiming Xiong, and Richard Socher. 2017. Learned in translation: Contextualized word vectors. In Proceedings of the 31st International Conference on Neural Information Processing Systems, 6297\u20136308."},{"key":"e_1_3_4_68_2","first-page":"29","article-title":"UMAP: Uniform manifold approximation and projection","volume":"3","author":"McInnes Leland","year":"2018","unstructured":"Leland McInnes, John Healy, Nathaniel Saul, and Lukas Gro\u00dfberger. 2018. UMAP: Uniform manifold approximation and projection. Journal of Open Source Software 3, 29 (2018), 861.","journal-title":"Journal of Open Source Software"},{"key":"e_1_3_4_69_2","doi-asserted-by":"crossref","first-page":"155","DOI":"10.3389\/fpsyg.2018.00155","article-title":"ULTRA: Universal grammar as a universal parser","volume":"9","author":"Medeiros David P","year":"2018","unstructured":"David P Medeiros. 2018. ULTRA: Universal grammar as a universal parser. Frontiers in Psychology 9 (2018), 155.","journal-title":"Frontiers in Psychology"},{"key":"e_1_3_4_70_2","first-page":"1777","volume-title":"Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL)","volume":"1","author":"Mrk\u0161i\u0107 Nikola","year":"2017","unstructured":"Nikola Mrk\u0161i\u0107, Diarmuid O S\u00e9aghdha, Tsung-Hsien Wen, Blaise Thomson, and Steve Young. 2017a. Neural belief tracker: Data-driven dialogue state tracking. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL), Vol. 1, Long Papers, 1777\u20131788."},{"key":"e_1_3_4_71_2","first-page":"108","volume-title":"Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL)","volume":"2","author":"Mrk\u0161i\u0107 Nikola","year":"2018","unstructured":"Nikola Mrk\u0161i\u0107 and Ivan Vuli\u0107. 2018. Fully statistical neural belief tracking. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL), Vol. 2, Short Papers, 108\u2013113."},{"key":"e_1_3_4_72_2","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1162\/tacl_a_00063","article-title":"Semantic specialization of distributional word vector spaces using monolingual and cross-Lingual constraints","volume":"5","author":"Mrk\u0161i\u0107 Nikola","year":"2017","unstructured":"Nikola Mrk\u0161i\u0107, Ivan Vuli\u0107, Diarmuid \u00d3 S\u00e9aghdha, Ira Leviant, Roi Reichart, Milica Ga\u0161i\u0107, Anna Korhonen, and Steve Young. 2017b. Semantic specialization of distributional word vector spaces using monolingual and cross-Lingual constraints. Transactions of the Association for Computational Linguistics 5 (2017), 309\u2013324.","journal-title":"Transactions of the Association for Computational Linguistics"},{"key":"e_1_3_4_73_2","unstructured":"Andr\u00e9 M\u00fcller S\u00f8ren Wichmann Viveka Velupillai Cecil H. Brown Pamela Brown Sebastian Sauppe Eric W. Holman Dik Bakker Johann-Mattis List Dmitri Egorov Oleg Belyaev Robert Mailhammer Matthias Urban Helen Geyer and Anthony Grant. 2010. ASJP World Language Tree of Lexical Similarity: Version 3. Retrieved from https:\/\/asjp.clld.org\/static\/WorldLanguageTree-003.pdf"},{"key":"e_1_3_4_74_2","doi-asserted-by":"crossref","first-page":"341","DOI":"10.1613\/jair.2843","article-title":"Multilingual part-of-speech tagging: Two unsupervised approaches","volume":"36","author":"Naseem Tahira","year":"2009","unstructured":"Tahira Naseem, Benjamin Snyder, Jacob Eisenstein, and Regina Barzilay. 2009. Multilingual part-of-speech tagging: Two unsupervised approaches. Journal of Artificial Intelligence Research 36 (2009), 341\u2013385.","journal-title":"Journal of Artificial Intelligence Research"},{"key":"e_1_3_4_75_2","unstructured":"Jinjie Ni Tom Young Vlad Pandelea Fuzhao Xue Vinay Adiga and Erik Cambria. 2021. Recent advances in deep learning based dialogue systems: A systematic survey. arXiv:2105.04387. Retrieved from https:\/\/arxiv.org\/abs\/2105.04387"},{"key":"e_1_3_4_76_2","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1007\/978-3-319-18111-0_1","volume-title":"Proceedings of the International Conference on Computational Linguistics and Intelligent Text Processing (CICLing)","author":"Nivre Joakim","year":"2015","unstructured":"Joakim Nivre. 2015. Towards a universal grammar for natural language processing. In Proceedings of the International Conference on Computational Linguistics and Intelligent Text Processing (CICLing). Springer, 3\u201316."},{"key":"e_1_3_4_77_2","unstructured":"Elnaz Nouri and Ehsan Hosseini-Asl. 2018. Toward scalable neural dialogue state tracking model. arXiv:1812.00899. Retrieved from https:\/\/arxiv.org\/abs\/1812.00899"},{"key":"e_1_3_4_78_2","doi-asserted-by":"crossref","first-page":"27","DOI":"10.18653\/v1\/W16-1905","volume-title":"Proceedings of the 7th Workshop on Cognitive Aspects of Computational Language Learning (CogACLL Workshop)","author":"Nouri Javad","year":"2016","unstructured":"Javad Nouri and Roman Yangarber. 2016. From alignment of etymological data to phylogenetic inference via population genetics. In Proceedings of the 7th Workshop on Cognitive Aspects of Computational Language Learning (CogACLL Workshop), 27\u201337."},{"key":"e_1_3_4_79_2","first-page":"1","volume-title":"Proceedings of the International Conference Oriental COCOSDA Held Jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA\/CASLRE).","author":"Oco Nathaniel","year":"2013","unstructured":"Nathaniel Oco, Leif Romeritch Syliongka, Rachel Edita Roxas, and Joel Ilao. 2013. Dice's coefficient on trigram profiles as metric for language similarity. In Proceedings of the International Conference Oriental COCOSDA Held Jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA\/CASLRE). IEEE, 1\u20134."},{"key":"e_1_3_4_80_2","first-page":"1297","volume-title":"Proceedings of the 26th International Conference on Computational Linguistics: COLING Technical Papers","author":"O\u2019Horan Helen","year":"2016","unstructured":"Helen O\u2019Horan, Yevgeni Berzak, Ivan Vuli\u0107, Roi Reichart, and Anna Korhonen. 2016. Survey on the use of typological information in natural language processing. In Proceedings of the 26th International Conference on Computational Linguistics: COLING Technical Papers, 1297\u20131308."},{"key":"e_1_3_4_81_2","first-page":"30","volume-title":"Proceedings of the 3rd Workshop on NLP for Conversational AI","author":"Panda Subhadarshi","year":"2021","unstructured":"Subhadarshi Panda, Caglar Tirkaz, Tobias Falke, and Patrick Lehnen. 2021. Multilingual paraphrase generation for bootstrapping new features in task-oriented dialog systems. In Proceedings of the 3rd Workshop on NLP for Conversational AI, 30\u201339."},{"key":"e_1_3_4_82_2","doi-asserted-by":"crossref","first-page":"261","DOI":"10.1162\/tacl_a_00365","article-title":"Morphology matters: A multilingual language modeling analysis","volume":"9","author":"Park Hyunji Hayley","year":"2021","unstructured":"Hyunji Hayley Park, Katherine J Zhang, Coleman Haley, Kenneth Steimel, Han Liu, and Lane Schwartz. 2021. Morphology matters: A multilingual language modeling analysis. Transactions of the Association for Computational Linguistics 9 (2021), 261\u2013276.","journal-title":"Transactions of the Association for Computational Linguistics"},{"issue":"6","key":"e_1_3_4_83_2","doi-asserted-by":"crossref","first-page":"724","DOI":"10.1089\/cmb.2007.R012","article-title":"Efficiently computing the Robinson-Foulds metric","volume":"14","author":"Pattengale Nicholas D.","year":"2007","unstructured":"Nicholas D. Pattengale, Eric J. Gottlieb, and Bernard M. E. Moret. 2007. Efficiently computing the Robinson-Foulds metric. Journal of Computational Biology 14, 6 (2007), 724\u2013735.","journal-title":"Journal of Computational Biology"},{"issue":"4","key":"e_1_3_4_84_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/2505126","article-title":"How to choose the best pivot language for automatic translation of low-resource languages","volume":"12","author":"Paul Michael","year":"2013","unstructured":"Michael Paul, Andrew Finch, and Eiichrio Sumita. 2013. How to choose the best pivot language for automatic translation of low-resource languages. ACM Transactions on Asian Language Information Processing 12, 4 (2013), 1\u201317.","journal-title":"ACM Transactions on Asian Language Information Processing"},{"key":"e_1_3_4_85_2","volume-title":"Proceedings of the International Conference on SIGIR Workshop on Conversational Interaction Systems","author":"Pei Jiahuan","year":"2019","unstructured":"Jiahuan Pei, Pengjie Ren, and Maarten de Rijke. 2019. A modular task-oriented dialogue system using a neural mixture-of-experts. In Proceedings of the International Conference on SIGIR Workshop on Conversational Interaction Systems."},{"key":"e_1_3_4_86_2","first-page":"1552","volume-title":"Proceedings of the Web Conference","author":"Pei Jiahuan","year":"2021","unstructured":"Jiahuan Pei, Pengjie Ren, and Maarten de Rijke. 2021. A cooperative memory network for personalized task-oriented dialogue systems with incomplete user profiles. In Proceedings of the Web Conference, 1552\u20131561."},{"key":"e_1_3_4_87_2","first-page":"2148","volume-title":"Proceedings of the 24th European Conference on Artificial Intelligence (ECAI \u201920)","author":"Pei Jiahuan","year":"2020","unstructured":"Jiahuan Pei, Pengjie Ren, Christof Monz, and Maarten de Rijke. 2020. Retrospective and prospective mixture-of-generators for task-oriented dialogue response generation. In Proceedings of the 24th European Conference on Artificial Intelligence (ECAI \u201920), 2148\u20132155."},{"key":"e_1_3_4_88_2","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-642-23008-0","volume-title":"Multilingual Information Retrieval","author":"Peters Carol","year":"2012","unstructured":"Carol Peters, Martin Braschler, and Paul Clough. 2012. Multilingual Information Retrieval. Springer-Verlag, Berlin."},{"key":"e_1_3_4_89_2","first-page":"3853","volume-title":"Proceedings of the 29th International Joint Conference on Artificial Intelligence (IJCAI)","author":"Qin Libo","year":"2021","unstructured":"Libo Qin, Minheng Ni, Yue Zhang, and Wanxiang Che. 2021. CoSDA-ML: multi-lingual code-switching data augmentation for zero-shot cross-lingual NLP. In Proceedings of the 29th International Joint Conference on Artificial Intelligence (IJCAI), 3853\u20133860."},{"issue":"8","key":"e_1_3_4_90_2","first-page":"9","article-title":"Language models are unsupervised multitask learners","volume":"1","author":"Radford Alec","year":"2019","unstructured":"Alec Radford, Jeffrey Wu, Rewon Child, David Luan, Dario Amodei, and Ilya Sutskever. 2019. Language models are unsupervised multitask learners. OpenAI Blog 1, 8 (2019), 9.","journal-title":"OpenAI Blog"},{"key":"e_1_3_4_91_2","first-page":"1","article-title":"Exploring the limits of transfer learning with a unified text-to-text transformer","volume":"21","author":"Raffel Colin","year":"2020","unstructured":"Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, and Peter J Liu. 2020. Exploring the limits of transfer learning with a unified text-to-text transformer. Journal of Machine Learning Research 21 (2020), 1\u201367.","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_4_92_2","unstructured":"Evgeniia Razumovskaia Goran Glava\u0161 Olga Majewska Anna Korhonen and Ivan Vulic. 2021. Crossing the conversational chasm: A primer on multilingual task-oriented dialogue systems. arXiv:2104.08570. Retrieved from https:\/\/arxiv.org\/abs\/2104.08570"},{"key":"e_1_3_4_93_2","doi-asserted-by":"crossref","first-page":"44","DOI":"10.18653\/v1\/2022.acl-tutorials.8","volume-title":"Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts","author":"Razumovskaia Evgeniia","year":"2022","unstructured":"Evgeniia Razumovskaia, Goran Glava\u0161, Olga Majewska, Edoardo Ponti, and Ivan Vuli\u0107. 2022a. Natural language processing for multilingual task-oriented dialogue. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts, 44\u201350."},{"key":"e_1_3_4_94_2","doi-asserted-by":"crossref","first-page":"2017","DOI":"10.18653\/v1\/2022.findings-acl.160","volume-title":"Proceedings of the Findings of the Association for Computational Linguistics (ACL \u201922)","author":"Razumovskaia Evgeniia","year":"2022","unstructured":"Evgeniia Razumovskaia, Ivan Vuli\u0107, and Anna Korhonen. 2022b. Data augmentation and learned layer aggregation for improved multilingual language understanding in dialogue. In Proceedings of the Findings of the Association for Computational Linguistics (ACL \u201922), 2017\u20132033."},{"key":"e_1_3_4_95_2","first-page":"2780","volume-title":"Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP)","author":"Ren Liliang","year":"2018","unstructured":"Liliang Ren, Kaige Xie, Lu Chen, and Kai Yu. 2018. Towards universal dialogue state tracking. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2780\u20132786."},{"issue":"4","key":"e_1_3_4_96_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3432726","article-title":"Conversations with search engines: SERP-based conversational response generation","volume":"39","author":"Ren Pengjie","year":"2021","unstructured":"Pengjie Ren, Zhumin Chen, Zhaochun Ren, Evangelos Kanoulas, Christof Monz, and Maarten de Rijke. 2021. Conversations with search engines: SERP-based conversational response generation. ACM Transactions on Information Systems 39, 4 (2021), 1\u201329.","journal-title":"ACM Transactions on Information Systems"},{"key":"e_1_3_4_97_2","doi-asserted-by":"crossref","first-page":"569","DOI":"10.1613\/jair.1.11640","article-title":"A survey of cross-lingual word embedding models","volume":"65","author":"Ruder Sebastian","year":"2019","unstructured":"Sebastian Ruder, Ivan Vuli\u0107, and Anders S\u00f8gaard. 2019. A survey of cross-lingual word embedding models. Journal of Artificial Intelligence Research 65 (2019), 569\u2013631.","journal-title":"Journal of Artificial Intelligence Research"},{"key":"e_1_3_4_98_2","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4020-8825-4","volume-title":"Universals of Language Today","author":"Scalise Sergio","year":"2009","unstructured":"Sergio Scalise, Elisabetta Magni, and Antonietta Bisetto. 2009. Universals of Language Today. Springer."},{"key":"e_1_3_4_99_2","first-page":"3795","volume-title":"Proceedings of NAACL-HLT","author":"Schuster Sebastian","year":"2019","unstructured":"Sebastian Schuster, Sonal Gupta, Rushin Shah, and Mike Lewis. 2019. Cross-lingual transfer learning for multilingual task oriented dialog. In Proceedings of NAACL-HLT, 3795\u20133805."},{"issue":"1","key":"e_1_3_4_100_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.5087\/dad.2018.101","article-title":"A survey of available corpora for building data-driven dialogue systems","volume":"9","author":"Serban Iulian Vlad","year":"2018","unstructured":"Iulian Vlad Serban, Ryan Lowe, Peter Henderson, Laurent Charlin, and Joelle Pineau. 2018. A survey of available corpora for building data-driven dialogue systems. Dialogue & Discourse 9, 1 (2018), 1\u201349.","journal-title":"Dialogue & Discourse"},{"issue":"6","key":"e_1_3_4_101_2","doi-asserted-by":"crossref","first-page":"68005","DOI":"10.1209\/0295-5075\/81\/68005","article-title":"Indo-European languages tree by Levenshtein distance","volume":"81","author":"Serva Maurizio","year":"2008","unstructured":"Maurizio Serva and Filippo Petroni. 2008. Indo-European languages tree by Levenshtein distance. EPL (Europhysics Letters) 81, 6 (2008), Article 68005.","journal-title":"EPL (Europhysics Letters)"},{"key":"e_1_3_4_102_2","first-page":"8854","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","volume":"34","author":"Siddhant Aditya","year":"2020","unstructured":"Aditya Siddhant, Melvin Johnson, Henry Tsai, Naveen Ari, Jason Riesa, Ankur Bapna, Orhan Firat, and Karthik Raman. 2020. Evaluating the cross-lingual effectiveness of massively multilingual neural machine translation. Proceedings of the AAAI Conference on Artificial Intelligence 34 (2020), 8854\u20138861."},{"issue":"2","key":"e_1_3_4_103_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/978-3-031-02171-8","article-title":"Cross-lingual word embeddings","volume":"12","author":"S\u00f8gaard Anders","year":"2019","unstructured":"Anders S\u00f8gaard, Ivan Vuli\u0107, Sebastian Ruder, and Manaal Faruqui. 2019. Cross-lingual word embeddings. Synthesis Lectures on Human Language Technologies 12, 2 (2019), 1\u2013132.","journal-title":"Synthesis Lectures on Human Language Technologies"},{"key":"e_1_3_4_104_2","doi-asserted-by":"crossref","unstructured":"Georgios P Spithourakis Ivan Vuli\u0107 Micha Lis I\u223cnigo Casanueva and Pawe Budzianowski. 2022. Evi: Multilingual spoken dialogue tasks and dataset for knowledge-based enrolment verification and identification. arXiv:2204.13496. Retrieved from https:\/\/arxiv.org\/abs\/2204.13496","DOI":"10.18653\/v1\/2022.findings-naacl.124"},{"issue":"1","key":"e_1_3_4_105_2","doi-asserted-by":"crossref","first-page":"223","DOI":"10.1075\/avt.24.21tan","article-title":"Mutual intelligibility and similarity of Chinese dialects: Predicting judgments from objective measures","volume":"24","author":"Tang Chaoju","year":"2007","unstructured":"Chaoju Tang and Vincent J. van Heuven. 2007. Mutual intelligibility and similarity of Chinese dialects: Predicting judgments from objective measures. Linguistics in the Netherlands 24, 1 (2007), 223\u2013234.","journal-title":"Linguistics in the Netherlands"},{"key":"e_1_3_4_106_2","volume-title":"Language Typology and Syntactic Description","author":"Thompson Sandra A.","year":"2007","unstructured":"Sandra A. Thompson, Robert E. Longacre, Shin Ja J. Hwang, and Timothy Shopen. 2007. Language Typology and Syntactic Description. Cambridge University Press."},{"key":"e_1_3_4_107_2","unstructured":"Hugo Touvron Louis Martin Kevin Stone Peter Albert Amjad Almahairi Yasmine Babaei Nikolay Bashlykov Soumya Batra Prajjwal Bhargava Shruti Bhosale Dan Bikel Lukas Blecher Cristian Canton Ferrer Moya Chen Guillem Cucurull David Esiobu Jude Fernandes Jeremy Fu Wenyin Fu Brian Fuller Cynthia Gao Vedanuj Goswami Naman Goyal Anthony Hartshorn Saghar Hosseini Rui Hou Hakan Inan Marcin Kardas Viktor Kerkez Madian Khabsa Isabel Kloumann Artem Korenev Punit Singh Koura Marie-Anne Lachaux Thibaut Lavril Jenya Lee Diana Liskovich Yinghai Lu Yuning Mao Xavier Martinet Todor Mihaylov Pushkar Mishra Igor Molybog Yixin Nie Andrew Poulton Jeremy Reizenstein Rashi Rungta Kalyan Saladi Alan Schelten Ruan Silva Eric Michael Smith Ranjan Subramanian Xiaoqing Ellen Tan Binh Tang Ross Taylor Adina Williams Jian Xiang Kuan Puxin Xu Zheng Yan Iliyan Zarov Yuchen Zhang Angela Fan Melanie Kambadur Sharan Narang Aurelien Rodriguez Robert Stojnic Sergey Edunov and Thomas Scialom. 2023. Llama 2: Open foundation and fine-tuned chat models. arXiv:2307.09288."},{"key":"e_1_3_4_108_2","first-page":"6034","volume-title":"Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).","author":"Upadhyay Shyam","year":"2018","unstructured":"Shyam Upadhyay, Manaal Faruqui, Gokhan T\u00fcr, Hakkani-T\u00fcr Dilek, and Larry Heck. 2018. (Almost) zero-shot cross-lingual spoken language understanding. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 6034\u20136038."},{"key":"e_1_3_4_109_2","unstructured":"Phi Nguyen Van Tung Cao Hoang Dung Nguyen Manh Quan Nguyen Minh and Long Tran Quoc. 2022. ViWOZ: A multi-domain task-oriented dialogue systems dataset for low-resource language. arXiv:2203.07742. Retrieved from https:\/\/arxiv.org\/abs\/2203.07742"},{"key":"e_1_3_4_110_2","doi-asserted-by":"crossref","DOI":"10.1075\/z.141","volume-title":"Unity and Diversity of Languages","author":"van Sterkenburg Piet","year":"2008","unstructured":"Piet van Sterkenburg (Ed.). 2008. Unity and Diversity of Languages. John Benjamins Publishing."},{"key":"e_1_3_4_111_2","first-page":"56","volume-title":"Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics","volume":"1","author":"Vuli\u0107 Ivan","year":"2017","unstructured":"Ivan Vuli\u0107, Nikola Mrk\u0161i\u0107, Roi Reichart, Diarmuid \u00d3 S\u00e9aghdha, Steve Young, and Anna Korhonen. 2017. Morph-fitting: Fine-tuning word vector spaces with simple language-specific rules. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, Vol. 1, Long Papers, 56\u201368."},{"key":"e_1_3_4_112_2","unstructured":"Guan Wang Sijie Cheng Xianyuan Zhan Xiangang Li Sen Song and Yang Liu. 2023. Openchat: Advancing open-source language models with mixed-quality data. arXiv:2309.11235. Retrieved from https:\/\/arxiv.org\/abs\/2309.11235"},{"issue":"9","key":"e_1_3_4_113_2","doi-asserted-by":"crossref","first-page":"421","DOI":"10.3390\/info11090421","article-title":"Measurement of text similarity: A survey","volume":"11","author":"Wang Jiapeng","year":"2020","unstructured":"Jiapeng Wang and Yihong Dong. 2020. Measurement of text similarity: A survey. Information 11, 9 (2020), 421.","journal-title":"Information"},{"key":"e_1_3_4_114_2","first-page":"438","volume-title":"Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics","volume":"1","author":"Wen Tsung-Hsien","year":"2017","unstructured":"Tsung-Hsien Wen, David Vandyke, Nikola Mrk\u0161i\u0107, Milica Gasic, Lina M Rojas Barahona, Pei-Hao Su, Stefan Ultes, and Steve Young. 2017. A network-based end-to-end trainable task-oriented dialogue system. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, Vol. 1, Long Papers, 438\u2013449."},{"key":"e_1_3_4_115_2","volume-title":"Introduction to Typology: The Unity and Diversity of Language","author":"Whaley Lindsay J.","year":"1996","unstructured":"Lindsay J. Whaley. 1996. Introduction to Typology: The Unity and Diversity of Language. SAGE Publications."},{"key":"e_1_3_4_116_2","first-page":"808","volume-title":"Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL)","author":"Wu Chien-Sheng","year":"2019","unstructured":"Chien-Sheng Wu, Andrea Madotto, Ehsan Hosseini-Asl, Caiming Xiong, Richard Socher, and Pascale Fung. 2019. Transferable multi-domain state generator for task-oriented dialogue systems. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), 808\u2013819."},{"issue":"6","key":"e_1_3_4_117_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3457571","article-title":"Robust cross-lingual task-oriented dialogue","volume":"20","author":"Xiang Lu","year":"2021","unstructured":"Lu Xiang, Junnan Zhu, Yang Zhao, Yu Zhou, and Chengqing Zong. 2021. Robust cross-lingual task-oriented dialogue. Transactions on Asian and Low-Resource Language Information Processing 20, 6 (2021), 1\u201324.","journal-title":"Transactions on Asian and Low-Resource Language Information Processing"},{"key":"e_1_3_4_118_2","first-page":"1448","volume-title":"Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics","volume":"1","author":"Xu Puyang","year":"2018","unstructured":"Puyang Xu and Qi Hu. 2018. An end-to-end approach for handling unknown slot values in dialogue state tracking. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, Vol. 1, Long Papers, 1448\u20131457."},{"issue":"4","key":"e_1_3_4_119_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3462207","article-title":"Response ranking with multi-types of deep interactive representations in retrieval-based dialogues","volume":"39","author":"Xu Ruijian","year":"2021","unstructured":"Ruijian Xu, Chongyang Tao, Jiazhan Feng, Wei Wu, Rui Yan, and Dongyan Zhao. 2021. Response ranking with multi-types of deep interactive representations in retrieval-based dialogues. ACM Transactions on Information Systems 39, 4 (2021), 1\u201328.","journal-title":"ACM Transactions on Information Systems"},{"key":"e_1_3_4_120_2","first-page":"5052","volume-title":"Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)","author":"Xu Weijia","year":"2020","unstructured":"Weijia Xu, Batool Haider, and Saab Mansour. 2020. End-to-end slot alignment and recognition for cross-lingual NLU. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), 5052\u20135063."},{"key":"e_1_3_4_121_2","volume-title":"Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT)","author":"Xue Linting","year":"2021","unstructured":"Linting Xue, Noah Constant, Adam Roberts, Mihir Kale, Rami Al-Rfou, Aditya Siddhant, Aditya Barua, and Colin Raffel. 2021. mT5: A massively multilingual pre-trained text-to-text transformer. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT)."},{"key":"e_1_3_4_122_2","first-page":"3013","volume-title":"Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval","author":"Yan Guojun","year":"2022","unstructured":"Guojun Yan, Jiahuan Pei, Pengjie Ren, Zhaochun Ren, Xin Xin, Huasheng Liang, Maarten de Rijke, and Zhumin Chen. 2022. ReMeDi: Resources for multi-domain, multi-service, medical dialogues. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, 3013\u20133024."},{"issue":"4","key":"e_1_3_4_123_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3470450","article-title":"Multi-response awareness for retrieval-based conversations: Respond with diversity via dynamic representation learning","volume":"39","author":"Yan Rui","year":"2021","unstructured":"Rui Yan, Weiheng Liao, Dongyan Zhao, and Ji-Rong Wen. 2021. Multi-response awareness for retrieval-based conversations: Respond with diversity via dynamic representation learning. ACM Transactions on Information Systems 39, 4 (2021), 1\u201329.","journal-title":"ACM Transactions on Information Systems"},{"key":"e_1_3_4_124_2","first-page":"9474","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","volume":"34","author":"Yin Yichun","year":"2020","unstructured":"Yichun Yin, Lifeng Shang, Xin Jiang, Xiao Chen, and Qun Liu. 2020. Dialog state tracking with reinforced data augmentation. Proceedings of the AAAI Conference on Artificial Intelligence 34 (2020), 9474\u20139481."},{"issue":"10","key":"e_1_3_4_125_2","doi-asserted-by":"crossref","first-page":"2011","DOI":"10.1007\/s11431-020-1692-3","article-title":"Recent advances and challenges in task-oriented dialog systems","volume":"63","author":"Zhang Zheng","year":"2020","unstructured":"Zheng Zhang, Ryuichi Takanobu, Qi Zhu, MinLie Huang, and XiaoYan Zhu. 2020. Recent advances and challenges in task-oriented dialog systems. Science China Technological Sciences 63, 10 (2020), 2011\u20132027.","journal-title":"Science China Technological Sciences"},{"key":"e_1_3_4_126_2","unstructured":"Wayne Xin Zhao Kun Zhou Junyi Li Tianyi Tang Xiaolei Wang Yupeng Hou Yingqian Min Beichen Zhang Junjie Zhang Zican Dong Yifan Du Chen Yang Yushuo Chen Zhipeng Chen Jinhao Jiang Ruiyang Ren Yifan Li Xinyu Tang Zikang Liu Peiyu Liu Jian-Yun Nie and Ji-Rong Wen. 2023. A survey of large language models. arXiv:2303.18223. Retrieved from https:\/\/arxiv.org\/abs\/2303.18223"},{"key":"e_1_3_4_127_2","first-page":"3637","volume-title":"Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)","author":"Zhao Zijian","year":"2019","unstructured":"Zijian Zhao, Su Zhu, and Kai Yu. 2019. Data augmentation with atomic templates for spoken language understanding. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 3637\u20133643."},{"key":"e_1_3_4_128_2","first-page":"1458","volume-title":"Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics","volume":"1","author":"Zhong Victor","year":"2018","unstructured":"Victor Zhong, Caiming Xiong, and Richard Socher. 2018. Global-locally self-attentive encoder for dialogue state tracking. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, Vol. 1, Long Papers, 1458\u20131467."},{"key":"e_1_3_4_129_2","doi-asserted-by":"crossref","unstructured":"Han Zhou Ignacio Iacobacci and Pasquale Minervini. 2022. XQA-DST: Multi-domain and multi-lingual dialogue state tracking. arXiv:2204.05895. Retrieved from https:\/\/arxiv.org\/abs\/2204.05895","DOI":"10.18653\/v1\/2023.findings-eacl.73"},{"key":"e_1_3_4_130_2","unstructured":"Lei Zuo Kun Qian Bowen Yang and Zhou Yu. 2021. AllWOZ: Towards multilingual task-oriented dialog systems for all. arXiv:2112.08333. Retrieved from https:\/\/arxiv.org\/abs\/2112.08333"}],"container-title":["ACM Transactions on Information Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3676956","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3676956","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T01:19:12Z","timestamp":1750295952000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3676956"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,10,22]]},"references-count":129,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2024,11,30]]}},"alternative-id":["10.1145\/3676956"],"URL":"https:\/\/doi.org\/10.1145\/3676956","relation":{},"ISSN":["1046-8188","1558-2868"],"issn-type":[{"value":"1046-8188","type":"print"},{"value":"1558-2868","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,10,22]]},"assertion":[{"value":"2023-03-10","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-06-24","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-10-22","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}