{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,10]],"date-time":"2026-04-10T21:43:32Z","timestamp":1775857412740,"version":"3.50.1"},"reference-count":160,"publisher":"Association for Computing Machinery (ACM)","issue":"10","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Comput. Surv."],"published-print":{"date-parts":[[2026,7,31]]},"abstract":"<jats:p>Recent advancements in conversational systems have significantly enhanced human-machine interactions across various domains. However, training these systems is challenging due to the scarcity of specialized dialogue data. Traditionally, conversational datasets were created through crowdsourcing, but this method has proven costly, limited in scale, and labor-intensive. As a solution, the development of synthetic dialogue data has emerged, utilizing techniques to augment existing datasets or convert textual resources into conversational formats, providing a more efficient and scalable approach to dataset creation. In this survey, we offer a systematic and comprehensive review of multi-turn conversational data generation, focusing on three types of dialogue systems: open domain, task-oriented, and information-seeking. We categorize the existing research based on key components like seed data creation, utterance generation, and quality filtering methods, and introduce a general framework that outlines the main principles of conversation data generation systems. Additionally, we examine the evaluation metrics and methods for assessing synthetic conversational data, address current challenges in the field, and explore potential directions for future research. Our goal is to accelerate progress for researchers and practitioners by presenting an overview of state-of-the-art methods and highlighting opportunities to further research in this area.<\/jats:p>","DOI":"10.1145\/3795686","type":"journal-article","created":{"date-parts":[[2026,2,19]],"date-time":"2026-02-19T11:52:12Z","timestamp":1771501932000},"page":"1-37","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["A Survey on Recent Advances in Conversational Data Generation"],"prefix":"10.1145","volume":"58","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-0393-8662","authenticated-orcid":false,"given":"Heydar","family":"Soudani","sequence":"first","affiliation":[{"name":"Radboud Universiteit","place":["Nijmegen, Netherlands"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2617-205X","authenticated-orcid":false,"given":"Roxana","family":"Petcu","sequence":"additional","affiliation":[{"name":"University of Amsterdam","place":["Amsterdam, Netherlands"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8312-0694","authenticated-orcid":false,"given":"Evangelos","family":"Kanoulas","sequence":"additional","affiliation":[{"name":"University of Amsterdam","place":["Amsterdam, Netherlands"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0006-9986-482X","authenticated-orcid":false,"given":"Faegheh","family":"Hasibi","sequence":"additional","affiliation":[{"name":"Radboud Universiteit","place":["Nijmegen, Netherlands"]}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2026,4,1]]},"reference":[{"key":"e_1_3_3_2_2","doi-asserted-by":"publisher","DOI":"10.1145\/3726302.3730316"},{"key":"e_1_3_3_3_2","doi-asserted-by":"publisher","DOI":"10.1145\/3616855.3635856"},{"key":"e_1_3_3_4_2","doi-asserted-by":"publisher","DOI":"10.1162\/TACL_A_00471"},{"key":"e_1_3_3_5_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.findings-emnlp.166"},{"key":"e_1_3_3_6_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1620"},{"key":"e_1_3_3_7_2","doi-asserted-by":"publisher","DOI":"10.1145\/3626772.3657860"},{"key":"e_1_3_3_8_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.naacl-main.44"},{"key":"e_1_3_3_9_2","doi-asserted-by":"crossref","unstructured":"Arian Askari Roxana Petcu Chuan Meng Mohammad Aliannejadi Amin Abolghasemi Evangelos Kanoulas and Suzan Verberne. 2025. SOLID: Self-seeding and multi-intent self-instructing LLMs for generating intent-aware information-seeking dialogs. NAACL 2025 (2025) 6375\u20136395.","DOI":"10.18653\/v1\/2025.findings-naacl.357"},{"key":"e_1_3_3_10_2","doi-asserted-by":"publisher","DOI":"10.1145\/3726302.3730348"},{"key":"e_1_3_3_11_2","doi-asserted-by":"publisher","DOI":"10.1561\/1500000098"},{"key":"e_1_3_3_12_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.acl-long.608"},{"key":"e_1_3_3_13_2","doi-asserted-by":"publisher","DOI":"10.1145\/3539618.3591883"},{"key":"e_1_3_3_14_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1547"},{"key":"e_1_3_3_15_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1547"},{"key":"e_1_3_3_16_2","doi-asserted-by":"publisher","DOI":"10.1145\/3539618.3591881"},{"key":"e_1_3_3_17_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.findings-eacl.63"},{"key":"e_1_3_3_18_2","doi-asserted-by":"publisher","unstructured":"Maximillian Chen Alexandros Papangelis Chenyang Tao Andy Rosenbaum Seokhwan Kim Yang Liu Zhou Yu and Dilek Hakkani-Tur. 2022. Weakly supervised data augmentation through prompting for dialogue understanding. CoRR abs\/2210.14169 (2022). 10.48550\/ARXIV.2210.14169","DOI":"10.48550\/ARXIV.2210.14169"},{"key":"e_1_3_3_19_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.acl-short.82"},{"key":"e_1_3_3_20_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1241"},{"key":"e_1_3_3_21_2","doi-asserted-by":"crossref","unstructured":"Haixing Dai Zhengliang Liu Wenxiong Liao Xiaoke Huang Yihan Cao Zihao Wu Lin Zhao Shaochen Xu Fang Zeng Wei Liu Ninghao Liu Sheng Li Dajiang Zhu Hongmin Cai Lichao Sun Quanzheng Li Dinggang Shen Tianming Liu and Xiang Li. 2025. AugGPT: Leveraging ChatGPT for text data augmentation. IEEE Trans. Big Data 11 3 (2025) 907\u2013918.","DOI":"10.1109\/TBDATA.2025.3536934"},{"key":"e_1_3_3_22_2","first-page":"4558","volume-title":"Proceedings of the 39th International Conference on Machine Learning (ICML\u201922) (Proceedings of Machine Learning Research, Vol. 162).","author":"Dai Zhuyun","year":"2022","unstructured":"Zhuyun Dai, Arun Tejasvi Chaganty, Vincent Y. Zhao, Aida Amini, Qazi Mamunur Rashid, Mike Green, and Kelvin Guu. 2022. Dialog inpainting: Turning documents into dialogs. In Proceedings of the 39th International Conference on Machine Learning (ICML\u201922) (Proceedings of Machine Learning Research, Vol. 162).4558\u20134586."},{"key":"e_1_3_3_23_2","volume-title":"Proceedings of the 29th Text Retrieval Conference, TREC (NIST Special Publication, Vol. 1266).","author":"Dalton Jeffrey","year":"2020","unstructured":"Jeffrey Dalton, Chenyan Xiong, and Jamie Callan. 2020. CAsT 2020: The conversational assistance track overview. In Proceedings of the 29th Text Retrieval Conference, TREC (NIST Special Publication, Vol. 1266).National Institute of Standards and Technology (NIST)."},{"key":"e_1_3_3_24_2","doi-asserted-by":"publisher","DOI":"10.1145\/3397271.3401206"},{"key":"e_1_3_3_25_2","unstructured":"Mar\u00eda \u00c0ngels de Luis Balaguer Vinamra Benara Renato Luiz de Freitas Cunha Roberto de M. Estev\u00e3o Filho Todd Hendry Daniel Holstein Jennifer Marsman Nick Mecklenburg Sara Malvar Leonardo O. Nunes et\u00a0al. 2024. RAG vs fine-tuning: Pipelines tradeoffs and a case study on agriculture. arXiv:2401.08406. Retrieved from https:\/\/arxiv.org\/abs\/2401.08406"},{"key":"e_1_3_3_26_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.acl-tutorials.1"},{"key":"e_1_3_3_27_2","doi-asserted-by":"crossref","unstructured":"Yang Deng Wenqiang Lei Lizi Liao and Tat-Seng Chua. 2023. Prompting and evaluating large language models for proactive dialogues: Clarification target-guided and non-collaboration. In Findings of the Association for Computational Linguistics: EMNLP (Findings of ACL Vol. EMNLP 2023). Association for Computational Linguistics 10602\u201310621.","DOI":"10.18653\/v1\/2023.findings-emnlp.711"},{"key":"e_1_3_3_28_2","doi-asserted-by":"publisher","DOI":"10.1145\/3715097"},{"key":"e_1_3_3_29_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10462-020-09866-x"},{"key":"e_1_3_3_30_2","doi-asserted-by":"publisher","DOI":"10.1145\/3731120.3744588"},{"key":"e_1_3_3_31_2","doi-asserted-by":"crossref","unstructured":"Emily Dinan Varvara Logacheva Valentin Malykh Alexander H. Miller Kurt Shuster Jack Urbanek Douwe Kiela Arthur Szlam Iulian Serban Ryan Lowe et\u00a0al. 2019. The second conversational intelligence challenge (ConvAI2). arXiv:1902.00098. Retrieved from https:\/\/arxiv.org\/abs\/1902.00098","DOI":"10.1007\/978-3-030-29135-8_7"},{"key":"e_1_3_3_32_2","volume-title":"Proceedings of the 7th International Conference on Learning Representations","author":"Dinan Emily","year":"2019","unstructured":"Emily Dinan, Stephen Roller, Kurt Shuster, Angela Fan, Michael Auli, and Jason Weston. 2019. Wizard of wikipedia: Knowledge-powered conversational agents. In Proceedings of the 7th International Conference on Learning Representations."},{"key":"e_1_3_3_33_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.emnlp-main.183"},{"key":"e_1_3_3_34_2","first-page":"580","volume-title":"Proceedings of the COLING","author":"Do Xuan Long","year":"2022","unstructured":"Xuan Long Do, Bowei Zou, Liangming Pan, Nancy F. Chen, Shafiq Joty, and Ai Ti Aw. 2022. CoHS-CQG: Context and history selection for conversational question generation. In Proceedings of the COLING. 580\u2013591."},{"key":"e_1_3_3_35_2","unstructured":"Yihao Fang Xianzhi Li Stephen W. Thomas and Xiaodan Zhu. 2023. ChatGPT as Data Augmentation for Compositional Generalization: A Case Study in Open Intent Detection. arXiv:2308.13517. Retrieved from https:\/\/arxiv.org\/abs\/2308.13517"},{"key":"e_1_3_3_36_2","doi-asserted-by":"crossref","unstructured":"Ryan Fellows Hisham Ihshaish Steve Battle Ciaran Haines Peter Mayhew and Juan Ignacio Deza. 2021. Task-oriented dialogue systems: Performance vs. quality-optima a review. arXiv:2112.11176. Retrieved from https:\/\/arxiv.org\/abs\/2112.11176","DOI":"10.5121\/csit.2022.121306"},{"key":"e_1_3_3_37_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i09.7089"},{"key":"e_1_3_3_38_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.emnlp-main.498"},{"key":"e_1_3_3_39_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1480"},{"key":"e_1_3_3_40_2","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2006-160"},{"key":"e_1_3_3_41_2","doi-asserted-by":"publisher","DOI":"10.1145\/3409256.3409834"},{"key":"e_1_3_3_42_2","doi-asserted-by":"publisher","DOI":"10.1037\/0003-066X.48.1.26"},{"key":"e_1_3_3_43_2","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2019-3079"},{"key":"e_1_3_3_44_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.eacl-main.177"},{"key":"e_1_3_3_45_2","unstructured":"Aman Gupta Anup Shirgaonkar Angels de Luis Balaguer Bruno Silva Daniel Holstein Dawei Li Jennifer Marsman Leonardo O. Nunes Mahsa Rouzbahman Morris Sharp et\u00a0al. 2024. RAG vs fine-tuning: Pipelines tradeoffs and a case study on agriculture. arXiv:2401.08406. Retrieved from https:\/\/arxiv.org\/abs\/2401.08406"},{"key":"e_1_3_3_46_2","doi-asserted-by":"publisher","DOI":"10.1109\/SLT.2018.8639652"},{"key":"e_1_3_3_47_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1256"},{"key":"e_1_3_3_48_2","first-page":"263","volume-title":"Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue","author":"Henderson Matthew","year":"2014","unstructured":"Matthew Henderson, Blaise Thomson, and Jason D. Williams. 2014. The second dialog state tracking challenge. In Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue. 263\u2013272."},{"key":"e_1_3_3_49_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.sigdial-1.34"},{"key":"e_1_3_3_50_2","doi-asserted-by":"publisher","DOI":"10.1145\/3703155"},{"key":"e_1_3_3_51_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.aacl-short.22"},{"key":"e_1_3_3_52_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.aacl-short.22"},{"key":"e_1_3_3_53_2","first-page":"1636","volume-title":"Proceedings of the COLING","author":"Hwang Seonjeong","year":"2022","unstructured":"Seonjeong Hwang and Gary Lee. 2022. Conversational QA dataset generation with answer revision. In Proceedings of the COLING. 1636\u20131644."},{"key":"e_1_3_3_54_2","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Jaderberg Max","year":"2017","unstructured":"Max Jaderberg, Volodymyr Mnih, Wojciech Marian Czarnecki, Tom Schaul, Joel Z. Leibo, David Silver, and Koray Kavukcuoglu. 2017. Reinforcement learning with unsupervised auxiliary tasks. In Proceedings of the International Conference on Learning Representations."},{"key":"e_1_3_3_55_2","doi-asserted-by":"crossref","unstructured":"Pegah Jandaghi XiangHai Sheng Xinyi Bai Jay Pujara and Hakim Sidahmed. 2024. Faithful persona-based conversational dataset generation with large language models. In Findings of the Association for Computational Linguistics ACL (Findings of ACL Vol. ACL 2024). Association for Computational Linguistics 15245\u201315270.","DOI":"10.18653\/v1\/2024.findings-acl.904"},{"key":"e_1_3_3_56_2","doi-asserted-by":"publisher","DOI":"10.1145\/3626772.3657815"},{"key":"e_1_3_3_57_2","doi-asserted-by":"publisher","DOI":"10.1145\/3511808.3557667"},{"key":"e_1_3_3_58_2","doi-asserted-by":"publisher","unstructured":"Hideaki Joko and Faegheh Hasibi. 2025. FACE: A fine-grained reference free evaluator for conversational recommender systems. CoRR abs\/2506.00314 (2025). 10.48550\/ARXIV.2506.00314","DOI":"10.48550\/ARXIV.2506.00314"},{"key":"e_1_3_3_59_2","doi-asserted-by":"publisher","DOI":"10.1145\/3404835.3463258"},{"key":"e_1_3_3_60_2","doi-asserted-by":"publisher","DOI":"10.1145\/3404835.3462859"},{"key":"e_1_3_3_61_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.emnlp-main.151"},{"key":"e_1_3_3_62_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.emnlp-main.799"},{"key":"e_1_3_3_63_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.emnlp-main.344"},{"key":"e_1_3_3_64_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.acl-long.287"},{"key":"e_1_3_3_65_2","first-page":"435","volume-title":"Proceedings of the International Workshop on Spoken Dialogue Systems","author":"Kim Seokhwan","year":"2016","unstructured":"Seokhwan Kim, Luis Fernando D\u2019Haro, Rafael E. Banchs, Jason D. Williams, and Matthew Henderson. 2016. The fourth dialog state tracking challenge. In Proceedings of the International Workshop on Spoken Dialogue Systems. 435\u2013449."},{"key":"e_1_3_3_66_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W18-5007"},{"key":"e_1_3_3_67_2","first-page":"29","volume-title":"Proceedings of the 1st Workshop on Customized Chat Grounding Persona and Knowledge","author":"Lee Young-Jun","year":"2022","unstructured":"Young-Jun Lee, Chae-Gyun Lim, Yunsu Choi, Ji-Hui Lm, and Ho-Jin Choi. 2022. PERSONACHATGEN: Generating personalized dialogues using GPT-3. In Proceedings of the 1st Workshop on Customized Chat Grounding Persona and Knowledge. 29\u201348."},{"key":"e_1_3_3_68_2","unstructured":"Megan Leszczynski Shu Zhang Ravi Ganti Krisztian Balog Filip Radlinski Fernando Pereira and Arun Tejasvi Chaganty. 2023. Talk the walk: Synthetic data generation for conversational music recommendation. arXiv:2301.11489. Retrieved from https:\/\/arxiv.org\/abs\/2301.11489"},{"key":"e_1_3_3_69_2","volume-title":"Proceedings of the 2nd Workshop on Interactive Learning for Natural Language Processing at NeurIPS","author":"Leszczynski Megan Eileen","year":"2022","unstructured":"Megan Eileen Leszczynski, Ravi Ganti, Shu Zhang, Krisztian Balog, Filip Radlinski, Fernando Pereira, and Arun Tejasvi Chaganty. 2022. Conversational music retrieval with synthetic data. In Proceedings of the 2nd Workshop on Interactive Learning for Natural Language Processing at NeurIPS."},{"key":"e_1_3_3_70_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.703"},{"key":"e_1_3_3_71_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N16-1014"},{"key":"e_1_3_3_72_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.findings-acl.224"},{"key":"e_1_3_3_73_2","unstructured":"Xiujun Li Zachary C. Lipton Bhuwan Dhingra Lihong Li Jianfeng Gao and Yun-Nung Chen. 2016. A user simulator for task-completion dialogues. arXiv:1612.05688. Retrieved from https:\/\/arxiv.org\/abs\/1612.05688"},{"key":"e_1_3_3_74_2","first-page":"986","volume-title":"Proceedings of the 8th International Joint Conference on Natural Language Processing","author":"Li Yanran","year":"2017","unstructured":"Yanran Li, Hui Su, Xiaoyu Shen, Wenjie Li, Ziqiang Cao, and Shuzi Niu. 2017. DailyDialog: A manually labelled multi-turn dialogue dataset. In Proceedings of the 8th International Joint Conference on Natural Language Processing. 986\u2013995."},{"key":"e_1_3_3_75_2","doi-asserted-by":"publisher","unstructured":"Yiming Li and Zhao Zhang. 2025. The first place solution of WSDM Cup 2024: Leveraging large language models for conversational Multi-Doc QA. CoRR abs\/2402.18385 (2024). 10.48550\/ARXIV.2402.18385","DOI":"10.48550\/ARXIV.2402.18385"},{"key":"e_1_3_3_76_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.findings-emnlp.318"},{"key":"e_1_3_3_77_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.sigdial-1.47"},{"key":"e_1_3_3_78_2","doi-asserted-by":"crossref","unstructured":"I-Fan Lin Faegheh Hasibi and Suzan Verberne. 2024. Generate then refine: Data augmentation for zero-shot intent detection. In Findings of the Association for Computational Linguistics: EMNLP (Findings of ACL Vol. EMNLP 2024). Association for Computational Linguistics 13138\u201313146.","DOI":"10.18653\/v1\/2024.findings-emnlp.768"},{"key":"e_1_3_3_79_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.sigdial-1.3"},{"key":"e_1_3_3_80_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.findings-acl.18"},{"key":"e_1_3_3_81_2","first-page":"2401","volume-title":"Proceedings of the Web Conference Companion","author":"Liu Ben","year":"2025","unstructured":"Ben Liu, Jihai Zhang, Fangquan Lin, Xu Jia, and Min Peng. 2025. One size doesn\u2019t fit all: A personalized conversational tutoring agent for mathematics instruction. In Proceedings of the Web Conference Companion. 2401\u20132410."},{"key":"e_1_3_3_82_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.acl-long.269"},{"key":"e_1_3_3_83_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.emnlp-main.153"},{"key":"e_1_3_3_84_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.emnlp-main.356"},{"key":"e_1_3_3_85_2","volume-title":"Proceedings of the Advances in Neural Information Processing Systems 36","author":"Madaan Aman","year":"2023","unstructured":"Aman Madaan, Niket Tandon, Prakhar Gupta, Skyler Hallinan, Luyu Gao, Sarah Wiegreffe, Uri Alon, Nouha Dziri, Shrimai Prabhumoye, Yiming Yang, et\u00a0al. 2023. Self-refine: Iterative refinement with self-feedback. In Proceedings of the Advances in Neural Information Processing Systems 36."},{"key":"e_1_3_3_86_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.acl-long.546"},{"key":"e_1_3_3_87_2","first-page":"1","volume-title":"Proceedings of the Joint Workshop on Interoperable Semantic Annotation","author":"Manuvirakurike Ramesh","year":"2018","unstructured":"Ramesh Manuvirakurike, Jacqueline Brixey, Trung Bui, Walter Chang, Ron Artstein, and Kallirroi Georgila. 2018. DialEdit: Annotations for spoken conversational image editing. In Proceedings of the Joint Workshop on Interoperable Semantic Annotation. 1\u20139."},{"key":"e_1_3_3_88_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.emnlp-main.190"},{"key":"e_1_3_3_89_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.64"},{"key":"e_1_3_3_90_2","doi-asserted-by":"publisher","DOI":"10.1109\/TSA.2003.814380"},{"key":"e_1_3_3_91_2","doi-asserted-by":"publisher","DOI":"10.1145\/3589335.3651940"},{"key":"e_1_3_3_92_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.findings-emnlp.103"},{"key":"e_1_3_3_93_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2025.acl-long.504"},{"key":"e_1_3_3_94_2","doi-asserted-by":"publisher","DOI":"10.1145\/511446.511457"},{"key":"e_1_3_3_95_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10462-022-10248-8"},{"key":"e_1_3_3_96_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10462-022-10248-8"},{"key":"e_1_3_3_97_2","doi-asserted-by":"crossref","unstructured":"Alexandra Olteanu Jean Garcia-Gathright Maarten de Rijke Michael D. Ekstrand Adam Roegiest Aldo Lipani Alex Beutel Ana Lucic Ana-Andreea Stoica Anubrata Das Asia Biega Bart Voorn Claudia Hauff Damiano Spina David D. Lewis Douglas W. Oard Emine Yilmaz Faegheh Hasibi Gabriella Kazai Graham McDonald Hinda Haned Iadh Ounis Ilse van der Linden Joris Baan Kamuela N. Lau Krisztian Balog Mahmoud F. Sayed Maria Panteli Mark Sanderson Matthew Lease Preethi Lahoti and Toshihiro Kamishima. 2019. FACTS-IR: Fairness accountability confidentiality transparency and safety in information retrieval. SIGIR Forum 53 2 (2019) 20\u201343.","DOI":"10.1145\/3458553.3458556"},{"key":"e_1_3_3_98_2","unstructured":"Long Ouyang Jeffrey Wu Xu Jiang Diogo Almeida Carroll L. Wainwright Pamela Mishkin Chong Zhang Sandhini Agarwal Katarina Slama Alex Ray John Schulman Jacob Hilton Fraser Kelton Luke Miller Maddie Simens Amanda Askell Peter Welinder Paul F. Christiano Jan Leike and Ryan Lowe. 2022. Training language models to follow instructions with human feedback. In Processing of the Advances in Neural Information Systems 35: Annual Conference on Neural Information Processing Systems."},{"key":"e_1_3_3_99_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2024.emnlp-main.15"},{"key":"e_1_3_3_100_2","volume-title":"Proceedings of the 31st Text REtrieval Conference, TREC (NIST Special Publication, Vol. 500-338).","author":"Owoicho Paul","year":"2022","unstructured":"Paul Owoicho, Jeff Dalton, Mohammad Aliannejadi, Leif Azzopardi, Johanne R. Trippas, and Svitlana Vakulenko. 2022. TREC CAsT 2022: Going beyond user ask and system retrieve with initiative and response generation. In Proceedings of the 31st Text REtrieval Conference, TREC (NIST Special Publication, Vol. 500-338).National Institute of Standards and Technology (NIST)."},{"key":"e_1_3_3_101_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1203"},{"key":"e_1_3_3_102_2","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","author":"Panickssery Arjun","year":"2024","unstructured":"Arjun Panickssery, Samuel R. Bowman, and Shi Feng. 2024. LLM evaluators recognize and favor their own generations. In Proceedings of the Advances in Neural Information Processing Systems."},{"key":"e_1_3_3_103_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N18-1162"},{"key":"e_1_3_3_104_2","unstructured":"Gustavo Penha Alexandru Balan and Claudia Hauff. 2019. Introducing MANtIS: A novel multi-domain information seeking dialogues dataset. arXiv:1912.04639. Retrieved from https:\/\/arxiv.org\/abs\/1912.04639"},{"key":"e_1_3_3_105_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.findings-acl.385"},{"key":"e_1_3_3_106_2","doi-asserted-by":"publisher","DOI":"10.1145\/3209978.3210124"},{"key":"e_1_3_3_107_2","first-page":"140:1\u2013140:67","article-title":"Exploring the limits of transfer learning with a unified text-to-text transformer","volume":"21","author":"Raffel Colin","year":"2020","unstructured":"Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, and Peter J. Liu. 2020. Exploring the limits of transfer learning with a unified text-to-text transformer. Journal of Machine Learning Research 21 (2020), 140:1\u2013140:67. https:\/\/jmlr.org\/papers\/v21\/20-074.html","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_3_108_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1534"},{"key":"e_1_3_3_109_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i05.6394"},{"key":"e_1_3_3_110_2","doi-asserted-by":"publisher","DOI":"10.1162\/TACL_A_00266"},{"key":"e_1_3_3_111_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-1410"},{"key":"e_1_3_3_112_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.acl-long.425"},{"key":"e_1_3_3_113_2","unstructured":"Chris Samarinas Pracha Promthaw Atharva Nijasure Hansi Zeng Julian Killingback and Hamed Zamani. 2024. Simulating task-oriented dialogues with state transition graphs and large language models. arXiv:2404.14772. Retrieved from https:\/\/arxiv.org\/abs\/2404.14772"},{"key":"e_1_3_3_114_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.nlp4convai-1.10"},{"key":"e_1_3_3_115_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2007.sigdial-1.48"},{"key":"e_1_3_3_116_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N19-1380"},{"key":"e_1_3_3_117_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.acl-long.194"},{"key":"e_1_3_3_118_2","unstructured":"Pararth Shah Dilek Hakkani-T\u00fcr G\u00f6khan T\u00fcr Abhinav Rastogi Ankur Bapna Neha Nayak and Larry P. Heck. 2018. Building a conversational agent overnight with dialogue self-play. CoRR abs\/1801.04871. http:\/\/arxiv.org\/abs\/1801.04871"},{"key":"e_1_3_3_119_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.nlp4convai-1.8"},{"key":"e_1_3_3_120_2","doi-asserted-by":"publisher","DOI":"10.1145\/3726302.3730130"},{"key":"e_1_3_3_121_2","doi-asserted-by":"publisher","DOI":"10.1145\/3583780.3615291"},{"key":"e_1_3_3_122_2","doi-asserted-by":"publisher","DOI":"10.1145\/3673791.3698415"},{"key":"e_1_3_3_123_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2025.findings-acl.852"},{"key":"e_1_3_3_124_2","unstructured":"Heydar Soudani Hamed Zamani and Faegheh Hasibi. 2025. Uncertainty quantification for retrieval-augmented reasoning. arXiv:2510.11483. Retrieved from https:\/\/arxiv.org\/abs\/2510.11483"},{"key":"e_1_3_3_125_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1565"},{"key":"e_1_3_3_126_2","doi-asserted-by":"publisher","unstructured":"Silvia Terragni Modestas Filipavicius Nghia Khau Bruna Guedes Andr\u00e9 Manso and Roland Mathis. 2023. In-context learning user simulators for task-oriented dialog systems. CoRR abs\/2306.00774. 10.48550\/ARXIV.2306.00774","DOI":"10.48550\/ARXIV.2306.00774"},{"key":"e_1_3_3_127_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.acl-long.13"},{"key":"e_1_3_3_128_2","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00737"},{"key":"e_1_3_3_129_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.findings-emnlp.277"},{"key":"e_1_3_3_130_2","unstructured":"Ben Wang and Aran Komatsuzaki. 2021. GPT-J-6B: A 6 Billion Parameter Autoregressive Language Model."},{"key":"e_1_3_3_131_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.emnlp-main.72"},{"key":"e_1_3_3_132_2","unstructured":"Weiyan Wang Yuxiang Wu Yu Zhang Zhongqi Lu Kaixiang Mo and Qiang Yang. 2017. Integrating user and agent models: A deep task-oriented dialogue system. arXiv:1711.03697. Retrieved from https:\/\/arxiv.org\/abs\/1711.03697"},{"key":"e_1_3_3_133_2","unstructured":"Koki Wataoka Tsubasa Takahashi and Ryokan Ri. 2024. Self-preference bias in LLM-as-a-judge. arXiv:2410.21819. Retrieved from https:\/\/arxiv.org\/abs\/2410.21819"},{"key":"e_1_3_3_134_2","doi-asserted-by":"publisher","DOI":"10.1145\/365153.365168"},{"key":"e_1_3_3_135_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/E17-1042"},{"key":"e_1_3_3_136_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/E17-1042"},{"key":"e_1_3_3_137_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/E17-1042"},{"key":"e_1_3_3_138_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.naacl-main.341"},{"key":"e_1_3_3_139_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.sigdial-1.21"},{"key":"e_1_3_3_140_2","doi-asserted-by":"crossref","unstructured":"Zeqiu Wu Ryu Parish Hao Cheng Sewon Min Prithviraj Ammanabrolu Mari Ostendorf and Hannaneh Hajishirzi. 2023. INSCIT: Information-seeking conversations with mixed-initiative interactions. Trans. Assoc. Comput. Linguistics 11 (2023) 453\u2013468.","DOI":"10.1162\/tacl_a_00559"},{"key":"e_1_3_3_141_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.emnlp-main.385"},{"key":"e_1_3_3_142_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.acl-long.758"},{"key":"e_1_3_3_143_2","doi-asserted-by":"publisher","DOI":"10.1145\/3209978.3210011"},{"key":"e_1_3_3_144_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-main.147"},{"key":"e_1_3_3_145_2","first-page":"1008","volume-title":"Proceedings of the EMNLP","author":"Yang Yiben","year":"2020","unstructured":"Yiben Yang, Chaitanya Malaviya, Jared Fernandez, Swabha Swayamdipta, Ronan Le Bras, Ji-Ping Wang, Chandra Bhagavatula, Yejin Choi, and Doug Downey. 2020. G-DAug: Generative data augmentation for commonsense reasoning. In Proceedings of the EMNLP. 1008\u20131025."},{"key":"e_1_3_3_146_2","first-page":"745","volume-title":"Proceedings of the COLING","author":"Yang Zhitong","year":"2022","unstructured":"Zhitong Yang, Bo Wang, Jinfeng Zhou, Yue Tan, Dongming Zhao, Kun Huang, Ruifang He, and Yuexian Hou. 2022. TopKG: Target-oriented dialog via global planning on knowledge graph. In Proceedings of the COLING. 745\u2013755."},{"key":"e_1_3_3_147_2","unstructured":"Jiayi Ye Yanbo Wang Yue Huang Dongping Chen Qihui Zhang Nuno Moniz Tian Gao Werner Geyer Chao Huang Pin-Yu Chen et\u00a0al. 2025. Justice or prejudice? quantifying biases in LLM-as-a-judge. In The Thirteenth International Conference on Learning Representations ICLR. OpenReview.net."},{"key":"e_1_3_3_148_2","doi-asserted-by":"crossref","unstructured":"Zihao Yi Jiarui Ouyang Yuwen Liu Tianhao Liao Zhe Xu and Ying Shen. 2026. A survey on recent advances in LLM-based multi-turn dialogue systems. ACM Comput. Surv. 58 6 (2026) 148:1\u2013148:38.","DOI":"10.1145\/3771090"},{"key":"e_1_3_3_149_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-main.274"},{"key":"e_1_3_3_150_2","unstructured":"Yue Yu Yuchen Zhuang Jieyu Zhang Yu Meng Alexander Ratner Ranjay Krishna Jiaming Shen and Chao Zhang. 2023. Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias. In Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems (NeurIPS\u201923)."},{"key":"e_1_3_3_151_2","first-page":"27263","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","author":"Yuan Weizhe","year":"2021","unstructured":"Weizhe Yuan, Graham Neubig, and Pengfei Liu. 2021. BARTScore: Evaluating generated text as text generation. In Proceedings of the Advances in Neural Information Processing Systems. 27263\u201327277."},{"key":"e_1_3_3_152_2","doi-asserted-by":"crossref","unstructured":"Hamed Zamani Johanne R. Trippas Jeff Dalton and Filip Radlinski. 2023. Conversational information seeking. Found. Trends Inf. Retr. 17 3\u20134 (2023) 244\u2013456.","DOI":"10.1561\/1500000081"},{"key":"e_1_3_3_153_2","doi-asserted-by":"crossref","unstructured":"Jiahao Zhang Haiyang Zhang Dongmei Zhang Liu Yong and Shen Huang. 2024. End-to-end beam retrieval for multi-hop question answering. In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers). Association for Computational Linguistics 1718\u20131731.","DOI":"10.18653\/v1\/2024.naacl-long.96"},{"key":"e_1_3_3_154_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P18-1205"},{"key":"e_1_3_3_155_2","volume-title":"Proceedings of the 8th International Conference on Learning Representations","author":"Zhang Tianyi","year":"2020","unstructured":"Tianyi Zhang, Varsha Kishore, Felix Wu, Kilian Q. Weinberger, and Yoav Artzi. 2020. BERTScore: Evaluating text generation with BERT. In Proceedings of the 8th International Conference on Learning Representations."},{"key":"e_1_3_3_156_2","first-page":"1815","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","author":"Zhang Yizhe","year":"2018","unstructured":"Yizhe Zhang, Michel Galley, Jianfeng Gao, Zhe Gan, Xiujun Li, Chris Brockett, and Bill Dolan. 2018. Generating informative and diverse conversational responses via adversarial information maximization. In Proceedings of the Advances in Neural Information Processing Systems. 1815\u20131825."},{"key":"e_1_3_3_157_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52733.2024.01845"},{"key":"e_1_3_3_158_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N19-1123"},{"key":"e_1_3_3_159_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.findings-acl.99"},{"key":"e_1_3_3_160_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.emnlp-main.131"},{"key":"e_1_3_3_161_2","doi-asserted-by":"publisher","DOI":"10.1145\/3209978.3210080"}],"container-title":["ACM Computing Surveys"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3795686","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,10]],"date-time":"2026-04-10T20:45:20Z","timestamp":1775853920000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3795686"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,4,1]]},"references-count":160,"journal-issue":{"issue":"10","published-print":{"date-parts":[[2026,7,31]]}},"alternative-id":["10.1145\/3795686"],"URL":"https:\/\/doi.org\/10.1145\/3795686","relation":{},"ISSN":["0360-0300","1557-7341"],"issn-type":[{"value":"0360-0300","type":"print"},{"value":"1557-7341","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,4,1]]},"assertion":[{"value":"2024-08-23","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2026-01-22","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2026-04-01","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}