{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,7,23]],"date-time":"2026-07-23T16:03:34Z","timestamp":1784822614849,"version":"3.55.0"},"reference-count":147,"publisher":"Association for Computing Machinery (ACM)","issue":"2","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Comput. Surv."],"published-print":{"date-parts":[[2026,1,31]]},"abstract":"<jats:p>With the development of the Large Language Models (LLMs), a large range of LLM-based Text-to-SQL(Text2SQL) methods have emerged. This survey provides a comprehensive review of LLM-based Text2SQL studies. We first enumerate classic benchmarks and evaluation metrics. For the two mainstream methods, prompt engineering and finetuning, we introduce a comprehensive taxonomy and offer practical insights into each subcategory. We present an overall analysis of the above methods and various models evaluated on well-known datasets and extract some characteristics. Finally, we discuss the challenges and future directions in this field.<\/jats:p>","DOI":"10.1145\/3737873","type":"journal-article","created":{"date-parts":[[2025,6,3]],"date-time":"2025-06-03T07:29:56Z","timestamp":1748935796000},"page":"1-37","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":48,"title":["A Survey on Employing Large Language Models for Text-to-SQL Tasks"],"prefix":"10.1145","volume":"58","author":[{"ORCID":"https:\/\/orcid.org\/0009-0009-3726-0299","authenticated-orcid":false,"given":"Liang","family":"Shi","sequence":"first","affiliation":[{"name":"School of Computer Science, Peking University","place":["Beijing, China"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0001-9299-2662","authenticated-orcid":false,"given":"Zhengju","family":"Tang","sequence":"additional","affiliation":[{"name":"School of Computer Science, Peking University","place":["Beijing, China"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0006-3678-5059","authenticated-orcid":false,"given":"Nan","family":"Zhang","sequence":"additional","affiliation":[{"name":"ZettaData US","place":["Bellevue, United States"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0000-2725-2927","authenticated-orcid":false,"given":"Xiaotong","family":"Zhang","sequence":"additional","affiliation":[{"name":"Beijing Bytedance Technology Co Ltd","place":["Beijing, China"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8219-4499","authenticated-orcid":false,"given":"Zhi","family":"Yang","sequence":"additional","affiliation":[{"name":"School of Computer Science, Peking University","place":["Beijing, China"]}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2025,9,10]]},"reference":[{"key":"e_1_3_1_2_2","unstructured":"Josh Achiam Steven Adler Sandhini Agarwal Lama Ahmad Ilge Akkaya Florencia Leoni Aleman Diogo Almeida Janko Altenschmidt Sam Altman Shyamal Anadkat et\u00a0al. 2023. Gpt-4 technical report. arXiv:2303.08774. Retrieved from https:\/\/arxiv.org\/abs\/2303.08774"},{"key":"e_1_3_1_3_2","unstructured":"Rohan Anil Andrew M. Dai Orhan Firat Melvin Johnson Dmitry Lepikhin Alexandre Passos Siamak Shakeri Emanuel Taropa Paige Bailey Zhifeng Chen et\u00a0al. 2023. Palm 2 technical report. arXiv:2305.10403. Retrieved from https:\/\/arxiv.org\/abs\/2305.10403"},{"key":"e_1_3_1_4_2","unstructured":"Aseem Arora Shabbirhussain Bhaisaheb Manasi Patwardhan Lovekesh Vig and Gautam Shroff. [n. d.]. A generic prompt for an LLM that enables NL-TO-SQL across domains and compositions. ([n. d.])."},{"key":"e_1_3_1_5_2","doi-asserted-by":"crossref","unstructured":"Aseem Arora Shabbirhussain Bhaisaheb Manasi Patwardhan Lovekesh Vig and Gautam Shroff. 2023. Adapt and decompose: Efficient generalization of Text-to-SQL via domain adapted least-to-most prompting. arXiv:2308.02582. Retrieved from https:\/\/arxiv.org\/abs\/2308.02582","DOI":"10.18653\/v1\/2023.genbench-1.3"},{"key":"e_1_3_1_6_2","unstructured":"Benjamin Ascoli Ram Kandikonda and Jinho D. Choi. 2024. ESM+: Modern insights into perspective on Text-to-SQL evaluation in the age of large language models. arXiv:2407.07313. Retrieved from https:\/\/arxiv.org\/abs\/2407.07313"},{"key":"e_1_3_1_7_2","doi-asserted-by":"crossref","unstructured":"Ben Bogin Matt Gardner and Jonathan Berant. 2019. Representing schema structure with graph neural networks for text-to-SQL parsing. arXiv:1905.06241. Retrieved from https:\/\/arxiv.org\/abs\/1905.06241","DOI":"10.18653\/v1\/P19-1448"},{"key":"e_1_3_1_8_2","first-page":"1877","article-title":"Language models are few-shot learners","volume":"33","author":"Brown Tom","year":"2020","unstructured":"Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D. Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, et\u00a0al. 2020. Language models are few-shot learners. Advances in Neural Information Processing Systems 33 (2020), 1877\u20131901.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_1_9_2","unstructured":"Hasan Alp Cafero\u011flu and \u00d6zg\u00fcr Ulusoy. 2024. E-sql: Direct schema linking via question enrichment in text-to-sql. arXiv:2409.16751. Retrieved from https:\/\/arxiv.org\/abs\/2409.16751"},{"key":"e_1_3_1_10_2","unstructured":"Zheng Cai Maosong Cao Haojiong Chen Kai Chen Keyu Chen Xin Chen Xun Chen Zehui Chen Zhi Chen Pei Chu et\u00a0al. 2024. InternLM2 Technical Report. arXiv:2403.17297. Retrieved from https:\/\/arxiv.org\/abs\/2403.17297"},{"key":"e_1_3_1_11_2","unstructured":"Zhenbiao Cao Yuanlei Zheng Zhihao Fan Xiaojin Zhang Wei Chen and Xiang Bai. 2024. Rsl-sql: Robust schema linking in text-to-sql generation. arXiv:2411.00073. Retrieved from https:\/\/arxiv.org\/abs\/2411.00073"},{"key":"e_1_3_1_12_2","unstructured":"Shuaichen Chang and Eric Fosler-Lussier. 2023. How to prompt llms for text-to-sql: A study in zero-shot single-domain and cross-domain settings. arXiv:2305.11853. Retrieved from https:\/\/arxiv.org\/abs\/2305.11853"},{"key":"e_1_3_1_13_2","unstructured":"Shuaichen Chang Jun Wang Mingwen Dong Lin Pan Henghui Zhu Alexander Hanbo Li Wuwei Lan Sheng Zhang Jiarong Jiang Joseph Lilien et\u00a0al. 2023. Dr. spider: A diagnostic evaluation benchmark towards text-to-sql robustness. arXiv:2301.08881. Retrieved from https:\/\/arxiv.org\/abs\/2301.08881"},{"key":"e_1_3_1_14_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2024.126215"},{"key":"e_1_3_1_15_2","unstructured":"Mark Chen Jerry Tworek Heewoo Jun Qiming Yuan Henrique Ponde de Oliveira Pinto Jared Kaplan Harri Edwards Yuri Burda Nicholas Joseph Greg Brockman et\u00a0al. 2021. Evaluating large language models trained on code. arXiv:2107.03374. Retrieved from https:\/\/arxiv.org\/abs\/2107.03374"},{"key":"e_1_3_1_16_2","unstructured":"Wenhu Chen Hongmin Wang Jianshu Chen Yunkai Zhang Hong Wang Shiyang Li Xiyou Zhou and William Yang Wang. 2020. TabFact: A Large-scale Dataset for Table-based Fact Verification. arXiv:1909.02164. Retrieved from https:\/\/arxiv.org\/abs\/1909.02164"},{"key":"e_1_3_1_17_2","unstructured":"Xinyun Chen Maxwell Lin Nathanael Sch\u00e4rli and Denny Zhou. 2023. Teaching large language models to self-debug. arXiv:2304.05128. Retrieved from https:\/\/arxiv.org\/abs\/2304.05128"},{"key":"e_1_3_1_18_2","unstructured":"Xiaojun Chen Tianle Wang Tianhao Qiu Jianbin Qin and Min Yang. 2024. Open-SQL Framework: Enhancing Text-to-SQL on Open-source Large Language Models. arXiv:2405.06674. Retrieved from https:\/\/arxiv.org\/abs\/2405.06674"},{"key":"e_1_3_1_19_2","unstructured":"Zhoujun Cheng Tianbao Xie Peng Shi Chengzu Li Rahul Nadkarni Yushi Hu Caiming Xiong Dragomir Radev Mari Ostendorf Luke Zettlemoyer et\u00a0al. 2022. Binding language models in symbolic languages. arXiv:2210.02875. Retrieved from https:\/\/arxiv.org\/abs\/2210.02875"},{"issue":"2","key":"e_1_3_1_20_2","first-page":"309","article-title":"Ryansql: Recursively applying sketch-based slot fillings for complex text-to-sql in cross-domain databases","volume":"47","author":"Choi DongHyun","year":"2021","unstructured":"DongHyun Choi, Myeong Cheol Shin, EungGyun Kim, and Dong Ryeol Shin. 2021. Ryansql: Recursively applying sketch-based slot fillings for complex text-to-sql in cross-domain databases. Computational Linguistics 47, 2 (2021), 309\u2013332.","journal-title":"Computational Linguistics"},{"key":"e_1_3_1_21_2","unstructured":"Sumit Kumar Dam Choong Seon Hong Yu Qiao and Chaoning Zhang. 2024. A complete survey on LLM-based AI chatbots. arXiv:2406.16937. Retrieved from https:\/\/arxiv.org\/abs\/2406.16937"},{"key":"e_1_3_1_22_2","unstructured":"DeepSeek-AI Daya Guo Dejian Yang Haowei Zhang Junxiao Song Ruoyu Zhang Runxin Xu Qihao Zhu Shirong Ma Peiyi Wang et\u00a0al. 2025. DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning. arXiv:2501.12948. Retrieved from https:\/\/arxiv.org\/abs\/2501.12948"},{"key":"e_1_3_1_23_2","unstructured":"Minghang Deng Ashwin Ramachandran Canwen Xu Lanxiang Hu Zhewei Yao Anupam Datta and Hao Zhang. 2025. ReFoRCE: A Text-to-SQL agent with self-refinement format restriction and column exploration. arXiv:2502.00675. Retrieved from https:\/\/arxiv.org\/abs\/2502.00675"},{"key":"e_1_3_1_24_2","doi-asserted-by":"crossref","unstructured":"Xiang Deng Ahmed Hassan Awadallah Christopher Meek Oleksandr Polozov Huan Sun and Matthew Richardson. 2020. Structure-grounded pretraining for text-to-sql. arXiv:2010.12773. Retrieved from https:\/\/arxiv.org\/abs\/2010.12773","DOI":"10.18653\/v1\/2021.naacl-main.105"},{"key":"e_1_3_1_25_2","article-title":"Qlora: Efficient finetuning of quantized llms","volume":"36","author":"Dettmers Tim","year":"2024","unstructured":"Tim Dettmers, Artidoro Pagnoni, Ari Holtzman, and Luke Zettlemoyer. 2024. Qlora: Efficient finetuning of quantized llms. Advances in Neural Information Processing Systems 36 (2024).","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_1_26_2","unstructured":"Xuemei Dong Chao Zhang Yuhang Ge Yuren Mao Yunjun Gao Jinshu Lin Dongfang Lou et\u00a0al. 2023. C3: Zero-shot text-to-sql with chatgpt. arXiv:2307.07306. Retrieved from https:\/\/arxiv.org\/abs\/2307.07306"},{"key":"e_1_3_1_27_2","unstructured":"Abhimanyu Dubey Abhinav Jauhri Abhinav Pandey Abhishek Kadian Ahmad Al-Dahle Aiesha Letman Akhil Mathur Alan Schelten Amy Yang Angela Fan et\u00a0al. 2024. The llama 3 herd of models. arXiv:2407.21783. Retrieved from https:\/\/arxiv.org\/abs\/2407.21783"},{"key":"e_1_3_1_28_2","unstructured":"Yuankai Fan Zhenying He Tonghui Ren Can Huang Yinan Jing Kai Zhang and X. Sean Wang. 2024. Metasql: A Generate-then-Rank Framework for Natural Language to SQL Translation. arXiv:2402.17144. Retrieved from https:\/\/arxiv.org\/abs\/2402.17144"},{"key":"e_1_3_1_29_2","article-title":"NL2SQL is a solved problem... Not!","author":"Floratou Avrilia","year":"2024","unstructured":"Avrilia Floratou and Fotis Psallidas. 2024. NL2SQL is a solved problem... Not! In Proceedings of the Conference on Innovative Data Systems Research.","journal-title":"In Proceedings of the Conference on Innovative Data Systems Research."},{"key":"e_1_3_1_30_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11023-020-09548-1"},{"key":"e_1_3_1_31_2","unstructured":"Daniel Fried Armen Aghajanyan Jessy Lin Sida Wang Eric Wallace Freda Shi Ruiqi Zhong Wen-tau Yih Luke Zettlemoyer and Mike Lewis. [n. d.]. Incoder: A generative model for code infilling and synthesis. arXiv:2204.05999. Retrieved from https:\/\/arxiv.org\/abs\/2204.05999"},{"key":"e_1_3_1_32_2","unstructured":"Yichao Fu Peter Bailis Ion Stoica and Hao Zhang. 2024. Break the sequential dependency of llm inference using lookahead decoding. arXiv:2402.02057. Retrieved from https:\/\/arxiv.org\/abs\/2402.02057"},{"key":"e_1_3_1_33_2","unstructured":"Yujian Gan Xinyun Chen Qiuping Huang and Matthew Purver. 2022. Measuring and improving compositional generalization in text-to-sql via component alignment. arXiv:2205.02054. Retrieved from https:\/\/arxiv.org\/abs\/2205.02054"},{"key":"e_1_3_1_34_2","unstructured":"Yujian Gan Xinyun Chen and Matthew Purver. 2021. Exploring underexplored limitations of cross-domain text-to-SQL generalization. arXiv:2109.05157. Retrieved from https:\/\/arxiv.org\/abs\/2109.05157"},{"key":"e_1_3_1_35_2","unstructured":"Yujian Gan Xinyun Chen Jinxia Xie Matthew Purver John R. Woodward John Drake and Qiaofu Zhang. 2021. Natural SQL: Making SQL easier to infer from natural language specifications. arXiv:2109.05153. Retrieved from https:\/\/arxiv.org\/abs\/2109.05153"},{"key":"e_1_3_1_36_2","unstructured":"Dawei Gao Haibin Wang Yaliang Li Xiuyu Sun Yichen Qian Bolin Ding and Jingren Zhou. 2023. Text-to-sql empowered by large language models: A benchmark evaluation. arXiv:2308.15363. Retrieved from https:\/\/arxiv.org\/abs\/2308.15363"},{"key":"e_1_3_1_37_2","unstructured":"Yunfan Gao Yun Xiong Xinyu Gao Kangxiang Jia Jinliu Pan Yuxi Bi Yi Dai Jiawei Sun and Haofen Wang. 2023. Retrieval-augmented generation for large language models: A survey. arXiv:2312.10997. Retrieved from https:\/\/arxiv.org\/abs\/2312.10997"},{"key":"e_1_3_1_38_2","unstructured":"Chunxi Guo Zhiliang Tian Jintao Tang Shasha Li Zhihua Wen Kaixuan Wang and Ting Wang. 2023. Retrieval-augmented GPT-3.5-based Text-to-SQL Framework with Sample-aware Prompting and Dynamic Revision Chain. arXiv:2307.05074. Retrieved from https:\/\/arxiv.org\/abs\/2307.05074"},{"key":"e_1_3_1_39_2","unstructured":"Chunxi Guo Zhiliang Tian Jintao Tang Pancheng Wang Zhihua Wen Kang Yang and Ting Wang. 2023. A case-based reasoning framework for adaptive prompting in cross-domain text-to-sql. arXiv:2304.13301. Retrieved from https:\/\/arxiv.org\/abs\/2304.13301"},{"key":"e_1_3_1_40_2","first-page":"262","volume-title":"Proceedings of the Pacific Rim International Conference on Artificial Intelligence","author":"Guo Chunxi","year":"2023","unstructured":"Chunxi Guo, Zhiliang Tian, Jintao Tang, Pancheng Wang, Zhihua Wen, Kang Yang, and Ting Wang. 2023. Prompting GPT-3.5 for Text-to-SQL with de-semanticization and skeleton retrieval. In Proceedings of the Pacific Rim International Conference on Artificial Intelligence. Springer, 262\u2013274."},{"key":"e_1_3_1_41_2","unstructured":"Daya Guo Qihao Zhu Dejian Yang Zhenda Xie Kai Dong Wentao Zhang Guanting Chen Xiao Bi Y. Wu Y. K. Li Fuli Luo Yingfei Xiong and Wenfeng Liang. 2024. DeepSeek-Coder: When the large language model meets programming-the rise of code intelligence. arXiv:2401.14196. Retrieved from https:\/\/arxiv.org\/abs\/2401.14196"},{"key":"e_1_3_1_42_2","doi-asserted-by":"publisher","DOI":"10.1162\/neco.1997.9.8.1735"},{"key":"e_1_3_1_43_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2024.findings-acl.653"},{"key":"e_1_3_1_44_2","unstructured":"Zijin Hong Zheng Yuan Qinggang Zhang Hao Chen Junnan Dong Feiran Huang and Xiao Huang. 2024. Next-generation database interfaces: A survey of llm-based text-to-sql. arXiv:2406.08426. Retrieved from https:\/\/arxiv.org\/abs\/2406.08426"},{"key":"e_1_3_1_45_2","unstructured":"Xinyi Hou Yanjie Zhao Yue Liu Zhou Yang Kailong Wang Li Li Xiapu Luo David Lo John Grundy and Haoyu Wang. 2023. Large language models for software engineering: A systematic literature review. arXiv:2308.10620. Retrieved from https:\/\/arxiv.org\/abs\/2308.10620"},{"key":"e_1_3_1_46_2","unstructured":"Edward J. Hu Yelong Shen Phillip Wallis Zeyuan Allen-Zhu Yuanzhi Li Shean Wang Lu Wang and Weizhu Chen. 2021. Lora: Low-rank adaptation of large language models. arXiv:2106.09685. Retrieved from https:\/\/arxiv.org\/abs\/2106.09685"},{"key":"e_1_3_1_47_2","first-page":"14702","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Jang Joel","year":"2023","unstructured":"Joel Jang, Seungone Kim, Seonghyeon Ye, Doyoung Kim, Lajanugen Logeswaran, Moontae Lee, Kyungjae Lee, and Minjoon Seo. 2023. Exploring the benefits of training expert language models over instruction tuning. In Proceedings of the International Conference on Machine Learning. PMLR, 14702\u201314729."},{"key":"e_1_3_1_48_2","doi-asserted-by":"publisher","DOI":"10.1145\/3292500.3330703"},{"key":"e_1_3_1_49_2","doi-asserted-by":"publisher","DOI":"10.1145\/3613905.3650755"},{"key":"e_1_3_1_50_2","unstructured":"Jared Kaplan Sam McCandlish Tom Henighan Tom B Brown Benjamin Chess Rewon Child Scott Gray Alec Radford Jeffrey Wu and Dario Amodei. 2020. Scaling laws for neural language models. arXiv:2001.08361. Retrieved from https:\/\/arxiv.org\/abs\/2001.08361"},{"key":"e_1_3_1_51_2","volume-title":"Proceedings of the naacL-HLT","author":"Kenton Jacob Devlin Ming-Wei Chang","year":"2019","unstructured":"Jacob Devlin Ming-Wei Chang Kenton and Lee Kristina Toutanova. 2019. Bert: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the naacL-HLT. Minneapolis, Minnesota."},{"key":"e_1_3_1_52_2","unstructured":"Tushar Khot Harsh Trivedi Matthew Finlayson Yao Fu Kyle Richardson Peter Clark and Ashish Sabharwal. 2022. Decomposed prompting: A modular approach for solving complex tasks. arXiv:2210.02406. Retrieved from https:\/\/arxiv.org\/abs\/2210.02406"},{"key":"e_1_3_1_53_2","doi-asserted-by":"crossref","unstructured":"Mayank Kothyari Dhruva Dhingra Sunita Sarawagi and Soumen Chakrabarti. 2023. CRUSH4SQL: Collective retrieval using schema hallucination for Text2SQL. arXiv:2311.01173. Retrieved from https:\/\/arxiv.org\/abs\/2311.01173","DOI":"10.18653\/v1\/2023.emnlp-main.868"},{"key":"e_1_3_1_54_2","unstructured":"Chia-Hsuan Lee Oleksandr Polozov and Matthew Richardson. 2021. KaggleDBQA: Realistic evaluation of text-to-SQL parsers. arXiv:2106.11455. Retrieved from https:\/\/arxiv.org\/abs\/2106.11455"},{"key":"e_1_3_1_55_2","first-page":"337","volume-title":"Proceedings of the 31st International Conference on Computational Linguistics","author":"Lee Dongjun","year":"2025","unstructured":"Dongjun Lee, Choongwon Park, Jaehyuk Kim, and Heesoo Park. 2025. MCS-SQL: Leveraging multiple prompts and multiple-choice selection for Text-to-SQL generation. In Proceedings of the 31st International Conference on Computational Linguistics. 337\u2013353."},{"key":"e_1_3_1_56_2","unstructured":"Fangyu Lei Jixuan Chen Yuxiao Ye Ruisheng Cao Dongchan Shin Hongjin Su Zhaoqing Suo Hongcheng Gao Wenjing Hu Pengcheng Yin et al. 2024. Spider 2.0: Evaluating language models on real-world enterprise Text-to-SQL workflows. arXiv:2411.07763. Retrieved from https:\/\/arxiv.org\/abs\/2411.07763"},{"key":"e_1_3_1_57_2","doi-asserted-by":"crossref","unstructured":"Brian Lester Rami Al-Rfou and Noah Constant. 2021. The power of scale for parameter-efficient prompt tuning. arXiv:2104.08691. Retrieved from https:\/\/arxiv.org\/abs\/2104.08691","DOI":"10.18653\/v1\/2021.emnlp-main.243"},{"key":"e_1_3_1_58_2","doi-asserted-by":"publisher","DOI":"10.14778\/2735461.2735468"},{"key":"e_1_3_1_59_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v37i11.26535"},{"key":"e_1_3_1_60_2","unstructured":"Haoyang Li Jing Zhang Hanbing Liu Ju Fan Xiaokang Zhang Jun Zhu Renjie Wei Hongyan Pan Cuiping Li and Hong Chen. 2024. CodeS: Towards Building Open-source Language Models for Text-to-SQL. arXiv:2402.16347. Retrieved from https:\/\/arxiv.org\/abs\/2402.16347"},{"key":"e_1_3_1_61_2","article-title":"Can llm already serve as a database interface? A big bench for large-scale database grounded text-to-sqls","volume":"36","author":"Li Jinyang","year":"2024","unstructured":"Jinyang Li, Binyuan Hui, Ge Qu, Jiaxi Yang, Binhua Li, Bowen Li, Bailin Wang, Bowen Qin, Ruiying Geng, Nan Huo, et\u00a0al. 2024. Can llm already serve as a database interface? A big bench for large-scale database grounded text-to-sqls. Advances in Neural Information Processing Systems 36 (2024).","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_1_62_2","unstructured":"Zhishuai Li Xiang Wang Jingjing Zhao Sun Yang Guoqing Du Xiaoru Hu Bin Zhang Yuxiao Ye Ziyue Li Rui Zhao et\u00a0al. 2024. PET-SQL: A prompt-enhanced two-stage Text-to-SQL framework with cross-consistency. arXiv:2403.09732. Retrieved from https:\/\/arxiv.org\/abs\/2403.09732"},{"key":"e_1_3_1_63_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.findings-emnlp.438"},{"key":"e_1_3_1_64_2","doi-asserted-by":"publisher","DOI":"10.1145\/3534678.3539294"},{"key":"e_1_3_1_65_2","unstructured":"Aiwei Liu Xuming Hu Lijie Wen and Philip S. Yu. 2023. A comprehensive evaluation of ChatGPT\u2019s zero-shot Text-to-SQL capability. arXiv:2303.13547. Retrieved from https:\/\/arxiv.org\/abs\/2303.13547"},{"key":"e_1_3_1_66_2","first-page":"9793","volume-title":"Proceedings of the 31st International Conference on Computational Linguistics","author":"Liu Geling","year":"2025","unstructured":"Geling Liu, Yunzhi Tan, Ruichao Zhong, Yuanzhen Xie, Lingchen Zhao, Qian Wang, Bo Hu, and Zang Li. 2025. Solid-SQL: Enhanced schema-linking based in-context learning for robust Text-to-SQL. In Proceedings of the 31st International Conference on Computational Linguistics. 9793\u20139803."},{"key":"e_1_3_1_67_2","first-page":"1950","article-title":"Few-shot parameter-efficient fine-tuning is better and cheaper than in-context learning","volume":"35","author":"Liu Haokun","year":"2022","unstructured":"Haokun Liu, Derek Tam, Mohammed Muqeeth, Jay Mohta, Tenghao Huang, Mohit Bansal, and Colin A. Raffel. 2022. Few-shot parameter-efficient fine-tuning is better and cheaper than in-context learning. Advances in Neural Information Processing Systems 35 (2022), 1950\u20131965.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_1_68_2","doi-asserted-by":"crossref","unstructured":"Xiao Liu Kaixuan Ji Yicheng Fu Weng Lam Tam Zhengxiao Du Zhilin Yang and Jie Tang. 2021. P-tuning v2: Prompt tuning can be comparable to fine-tuning universally across scales and tasks. arXiv:2110.07602. Retrieved from https:\/\/arxiv.org\/abs\/2110.07602","DOI":"10.18653\/v1\/2022.acl-short.8"},{"key":"e_1_3_1_69_2","unstructured":"Xinyu Liu Shuyu Shen Boyan Li Peixian Ma Runzhi Jiang Yuxin Zhang Ju Fan Guoliang Li Nan Tang and Yuyu Luo. 2024. A survey of NL2SQL with large language models: Where are we and where are we going? arXiv:2408.05109. Retrieved from https:\/\/arxiv.org\/abs\/2408.05109"},{"key":"e_1_3_1_70_2","unstructured":"Xiping Liu and Zhao Tan. 2023. Divide and prompt: Chain of thought prompting for text-to-sql. arXiv:2304.11556. Retrieved from https:\/\/arxiv.org\/abs\/2304.11556"},{"key":"e_1_3_1_71_2","article-title":"GPT understands, too","author":"Liu Xiao","year":"2023","unstructured":"Xiao Liu, Yanan Zheng, Zhengxiao Du, Ming Ding, Yujie Qian, Zhilin Yang, and Jie Tang. 2023. GPT understands, too. AI Open (2023).","journal-title":"AI Open"},{"key":"e_1_3_1_72_2","unstructured":"Yinhan Liu Myle Ott Naman Goyal Jingfei Du Mandar Joshi Danqi Chen Omer Levy Mike Lewis Luke Zettlemoyer and Veselin Stoyanov. 2019. Roberta: A robustly optimized bert pretraining approach. arXiv:1907.11692. Retrieved from https:\/\/arxiv.org\/abs\/1907.11692"},{"key":"e_1_3_1_73_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2024.emnlp-main.221"},{"key":"e_1_3_1_74_2","unstructured":"Qin Lyu Kaushik Chakrabarti Shobhit Hathi Souvik Kundu Jianwen Zhang and Zheng Chen. 2020. Hybrid ranking network for text-to-sql. arXiv:2008.04759. Retrieved from https:\/\/arxiv.org\/abs\/2008.04759"},{"key":"e_1_3_1_75_2","unstructured":"Karime Maamari Fadhil Abubaker Daniel Jaroslawicz and Amine Mhedhbi. 2024. The death of schema linking? Text-to-SQL in the age of well-reasoned language models. arXiv:2408.07702. Retrieved from https:\/\/arxiv.org\/abs\/2408.07702"},{"key":"e_1_3_1_76_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-88244-0_10"},{"key":"e_1_3_1_77_2","unstructured":"Qingkai Min Yuefeng Shi and Yue Zhang. 2019. A pilot study for Chinese SQL semantic parsing. arXiv:1909.13293. Retrieved from https:\/\/arxiv.org\/abs\/1909.13293"},{"key":"e_1_3_1_78_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.findings-emnlp.996"},{"key":"e_1_3_1_79_2","unstructured":"Ansong Ni Srini Iyer Dragomir Radev Ves Stoyanov Wen tau Yih Sida I. Wang and Xi Victoria Lin. 2023. LEVER: Learning to Verify Language-to-Code Generation with Execution. arXiv:2302.08468. Retrieved from https:\/\/arxiv.org\/abs\/2302.08468"},{"key":"e_1_3_1_80_2","unstructured":"Erik Nijkamp Bo Pang Hiroaki Hayashi Lifu Tu Huan Wang Yingbo Zhou Silvio Savarese and Caiming Xiong. 2022. Codegen: An open large language model for code with multi-turn program synthesis. arXiv:2203.13474. Retrieved from https:\/\/arxiv.org\/abs\/2203.13474"},{"key":"e_1_3_1_81_2","unstructured":"OpenAI. 2024. Introducing OpenAI o1-Preview. Retrieved from https:\/\/openai.com\/index\/introducing-openai-o1-preview\/"},{"key":"e_1_3_1_82_2","unstructured":"Liangming Pan Michael Saxon Wenda Xu Deepak Nathani Xinyi Wang and William Yang Wang. 2023. Automatically Correcting Large Language Models: Surveying the Landscape of Diverse Self-Correction Strategies. arXiv:2308.03188. Retrieved from https:\/\/arxiv.org\/abs\/2308.03188"},{"key":"e_1_3_1_83_2","unstructured":"Wenqi Pei Hailing Xu Hengyuan Zhao Shizheng Hou Han Chen Zining Zhang Pingyi Luo and Bingsheng He. 2025. Feather-SQL: A lightweight NL2SQL framework with dual-model collaboration paradigm for small language models. arXiv:2503.17811. Retrieved from https:\/\/arxiv.org\/abs\/2503.17811"},{"key":"e_1_3_1_84_2","unstructured":"Xinyu Pi Bing Wang Yan Gao Jiaqi Guo Zhoujun Li and Jian-Guang Lou. 2022. Towards robustness of text-to-SQL models against natural and realistic adversarial table perturbation. arXiv:2212.09994. Retrieved from https:\/\/arxiv.org\/abs\/2212.09994"},{"key":"e_1_3_1_85_2","unstructured":"Mohammadreza Pourreza Hailong Li Ruoxi Sun Yeounoh Chung Shayan Talaei Gaurav Tarlok Kakkar Yu Gan Amin Saberi Fatma Ozcan and Sercan O Arik. 2024. Chase-sql: Multi-path reasoning and preference optimized candidate selection in text-to-sql. arXiv:2410.01943. Retrieved from https:\/\/arxiv.org\/abs\/2410.01943"},{"key":"e_1_3_1_86_2","article-title":"Din-sql: Decomposed in-context learning of text-to-sql with self-correction","volume":"36","author":"Pourreza Mohammadreza","year":"2024","unstructured":"Mohammadreza Pourreza and Davood Rafiei. 2024. Din-sql: Decomposed in-context learning of text-to-sql with self-correction. Advances in Neural Information Processing Systems 36 (2024).","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_1_87_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2024.findings-emnlp.481"},{"key":"e_1_3_1_88_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.emnlp-main.211"},{"key":"e_1_3_1_89_2","unstructured":"Bowen Qin Binyuan Hui Lihan Wang Min Yang Jinyang Li Binhua Li Ruiying Geng Rongyu Cao Jian Sun Luo Si et\u00a0al. 2022. A survey on text-to-sql parsing: Concepts methods and future directions. arXiv:2208.13629. Retrieved from https:\/\/arxiv.org\/abs\/2208.13629"},{"key":"e_1_3_1_90_2","unstructured":"Nitarshan Rajkumar Raymond Li and Dzmitry Bahdanau. 2022. Evaluating the Text-to-SQL capabilities of large language models. arXiv:2204.00498. Retrieved from https:\/\/arxiv.org\/abs\/2204.00498"},{"key":"e_1_3_1_91_2","unstructured":"Tonghui Ren Yuankai Fan Zhenying He Ren Huang Jiaqi Dai Can Huang Yinan Jing Kai Zhang Yifan Yang and X. Sean Wang. 2024. PURPLE: Making a Large Language Model a Better SQL Writer. arXiv:2403.20014. Retrieved from https:\/\/arxiv.org\/abs\/2403.20014"},{"key":"e_1_3_1_92_2","unstructured":"Baptiste Roziere Jonas Gehring Fabian Gloeckle Sten Sootla Itai Gat Xiaoqing Ellen Tan Yossi Adi Jingyu Liu Tal Remez J\u00e9r\u00e9my Rapin et\u00a0al. 2023. Code llama: Open foundation models for code. arXiv:2308.12950. Retrieved from https:\/\/arxiv.org\/abs\/2308.12950"},{"key":"e_1_3_1_93_2","doi-asserted-by":"crossref","unstructured":"Ohad Rubin and Jonathan Berant. 2020. SmBoP: Semi-autoregressive bottom-up semantic parsing. arXiv:2010.12412. Retrieved from https:\/\/arxiv.org\/abs\/2010.12412","DOI":"10.18653\/v1\/2021.naacl-main.29"},{"key":"e_1_3_1_94_2","doi-asserted-by":"crossref","unstructured":"Torsten Scholak Nathan Schucher and Dzmitry Bahdanau. 2021. PICARD: Parsing incrementally for constrained auto-regressive decoding from language models. arXiv:2109.05093. Retrieved from https:\/\/arxiv.org\/abs\/2109.05093","DOI":"10.18653\/v1\/2021.emnlp-main.779"},{"key":"e_1_3_1_95_2","doi-asserted-by":"publisher","DOI":"10.14778\/3407790.3407858"},{"key":"e_1_3_1_96_2","unstructured":"SenseTime. 2024. SenseChat. Retrieved from https:\/\/platform.sensenova.cn\/#\/doc?path=\/chat\/ChatCompletions\/ChatCompletions.md"},{"key":"e_1_3_1_97_2","unstructured":"Burr Settles. 2009. Active learning literature survey. (2009)."},{"key":"e_1_3_1_98_2","unstructured":"Lei Sheng Shuai-Shuai Xu and Wei Xie. 2025. BASE-SQL: A powerful open source Text-To-SQL baseline approach. arXiv:2502.10739. Retrieved from https:\/\/arxiv.org\/abs\/2502.10739"},{"key":"e_1_3_1_99_2","unstructured":"Rishabh Srivastava and Wendy Aw. 2023. Defog SQLCoder. Retrieved from https:\/\/github.com\/defog-ai\/sqlcoder"},{"key":"e_1_3_1_100_2","unstructured":"Guanghu Sui Zhishuai Li Ziyue Li Sun Yang Jingqing Ruan Hangyu Mao and Rui Zhao. 2023. Reboost large language model-based Text-to-SQL Text-to-Python and text-to-function\u2013with real applications in traffic domain. arXiv:2310.18752. Retrieved from https:\/\/arxiv.org\/abs\/2310.18752"},{"key":"e_1_3_1_101_2","unstructured":"Ruoxi Sun Sercan O Arik Hootan Nakhost Hanjun Dai Rajarishi Sinha Pengcheng Yin and Tomas Pfister. 2023. Sql-palm: Improved large language modeladaptation for text-to-sql. arXiv:2306.00739. Retrieved from https:\/\/arxiv.org\/abs\/2306.00739"},{"key":"e_1_3_1_102_2","article-title":"Sequence to sequence learning with neural networks","volume":"27","author":"Sutskever Ilya","year":"2014","unstructured":"Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. 2014. Sequence to sequence learning with neural networks. Advances in Neural Information Processing Systems 27 (2014).","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_1_103_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.emnlp-main.327"},{"key":"e_1_3_1_104_2","unstructured":"Shayan Talaei Mohammadreza Pourreza Yu-Chen Chang Azalia Mirhoseini and Amin Saberi. 2024. CHESS: Contextual Harnessing for Efficient SQL Synthesis. arXiv:2405.16755. Retrieved from https:\/\/arxiv.org\/abs\/2405.16755"},{"key":"e_1_3_1_105_2","unstructured":"Rohan Taori Ishaan Gulrajani Tianyi Zhang Yann Dubois Xuechen Li Carlos Guestrin Percy Liang and Tatsunori B. Hashimoto. 2023. Stanford Alpaca: An Instruction-following LLaMA Model. Retrieved from https:\/\/github.com\/tatsu-lab\/stanford_alpaca"},{"key":"e_1_3_1_106_2","unstructured":"Dayton G. Thorpe Andrew J. Duberstein and Ian A. Kinsey. 2024. Dubo-SQL: Diverse Retrieval-Augmented Generation and Fine-tuning for Text-to-SQL. arXiv:2404.12560. Retrieved from https:\/\/arxiv.org\/abs\/2404.12560"},{"key":"e_1_3_1_107_2","article-title":"Attention is all you need","volume":"30","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, \u0141ukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. Advances in Neural Information Processing Systems 30 (2017).","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_1_108_2","unstructured":"Bing Wang Changyu Ren Jian Yang Xinnian Liang Jiaqi Bai Qian-Wen Zhang Zhao Yan and Zhoujun Li. 2023. Mac-sql: Multi-agent collaboration for text-to-sql. arXiv:2312.11242. Retrieved from https:\/\/arxiv.org\/abs\/2312.11242"},{"key":"e_1_3_1_109_2","unstructured":"Dingzirui Wang Longxu Dou Xuanliang Zhang Qingfu Zhu and Wanxiang Che. 2024. Improving Demonstration Diversity by Human-Free Fusing for Text-to-SQL. arXiv:2402.10663. Retrieved from https:\/\/arxiv.org\/abs\/2402.10663"},{"key":"e_1_3_1_110_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11704-024-40231-1"},{"key":"e_1_3_1_111_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-main.562"},{"key":"e_1_3_1_112_2","unstructured":"Tianshu Wang Hongyu Lin Xianpei Han Le Sun Xiaoyang Chen Hao Wang and Zhenyu Zeng. 2023. DBCopilot: Scaling natural language querying to massive databases. arXiv:2312.03463. Retrieved from https:\/\/arxiv.org\/abs\/2312.03463"},{"key":"e_1_3_1_113_2","unstructured":"Xuezhi Wang Jason Wei Dale Schuurmans Quoc Le Ed Chi Sharan Narang Aakanksha Chowdhery and Denny Zhou. 2022. Self-consistency improves chain of thought reasoning in language models. arXiv:2203.11171. Retrieved from https:\/\/arxiv.org\/abs\/2203.11171"},{"key":"e_1_3_1_114_2","unstructured":"Jason Wei Maarten Bosma Vincent Y. Zhao Kelvin Guu Adams Wei Yu Brian Lester Nan Du Andrew M. Dai and Quoc V. Le. 2021. Finetuned language models are zero-shot learners. arXiv:2109.01652. Retrieved from https:\/\/arxiv.org\/abs\/2109.01652"},{"key":"e_1_3_1_115_2","unstructured":"Jason Wei Yi Tay Rishi Bommasani Colin Raffel Barret Zoph Sebastian Borgeaud Dani Yogatama Maarten Bosma Denny Zhou Donald Metzler et\u00a0al. 2022. Emergent abilities of large language models. arXiv:2206.07682. Retrieved from https:\/\/arxiv.org\/abs\/2206.07682"},{"key":"e_1_3_1_116_2","first-page":"24824","article-title":"Chain-of-thought prompting elicits reasoning in large language models","volume":"35","author":"Wei Jason","year":"2022","unstructured":"Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Fei Xia, Ed Chi, Quoc V. Le, Denny Zhou, et\u00a0al. 2022. Chain-of-thought prompting elicits reasoning in large language models. Advances in Neural Information Processing Systems 35 (2022), 24824\u201324837.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_1_117_2","unstructured":"Lixia Wu Peng Li Junhong Lou and Lei Fu. 2024. DataGpt-SQL-7B: An Open-Source Language Model for Text-to-SQL. arXiv:2409.15985. Retrieved from https:\/\/arxiv.org\/abs\/2409.15985"},{"key":"e_1_3_1_118_2","unstructured":"Hanchen Xia Feng Jiang Naihao Deng Cunxiang Wang Guojiang Zhao Rada Mihalcea and Yue Zhang. 2024. \\(R^3\\) : \u201cThis is My SQL Are You With Me?\u201d A Consensus-Based Multi-Agent System for Text-to-SQL Tasks. arXiv:2402.14851. Retrieved from https:\/\/arxiv.org\/abs\/2402.14851"},{"key":"e_1_3_1_119_2","unstructured":"Wenxuan Xie Gaochen Wu and Bowen Zhou. 2024. Mag-sql: Multi-agent generative approach with soft schema linking and iterative sub-sql refinement for text-to-sql. arXiv:2408.07930. Retrieved from https:\/\/arxiv.org\/abs\/2408.07930"},{"key":"e_1_3_1_120_2","unstructured":"Xiangjin Xie Guangwei Xu Lingyan Zhao and Ruijie Guo. 2025. OpenSearch-SQL: Enhancing Text-to-SQL with dynamic few-shot and consistency alignment. arXiv:2502.14913. Retrieved from https:\/\/arxiv.org\/abs\/2502.14913"},{"key":"e_1_3_1_121_2","unstructured":"Yuanzhen Xie Xinzhou Jin Tao Xie MingXiong Lin Liang Chen Chenyun Yu Lei Cheng ChengXiang Zhuo Bo Hu and Zang Li. 2024. Decomposition for Enhancing Attention: Improving LLM-based Text-to-SQL through Workflow Paradigm. arXiv:2402.10671. Retrieved from https:\/\/arxiv.org\/abs\/2402.10671"},{"key":"e_1_3_1_122_2","unstructured":"Can Xu Qingfeng Sun Kai Zheng Xiubo Geng Pu Zhao Jiazhan Feng Chongyang Tao and Daxin Jiang. 2023. Wizardlm: Empowering large language models to follow complex instructions. arXiv:2304.12244. Retrieved from https:\/\/arxiv.org\/abs\/2304.12244"},{"key":"e_1_3_1_123_2","unstructured":"Xiaojun Xu Chang Liu and Dawn Song. 2017. Sqlnet: Generating structured queries from natural language without reinforcement learning. arXiv:1711.04436. Retrieved from https:\/\/arxiv.org\/abs\/1711.04436"},{"key":"e_1_3_1_124_2","unstructured":"Siqiao Xue Caigao Jiang Wenhui Shi Fangyin Cheng Keting Chen Hongjun Yang Zhiping Zhang Jianshan He Hongyang Zhang Ganglin Wei et\u00a0al. 2023. Db-gpt: Empowering database interactions with private large language models. arXiv:2312.17449. Retrieved from https:\/\/arxiv.org\/abs\/2312.17449"},{"key":"e_1_3_1_125_2","unstructured":"An Yang Baosong Yang Beichen Zhang Binyuan Hui Bo Zheng Bowen Yu Chengyuan Li Dayiheng Liu Fei Huang Haoran Wei et\u00a0al. 2025. Qwen2.5 Technical Report. arXiv:2412.15115. Retrieved from https:\/\/arxiv.org\/abs\/2412.15115"},{"key":"e_1_3_1_126_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-68309-1_11"},{"key":"e_1_3_1_127_2","article-title":"Tree of thoughts: Deliberate problem solving with large language models","volume":"36","author":"Yao Shunyu","year":"2024","unstructured":"Shunyu Yao, Dian Yu, Jeffrey Zhao, Izhak Shafran, Tom Griffiths, Yuan Cao, and Karthik Narasimhan. 2024. Tree of thoughts: Deliberate problem solving with large language models. Advances in Neural Information Processing Systems 36 (2024).","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_1_128_2","unstructured":"Shunyu Yao Jeffrey Zhao Dian Yu Nan Du Izhak Shafran Karthik Narasimhan and Yuan Cao. 2022. React: Synergizing reasoning and acting in language models. arXiv:2210.03629. Retrieved from https:\/\/arxiv.org\/abs\/2210.03629"},{"key":"e_1_3_1_129_2","doi-asserted-by":"crossref","unstructured":"Tao Yu Rui Zhang He Yang Er Suyi Li Eric Xue Bo Pang Xi Victoria Lin Yi Chern Tan Tianze Shi Zihan Li et\u00a0al. 2019. Cosql: A conversational text-to-sql challenge towards cross-domain natural language interfaces to databases. arXiv:1909.05378. Retrieved from https:\/\/arxiv.org\/abs\/1909.05378","DOI":"10.18653\/v1\/D19-1204"},{"key":"e_1_3_1_130_2","doi-asserted-by":"crossref","unstructured":"Tao Yu Rui Zhang Kai Yang Michihiro Yasunaga Dongxu Wang Zifan Li James Ma Irene Li Qingning Yao Shanelle Roman et\u00a0al. 2018. Spider: A large-scale human-labeled dataset for complex and cross-domain semantic parsing and text-to-sql task. arXiv:1809.08887. Retrieved from https:\/\/arxiv.org\/abs\/1809.08887","DOI":"10.18653\/v1\/D18-1425"},{"key":"e_1_3_1_131_2","doi-asserted-by":"crossref","unstructured":"Tao Yu Rui Zhang Michihiro Yasunaga Yi Chern Tan Xi Victoria Lin Suyi Li Heyang Er Irene Li Bo Pang Tao Chen et\u00a0al. 2019. Sparc: Cross-domain semantic parsing in context. arXiv:1906.02285. Retrieved from https:\/\/arxiv.org\/abs\/1906.02285","DOI":"10.18653\/v1\/P19-1443"},{"key":"e_1_3_1_132_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v39i24.34770"},{"key":"e_1_3_1_133_2","first-page":"1050","volume-title":"Proceedings of the National Conference on Artificial Intelligence","author":"Zelle John M.","year":"1996","unstructured":"John M. Zelle and Raymond J. Mooney. 1996. Learning to parse database queries using inductive logic programming. In Proceedings of the National Conference on Artificial Intelligence. 1050\u20131055."},{"key":"e_1_3_1_134_2","unstructured":"Bin Zhang Yuxiao Ye Guoqing Du Xiaoru Hu Zhishuai Li Sun Yang Chi Harold Liu Rui Zhao Ziyue Li and Hangyu Mao. 2024. Benchmarking the Text-to-SQL capability of large language models: A comprehensive evaluation. arXiv:2403.02951. Retrieved from https:\/\/arxiv.org\/abs\/2403.02951"},{"key":"e_1_3_1_135_2","doi-asserted-by":"crossref","unstructured":"Chao Zhang Yuren Mao Yijiang Fan Yu Mi Yunjun Gao Lu Chen Dongfang Lou and Jinshu Lin. 2024. FinSQL: Model-Agnostic LLMs-based Text-to-SQL Framework for Financial Analysis. arXiv:2401.10506. Retrieved from https:\/\/arxiv.org\/abs\/2401.10506","DOI":"10.1145\/3626246.3653375"},{"key":"e_1_3_1_136_2","doi-asserted-by":"crossref","unstructured":"Hanchong Zhang Ruisheng Cao Lu Chen Hongshen Xu and Kai Yu. 2023. Act-sql: In-context learning for text-to-sql with automatically-generated chain-of-thought. arXiv:2310.17342. Retrieved from https:\/\/arxiv.org\/abs\/2310.17342","DOI":"10.18653\/v1\/2023.findings-emnlp.227"},{"key":"e_1_3_1_137_2","unstructured":"Hanchong Zhang Ruisheng Cao Hongshen Xu Lu Chen and Kai Yu. 2024. CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions. arXiv:2405.02712. Retrieved from https:\/\/arxiv.org\/abs\/2405.02712"},{"key":"e_1_3_1_138_2","unstructured":"Qinggang Zhang Junnan Dong Hao Chen Wentao Li Feiran Huang and Xiao Huang. 2024. Structure Guided Large Language Model for SQL Generation. arXiv:2402.13284. Retrieved from https:\/\/arxiv.org\/abs\/2402.13284"},{"key":"e_1_3_1_139_2","unstructured":"Tingkai Zhang Chaoyu Chen Cong Liao Jun Wang Xudong Zhao Hang Yu Jianchao Wang Jianguo Li and Wenhui Shi. 2024. SQLfuse: Enhancing Text-to-SQL Performance through Comprehensive LLM Synergy. arXiv:2407.14568. Retrieved from https:\/\/arxiv.org\/abs\/2407.14568"},{"key":"e_1_3_1_140_2","article-title":"Natural language interfaces for tabular data querying and visualization: A survey","author":"Zhang Weixu","year":"2024","unstructured":"Weixu Zhang, Yifei Wang, Yuanfeng Song, Victor Junqiu Wei, Yuxing Tian, Yiyan Qi, Jonathan H Chan, Raymond Chi-Wing Wong, and Haiqin Yang. 2024. Natural language interfaces for tabular data querying and visualization: A survey. IEEE Transactions on Knowledge and Data Engineering (2024).","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"key":"e_1_3_1_141_2","doi-asserted-by":"crossref","unstructured":"Yi Zhang Jan Deriu George Katsogiannis-Meimarakis Catherine Kosten Georgia Koutrika and Kurt Stockinger. 2023. ScienceBenchmark: A Complex Real-World Benchmark for Evaluating Natural Language to SQL Systems. arXiv:2306.04743. Retrieved from https:\/\/arxiv.org\/abs\/2306.04743","DOI":"10.14778\/3636218.3636225"},{"key":"e_1_3_1_142_2","unstructured":"Wayne Xin Zhao Kun Zhou Junyi Li Tianyi Tang Xiaolei Wang Yupeng Hou Yingqian Min Beichen Zhang Junjie Zhang Zican Dong et\u00a0al. 2023. A survey of large language models. arXiv:2303.18223. Retrieved from https:\/\/arxiv.org\/abs\/2303.18223"},{"key":"e_1_3_1_143_2","unstructured":"Lianmin Zheng Wei-Lin Chiang Ying Sheng Siyuan Zhuang Zhanghao Wu Yonghao Zhuang Zi Lin Zhuohan Li Dacheng Li Eric. P. Xing Hao Zhang Joseph E. Gonzalez and Ion Stoica. 2023. Judging LLM-as-a-judge with MT-Bench and Chatbot Arena. arXiv:2306.05685. Retrieved from https:\/\/arxiv.org\/abs\/2306.05685"},{"key":"e_1_3_1_144_2","doi-asserted-by":"crossref","unstructured":"Ruiqi Zhong Tao Yu and Dan Klein. 2020. Semantic evaluation for text-to-SQL with distilled test suites. arXiv:2010.02840. Retrieved from https:\/\/arxiv.org\/abs\/2010.02840","DOI":"10.18653\/v1\/2020.emnlp-main.29"},{"key":"e_1_3_1_145_2","unstructured":"Victor Zhong Caiming Xiong and Richard Socher. 2017. Seq2sql: Generating structured queries from natural language using reinforcement learning. arXiv:1709.00103. Retrieved from https:\/\/arxiv.org\/abs\/1709.00103"},{"key":"e_1_3_1_146_2","unstructured":"Denny Zhou Nathanael Sch\u00e4rli Le Hou Jason Wei Nathan Scales Xuezhi Wang Dale Schuurmans Claire Cui Olivier Bousquet Quoc Le et\u00a0al. 2022. Least-to-most prompting enables complex reasoning in large language models. arXiv:2205.10625. Retrieved from https:\/\/arxiv.org\/abs\/2205.10625"},{"key":"e_1_3_1_147_2","unstructured":"Fan Zhou Siqiao Xue Danrui Qi Wenhui Shi Wang Zhao Ganglin Wei Hongyang Zhang Caigai Jiang Gangwei Jiang Zhixuan Chu and Faqiang Chen. 2024. DB-GPT-Hub: Towards open benchmarking Text-to-SQL empowered by large language models. arXiv:2406.11434. Retrieved from https:\/\/arxiv.org\/abs\/2406.11434"},{"key":"e_1_3_1_148_2","unstructured":"Xiaohu Zhu Qian Li Lizhen Cui and Yongkang Liu. 2024. Large language model enhanced text-to-sql generation: A survey. arXiv:2410.06011. Retrieved from https:\/\/arxiv.org\/abs\/2410.06011"}],"container-title":["ACM Computing Surveys"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3737873","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,10]],"date-time":"2025-09-10T13:13:32Z","timestamp":1757510012000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3737873"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,9,10]]},"references-count":147,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2026,1,31]]}},"alternative-id":["10.1145\/3737873"],"URL":"https:\/\/doi.org\/10.1145\/3737873","relation":{},"ISSN":["0360-0300","1557-7341"],"issn-type":[{"value":"0360-0300","type":"print"},{"value":"1557-7341","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,9,10]]},"assertion":[{"value":"2024-11-12","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-05-18","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-09-10","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}