{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,11]],"date-time":"2026-01-11T05:45:44Z","timestamp":1768110344466,"version":"3.49.0"},"reference-count":128,"publisher":"Association for Computing Machinery (ACM)","issue":"2","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2024,10]]},"abstract":"<jats:p>Social scientists are increasingly interested in analyzing the semantic information (e.g., emotion) of unstructured data (e.g., Tweets), where the semantic information is not natively present. Performing this analysis in a cost-efficient manner requires using machine learning (ML) models to extract the semantic information and subsequently analyze the now structured data. However, this process remains challenging for domain experts.<\/jats:p>\n          <jats:p>To demonstrate the challenges in social science analytics, we collect a dataset, QUIET-ML, of 120 real-world social science queries in natural language and their ground truth answers. Existing systems struggle with these queries since (1) they require selecting and applying ML models, and (2) more than a quarter of these queries are vague, making standard tools like natural language to SQL systems unsuited. To address these issues, we develop LEAP, an end-to-end library that answers social science queries in natural language with ML. LEAP filters vague queries to ensure that the answers are deterministic and selects from internally supported and user-defined ML functions to extend the unstructured data to structured tables with necessary annotations. LEAP further generates and executes code to respond to these natural language queries. LEAP achieves a 100% pass @ 3 and 92% pass @ 1 on QUIET-ML, with a $1.06 average end-to-end cost, of which code generation costs $0.02.<\/jats:p>","DOI":"10.14778\/3705829.3705843","type":"journal-article","created":{"date-parts":[[2025,2,28]],"date-time":"2025-02-28T23:21:06Z","timestamp":1740784866000},"page":"253-264","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["LEAP: LLM-Powered End-to-End Automatic Library for Processing Social Science Queries on Unstructured Data"],"prefix":"10.14778","volume":"18","author":[{"given":"Chuxuan","family":"Hu","sequence":"first","affiliation":[{"name":"UIUC, Urbana, IL"}]},{"given":"Austin","family":"Peters","sequence":"additional","affiliation":[{"name":"University of Chicago, Chicago, IL"}]},{"given":"Daniel","family":"Kang","sequence":"additional","affiliation":[{"name":"UIUC, Urbana, IL"}]}],"member":"320","published-online":{"date-parts":[[2025,2,28]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"crossref","unstructured":"Faisal Alatawi Paras Sheth and Huan Liu. 2023. Quantifying the Echo Chamber Effect: An Embedding Distance-Based Approach. arXiv:2307.04668 [cs]","DOI":"10.1145\/3625007.3627731"},{"key":"e_1_2_1_2_1","volume-title":"https:\/\/huggingface.co\/gaussalgo\/T5-LM-Large-text2sql-spider Accessed","author":"Algorithmic Gauss","year":"2024","unstructured":"Gauss Algorithmic. 2023. https:\/\/huggingface.co\/gaussalgo\/T5-LM-Large-text2sql-spider Accessed: February 25, 2024."},{"key":"e_1_2_1_3_1","volume-title":"How to Ask for a Favor: A Case Study on the Success of Altruistic Requests. ArXiv abs\/1405.3282","author":"Althoff Tim","year":"2014","unstructured":"Tim Althoff, Cristian Danescu-Niculescu-Mizil, and Daniel Jurafsky. 2014. How to Ask for a Favor: A Case Study on the Success of Altruistic Requests. ArXiv abs\/1405.3282 (2014). https:\/\/api.semanticscholar.org\/CorpusID:8809599"},{"key":"e_1_2_1_4_1","unstructured":"Akari Asai Sara Evensen Behzad Golshan Alon Halevy Vivian Li Andrei Lopatenko Daniela Stepanov Yoshihiko Suhara Wang-Chiew Tan and Yinzhan Xu. 2018. HappyDB: A Corpus of 100 000 Crowdsourced Happy Moments. arXiv:1801.07746 [cs.CL]"},{"key":"e_1_2_1_5_1","volume-title":"Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). Association for Computational Linguistics","author":"Ashida Mana","year":"2022","unstructured":"Mana Ashida and Mamoru Komachi. 2022. Towards Automatic Generation of Messages Countering Online Hate Speech and Microaggressions. In Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). Association for Computational Linguistics, Seattle, Washington (Hybrid), 11--23. https:\/\/aclanthology.org\/2022.woah-1.2"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-main.404"},{"key":"e_1_2_1_7_1","volume-title":"Smith","author":"Bamman David","year":"2013","unstructured":"David Bamman, Brendan O'Connor, and Noah A. Smith. 2013. Learning Latent Personas of Film Characters. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Hinrich Schuetze, Pascale Fung, and Massimo Poesio (Eds.). Association for Computational Linguistics, Sofia, Bulgaria, 352--361. https:\/\/aclanthology.org\/P13-1035"},{"key":"e_1_2_1_8_1","doi-asserted-by":"crossref","unstructured":"Adithya Bhaskar Tushar Tomar Ashutosh Sathe and Sunita Sarawagi. 2023. Benchmarking and Improving Text-to-SQL Generation under Ambiguity. arXiv:2310.13659 [cs.CL]","DOI":"10.18653\/v1\/2023.emnlp-main.436"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jocs.2010.12.007"},{"key":"e_1_2_1_10_1","volume-title":"Garnett (Eds.)","volume":"29","author":"Bolukbasi Tolga","year":"2016","unstructured":"Tolga Bolukbasi, Kai-Wei Chang, James Y Zou, Venkatesh Saligrama, and Adam T Kalai. 2016. Man is to Computer Programmer as Woman is to Home-maker? Debiasing Word Embeddings. In Advances in Neural Information Processing Systems, D. Lee, M. Sugiyama, U. Luxburg, I. Guyon, and R. Garnett (Eds.), Vol. 29. Curran Associates, Inc. https:\/\/proceedings.neurips.cc\/paper_files\/paper\/2016\/file\/a486cd07e4ac3d270571622f4f316ec5-Paper.pdf"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-1176"},{"key":"e_1_2_1_12_1","volume-title":"Levinson","author":"Brown Penelope","year":"1987","unstructured":"Penelope Brown and Stephen C. Levinson. 1987. Politeness: Some universals in language usage. Vol. 4. Cambridge University Press."},{"key":"e_1_2_1_13_1","unstructured":"Tom B. Brown Benjamin Mann Nick Ryder Melanie Subbiah Jared Kaplan Prafulla Dhariwal Arvind Neelakantan Pranav Shyam Girish Sastry Amanda Askell Sandhini Agarwal Ariel Herbert-Voss Gretchen Krueger Tom Henighan Rewon Child Aditya Ramesh Daniel M. Ziegler Jeffrey Wu Clemens Winter Christopher Hesse Mark Chen Eric Sigler Mateusz Litwin Scott Gray Benjamin Chess Jack Clark Christopher Berner Sam McCandlish Alec Radford Ilya Sutskever and Dario Amodei. 2020. Language Models are Few-Shot Learners. arXiv:2005.14165 [cs.CL]"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.emnlp-main.481"},{"key":"e_1_2_1_15_1","volume-title":"https:\/\/huggingface.co\/deepset\/roberta-base-squad2 Accessed","author":"Chan Branden","year":"2024","unstructured":"Branden Chan, Timo M\u00f6ller, Malte Pietsch, and Tanay Soni. 2023. https:\/\/huggingface.co\/deepset\/roberta-base-squad2 Accessed: June 23, 2024."},{"key":"e_1_2_1_16_1","unstructured":"Mark Chen Jerry Tworek Heewoo Jun Qiming Yuan Henrique Ponde de Oliveira Pinto Jared Kaplan Harri Edwards Yuri Burda Nicholas Joseph Greg Brockman Alex Ray Raul Puri Gretchen Krueger Michael Petrov Heidy Khlaaf Girish Sastry Pamela Mishkin Brooke Chan Scott Gray Nick Ryder Mikhail Pavlov Alethea Power Lukasz Kaiser Mohammad Bavarian Clemens Winter Philippe Tillet Felipe Petroski Such Dave Cummings Matthias Plappert Fotios Chantzis Elizabeth Barnes Ariel Herbert-Voss William Hebgen Guss Alex Nichol Alex Paino Nikolas Tezak Jie Tang Igor Babuschkin Suchir Balaji Shantanu Jain William Saunders Christopher Hesse Andrew N. Carr Jan Leike Josh Achiam Vedant Misra Evan Morikawa Alec Radford Matthew Knight Miles Brundage Mira Murati Katie Mayer Peter Welinder Bob McGrew Dario Amodei Sam McCandlish Ilya Sutskever and Wojciech Zaremba. 2021. Evaluating Large Language Models Trained on Code. arXiv:2107.03374 [cs.LG]"},{"key":"e_1_2_1_17_1","doi-asserted-by":"crossref","unstructured":"Minje Choi Ceren Budak Daniel M. Romero and David Jurgens. 2021. More than Meets the Tie: Examining the Role of Interpersonal Relationships in Social Networks. arXiv:2105.06038 [cs.SI]","DOI":"10.1609\/icwsm.v15i1.18045"},{"key":"e_1_2_1_18_1","volume-title":"Proceedings of the 33rd international conference on Very large data bases. 1045--1056","author":"Chu Eric","year":"2007","unstructured":"Eric Chu, Akanksha Baid, Ting Chen, AnHai Doan, and Jeffrey Naughton. 2007. A relational approach to incrementally extracting and querying structure in unstructured data. In Proceedings of the 33rd international conference on Very large data bases. 1045--1056."},{"key":"e_1_2_1_19_1","doi-asserted-by":"crossref","unstructured":"Eric Chu Prashanth Vijayaraghavan and Deb Roy. 2018. Learning Personas from Dialogue with Attentive Memory Networks. arXiv:1810.08717 [cs.CL]","DOI":"10.18653\/v1\/D18-1284"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1271"},{"key":"e_1_2_1_21_1","volume-title":"Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, Online.","author":"Chung Yi-Ling","year":"2021","unstructured":"Yi-Ling Chung, Serra Sinem Tekiro\u011flu, and Marco Guerini. 2021. Towards Knowledge-Grounded Counter Narrative Generation for Hate Speech. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, Online."},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1017\/S0269888900005476"},{"key":"e_1_2_1_23_1","volume-title":"Proceedings of WWW. 699--708","author":"Danescu-Niculescu-Mizil Cristian","year":"2012","unstructured":"Cristian Danescu-Niculescu-Mizil, Lillian Lee, Bo Pang, and Jon Kleinberg. 2012. Echoes of power: Language effects and power differences in social interaction. In Proceedings of WWW. 699--708."},{"key":"e_1_2_1_24_1","volume-title":"Proceedings of ACL.","author":"Danescu-Niculescu-Mizil Cristian","year":"2013","unstructured":"Cristian Danescu-Niculescu-Mizil, Moritz Sudhof, Dan Jurafsky, Jure Leskovec, and Christopher Potts. 2013. A computational approach to politeness with application to social factors. In Proceedings of ACL."},{"key":"e_1_2_1_25_1","volume-title":"Handbook of Computational Social Science","author":"De Veaux Richard D","year":"2021","unstructured":"Richard D De Veaux and Adam Eck. 2021. Machine Learning Methods for Computational Social Science. Handbook of Computational Social Science, Volume 2: Data Science, Statistical Modelling, and Machine Learning Methods (2021)."},{"key":"e_1_2_1_26_1","doi-asserted-by":"crossref","unstructured":"Dorottya Demszky Devyani Sharma Jonathan H. Clark Vinodkumar Prabhakaran and Jacob Eisenstein. 2021. Learning to Recognize Dialect Features. arXiv:2010.12707 [cs.CL]","DOI":"10.18653\/v1\/2021.naacl-main.184"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.naacl-main.105"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1146\/annurev-soc-121919-054621"},{"key":"e_1_2_1_29_1","doi-asserted-by":"crossref","unstructured":"Julian Martin Eisenschlos Syrine Krichene and Thomas M\u00fcller. 2020. Understanding tables with intermediate pre-training. arXiv:2010.00571 [cs.CL]","DOI":"10.18653\/v1\/2020.findings-emnlp.27"},{"key":"e_1_2_1_30_1","volume-title":"Munmun De Choudhury, and Diyi Yang","author":"ElSherief Mai","year":"2021","unstructured":"Mai ElSherief, Caleb Ziems, David Muchlinski, Vaishnavi Anupindi, Jordyn Seybolt, Munmun De Choudhury, and Diyi Yang. 2021. Latent Hatred: A Benchmark for Understanding Implicit Hate Speech. arXiv:2109.05322 [cs.CL]"},{"key":"e_1_2_1_31_1","article-title":"Secrecy by Stipulation","author":"Engstrom Nora Freeman","year":"2024","unstructured":"Nora Freeman Engstrom, David Freeman Engstrom, Jonah Gelbach, Austin Peters, and Aaron Schaffer-Neitz. 2024. Secrecy by Stipulation. Duke Law Journal (May 2 2024). https:\/\/papers.ssrn.com\/sol3\/papers.cfm?abstract_id=4811151","journal-title":"Duke Law Journal"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.acl-long.250"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0142390"},{"key":"e_1_2_1_34_1","unstructured":"Avrilia Floratou Fotis Psallidas Fuheng Zhao and et al. [n.d.]. NL2SQL is a solved problem... Not! https:\/\/www.cidrdb.org\/cidr2024\/papers\/p74-floratou.pdf"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.14778\/3583140.3583165"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.acl-long.222"},{"key":"e_1_2_1_37_1","volume-title":"Misinfo Reaction Frames: Reasoning about Readers' Reactions to News Headlines. ACL","author":"Gabriel Saadia","year":"2022","unstructured":"Saadia Gabriel, Skyler Hallinan, Maarten Sap, Pemi Nguyen, Franziska Roesner, Eunsol Choi, and Yejin Choi. 2022. Misinfo Reaction Frames: Reasoning about Readers' Reactions to News Headlines. ACL (2022)."},{"key":"e_1_2_1_38_1","volume-title":"Text-to-SQL Empowered by Large Language Models: A Benchmark Evaluation. CoRR abs\/2308.15363","author":"Gao Dawei","year":"2023","unstructured":"Dawei Gao, Haibin Wang, Yaliang Li, Xiuyu Sun, Yichen Qian, Bolin Ding, and Jingren Zhou. 2023. Text-to-SQL Empowered by Large Language Models: A Benchmark Evaluation. CoRR abs\/2308.15363 (2023)."},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.1720347115"},{"key":"e_1_2_1_40_1","volume-title":"https:\/\/huggingface.co\/google\/tapas-large-finetuned-wtq Accessed","year":"2024","unstructured":"Google. 2023. https:\/\/huggingface.co\/google\/tapas-large-finetuned-wtq Accessed: February 25, 2024."},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1146\/annurev-polisci-053119-015921"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.2110013119"},{"key":"e_1_2_1_43_1","doi-asserted-by":"crossref","unstructured":"Jiaqi Guo Zecheng Zhan Yan Gao Yan Xiao Jian-Guang Lou Ting Liu and Dongmei Zhang. 2019. Towards Complex Text-to-SQL in Cross-Domain Database with Intermediate Representation. arXiv:1905.08205 [cs.CL]","DOI":"10.18653\/v1\/P19-1444"},{"key":"e_1_2_1_44_1","doi-asserted-by":"crossref","unstructured":"E. Hatfield J.T. Cacioppo and R.L. Rapson. 1993. Emotional Contagion. Cambridge University Press. https:\/\/books.google.com\/books?id=BbA-BAAAQBAJ","DOI":"10.1017\/CBO9781139174138"},{"key":"e_1_2_1_45_1","volume-title":"Thomas M\u00fcller, Francesco Piccinno, and Julian Martin Eisenschlos.","author":"Herzig Jonathan","year":"2020","unstructured":"Jonathan Herzig, Pawe\u0142 Krzysztof Nowak, Thomas M\u00fcller, Francesco Piccinno, and Julian Martin Eisenschlos. 2020. TAPAS: Weakly Supervised Table Parsing via Pre-training. arXiv:2004.02349 [cs.IR]"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W17-5102"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1137\/1.9781611978032.2"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.findings-emnlp.402"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.case-1.1"},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/P14-1105"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P17-1167"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1080\/10926488.2018.1434943"},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/D14-1191"},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-main.1"},{"key":"e_1_2_1_55_1","doi-asserted-by":"crossref","unstructured":"Daniel Kang John Emmons Firas Abuzaid Peter Bailis and Matei Zaharia. 2017. NoScope: Optimizing Neural Network Queries over Video at Scale. arXiv:1703.02529 [cs.DB]","DOI":"10.14778\/3137628.3137664"},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1145\/3514221.3517897"},{"key":"e_1_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00778-022-00776-8"},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.1320040111"},{"key":"e_1_2_1_59_1","unstructured":"Sumith Kulal Panupong Pasupat Kartik Chandra Mina Lee Oded Padon Alex Aiken and Percy Liang. 2019. SPoC: Search-based Pseudocode to Code. arXiv:1906.04908 [cs.LG]"},{"key":"e_1_2_1_60_1","doi-asserted-by":"crossref","unstructured":"David Lazer Alex Pentland Lada Adamic Sinan Aral Albert-L\u00e5szl\u00f3 Barab\u00e5si Devon Brewer Nicholas Christakis Noshir Contractor James Fowler Myron Gutmann et al. 2009. Computational social science. Science 323 5915 (2009) 721--723.","DOI":"10.1126\/science.1167742"},{"key":"e_1_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.1126\/science.aaz8170"},{"key":"e_1_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.1126\/science.aao2998"},{"key":"e_1_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.nlp4if-1.2"},{"key":"e_1_2_1_64_1","unstructured":"Jinyang Li Binyuan Hui Ge Qu Jiaxi Yang Binhua Li Bowen Li Bailin Wang Bowen Qin Ruiying Geng Nan Huo et al. 2024. Can llm already serve as a database interface? a big bench for large-scale database grounded text-to-sqls. Advances in Neural Information Processing Systems 36 (2024)."},{"key":"e_1_2_1_65_1","doi-asserted-by":"publisher","DOI":"10.1145\/3415190"},{"key":"e_1_2_1_66_1","doi-asserted-by":"crossref","unstructured":"Sha Li Heng Ji and Jiawei Han. 2021. Document-Level Event Argument Extraction by Conditional Generation. arXiv:2104.05919 [cs.CL]","DOI":"10.18653\/v1\/2021.naacl-main.69"},{"key":"e_1_2_1_67_1","doi-asserted-by":"publisher","DOI":"10.1145\/3543507.3583474"},{"key":"e_1_2_1_68_1","volume-title":"TAPEX: Table Pre-training via Learning a Neural SQL Executor. arXiv:2107.07653 [cs.CL] https:\/\/arxiv.org\/abs\/2107.07653","author":"Liu Qian","year":"2022","unstructured":"Qian Liu, Bei Chen, Jiaqi Guo, Morteza Ziyadi, Zeqi Lin, Weizhu Chen, and Jian-Guang Lou. 2022. TAPEX: Table Pre-training via Learning a Neural SQL Executor. arXiv:2107.07653 [cs.CL] https:\/\/arxiv.org\/abs\/2107.07653"},{"key":"e_1_2_1_69_1","unstructured":"Siyang Liu Chujie Zheng Orianna Demasi Sahand Sabour Yu Li Zhou Yu Yong Jiang and Minlie Huang. 2021. Towards Emotional Support Dialog Systems. arXiv:2106.01144 [cs.CL]"},{"key":"e_1_2_1_70_1","volume-title":"RoBERTa: A Robustly Optimized BERT Pretraining Approach. CoRR abs\/1907.11692","author":"Liu Yinhan","year":"2019","unstructured":"Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, and Veselin Stoyanov. 2019. RoBERTa: A Robustly Optimized BERT Pretraining Approach. CoRR abs\/1907.11692 (2019). arXiv:1907.11692 http:\/\/arxiv.org\/abs\/1907.11692"},{"key":"e_1_2_1_71_1","volume-title":"Semantic-Driven Instance Generation for Table Question Answering. In International Conference on Database Systems for Advanced Applications. Springer, 3--18","author":"Ma Shuai","year":"2023","unstructured":"Shuai Ma, Wenbin Jiang, Xiang Ao, Meng Tian, Xinwei Feng, Yajuan Lyu, Qiaoqiao She, and Qing He. 2023. Semantic-Driven Instance Generation for Table Question Answering. In International Conference on Database Systems for Advanced Applications. Springer, 3--18."},{"key":"e_1_2_1_72_1","doi-asserted-by":"crossref","unstructured":"Binny Mathew Anurag Illendula Punyajoy Saha Soumya Sarkar Pawan Goyal and Animesh Mukherjee. 2020. Hate begets Hate: A Temporal Study of Hate Speech. arXiv:1909.10966 [cs.SI]","DOI":"10.1145\/3415163"},{"key":"e_1_2_1_73_1","doi-asserted-by":"publisher","DOI":"10.1007\/s12108-015-9291-8"},{"key":"e_1_2_1_74_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1037\/emo0000703","article-title":"Emotion regulation","volume":"20","author":"McRae K.","year":"2020","unstructured":"K. McRae and J. J. Gross. 2020. Emotion regulation. Emotion 20, 1 (2020), 1--9.","journal-title":"Emotion"},{"key":"e_1_2_1_75_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.acl-long.845"},{"key":"e_1_2_1_76_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/S16-1003"},{"key":"e_1_2_1_77_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/D14-1215"},{"key":"e_1_2_1_78_1","volume-title":"https:\/\/chat.openai.com\/ Accessed","author":"AI.","year":"2024","unstructured":"OpenAI. 2023. https:\/\/chat.openai.com\/ Accessed: October 24, 2024."},{"key":"e_1_2_1_79_1","volume-title":"https:\/\/platform.openai.com\/docs\/models\/gpt-4-and-gpt-4-turbo Accessed","author":"AI.","year":"2024","unstructured":"OpenAI. 2023. https:\/\/platform.openai.com\/docs\/models\/gpt-4-and-gpt-4-turbo Accessed: October 24, 2024."},{"key":"e_1_2_1_80_1","volume-title":"https:\/\/platform.openai.com\/docs\/guides\/function-calling\/supported-models Accessed","author":"AI.","year":"2024","unstructured":"OpenAI. 2023. https:\/\/platform.openai.com\/docs\/guides\/function-calling\/supported-models Accessed: October 24, 2024."},{"key":"e_1_2_1_81_1","volume-title":"https:\/\/platform.openai.com\/examples\/default-sql-translate Accessed","author":"AI.","year":"2024","unstructured":"OpenAI. 2023. https:\/\/platform.openai.com\/examples\/default-sql-translate Accessed: October 24, 2024."},{"key":"e_1_2_1_82_1","unstructured":"OpenAI. 2024. https:\/\/openai.com\/pricing"},{"key":"e_1_2_1_83_1","volume-title":"Compositional Semantic Parsing on Semi-Structured Tables. CoRR abs\/1508.00305","author":"Pasupat Panupong","year":"2015","unstructured":"Panupong Pasupat and Percy Liang. 2015. Compositional Semantic Parsing on Semi-Structured Tables. CoRR abs\/1508.00305 (2015). arXiv:1508.00305 http:\/\/arxiv.org\/abs\/1508.00305"},{"key":"e_1_2_1_84_1","doi-asserted-by":"publisher","DOI":"10.1145\/2818346.2820758"},{"key":"e_1_2_1_85_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N19-1128"},{"key":"e_1_2_1_86_1","volume-title":"Din-sql: Decomposed in-context learning of text-to-sql with self-correction. Advances in Neural Information Processing Systems 36","author":"Pourreza Mohammadreza","year":"2024","unstructured":"Mohammadreza Pourreza and Davood Rafiei. 2024. Din-sql: Decomposed in-context learning of text-to-sql with self-correction. Advances in Neural Information Processing Systems 36 (2024)."},{"key":"e_1_2_1_87_1","unstructured":"Marcelo O. R. Prates Pedro H. C. Avelar and Luis Lamb. 2019. Assessing Gender Bias in Machine Translation - A Case Study with Google Translate. arXiv:1809.02208 [cs.CY] https:\/\/arxiv.org\/abs\/1809.02208"},{"key":"e_1_2_1_88_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P17-1068"},{"key":"e_1_2_1_89_1","unstructured":"Bowen Qin Binyuan Hui Lihan Wang Min Yang Jinyang Li Binhua Li Ruiying Geng Rongyu Cao Jian Sun Luo Si et al. 2022. A survey on text-to-sql parsing: Concepts methods and future directions. arXiv preprint arXiv:2208.13629 (2022)."},{"key":"e_1_2_1_90_1","volume-title":"Liu","author":"Raffel Colin","year":"2023","unstructured":"Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, and Peter J. Liu. 2023. Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer. arXiv:1910.10683 [cs.LG]"},{"key":"e_1_2_1_91_1","doi-asserted-by":"crossref","unstructured":"Pranav Rajpurkar Robin Jia and Percy Liang. 2018. Know What You Don't Know: Unanswerable Questions for SQuAD. arXiv:1806.03822 [cs.CL] https:\/\/arxiv.org\/abs\/1806.03822","DOI":"10.18653\/v1\/P18-2124"},{"key":"e_1_2_1_92_1","volume-title":"https:\/\/huggingface.co\/mrm8488\/t5-base-finetuned-wikiSQL Accessed","author":"Romero Manuel","year":"2024","unstructured":"Manuel Romero. 2023. https:\/\/huggingface.co\/mrm8488\/t5-base-finetuned-wikiSQL Accessed: February 25, 2024."},{"key":"e_1_2_1_93_1","volume-title":"Journal of Traumatic Stress 13 (10","author":"Rothbaum Barbara","year":"2012","unstructured":"Barbara Rothbaum, Elizabeth Meadows, and Patricia Resick. 2012. Cognitive-Behavioral Therapy. Journal of Traumatic Stress 13 (10 2012)."},{"key":"e_1_2_1_94_1","volume-title":"Effective treatments for PTSD: Practice guidelines from the International Society for Traumatic Stress Studies","author":"Rothbaum Barbara O","unstructured":"Barbara O Rothbaum, Elizabeth A Meadows, Patricia Resick, and David W Foy. 2000. Cognitive-behavioral therapy. In Effective treatments for PTSD: Practice guidelines from the International Society for Traumatic Stress Studies, Edna B Foa, Terence M Keane, and Matthew J Friedman (Eds.). The Guilford Press, 320--325."},{"key":"e_1_2_1_95_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1163"},{"key":"e_1_2_1_96_1","unstructured":"Maarten Sap Saadia Gabriel Lianhui Qin Dan Jurafsky Noah A. Smith and Yejin Choi. 2020. Social Bias Frames: Reasoning about Social and Power Implications of Language. arXiv:1911.03891 [cs.CL]"},{"key":"e_1_2_1_97_1","unstructured":"Maarten Sap Saadia Gabriel Lianhui Qin Dan Jurafsky Noah A Smith and Yejin Choi. 2020. Social Bias Frames: Reasoning about Social and Power Implications of Language. In ACL."},{"key":"e_1_2_1_98_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.178"},{"key":"e_1_2_1_99_1","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.2211715119"},{"key":"e_1_2_1_100_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1404"},{"key":"e_1_2_1_101_1","volume-title":"https:\/\/scale.com\/rapid Accessed","year":"2024","unstructured":"Scale. [n.d.]. https:\/\/scale.com\/rapid Accessed: October 24, 2024."},{"key":"e_1_2_1_102_1","volume-title":"https:\/\/case.law\/ Accessed","author":"School Harvard Law","year":"2024","unstructured":"Harvard Law School. 2023. https:\/\/case.law\/ Accessed: October 24, 2024."},{"key":"e_1_2_1_103_1","doi-asserted-by":"crossref","unstructured":"Ashish Sharma Adam S. Miner David C. Atkins and Tim Althoff. 2020. A Computational Approach to Understanding Empathy Expressed in Text-Based Mental Health Support. arXiv:2009.08441 [cs.CL]","DOI":"10.18653\/v1\/2020.emnlp-main.425"},{"key":"e_1_2_1_104_1","volume-title":"Compositional generalization and natural language variation: Can a semantic parsing approach handle both? arXiv preprint arXiv:2010.12725","author":"Shaw Peter","year":"2020","unstructured":"Peter Shaw, Ming-Wei Chang, Panupong Pasupat, and Kristina Toutanova. 2020. Compositional generalization and natural language variation: Can a semantic parsing approach handle both? arXiv preprint arXiv:2010.12725 (2020)."},{"key":"e_1_2_1_105_1","doi-asserted-by":"publisher","DOI":"10.1162\/coli_a_00347"},{"key":"e_1_2_1_106_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.acl-long.369"},{"key":"e_1_2_1_107_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.742"},{"key":"e_1_2_1_108_1","doi-asserted-by":"publisher","unstructured":"Chenhao Tan Vlad Niculae Cristian Danescu-Niculescu-Mizil and Lillian Lee. 2016. Winning Arguments: Interaction Dynamics and Persuasion Strategies in Good-faith Online Discussions. In Proceedings of the 25th International Conference on World Wide Web (Montr\u00e9al Qu\u00e9bec Canada) (WWW '16). International World Wide Web Conferences Steering Committee Republic and Canton of Geneva CHE 613--624. 10.1145\/2872427.2883081","DOI":"10.1145\/2872427.2883081"},{"key":"e_1_2_1_109_1","doi-asserted-by":"publisher","DOI":"10.1145\/3398069"},{"key":"e_1_2_1_110_1","doi-asserted-by":"publisher","DOI":"10.1177\/0146167208318067"},{"key":"e_1_2_1_111_1","doi-asserted-by":"crossref","unstructured":"Bing Wang Yan Gao Zhoujun Li and Jian-Guang Lou. 2023. Know What I don't Know: Handling Ambiguous and Unanswerable Questions for Text-to-SQL. arXiv:2212.08902 [cs.CL]","DOI":"10.18653\/v1\/2023.findings-acl.352"},{"key":"e_1_2_1_112_1","doi-asserted-by":"crossref","unstructured":"Bailin Wang Richard Shin Xiaodong Liu Oleksandr Polozov and Matthew Richardson. 2021. RAT-SQL: Relation-Aware Schema Encoding and Linking for Text-to-SQL Parsers. arXiv:1911.04942 [cs.CL]","DOI":"10.18653\/v1\/2020.acl-main.677"},{"key":"e_1_2_1_113_1","doi-asserted-by":"publisher","DOI":"10.14778\/3415478.3415541"},{"key":"e_1_2_1_114_1","volume-title":"Chi, Quoc Le, and Denny Zhou","author":"Wei Jason","year":"2023","unstructured":"Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter, Fei Xia, Ed Chi, Quoc Le, and Denny Zhou. 2023. Chain-of-Thought Prompting Elicits Reasoning in Large Language Models. arXiv:2201.11903 [cs.CL]"},{"key":"e_1_2_1_115_1","doi-asserted-by":"publisher","unstructured":"Orion Weller and Kevin Seppi. 2019. Humor Detection: A Transformer Gets the Last Laugh. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) Kentaro Inui Jing Jiang Vincent Ng and Xiaojun Wan (Eds.). Association for Computational Linguistics Hong Kong China 3621--3625. 10.18653\/v1\/D19-1372","DOI":"10.18653\/v1\/D19-1372"},{"key":"e_1_2_1_116_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N19-1364"},{"key":"e_1_2_1_117_1","volume-title":"CS 224C: NLP for Computational Social Science","author":"Yang Diyi","unstructured":"Diyi Yang, Kaitlyn Zhou, and Wenna Qin. 2023. CS 224C: NLP for Computational Social Science. Stanford University. https:\/\/web.stanford.edu\/class\/cs224c\/"},{"key":"e_1_2_1_118_1","doi-asserted-by":"crossref","unstructured":"Tao Yu Zifan Li Zilin Zhang Rui Zhang and Dragomir Radev. 2018. TypeSQL: Knowledge-based Type-Aware Neural Text-to-SQL Generation. arXiv:1804.09769 [cs.CL]","DOI":"10.18653\/v1\/N18-2093"},{"key":"e_1_2_1_119_1","doi-asserted-by":"publisher","unstructured":"Tao Yu Rui Zhang Kai Yang Michihiro Yasunaga Dongxu Wang Zifan Li James Ma Irene Li Qingning Yao Shanelle Roman Zilin Zhang and Dragomir Radev. 2018. Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing Ellen Riloff David Chiang Julia Hockenmaier and Jun'ichi Tsujii (Eds.). Association for Computational Linguistics Brussels Belgium 3911--3921. 10.18653\/v1\/D18-1425","DOI":"10.18653\/v1\/D18-1425"},{"key":"e_1_2_1_120_1","volume-title":"Why Do You Feel This Way? Summarizing Triggers of Emotions in Social Media Posts. ArXiv abs\/2210.12531","author":"Zhan Hongli","year":"2022","unstructured":"Hongli Zhan, Tiberiu Sosea, Cornelia Caragea, and Junyi Jessy Li. 2022. Why Do You Feel This Way? Summarizing Triggers of Emotions in Social Media Posts. ArXiv abs\/2210.12531 (2022). https:\/\/api.semanticscholar.org\/CorpusID:253098848"},{"key":"e_1_2_1_121_1","volume-title":"Proceedings of the International AAAI Conference on Web and Social Media","author":"Zhang Amy X.","year":"2017","unstructured":"Amy X. Zhang, Bryan Culbertson, and Praveen K. Paritosh. 2017. Characterizing Online Discussion Using Coarse Discourse Sequences. Proceedings of the International AAAI Conference on Web and Social Media (2017). https:\/\/api.semanticscholar.org\/CorpusID:35696952"},{"key":"e_1_2_1_122_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P18-1125"},{"key":"e_1_2_1_123_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.bdr.2020.100145"},{"key":"e_1_2_1_124_1","doi-asserted-by":"crossref","unstructured":"Yi Zhang Jan Deriu George Katsogiannis-Meimarakis Catherine Kosten Georgia Koutrika and Kurt Stockinger. 2023. ScienceBenchmark: A Complex Real-World Benchmark for Evaluating Natural Language to SQL Systems. arXiv:2306.04743 [cs.DB]","DOI":"10.14778\/3636218.3636225"},{"key":"e_1_2_1_125_1","volume-title":"Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning. CoRR abs\/1709.00103","author":"Zhong Victor","year":"2017","unstructured":"Victor Zhong, Caiming Xiong, and Richard Socher. 2017. Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning. CoRR abs\/1709.00103 (2017)."},{"key":"e_1_2_1_126_1","doi-asserted-by":"crossref","unstructured":"Caleb Ziems William Held Omar Shaikh Jiaao Chen Zhehao Zhang and Diyi Yang. 2023. Can Large Language Models Transform Computational Social Science? arXiv:2305.03514 [cs.CL]","DOI":"10.1162\/coli_a_00502"},{"key":"e_1_2_1_127_1","doi-asserted-by":"crossref","unstructured":"Caleb Ziems William Held Jingfeng Yang Jwala Dhamala Rahul Gupta and Diyi Yang. 2023. Multi-VALUE: A Framework for Cross-Dialectal English NLP. arXiv:2212.08011 [cs.CL]","DOI":"10.18653\/v1\/2023.acl-long.44"},{"key":"e_1_2_1_128_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.acl-long.257"}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/3705829.3705843","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,2,28]],"date-time":"2025-02-28T23:22:18Z","timestamp":1740784938000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/3705829.3705843"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,10]]},"references-count":128,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2024,10]]}},"alternative-id":["10.14778\/3705829.3705843"],"URL":"https:\/\/doi.org\/10.14778\/3705829.3705843","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2024,10]]},"assertion":[{"value":"2025-02-28","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}