{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,20]],"date-time":"2025-11-20T19:02:24Z","timestamp":1763665344387,"version":"3.37.3"},"reference-count":73,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2023,12,5]],"date-time":"2023-12-05T00:00:00Z","timestamp":1701734400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,12,5]],"date-time":"2023-12-05T00:00:00Z","timestamp":1701734400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001659","name":"Deutsche Forschungsgemeinschaft","doi-asserted-by":"crossref","award":["MA 5030\/3-1"],"award-info":[{"award-number":["MA 5030\/3-1"]}],"id":[{"id":"10.13039\/501100001659","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100002347","name":"Bundesministerium f\u00fcr Bildung und Forschung","doi-asserted-by":"publisher","award":["16DHB4009"],"award-info":[{"award-number":["16DHB4009"]}],"id":[{"id":"10.13039\/501100002347","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Empir Software Eng"],"published-print":{"date-parts":[[2024,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Software analytics integrated with complex databases can deliver project intelligence into the hands of software engineering (SE) experts for satisfying their information needs. A new and promising machine learning technique known as text-to-SQL automatically extracts information for users of complex databases without the need to fully understand the database structure nor the accompanying query language. Users pose their request as so-called natural language utterance, i.e., question. Our goal was evaluating the performance and applicability of text-to-SQL approaches on data derived from tools typically used in the workflow of software engineers for satisfying their information needs. We carefully selected and discussed five seminal as well as state-of-the-art text-to-SQL approaches and conducted a comparative assessment using the large-scale, cross-domain Spider dataset and the SE domain-specific SEOSS-Queries dataset. Furthermore, we study via a survey how SE professionals perform in satisfying their information needs and how they perceive text-to-SQL approaches. For the best performing approach, we observe a high accuracy of 94% in query prediction when training specifically on SE data. This accuracy is almost independent of the query\u2019s complexity. At the same time, we observe that SE professionals have substantial deficits in satisfying their information needs directly via SQL queries. Furthermore, SE professionals are open for utilizing text-to-SQL approaches in their daily work, considering them less time-consuming and helpful. We conclude that state-of-the-art text-to-SQL approaches are applicable in SE practice for day-to-day information needs.<\/jats:p>","DOI":"10.1007\/s10664-023-10374-z","type":"journal-article","created":{"date-parts":[[2023,12,5]],"date-time":"2023-12-05T12:02:06Z","timestamp":1701777726000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Assessing the utility of text-to-SQL approaches for satisfying software developer information needs"],"prefix":"10.1007","volume":"29","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1554-7239","authenticated-orcid":false,"given":"Mihaela","family":"Tomova","sequence":"first","affiliation":[]},{"given":"Martin","family":"Hofmann","sequence":"additional","affiliation":[]},{"given":"Constantin","family":"H\u00fctterer","sequence":"additional","affiliation":[]},{"given":"Patrick","family":"M\u00e4der","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,12,5]]},"reference":[{"issue":"3","key":"10374_CR1","doi-asserted-by":"publisher","first-page":"1834","DOI":"10.1007\/s10664-019-09788-5","volume":"25","author":"A Abdellatif","year":"2020","unstructured":"Abdellatif A, Badran K, Shihab E (2020) Msrbot: Using bots to answer questions from software repositories. Empir Softw Eng 25(3):1834\u20131863","journal-title":"Empir Softw Eng"},{"key":"10374_CR2","unstructured":"Apache Pig project. https:\/\/pig.apache.org\/. Accessed 12 January 2023"},{"key":"10374_CR3","unstructured":"Evaluation script spider (2023) https:\/\/github.com\/taoyds\/spider. Accessed 12 January 2023"},{"key":"10374_CR4","unstructured":"Assembla (2023) https:\/\/get.assembla.com\/. Accessed 12 January 2023"},{"key":"10374_CR5","unstructured":"Atlassian JIRA (2023) https:\/\/www.atlassian.com\/de\/software\/jira. Accessed 12 January 2023"},{"key":"10374_CR6","unstructured":"Bahdanau D, Cho K, Bengio Y (2015) Neural machine translation by jointly learning to align and translate. In: ICLR"},{"key":"10374_CR7","doi-asserted-by":"crossref","unstructured":"Begel A, Zimmermann T (2014) Analyze this! 145 questions for data scientists in software engineering. In: ICSE, pp. 12\u201323. ACM","DOI":"10.1145\/2568225.2568233"},{"key":"10374_CR8","doi-asserted-by":"crossref","unstructured":"Bertram D, Voida A, Greenberg S, Walker RJ (2010) Communication, collaboration, and bugs: the social nature of issue tracking in small, collocated teams. In: CSCW, pp. 291\u2013300. ACM","DOI":"10.1145\/1718918.1718972"},{"key":"10374_CR9","doi-asserted-by":"publisher","unstructured":"Cao R, Chen L, Chen Z, Zhao Y, Zhu S, Yu K (2021) LGESQL: Line graph enhanced textto- SQL model with mixed local and non-local relations. In: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pp. 2541\u20132555. Association for Computational Linguistics, Online. https:\/\/doi.org\/10.18653\/v1\/2021.acllong.198. https:\/\/aclanthology.org\/2021.acl-long.198","DOI":"10.18653\/v1\/2021.acllong.198"},{"issue":"12","key":"10374_CR10","first-page":"4818","volume":"48","author":"M Ciniselli","year":"2022","unstructured":"Ciniselli M, Cooper N, Pascarella L, Mastropaolo A, Aghajani E, Poshyvanyk D, Penta MD, Bavota G (2022) An empirical study on the usage of transformer models for code completion. IEEE Trans Software Eng 48(12):4818\u20134837","journal-title":"IEEE Trans Software Eng"},{"key":"10374_CR11","unstructured":"Clark K, Luong M, Le QV, Manning CD (2020) ELECTRA: pre-training text encoders as discriminators rather than generators. In: ICLR OpenReview net"},{"issue":"6","key":"10374_CR12","doi-asserted-by":"publisher","first-page":"377","DOI":"10.1145\/362384.362685","volume":"13","author":"EF Codd","year":"1970","unstructured":"Codd EF (1970) A relational model of data for large shared data banks. Commun. ACM 13(6):377\u2013387","journal-title":"Commun. ACM"},{"key":"10374_CR13","unstructured":"Devlin J, Chang M, Lee K, Toutanova K (2019) BERT: pre-training of deep bidirectional transformers for language understanding. In: NAACL-HLT (1):4171\u20134186. Association for Computational Linguistics"},{"key":"10374_CR14","doi-asserted-by":"crossref","unstructured":"Fritz T, Murphy GC (2010) Using information fragments to answer the questions developers ask. In: ICSE (1):175\u2013184. ACM","DOI":"10.1145\/1806799.1806828"},{"key":"10374_CR15","unstructured":"Git (2023) https:\/\/git-scm.com\/. Accessed 12 January 2023"},{"key":"10374_CR16","unstructured":"Github (2023) https:\/\/github.com\/. Accessed 12 January 2023"},{"key":"10374_CR17","unstructured":"GitHub Copilot (2023) https:\/\/github.com\/features\/copilot. Accessed 18 Juli 2023"},{"issue":"1","key":"10374_CR18","doi-asserted-by":"publisher","first-page":"67","DOI":"10.1109\/MS.2009.10","volume":"26","author":"MW Godfrey","year":"2009","unstructured":"Godfrey MW, Hassan AE, Herbsleb JD, Murphy GC, Robillard MP, Devanbu PT, Mockus A, Perry DE, Notkin D (2009) Future of mining software archives: A roundtable. IEEE Softw 26(1):67\u201370. https:\/\/doi.org\/10.1109\/MS.2009.10","journal-title":"IEEE Softw"},{"key":"10374_CR19","doi-asserted-by":"publisher","unstructured":"Green BF, Wolf AK, Chomsky C, Laughery K (1961) Baseball: An automatic question answerer. In: Papers Presented at the May 9-11, 1961, Western Joint IRE-AIEE-ACM Computer Conference, IRE-AIEE-ACM\u201961 (Western), vol. 19, pp. 219-224. Association for Computing Machinery, New York, NY, USA. https:\/\/doi.org\/10.1145\/1460690.1460714","DOI":"10.1145\/1460690.1460714"},{"key":"10374_CR20","doi-asserted-by":"crossref","unstructured":"Guo J, Zhan Z, Gao Y, Xiao Y, Lou J, Liu T, Zhang D (2019) Towards complex text-tosql in cross-domain database with intermediate representation. In: ACL (1):4524\u20134535. Association for Computational Linguistics","DOI":"10.18653\/v1\/P19-1444"},{"key":"10374_CR21","doi-asserted-by":"crossref","unstructured":"Hassan AE (2006) Mining software repositories to assist developers and support managers. In: ICSM, pp. 339-342. IEEE Computer Society","DOI":"10.1109\/ICSM.2006.38"},{"key":"10374_CR22","doi-asserted-by":"publisher","unstructured":"Hassan AE (2008) The road ahead for mining software repositories. In: 2008 IEEE International Conference on Software Maintenance 48\u201357. https:\/\/doi.org\/10.1109\/FOSM.2008.4659248","DOI":"10.1109\/FOSM.2008.4659248"},{"issue":"8","key":"10374_CR23","doi-asserted-by":"publisher","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","volume":"9","author":"S Hochreiter","year":"1997","unstructured":"Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735\u20131780","journal-title":"Neural Comput"},{"issue":"3","key":"10374_CR24","first-page":"848","volume":"48","author":"M Janke","year":"2022","unstructured":"Janke M, M\u00e4der P (2022) Graph based mining of code change patterns from version control commits. IEEE Trans Software Eng 48(3):848\u2013863","journal-title":"IEEE Trans Software Eng"},{"key":"10374_CR25","doi-asserted-by":"crossref","unstructured":"Kajiura T, Souma N, Sato M, Takahashi M, Kuramitsu K (2022) An additional approach to pre-trained code model with multilingual natural languages. In: APSEC 580\u2013581. IEEE","DOI":"10.1109\/APSEC57359.2022.00090"},{"key":"10374_CR26","doi-asserted-by":"publisher","unstructured":"Keivanloo I, Forbes C, Hmood A, Erfani M, Neal C, Peristerakis G, Rilling J (2012) A linked data platform for mining software repositories. In: 2012 9th IEEE Working Conference on Mining Software Repositories (MSR) 32\u201335. https:\/\/doi.org\/10.1109\/MSR.2012.6224296","DOI":"10.1109\/MSR.2012.6224296"},{"key":"10374_CR27","doi-asserted-by":"crossref","unstructured":"Ko AJ, DeLine R, Venolia G (2007) Information needs in collocated software development teams. In: ICSE 344\u2013353. IEEE Computer Society","DOI":"10.1109\/ICSE.2007.45"},{"key":"10374_CR28","unstructured":"Kojima T, Gu SS, Reid M, Matsuo Y, Iwasawa Y (2022) Large language models are zero-shot reasoners. In: NeurIPS. http:\/\/papers.nips.cc\/paper files\/paper\/2022\/hash\/8bb0d291acd4acf06ef112099c16f326-Abstract-Conference.html"},{"key":"10374_CR29","doi-asserted-by":"publisher","unstructured":"Kolovos D, Neubauer P, Barmpis K, Matragkas N, Paige R (2019) Crossflow: A framework for distributed mining of software repositories. In: 2019 IEEE\/ACM 16th International Conference on Mining Software Repositories (MSR) 155\u2013159. https:\/\/doi.org\/10.1109\/MSR.2019.00032","DOI":"10.1109\/MSR.2019.00032"},{"key":"10374_CR30","doi-asserted-by":"crossref","unstructured":"Kudo T, Richardson J (2018) Sentencepiece: A simple and language independent subword tokenizer and detokenizer for neural text processing. In: EMNLP (Demonstration) 66\u201371. Association for Computational Linguistics","DOI":"10.18653\/v1\/D18-2012"},{"key":"10374_CR31","unstructured":"Lee C, Gottschlich J, Roth D (2021) Toward code generation: A survey and lessons from semantic parsing. CoRR arXiv:2105.03317"},{"key":"10374_CR32","doi-asserted-by":"crossref","unstructured":"Lin J, Liu Y, Guo J, Cleland-Huang J, Goss W, Liu W, Lohar S, Monaikul N, Rasin A (2017) Tiqi: a natural language interface for querying software project data. In: ASE 973\u2013977. IEEE Computer Society","DOI":"10.1109\/ASE.2017.8115714"},{"key":"10374_CR33","unstructured":"Liu Y, Ott M, Goyal N, Du J, Joshi M, Chen D Levy O, Lewis M, Zettlemoyer L, Stoyanov V (2019) Roberta: A robustly optimized BERT pretraining approach. CoRR arXiv:1907.11692"},{"key":"10374_CR34","unstructured":"Lohar S, Cleland-Huang J, Rasin A, M\u00e4der P (2015) Live study proposal: Collecting natural language trace queries. In: R. Matulevicius, T. Weyer, P. Forbrig, A. Herrmann, M. Daneva, J. D\u00d6rr, A. Hoffmann, A. Kalenborn, M. Trapp, G. Herzwurm, W. Pietsch, A. Lenz, S. Schockert, M. Daun, C. Palomares, I. Morales-Ramirez, B. Tenbergen, B. Paech, R.J. Wieringa, E. Knauss, A. Perini (eds.) Joint Proceedings of REFSQ-2015 Workshops, Research Method Track, and Poster Track co-located with the 21st International Conference on Requirements Engineering: Foundation for Software Quality (REFSQ 2015), Essen, Germany, March 23, 2015, CEUR Workshop Proceedings 1342:207\u2013 210. CEUR-WS.org. http:\/\/ceur-ws.org\/Vol-1342\/preface-RMT.pdf"},{"issue":"4","key":"10374_CR35","doi-asserted-by":"publisher","first-page":"60","DOI":"10.1145\/166635.166656","volume":"22","author":"H Lu","year":"1993","unstructured":"Lu H, Chan HC, Wei KK (1993) A survey on usage of SQL. SIGMOD Rec 22(4):60\u201365","journal-title":"SIGMOD Rec"},{"issue":"3","key":"10374_CR36","doi-asserted-by":"publisher","first-page":"537","DOI":"10.1007\/s10270-012-0237-0","volume":"12","author":"P M\u00e4der","year":"2013","unstructured":"M\u00e4der P, Cleland-Huang J (2013) A visual language for modeling and executing traceability queries. Softw. Syst. Model. 12(3):537\u2013553. https:\/\/doi.org\/10.1007\/s10270-012-0237-0","journal-title":"Softw. Syst. Model."},{"key":"10374_CR37","doi-asserted-by":"crossref","unstructured":"Mastropaolo A, Pascarella L, Bavota G (2022) Using deep learning to generate complete log statements. In: ICSE 2279\u20132290. ACM","DOI":"10.1145\/3510003.3511561"},{"key":"10374_CR38","unstructured":"Maven (2023) https:\/\/maven.apache.org\/. Accessed 12 January 2023"},{"key":"10374_CR39","doi-asserted-by":"publisher","first-page":"35","DOI":"10.1162\/tacl_a_00446","volume":"10","author":"L Nan","year":"2022","unstructured":"Nan L, Hsieh C, Mao Z, Lin XV, Verma N, Zhang R, Kryscinski W, Schoelkopf H, Kong R, Tang X, Mutuma M, Rosand B, Trindade I, Bandaru R, Cunningham J, Xiong C, Radev DR (2022) Fetaqa: Free-form table question answering. Trans Assoc Comput Linguistics 10:35\u201349","journal-title":"Trans Assoc Comput Linguistics"},{"key":"10374_CR40","unstructured":"OpenAI ChatGPT (2023) https:\/\/openai.com\/blog\/chatgpt. Accessed 18 Juli 2023"},{"key":"10374_CR41","doi-asserted-by":"crossref","unstructured":"Pennington J, Socher R, Manning CD (2014) Glove: Global vectors for word representation. In: EMNLP 1532\u20131543. ACL","DOI":"10.3115\/v1\/D14-1162"},{"key":"10374_CR42","doi-asserted-by":"crossref","unstructured":"Portillo-Rodr\u00edguez J, Vizca\u00edno A, Ebert C, Piattini M (2010) Tools to support global software development processes: A survey. In: ICGSE 13\u201322. IEEE Computer Society","DOI":"10.1109\/ICGSE.2010.12"},{"key":"10374_CR43","unstructured":"Raffel C, Shazeer N, Roberts A, Lee K, Narang S, Matena M, Zhou Y, Li W, Liu PJ (2020) Exploring the limits of transfer learning with a unified text-totext transformer. Journal of Machine Learning Research 21(140):1\u201367. http:\/\/jmlr.org\/papers\/v21\/20-074.html"},{"key":"10374_CR44","doi-asserted-by":"publisher","DOI":"10.1016\/j.dib.2019.104005","volume":"25","author":"M Rath","year":"2019","unstructured":"Rath M, M\u00e4der P (2019) The seoss 33 dataset - requirements, bug reports, code history, and trace links for entire projects. Data in Brief 25:104005. https:\/\/doi.org\/10.1016\/j.dib.2019.104005","journal-title":"Data in Brief"},{"key":"10374_CR45","doi-asserted-by":"publisher","unstructured":"Rath M, Rempel P, M\u00e4der, P (2017) The ilmseven dataset. In: 2017 IEEE 25th International Requirements Engineering Conference (RE) 516\u2013519. https:\/\/doi.org\/10.1109\/RE.2017.18","DOI":"10.1109\/RE.2017.18"},{"key":"10374_CR46","doi-asserted-by":"crossref","unstructured":"Rath M, Rendall J, Guo JLC, Cleland-Huang J, M\u00e4der P (2018) Traceability in the wild: automatically augmenting incomplete trace links. In: ICSE 834\u2013845. ACM","DOI":"10.1145\/3180155.3180207"},{"key":"10374_CR47","unstructured":"Requirements management products (2023) https:\/\/www.ibm.com\/dede\/products\/requirements-management. Online; accessed 12 January 2023"},{"key":"10374_CR48","doi-asserted-by":"crossref","unstructured":"Rubin O, Berant J (2021) Smbop: Semi-autoregressive bottom-up semantic parsing. In: NAACL-HLT 311\u2013324. Association for Computational Linguistics","DOI":"10.18653\/v1\/2021.naacl-main.29"},{"key":"10374_CR49","doi-asserted-by":"crossref","unstructured":"Scholak T, Schucher N, Bahdanau D (2021) PICARD: Parsing incrementally for constrained auto-regressive decoding from language models. In: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pp. 9895\u20139901. Association for Computational Linguistics. https:\/\/aclanthology.org\/2021.emnlp-main.779","DOI":"10.18653\/v1\/2021.emnlp-main.779"},{"key":"10374_CR50","doi-asserted-by":"crossref","unstructured":"Scholak T, Schucher N, Bahdanau D (2021) PICARD: parsing incrementally for constrained auto-regressive decoding from language models. In: EMNLP (1):9895\u20139901. Association for Computational Linguistics","DOI":"10.18653\/v1\/2021.emnlp-main.779"},{"issue":"11","key":"10374_CR51","doi-asserted-by":"publisher","first-page":"2673","DOI":"10.1109\/78.650093","volume":"45","author":"M Schuster","year":"1997","unstructured":"Schuster M, Paliwal KK (1997) Bidirectional recurrent neural networks. IEEE Trans Signal Process 45(11):2673\u20132681","journal-title":"IEEE Trans Signal Process"},{"key":"10374_CR52","unstructured":"Selenium site (2023) https:\/\/www.selenium.dev\/. Accessed 12 January 2023"},{"key":"10374_CR53","unstructured":"SEOSS-Queries Repository (2023) https:\/\/figshare.com\/s\/e2190f2d32798ce1d0fd. Accessed 12 January 2023"},{"key":"10374_CR54","doi-asserted-by":"crossref","unstructured":"Shang W, Nagappan M, Hassan AE, Jiang ZM (2014) Understanding log lines using development knowledge. In: ICSME 21\u201330. IEEE Computer Society","DOI":"10.1109\/ICSME.2014.24"},{"key":"10374_CR55","doi-asserted-by":"publisher","unstructured":"Shaw P, Chang MW, Pasupat P, Toutanova K (2021) Compositional generalization and natural language variation: Can a semantic parsing approach handle both? In: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pp. 922-938. Association for Computational Linguistics, Online. https:\/\/doi.org\/10.18653\/v1\/2021.acl-long.75","DOI":"10.18653\/v1\/2021.acl-long.75"},{"key":"10374_CR56","doi-asserted-by":"crossref","unstructured":"Shaw P, Uszkoreit J, Vaswani A (2018) Self-attention with relative position representations. In: NAACL-HLT (2):464\u2013468. Association for Computational Linguistics","DOI":"10.18653\/v1\/N18-2074"},{"key":"10374_CR57","unstructured":"Spider Leaderboard (2023) https:\/\/yale-lily.github.io\/spider. Accessed 12 January 2023"},{"key":"10374_CR58","unstructured":"SQLNetSpider version (2023) https:\/\/github.com\/taoyds\/spider\/tree\/master\/baselines\/sqlnet. Accessed 12 January 2023"},{"issue":"1","key":"10374_CR59","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1016\/j.jvlc.2010.11.004","volume":"22","author":"H St\u00f6rrle","year":"2011","unstructured":"St\u00f6rrle H (2011) VMQL: A visual language for ad-hoc model querying. J Vis Lang Comput 22(1):3\u201329","journal-title":"J Vis Lang Comput"},{"key":"10374_CR60","unstructured":"Sutskever I, Vinyals O, Le QV (2014) Sequence to sequence learning with neural networks. In: NIPS 3104\u20133112"},{"key":"10374_CR61","doi-asserted-by":"publisher","DOI":"10.1016\/j.dib.2022.108211","volume":"42","author":"M Tomova","year":"2022","unstructured":"Tomova M, Hofmann M, M\u00e4der P (2022) Seoss-queries - a software engineering dataset for text-to-sql and question answering tasks. Data in Brief 42:108211. https:\/\/doi.org\/10.1016\/j.dib.2022.108211","journal-title":"Data in Brief"},{"key":"10374_CR62","unstructured":"Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I (2017) Attention is all you need. In: NIPS 5998\u20136008"},{"key":"10374_CR63","unstructured":"Vinyals O, Fortunato M, Jaitly N (2015) Pointer networks. In: NIPS 2692\u20132700"},{"issue":"7","key":"10374_CR64","doi-asserted-by":"publisher","first-page":"526","DOI":"10.1145\/359545.359550","volume":"21","author":"DL Waltz","year":"1978","unstructured":"Waltz DL (1978) An english language question answering system for a large relational database. Commun. ACM 21(7):526\u2013539. https:\/\/doi.org\/10.1145\/359545.359550","journal-title":"Commun. ACM"},{"key":"10374_CR65","doi-asserted-by":"crossref","unstructured":"Wang B, Shin R, Liu X, Polozov O, Richardson M (2020) RAT-SQL: relation-aware schema encoding and linking for text-to-sql parsers. In: ACL, pp. 7567\u20137578. Association for Computational Linguistics","DOI":"10.18653\/v1\/2020.acl-main.677"},{"key":"10374_CR66","volume-title":"Lunar Rocks in Natural English: Explorations in Natural Language Question Answering 5:521\u2013569","author":"W Woods","year":"1977","unstructured":"Woods W (1977) Lunar Rocks in Natural English: Explorations in Natural Language Question Answering 5:521\u2013569. North-Holland"},{"key":"10374_CR67","unstructured":"Xu X, Liu C, Song D (2017) Sqlnet: Generating structured queries from natural language without reinforcement learning. CoRR arXiv:1711.04436"},{"key":"10374_CR68","doi-asserted-by":"crossref","unstructured":"Yin P, Neubig G (2017) A syntactic neural model for general-purpose code generation. In: ACL (1):440\u2013450. Association for Computational Linguistics","DOI":"10.18653\/v1\/P17-1041"},{"key":"10374_CR69","doi-asserted-by":"crossref","unstructured":"Yu T, Li Z, Zhang Z, Zhang R, Radev DR (2018) Typesql: Knowledge-based type-aware neural text-to-sql generation. In: NAACL-HLT (2):588\u2013594. Association for Computational Linguistics","DOI":"10.18653\/v1\/N18-2093"},{"key":"10374_CR70","unstructured":"Yu T, Wu C, Lin XV, Wang B, Tan YC, Yang X, Radev DR, Socher R, Xiong C (2021) Grappa: Grammar-augmented pre-training for table semantic parsing. In: ICLR. Open- Review net"},{"key":"10374_CR71","doi-asserted-by":"crossref","unstructured":"Yu T, Zhang R, Yang K, Yasunaga M, Wang D, Li Z, Ma J, Li I, Yao Q, Roman S, Zhang Z, Radev DR (2018) Spider: A large-scale human-labeled dataset for complex and cross-domain semantic parsing and text-to-sql task. In: EMNLP 3911\u20133921. Association for Computational Linguistics","DOI":"10.18653\/v1\/D18-1425"},{"key":"10374_CR72","doi-asserted-by":"publisher","first-page":"136409","DOI":"10.1109\/ACCESS.2020.3011747","volume":"8","author":"X Zhang","year":"2020","unstructured":"Zhang X, Yin F, Ma G, Ge B, Xiao W (2020) F-SQL: fuse table schema and table content for single-table text2sql generation. IEEE Access 8:136409\u2013136420","journal-title":"IEEE Access"},{"key":"10374_CR73","unstructured":"Zhong V, Xiong C, Socher R (2017) Seq2sql: Generating structured queries from natural language using reinforcement learning. CoRR arXiv:1709.00103"}],"container-title":["Empirical Software Engineering"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10664-023-10374-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10664-023-10374-z\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10664-023-10374-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,3,27]],"date-time":"2024-03-27T13:27:26Z","timestamp":1711546046000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10664-023-10374-z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,12,5]]},"references-count":73,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2024,1]]}},"alternative-id":["10374"],"URL":"https:\/\/doi.org\/10.1007\/s10664-023-10374-z","relation":{},"ISSN":["1382-3256","1573-7616"],"issn-type":[{"type":"print","value":"1382-3256"},{"type":"electronic","value":"1573-7616"}],"subject":[],"published":{"date-parts":[[2023,12,5]]},"assertion":[{"value":"25 July 2023","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 December 2023","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflicts of interest"}}],"article-number":"15"}}