{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T15:44:35Z","timestamp":1772207075977,"version":"3.50.1"},"reference-count":185,"publisher":"Emerald","issue":"4","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,5,30]]},"abstract":"<jats:p>Recent advances in natural language understanding and processing have resulted in renewed interest in natural language interfaces to data, which provide an easy mechanism for non-technical users to access and query the data. While early systems evolved from keyword search and focused on simple factual queries, the complexity of both the input sentences as well as the generated SQL queries has evolved over time. More recently, there has also been a lot of focus on using conversational interfaces for data analytics, empowering a line of business owners and non-technical users with quick insights into the data. There are three main challenges in natural language querying: (1) identifying the entities involved in the user utterance, (2) connecting the different entities in a meaningful way over the underlying data source to interpret user intents, and finally (3) generating a structured query in the form of SQL or SPARQL.<\/jats:p>\n                  <jats:p>There are two main approaches in the literature for interpreting a user\u2019s natural language query. Rule-based systems make use of semantic indices, ontologies, and knowledge graphs to identify the entities in the query, understand the intended relationships between those entities, and utilize grammars to generate the target queries. With the advances in deep learning-based language models, there have been many text-to-SQL approaches that try to interpret the query holistically using deep learning models. Hybrid approaches that utilize both rule-based techniques as well as deep learning models are also emerging by combining the strengths of both approaches. Conversational interfaces are the next natural step to one-shot natural language querying by exploiting query context between multiple turns of conversation for disambiguation. In this monograph, we review the background technologies that are used in natural language interfaces, and survey the different approaches to natural language querying. We also describe conversational interfaces for data analytics and discuss several benchmarks used for natural language querying research and evaluation.<\/jats:p>","DOI":"10.1561\/1900000078","type":"journal-article","created":{"date-parts":[[2022,5,30]],"date-time":"2022-05-30T03:10:14Z","timestamp":1653880214000},"page":"319-414","source":"Crossref","is-referenced-by-count":29,"title":["Natural Language Interfaces to Data"],"prefix":"10.1561","volume":"11","author":[{"given":"Abdul","family":"Quamar","sequence":"first","affiliation":[{"name":"IBM Research AI ,","place":["USA"]}]},{"given":"Vasilis","family":"Efthymiou","sequence":"additional","affiliation":[{"name":"FORTH-ICS ,","place":["Greece"]}]},{"given":"Chuan","family":"Lei","sequence":"additional","affiliation":[{"name":"Instacart ,","place":["USA"]}]},{"given":"Chuan","family":"Lei","sequence":"additional","affiliation":[{"name":"Systems Research@Google ,","place":["USA"]}]}],"member":"140","published-online":{"date-parts":[[2022,5,30]]},"reference":[{"key":"2025120412011186800_ref001","article-title":"Towards Universal Semantic Tagging","volume-title":"CoRR","author":"Abzianidze","year":"2017"},{"key":"2025120412011186800_ref002","first-page":"1083","article-title":"BANKS: Browsing and Keyword Searching in Relational Databases","volume-title":"VLDB","author":"Aditya","year":"2002"},{"issue":"5","key":"2025120412011186800_ref003","doi-asserted-by":"crossref","first-page":"793","DOI":"10.1007\/s00778-019-00567-8","article-title":"A comparative survey of recent natural language interfaces for databases","volume":"28","author":"Affolter","year":"2019","journal-title":"The VLDB Journal"},{"key":"2025120412011186800_ref004","doi-asserted-by":"crossref","DOI":"10.18653\/v1\/2020.emnlp-main.408","article-title":"Conversational Semantic Parsing","volume-title":"CoRR","author":"Aghajanyan","year":"2020"},{"key":"2025120412011186800_ref005","first-page":"15247","article-title":"Ontology-Enriched Query Answering on Relational Databases","volume-title":"AAAI","author":"Ahmetaj","year":"2021"},{"key":"2025120412011186800_ref006","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/ICEngTechnol.2017.8308186","article-title":"Understanding of a convolutional neural network","volume-title":"2017 International Conference on Engineering and Technology","author":"Albawi","year":"2017"},{"key":"2025120412011186800_ref007","unstructured":"Amazon\n          . (2018). \u201cAmazon Alexa\u201d. url: https:\/\/developer.amazon.com\/alexa."},{"key":"2025120412011186800_ref008","unstructured":"Amazon\n          . (2021). \u201cAmazon QuickSight\u201d. url: https:\/\/aws.amazon.com\/quicksight\/."},{"key":"2025120412011186800_ref009","unstructured":"Apple\n          . (2018). \u201cSiri\u201d. url: https:\/\/www.apple.com\/ios\/siri\/."},{"key":"2025120412011186800_ref010","article-title":"FinBERT: Financial Sentiment Analysis with Pretrained Language Models","volume-title":"CoRR","author":"Araci","year":"2019"},{"key":"2025120412011186800_ref011","first-page":"215","article-title":"A Quantitative Evaluation of Natural Language Question Interpretation for Question Answering Systems","volume-title":"The 8th Joint International Semantic Technology Conference","author":"Asakura","year":"2018"},{"key":"2025120412011186800_ref012","article-title":"Affective Neural Response Generation","volume-title":"CoRR","author":"Asghar","year":"2017"},{"key":"2025120412011186800_ref013","unstructured":"\u201cAsk Data | Tableau Software\u201d. (2021). url: https:\/\/www.tableau.com\/products\/new-features\/ask-data."},{"key":"2025120412011186800_ref014","first-page":"722","article-title":"DBpedia: A Nucleus for a Web of Open Data","volume-title":"ISWC","author":"Auer","year":"2007"},{"key":"2025120412011186800_ref015","doi-asserted-by":"crossref","DOI":"10.1017\/9781139025355","volume-title":"An Introduction to Description Logic","author":"Baader","year":"2017"},{"key":"2025120412011186800_ref016","article-title":"Neural Machine Translation by Jointly Learning to Align and Translate","volume-title":"CoRR","author":"Bahdanau","year":"2015"},{"key":"2025120412011186800_ref017","first-page":"2319","article-title":"Duoquest: A Dual-Specification System for Expressive SQL Queries","volume-title":"SIGMOD","author":"Baik","year":"2020"},{"key":"2025120412011186800_ref018","first-page":"193","article-title":"Towards NLG for Physiological Data Monitoring with Body Area Networks","volume-title":"Proceedings of the 14th European Workshop on Natural Language Generation","author":"Banaee","year":"2013"},{"key":"2025120412011186800_ref019","first-page":"1765","article-title":"DBPal: A Learned NL-Interface for Databases","volume-title":"SIGMOD","author":"Basik","year":"2018"},{"key":"2025120412011186800_ref020","first-page":"1431","article-title":"More Accurate Question Answering on Freebase","volume-title":"CIKM","author":"Bast","year":"2015"},{"key":"2025120412011186800_ref021","first-page":"3615","article-title":"SciBERT: A Pretrained Language Model for Scientific Text","volume-title":"EMNLP-IJCNLP","author":"Beltagy","year":"2019"},{"issue":"5","key":"2025120412011186800_ref022","doi-asserted-by":"crossref","first-page":"570","DOI":"10.1016\/j.ipm.2015.04.006","article-title":"MEANS: A medical question-answering system combining NLP techniques and semantic Web technologies","volume":"51","author":"Ben Abacha","year":"2015","journal-title":"Inf. Process. Manage"},{"key":"2025120412011186800_ref023","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.is.2015.07.005","article-title":"Combining user and database perspective for solving keyword queries over relational databases","volume":"55","author":"Bergamaschi","year":"2016","journal-title":"Inf. Syst"},{"key":"2025120412011186800_ref024","first-page":"3289","article-title":"ClearTK 2.0: Design Patterns for Machine Learning in UIMA","volume-title":"LREC","author":"Bethard","year":"2014"},{"issue":"5","key":"2025120412011186800_ref025","doi-asserted-by":"crossref","first-page":"482","DOI":"10.1016\/j.jbi.2005.12.008","article-title":"Automatic Generation of Spoken Dialogue from Medical Plans and Ontologies","volume":"39","author":"Beveridge","year":"2006","journal-title":"J. of Biomedical Informatics"},{"issue":"8","key":"2025120412011186800_ref026","doi-asserted-by":"crossref","first-page":"1142","DOI":"10.1109\/5.880077","article-title":"Automatic recognition and understanding of spoken language - a first step toward natural human-machine communication","volume":"88","author":"Juang","year":"2000","journal-title":"Proceedings of the IEEE"},{"key":"2025120412011186800_ref027","article-title":"Semantic Tagging with Deep Residual Networks","volume-title":"CoRR","author":"Bjerva","year":"2016"},{"issue":"10","key":"2025120412011186800_ref028","first-page":"932","article-title":"SODA: Generating SQL for Business Users","volume":"5","author":"Blunschi","year":"2012","journal-title":"PVLDB"},{"key":"2025120412011186800_ref029","article-title":"Language Models are Few-Shot Learners","volume-title":"CoRR","author":"Brown","year":"2020"},{"key":"2025120412011186800_ref030","article-title":"ValueNet: A Neural Text-to-SQL Architecture Incorporating Values","volume-title":"CoRR","author":"Brunner","year":"2020"},{"key":"2025120412011186800_ref031","first-page":"5016","article-title":"MultiWOZ - A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling","volume-title":"EMNLP","author":"Budzianowski","year":"2018"},{"key":"2025120412011186800_ref032","first-page":"1","article-title":"Linguistic realisation as machine translation: Comparing different MT models for AMR-to-text generation","volume-title":"Proceedings of the 10th International Conference on Natural Language Generation","author":"Castro Ferreira","year":"2017"},{"issue":"1","key":"2025120412011186800_ref033","doi-asserted-by":"crossref","first-page":"65","DOI":"10.1145\/248603.248616","article-title":"An Overview of Data Warehousing and OLAP Technology","volume":"26","author":"Chaudhuri","year":"1997","journal-title":"SIGMOD Rec"},{"key":"2025120412011186800_ref034","first-page":"25","article-title":"Deep Learning for Dialogue Systems","volume-title":"Proceedings of the 27th International Conference on Computational Linguistics: Tutorial Abstracts","author":"Chen","year":"2018"},{"key":"2025120412011186800_ref035","doi-asserted-by":"crossref","DOI":"10.3115\/v1\/W14-4012","article-title":"On the Properties of Neural Machine Translation: Encoder-Decoder Approaches","volume-title":"CoRR","author":"Cho","year":"2014"},{"key":"2025120412011186800_ref036","unstructured":"\u201cCognos Assistant\u201d. (2021). url: https:\/\/tinyurl.com\/u3sdaxa."},{"key":"2025120412011186800_ref037","first-page":"2978","article-title":"Transformer-XL: Attentive Language Models beyond a Fixed-Length Context","volume-title":"ACL","author":"Dai","year":"2019"},{"key":"2025120412011186800_ref038","doi-asserted-by":"crossref","DOI":"10.18653\/v1\/2020.aacl-main.46","article-title":"A Survey of the State of Explainable AI for Natural Language Processing","volume-title":"CoRR","author":"Danilevsky","year":"2020"},{"issue":"3","key":"2025120412011186800_ref039","doi-asserted-by":"crossref","first-page":"99","DOI":"10.1111\/lnc3.12067","article-title":"Context-Sensitive Natural Language Generation: From Knowledge-Driven to Data-Driven Techniques","volume":"8","author":"Dethlefs","year":"2014","journal-title":"Lang. Linguistics Compass"},{"key":"2025120412011186800_ref040","first-page":"4171","article-title":"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding","volume-title":"NAACL","author":"Devlin","year":"2019"},{"key":"2025120412011186800_ref041","first-page":"493","article-title":"Analyza: Exploring Data with Conversation","volume-title":"IUI","author":"Dhamdhere","year":"2017"},{"key":"2025120412011186800_ref042","doi-asserted-by":"crossref","DOI":"10.18653\/v1\/P16-1004","article-title":"Language to Logical Form with Neural Attention","volume-title":"CoRR","author":"Dong","year":"2016"},{"key":"2025120412011186800_ref043","article-title":"Deep Biaffine Attention for Neural Dependency Parsing","volume-title":"CoRR","author":"Dozat","year":"2016"},{"key":"2025120412011186800_ref044","first-page":"69","article-title":"LC-QuAD 2.0: A Large Dataset for Complex Question Answering over Wikidata and DBpedia","volume":"11779","author":"Dubey","year":"2019","journal-title":"ISWC"},{"key":"2025120412011186800_ref045","first-page":"45","article-title":"Sequence-to-Sequence Generation for Spoken Dialogue via Deep Syntax Trees and Strings","volume-title":"ACL","author":"Dusek","year":"2016"},{"issue":"1","key":"2025120412011186800_ref046","doi-asserted-by":"crossref","first-page":"89","DOI":"10.1016\/j.tcs.2004.10.033","article-title":"Data exchange: semantics and query answering","volume":"336","author":"Fagin","year":"2005","journal-title":"Theor. Comput. Sci"},{"key":"2025120412011186800_ref047","first-page":"1625","article-title":"TAGME: on-the-fly annotation of short text fragments (by wikipedia entities)","volume-title":"CIKM","author":"Ferragina","year":"2010"},{"issue":"2","key":"2025120412011186800_ref048","doi-asserted-by":"crossref","DOI":"10.2196\/mental.7785","article-title":"Delivering Cognitive Behavior Therapy to Young Adults With Symptoms of Depression and Anxiety Using a Fully Automated Conversational Agent (Woebot): A Randomized Controlled Trial","volume":"4","author":"Fitzpatrick","year":"2017","journal-title":"JMIR Ment Health"},{"issue":"3","key":"2025120412011186800_ref049","doi-asserted-by":"crossref","first-page":"268","DOI":"10.1109\/PROC.1973.9030","article-title":"The viterbi algorithm","volume":"61","author":"Forney","year":"1973","journal-title":"Proceedings of the IEEE"},{"key":"2025120412011186800_ref050","first-page":"6","article-title":"Towards Conversational OLAP","volume":"2572","author":"Francia","year":"2020","journal-title":"DOLAP@EDBT\/ICDT"},{"key":"2025120412011186800_ref051","article-title":"Neural Approaches to Conversational AI","volume-title":"CoRR","author":"Gao","year":"2018"},{"key":"2025120412011186800_ref052","doi-asserted-by":"crossref","DOI":"10.18653\/v1\/W19-5932","article-title":"Dialog State Tracking: A Neural Reading Comprehension Approach","volume-title":"CoRR","author":"Gao","year":"2019"},{"issue":"Jan","key":"2025120412011186800_ref053","article-title":"Planning-Based Models of Natural Language Generation","volume":"8","author":"Garoufi","year":"2014","journal-title":"Language and Linguistics Compass"},{"issue":"1","key":"2025120412011186800_ref054","first-page":"65","article-title":"Survey of the State of the Art in Natural Language Generation: Core Tasks, Applications and Evaluation","volume":"61","author":"Gatt","year":"2018","journal-title":"J. Artif. Int. Res"},{"issue":"2","key":"2025120412011186800_ref055","doi-asserted-by":"crossref","first-page":"159","DOI":"10.1016\/j.ijmedinf.2004.04.026","article-title":"Automated spoken dialogue system for hypertensive patient home management","volume":"74","author":"Giorgino","year":"2005","journal-title":"International Journal of Medical Informatics"},{"key":"2025120412011186800_ref056","first-page":"632","article-title":"An In-Depth Benchmarking of Text-to-SQL Systems","volume-title":"SIGMOD","author":"Gkini","year":"2021"},{"key":"2025120412011186800_ref057","unstructured":"Google\n          . (2021a). \u201cGoogle Looker\u201d. url: https:\/\/www.looker.com\/google-cloud\/."},{"key":"2025120412011186800_ref058","unstructured":"Google\n          . (2021b). \u201cGoogleAssitant\u201d. url: https:\/\/assistant.google.com."},{"key":"2025120412011186800_ref059","unstructured":"Google\n          . (2021c). \u201cLamda\u201d. url: https:\/\/blog.google\/technology\/ai\/lamda\/."},{"key":"2025120412011186800_ref060","doi-asserted-by":"crossref","DOI":"10.18653\/v1\/P19-1444","article-title":"Towards Complex Text-to-SQL in Cross-Domain Database with Intermediate Representation","volume-title":"CoRR","author":"Guo","year":"2019"},{"key":"2025120412011186800_ref061","first-page":"1339","article-title":"DialSQL: Dialogue Based Structured Query Generation","volume-title":"ACL","author":"Gur","year":"2018"},{"key":"2025120412011186800_ref062","first-page":"2946","article-title":"MEDTO: Medical Data to Ontology Matching Using Hybrid Graph Neural Networks","volume-title":"SIGKDD","author":"Hao","year":"2021"},{"key":"2025120412011186800_ref063","article-title":"Establishing Strong Baselines for the New Decade: Sequence Tagging, Syntactic and Semantic Parsing with BERT","volume-title":"CoRR","author":"He","year":"2019"},{"key":"2025120412011186800_ref064","article-title":"ConveRT: Efficient and Accurate Conversational Representations from Transformers","volume-title":"CoRR","author":"Henderson","year":"2019"},{"key":"2025120412011186800_ref065","first-page":"2161","article-title":"ConveRT: Efficient and Accurate Conversational Representations from Transformers","volume-title":"EMNLP","author":"Henderson","year":"2020"},{"key":"2025120412011186800_ref066","first-page":"467","article-title":"Deep Neural Network Approach for the Dialog State Tracking Challenge","volume-title":"SIGDIAL","author":"Henderson","year":"2013"},{"key":"2025120412011186800_ref067","first-page":"292","article-title":"Word-Based Dialog State Tracking with Recurrent Neural Networks","volume-title":"SIGDIAL","author":"Henderson","year":"2014"},{"key":"2025120412011186800_ref068","doi-asserted-by":"crossref","DOI":"10.18653\/v1\/P19-1536","article-title":"Training Neural Response Selection for Task-Oriented Dialogue Systems","volume-title":"CoRR","author":"Henderson","year":"2019"},{"key":"2025120412011186800_ref069","first-page":"252","article-title":"Neural Response Generation for Customer Service based on Personality Traits","volume-title":"Proceedings of the 10th International Conference on Natural Language Generation","author":"Herzig","year":"2017"},{"issue":"8","key":"2025120412011186800_ref070","doi-asserted-by":"crossref","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","article-title":"Long Short-Term Memory","volume":"9","author":"Hochreiter","year":"1997","journal-title":"Neural Computation"},{"issue":"4","key":"2025120412011186800_ref071","doi-asserted-by":"crossref","DOI":"10.1002\/widm.1312","article-title":"Causability and explainability of artificial intelligence in medicine","volume":"9","author":"Holzinger","year":"2019","journal-title":"Wiley Interdiscip. Rev. Data Min. Knowl. Discov"},{"key":"2025120412011186800_ref072","first-page":"328","article-title":"Universal Language Model Finetuning for Text Classification","volume-title":"ACL","author":"Howard","year":"2018"},{"key":"2025120412011186800_ref073","article-title":"Bidirectional LSTM-CRF Models for Sequence Tagging","volume-title":"CoRR","author":"Huang","year":"2015"},{"key":"2025120412011186800_ref074","article-title":"Improving Text-to-SQL with Schema Dependency Learning","volume-title":"CoRR","author":"Hui","year":"2021"},{"key":"2025120412011186800_ref075","first-page":"946","article-title":"A Survey on Conversational Agents\/Chatbots Classification and Design Techniques","author":"Hussain","year":"2019"},{"key":"2025120412011186800_ref076","article-title":"A Comprehensive Exploration on WikiSQL with Table-Aware Word Contextualization","volume-title":"CoRR","author":"Hwang","year":"2019"},{"key":"2025120412011186800_ref077","first-page":"1821","article-title":"Search-based Neural Structured Learning for Sequential Question Answering","volume-title":"ACL","author":"Iyyer","year":"2017"},{"issue":"12","key":"2025120412011186800_ref078","first-page":"2014","article-title":"Tooling Framework for Instantiating Natural Language Querying System","volume":"11","author":"Jammi","year":"2018","journal-title":"PVLDB"},{"key":"2025120412011186800_ref079","first-page":"4163","article-title":"TinyBERT: Distilling BERT for Natural Language Understanding","volume-title":"EMNLP","author":"Jiao","year":"2020"},{"key":"2025120412011186800_ref080","article-title":"SpanBERT: Improving Pre-training by Representing and Predicting Spans","volume-title":"CoRR","author":"Joshi","year":"2019"},{"key":"2025120412011186800_ref081","first-page":"2846","article-title":"A Deep Dive into Deep Learning Approaches for Text-to-SQL Systems","volume-title":"SIG-MOD","author":"Katsogiannis-Meimarakis","year":"2021"},{"key":"2025120412011186800_ref082","first-page":"710","article-title":"Deep Learning Approaches for Text-to-SQL Systems","volume-title":"EDBT","author":"Katsogiannis-Meimarakis","year":"2021"},{"issue":"4","key":"2025120412011186800_ref083","doi-asserted-by":"crossref","first-page":"377","DOI":"10.1016\/j.websem.2010.06.001","article-title":"Evaluating the usability of natural language query languages and interfaces to Semantic Web knowledge bases","volume":"8","author":"Kaufmann","year":"2010","journal-title":"J. Web Semant"},{"issue":"3","key":"2025120412011186800_ref084","doi-asserted-by":"crossref","first-page":"443","DOI":"10.1145\/319732.319745","article-title":"On Optimizing an SQL-like Nested Query","volume":"7","author":"Kim","year":"1982","journal-title":"ACM Trans. Database Syst"},{"key":"2025120412011186800_ref085","first-page":"1746","article-title":"Convolutional Neural Networks for Sentence Classification","volume-title":"EMNLP","author":"Kim","year":"2014"},{"key":"2025120412011186800_ref086","first-page":"2741","article-title":"Character-Aware Neural Language Models","volume-title":"AAAI","author":"Kim","year":"2016"},{"key":"2025120412011186800_ref087","first-page":"69","article-title":"Pr\u00e9cis: The Essence of a Query Answer","volume-title":"ICDE","author":"Koutrika","year":"2006"},{"key":"2025120412011186800_ref088","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/RCIS.2013.6577686","article-title":"QUASL: A framework for question answering and its Application to business intelligence","volume-title":"IEEE 7th International Conference on Research Challenges in Information Science (RCIS)","author":"Kuchmann-Beauger","year":"2013"},{"key":"2025120412011186800_ref089","first-page":"785","article-title":"RACE: Large-scale ReAding Comprehension Dataset From Examinations","volume-title":"EMNLP","author":"Lai","year":"2017"},{"key":"2025120412011186800_ref090","article-title":"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations","volume-title":"CoRR","author":"Lan","year":"2019"},{"key":"2025120412011186800_ref091","first-page":"1203","article-title":"Neural Text Generation from Structured Data with Application to the Biography Domain","volume-title":"EMNLP","author":"Lebret","year":"2016"},{"key":"2025120412011186800_ref092","doi-asserted-by":"crossref","first-page":"259","DOI":"10.1145\/319950.320011","article-title":"SemQL: A Semantic Query Language for Multidatabase Systems","volume-title":"CIKM","author":"Lee","year":"1999"},{"issue":"4","key":"2025120412011186800_ref093","doi-asserted-by":"crossref","first-page":"1234","DOI":"10.1093\/bioinformatics\/btz682","article-title":"BioBERT: a pre-trained biomedical language representation model for biomedical text mining","volume":"36","author":"Lee","year":"2019","journal-title":"Bioinformatics"},{"key":"2025120412011186800_ref094","first-page":"567","article-title":"Expanding Query Answers on Medical Knowledge Bases","volume-title":"EDBT","author":"Lei","year":"2020"},{"issue":"3","key":"2025120412011186800_ref095","first-page":"52","article-title":"Ontology-Based Natural Language Query Interfaces for Data Exploration","volume":"41","author":"Lei","year":"2018","journal-title":"IEEE Data Eng. Bull"},{"issue":"1","key":"2025120412011186800_ref096","first-page":"73","article-title":"Constructing an Interactive Natural Language Interface for Relational Databases","volume":"8","author":"Li","year":"2014","journal-title":"PVLDB"},{"issue":"1","key":"2025120412011186800_ref097","doi-asserted-by":"crossref","first-page":"6","DOI":"10.1145\/2949741.2949744","article-title":"Understanding Natural Language Queries over Relational Databases","volume":"45","author":"Li","year":"2016","journal-title":"SIGMOD Record"},{"key":"2025120412011186800_ref098","first-page":"709","article-title":"NaLIR: an interactive natural language interface for querying relational databases","volume-title":"SIGMOD","author":"Li","year":"2014"},{"issue":"12","key":"2025120412011186800_ref099","first-page":"2549","article-title":"Deep or Simple Models for Semantic Tagging? It Depends on Your Data","volume":"13","author":"Li","year":"2020","journal-title":"PVLDB"},{"key":"2025120412011186800_ref100","first-page":"1765","article-title":"Natural Language Data Management and Interfaces: Recent Development and Open Challenges","volume-title":"SIGMOD","author":"Li","year":"2017"},{"key":"2025120412011186800_ref101","doi-asserted-by":"crossref","first-page":"900","DOI":"10.1145\/1066157.1066281","article-title":"NaLIX: An Interactive Natural Language Interface for Querying XML","volume-title":"SIGMOD","author":"Li","year":"2005"},{"key":"2025120412011186800_ref102","article-title":"RoBERTa: A Robustly Optimized BERT Pretraining Approach","volume-title":"CoRR","author":"Liu","year":"2019"},{"key":"2025120412011186800_ref103","first-page":"2129","article-title":"Making the Case for Query-by-Voice with EchoQuery","volume-title":"SIGMOD","author":"Lyons","year":"2016"},{"key":"2025120412011186800_ref104","article-title":"Hybrid Ranking Network for Text-to-SQL","volume-title":"CoRR","author":"Lyu","year":"2020"},{"issue":"4","key":"2025120412011186800_ref105","doi-asserted-by":"crossref","first-page":"763","DOI":"10.1162\/COLI_a_00199","article-title":"Stochastic Language Generation in Dialogue using Factored Language Models","volume":"40","author":"Mairesse","year":"2014","journal-title":"Computational Linguistics"},{"key":"2025120412011186800_ref106","first-page":"1","article-title":"A survey on human machine dialogue systems","volume-title":"7th International Conference on Information, Intelligence, Systems & Applications (IISA)","author":"Mallios","year":"2016"},{"key":"2025120412011186800_ref107","first-page":"449","article-title":"Generating Typed Dependency Parses from Phrase Structure Parses","volume-title":"LREC","author":"Marneffe","year":"2006"},{"issue":"1","key":"2025120412011186800_ref108","doi-asserted-by":"crossref","first-page":"90","DOI":"10.1145\/505282.505285","article-title":"Spoken Dialogue Technology: Enabling the Conversational User Interface","volume":"34","author":"McTear","year":"2002","journal-title":"ACM Comput. Surv"},{"key":"2025120412011186800_ref109","unstructured":"Microsoft\n          . (2018). \u201cMicrosoft Cortana\u201d. url: https:\/\/www.microsoft.com\/en-us\/windows\/cortana."},{"key":"2025120412011186800_ref110","unstructured":"\u201cMicroStrategy\u201d. (2021). url: https:\/\/community.microstrategy.com\/s\/article\/Natural-Language-Query-in-A-Nutshell-MicroStrategy-11-0?language=en_US."},{"key":"2025120412011186800_ref111","article-title":"Efficient Estimation of Word Representations in Vector Space","volume-title":"ICLR","author":"Mikolov","year":"2013"},{"issue":"5","key":"2025120412011186800_ref112","doi-asserted-by":"crossref","first-page":"619","DOI":"10.1001\/jamainternmed.2016.0400","article-title":"Smartphone-Based Conversational Agents and Responses to Questions About Mental Health, Interpersonal Violence, and Physical Health","volume":"176","author":"Miner","year":"2016","journal-title":"JAMA Internal Medicine"},{"key":"2025120412011186800_ref113","article-title":"Neural Belief Tracker: Data-Driven Dialogue State Tracking","volume-title":"CoRR","author":"Mrksic","year":"2016"},{"key":"2025120412011186800_ref114","first-page":"2159","article-title":"Towards the Extraction of Customer-to-Customer Suggestions from Reviews","volume-title":"EMNLP","author":"Negi","year":"2015"},{"key":"2025120412011186800_ref115","unstructured":"Net, C.\n           (2021). \u201cConcept Net a freely-available semantic network.\u201d url: https:\/\/conceptnet.io\/."},{"issue":"8","key":"2025120412011186800_ref116","first-page":"1325","article-title":"CBench: Towards Better Evaluation of Question Answering Over Knowledge Graphs","volume":"14","author":"Orogat","year":"2021","journal-title":"PVLDB"},{"key":"2025120412011186800_ref117","doi-asserted-by":"crossref","DOI":"10.1145\/3462462.3468881","article-title":"Semantic Enrichment of Data for AI Applications","volume-title":"Proceedings of the Fifth Workshop on Data Management for End-To-End Machine Learning","author":"\u00d6zcan","year":"2021"},{"key":"2025120412011186800_ref118","first-page":"2629","article-title":"State of the Art and Open Challenges in Natural Language Interfaces to Data","volume-title":"SIGMOD","author":"\u00d6zcan","year":"2020"},{"key":"2025120412011186800_ref119","first-page":"1470","article-title":"Compositional Semantic Parsing on Semi-Structured Tables","volume-title":"ACL","author":"Pasupat","year":"2015"},{"key":"2025120412011186800_ref120","first-page":"1532","article-title":"GloVe: Global Vectors for Word Representation","volume-title":"EMNLP","author":"Pennington","year":"2014"},{"key":"2025120412011186800_ref121","first-page":"2227","article-title":"Deep Contextualized Word Representations","volume-title":"NAACL","author":"Peters","year":"2018"},{"key":"2025120412011186800_ref122","unstructured":"\u201cPower BI Platform\u201d. (2021). url: https:\/\/powerbi.microsoft.com\/en-us\/."},{"issue":"2","key":"2025120412011186800_ref123","first-page":"63","article-title":"Asking \u2018Why\u2019 in AI: Explainability of intelligent systems - perspectives and challenges","volume":"25","author":"Preece","year":"2018","journal-title":"Intell. Syst. Account. Finance Manag"},{"key":"2025120412011186800_ref124","article-title":"Template-Based Question Answering over Linked Geospatial Data","volume-title":"CoRR","author":"Punjani","year":"2020"},{"issue":"10","key":"2025120412011186800_ref125","doi-asserted-by":"crossref","first-page":"1872","DOI":"10.1007\/s11431-020-1647-3","article-title":"Pretrained models for natural language processing: A survey","volume":"63","author":"Qiu","year":"2020","journal-title":"Science in China E: Technological Sciences"},{"key":"2025120412011186800_ref126","first-page":"361","article-title":"An Ontology-Based Conversation System for Knowledge Bases","volume-title":"SIGMOD","author":"Quamar","year":"2020"},{"issue":"12","key":"2025120412011186800_ref127","article-title":"Conversational BI: An Ontology-Driven Conversation System for Business Intelligence Applications","volume":"13","author":"Quamar","year":"2020","journal-title":"PVLDB"},{"key":"2025120412011186800_ref128","article-title":"Improving Language Understanding by Generative Pre-Training","author":"Radford","year":"2018"},{"issue":"8","key":"2025120412011186800_ref129","article-title":"Language Models are Unsupervised Multitask Learners","volume":"1","author":"Radford","year":"2019","journal-title":"OpenAI blog"},{"key":"2025120412011186800_ref130","article-title":"Evaluating Quality of Chatbots and Intelligent Conversational Agents","volume-title":"CoRR","author":"Radziwill","year":"2017"},{"key":"2025120412011186800_ref131","doi-asserted-by":"crossref","DOI":"10.1109\/ASRU.2017.8268986","article-title":"Scalable Multi-Domain Dialogue State Tracking","volume-title":"CoRR","author":"Rastogi","year":"2017"},{"key":"2025120412011186800_ref132","doi-asserted-by":"crossref","first-page":"249","DOI":"10.1162\/tacl_a_00266","article-title":"CoQA: A Conversational Question Answering Challenge","volume":"7","author":"Reddy","year":"2019","journal-title":"Transactions of the Association for Computational Linguistics"},{"issue":"1","key":"2025120412011186800_ref133","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1017\/S1351324997001502","article-title":"Building Applied Natural Language Generation Systems","volume":"3","author":"Reiter","year":"1997","journal-title":"Nat. Lang. Eng"},{"key":"2025120412011186800_ref134","doi-asserted-by":"crossref","DOI":"10.1017\/CBO9780511519857","volume-title":"Building Natural Language Generation Systems","author":"Reiter","year":"2000"},{"key":"2025120412011186800_ref135","unstructured":"Richardson, J., K.Schlegel, R.Sallam, A.Kronz, and J.Sun. (2021). \u201cTop Trends in Data and Analytics for 2021: The Rise of the Augmented Consumer\u201d. url: https:\/\/www.gartner.com\/doc\/reprints?id=1-25H0EUUY&ct=210317&st=sb."},{"key":"2025120412011186800_ref136","article-title":"Natural Language Generation as Planning under Uncertainty Using Reinforcement Learning","volume-title":"CoRR","author":"Rieser","year":"2016"},{"key":"2025120412011186800_ref137","article-title":"SmBoP: Semi-autoregressive Bottom-up Semantic Parsing","volume-title":"CoRR","author":"Rubin","year":"2020"},{"issue":"12","key":"2025120412011186800_ref138","first-page":"1209","article-title":"ATHENA: An Ontology-Driven System for Natural Language Querying over Relational Data Stores","volume":"9","author":"Saha","year":"2016","journal-title":"PVLDB"},{"key":"2025120412011186800_ref139","article-title":"DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter","volume-title":"CoRR","author":"Sanh","year":"2019"},{"key":"2025120412011186800_ref140","article-title":"A Survey of Natural Language Generation Techniques with a Focus on Dialogue Systems - Past, Present and Future Directions","volume-title":"CoRR","author":"Santhanam","year":"2019"},{"key":"2025120412011186800_ref141","doi-asserted-by":"crossref","DOI":"10.18653\/v1\/W15-3904","article-title":"Boosting Named Entity Recognition with Neural Character Embeddings","volume-title":"CoRR","author":"Santos","year":"2015"},{"key":"2025120412011186800_ref142","first-page":"1073","article-title":"Get To The Point: Summarization with Pointer-Generator Networks","volume-title":"ACL","author":"See","year":"2017"},{"key":"2025120412011186800_ref143","article-title":"Definition, Dictionaries and Tagger for Extended Named Entity Hierarchy","volume-title":"LREC","author":"Sekine","year":"2004"},{"key":"2025120412011186800_ref144","unstructured":"Semarchy\n          . (2021). \u201cThe SemQL Language\u201d. url: https:\/\/www.semarchy.com\/doc\/semarchy-xdm\/xdm\/5.3\/SemQL\/overview.html."},{"issue":"11","key":"2025120412011186800_ref145","doi-asserted-by":"crossref","first-page":"2747","DOI":"10.14778\/3407790.3407858","article-title":"ATHENA++: Natural Language Querying for Complex Nested SQL Queries","volume":"13","author":"Sen","year":"2020","journal-title":"Proc. VLDB Endow"},{"key":"2025120412011186800_ref146","first-page":"1997","article-title":"Natural Language Querying of Complex Business Intelligence Queries","volume-title":"SIGMOD","author":"Sen","year":"2019"},{"key":"2025120412011186800_ref147","article-title":"Multilingual Named Entity Recognition using Hybrid Neural Networks","volume-title":"The Sixth Swedish Language Technology Conference (SLTC)","author":"Shao","year":"2016"},{"issue":"Mar","key":"2025120412011186800_ref148","doi-asserted-by":"crossref","first-page":"132306","DOI":"10.1016\/j.physd.2019.132306","article-title":"Fundamentals of Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM) network","volume":"404","author":"Sherstinsky","year":"2020","journal-title":"Physica D: Nonlinear Phenomena"},{"key":"2025120412011186800_ref149","article-title":"IncSQL: Training Incremental Text-to-SQL Parsers with Non-Deterministic Oracles","volume-title":"CoRR","author":"Shi","year":"2018"},{"issue":"1","key":"2025120412011186800_ref150","doi-asserted-by":"crossref","first-page":"117","DOI":"10.1007\/s00778-007-0075-9","article-title":"Pr\u00e9cis: from unstructured keywords as queries to structured databases as answers","volume":"17","author":"Simitsis","year":"2008","journal-title":"VLDB J"},{"key":"2025120412011186800_ref151","first-page":"21","article-title":"TR Discover: A Natural Language Interface for Querying and Analyzing Interlinked Datasets","volume-title":"ISWC","author":"Song","year":"2015"},{"key":"2025120412011186800_ref152","first-page":"718","article-title":"Data Responsibly: Fairness, Neutrality and Transparency in Data Analysis","volume-title":"EDBT","author":"Stoyanovich","year":"2016"},{"issue":"12","key":"2025120412011186800_ref153","first-page":"3474","article-title":"Responsible Data Management","volume":"13","author":"Stoyanovich","year":"2020","journal-title":"PVLDB"},{"key":"2025120412011186800_ref154","first-page":"889","article-title":"SQAK: Doing More with Keywords","volume-title":"SIGMOD","author":"Tata","year":"2008"},{"key":"2025120412011186800_ref155","doi-asserted-by":"crossref","DOI":"10.1177\/107769905303000401","article-title":"Cloze Procedure: A New Tool for Measuring Readability","volume-title":"Journalism Quarterly","author":"Taylor","year":"1953"},{"issue":"4","key":"2025120412011186800_ref156","doi-asserted-by":"crossref","first-page":"562","DOI":"10.1016\/j.csl.2009.07.003","article-title":"Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems","volume":"24","author":"Thomson","year":"2010","journal-title":"Computer Speech and Language"},{"key":"2025120412011186800_ref157","first-page":"639","article-title":"Template-based question answering over RDF data","volume-title":"WWW","author":"Unger","year":"2012"},{"issue":"5","key":"2025120412011186800_ref158","first-page":"813","article-title":"DBTagger: Multi-Task Learning for Keyword Mapping in NLIDBs Using Bi-Directional Recurrent Neural Networks","volume":"14","author":"Usta","year":"2021","journal-title":"PVLDB"},{"key":"2025120412011186800_ref159","article-title":"Attention Is All You Need","volume-title":"CoRR","author":"Vaswani","year":"2017"},{"key":"2025120412011186800_ref160","article-title":"Pointer Networks","volume":"28","author":"Vinyals","year":"2015","journal-title":"Advances in Neural Information Processing Systems"},{"key":"2025120412011186800_ref161","first-page":"362","article-title":"Evaluation of a Layered Approach to Question Answering over Linked Data","volume-title":"ISWC","author":"Walter","year":"2012"},{"key":"2025120412011186800_ref162","first-page":"1471","article-title":"USI Answers: Natural Language Question Answering Over (Semi-) Structured Industry Data","volume-title":"IAAI","author":"Waltinger","year":"2013"},{"key":"2025120412011186800_ref163","article-title":"RAT-SQL: Relation-Aware Schema Encoding and Linking for Text-to-SQL Parsers","volume-title":"CoRR","author":"Wang","year":"2019"},{"key":"2025120412011186800_ref164","article-title":"Execution-Guided Neural Program Decoding","volume-title":"CoRR","author":"Wang","year":"2018"},{"key":"2025120412011186800_ref165","article-title":"A Transfer-Learnable Natural Language Interface for Databases","volume-title":"CoRR","author":"Wang","year":"2018"},{"key":"2025120412011186800_ref166","first-page":"1862","article-title":"Bootstrapping an End-to-End Natural Language Interface for Databases","volume-title":"SIGMOD","author":"Weir","year":"2019"},{"key":"2025120412011186800_ref167","first-page":"1711","article-title":"Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems","volume-title":"EMNLP","author":"Wen","year":"2015"},{"issue":"2","key":"2025120412011186800_ref168","doi-asserted-by":"crossref","first-page":"393","DOI":"10.1016\/j.csl.2006.06.008","article-title":"Partially observable Markov decision processes for spoken dialog systems","volume":"21","author":"Williams","year":"2007","journal-title":"Computer Speech and Language"},{"key":"2025120412011186800_ref169","doi-asserted-by":"crossref","first-page":"4283","DOI":"10.18653\/v1\/2021.emnlp-main.352","article-title":"Data-to-text Generation by Splicing Together Nearest Neighbors","volume-title":"Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing","author":"Wiseman","year":"2021"},{"key":"2025120412011186800_ref170","doi-asserted-by":"crossref","DOI":"10.18653\/v1\/2020.emnlp-main.66","article-title":"ToD-BERT: Pre-trained Natural Language Understanding for Task-Oriented Dialogues","volume-title":"CoRR","author":"Wu","year":"2020"},{"key":"2025120412011186800_ref171","first-page":"133","article-title":"Verb Semantics and Lexical Selection","volume-title":"ACL","author":"Wu","year":"1994"},{"key":"2025120412011186800_ref172","article-title":"SQLNet: Generating Structured Queries From Natural Language Without Reinforcement Learning","volume-title":"CoRR","author":"Xu","year":"2017"},{"key":"2025120412011186800_ref173","article-title":"XLNet: Generalized Autoregressive Pretraining for Language Understanding","volume-title":"CoRR","author":"Yang","year":"2019"},{"issue":"5","key":"2025120412011186800_ref174","doi-asserted-by":"crossref","first-page":"1160","DOI":"10.1109\/JPROC.2012.2225812","article-title":"POMDP-Based Statistical Spoken Dialog Systems: A Review","volume":"101","author":"Young","year":"2013","journal-title":"Proceedings of the IEEE"},{"issue":"3","key":"2025120412011186800_ref175","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1109\/MCI.2018.2840738","article-title":"Recent trends in deep learning based natural language processing","volume":"13","author":"Young","year":"2018","journal-title":"IEEE Computational Intelligence Magazine"},{"key":"2025120412011186800_ref176","first-page":"588","article-title":"TypeSQL: Knowledge-Based Type-Aware Neural Text-to-SQL Generation","volume-title":"NAACL-HLT","author":"Yu","year":"2018"},{"key":"2025120412011186800_ref177","doi-asserted-by":"crossref","DOI":"10.18653\/v1\/D19-1204","article-title":"CoSQL: A Conversational Text-to-SQL Challenge Towards Cross-Domain Natural Language Interfaces to Databases","volume-title":"CoRR","author":"Yu","year":"2019"},{"key":"2025120412011186800_ref178","article-title":"SCoRe: Pre-Training for Context Representation in Conversational Semantic Parsing","volume-title":"ICLR","author":"Yu","year":"2021"},{"key":"2025120412011186800_ref179","first-page":"3911","article-title":"Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task","volume-title":"EMNLP","author":"Yu","year":"2018"},{"key":"2025120412011186800_ref180","doi-asserted-by":"crossref","DOI":"10.18653\/v1\/P19-1443","article-title":"SParC: Cross-Domain Semantic Parsing in Context","volume-title":"CoRR","author":"Yu","year":"2019"},{"issue":"3","key":"2025120412011186800_ref181","doi-asserted-by":"crossref","first-page":"166","DOI":"10.1016\/j.websem.2009.07.005","article-title":"From keywords to semantic queries - Incremental query construction on the semantic web","volume":"7","author":"Zenz","year":"2009","journal-title":"J. Web Semant"},{"key":"2025120412011186800_ref182","first-page":"5337","article-title":"Editing-Based SQL Query Generation for Cross-Domain Context-Dependent Questions","volume-title":"EMNLP-IJCNLP","author":"Zhang","year":"2019"},{"key":"2025120412011186800_ref183","first-page":"270","article-title":"DIALOGPT : Large-Scale Generative Pre-training for Conversational Response Generation","volume-title":"ACL","author":"Zhang","year":"2020"},{"key":"2025120412011186800_ref184","first-page":"9628","article-title":"Semantics-Aware BERT for Language Understanding","volume-title":"AAAI","author":"Zhang","year":"2020"},{"key":"2025120412011186800_ref185","article-title":"Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning","volume-title":"CoRR","author":"Zhong","year":"2017"}],"container-title":["Foundations and Trends in Databases"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/ftdbs\/article-pdf\/11\/4\/319\/10901268\/1900000078en.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/www.emerald.com\/ftdbs\/article-pdf\/11\/4\/319\/10901268\/1900000078en.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,12,4]],"date-time":"2025-12-04T17:01:46Z","timestamp":1764867706000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.emerald.com\/ftdbs\/article\/11\/4\/319\/1320822\/Natural-Language-Interfaces-to-Data"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,5,30]]},"references-count":185,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2022,5,30]]}},"URL":"https:\/\/doi.org\/10.1561\/1900000078","relation":{},"ISSN":["1931-7883","1931-7891"],"issn-type":[{"value":"1931-7883","type":"print"},{"value":"1931-7891","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,5,30]]}}}