{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,8]],"date-time":"2026-01-08T07:44:46Z","timestamp":1767858286736,"version":"3.49.0"},"reference-count":15,"publisher":"Association for Computing Machinery (ACM)","issue":"12","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2023,8]]},"abstract":"<jats:p>GPT-DB generates code for SQL processing in general-purpose programming languages such as Python. Generated code can be freely customized using user-provided natural language instructions. This enables users, for instance, to try out specific libraries for SQL processing or to generate non-standard output while processing.<\/jats:p>\n          <jats:p>GPT-DB is based on OpenAI's GPT model series, neural networks capable of translating natural language instructions into code. By default, GPT-DB exploits the most recently released GPT-4 model whereas visitors may also select prior versions for comparison. GPT-DB automatically generates query-specific prompts, instructing GPT on code generation. These prompts include a description of the target database, as well as logical query plans described as natural language text, and instructions for customization. GPT-DB automatically verifies, and possibly re-generates, code using a reference database system for result comparisons. It enables users to select code samples for training, thereby increasing accuracy for future queries. The proposed demonstration showcases code generation for various queries and with varying instructions for code customization.<\/jats:p>","DOI":"10.14778\/3611540.3611630","type":"journal-article","created":{"date-parts":[[2023,9,15]],"date-time":"2023-09-15T11:32:37Z","timestamp":1694777557000},"page":"4098-4101","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":11,"title":["Demonstrating GPT-DB: Generating Query-Specific and Customizable Code for SQL Processing with GPT-4"],"prefix":"10.14778","volume":"16","author":[{"given":"Immanuel","family":"Trummer","sequence":"first","affiliation":[{"name":"Cornell University, Ithaca, NY, USA"}]}],"member":"320","published-online":{"date-parts":[[2023,8]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"S\u00e9bastien Bubeck Varun Chandrasekaran Ronen Eldan Johannes Gehrke Eric Horvitz Ece Kamar Peter Lee Yin Tat Lee Yuanzhi Li Scott Lundberg Harsha Nori Hamid Palangi Marco Tulio Ribeiro and Yi Zhang. 2023. Sparks of Artificial General Intelligence: Experiments with an early version of GPT-4. Technical Report. arXiv:arXiv:2303.12712v3"},{"key":"e_1_2_1_2_1","volume-title":"Kenton Lee, and Kristina Toutanova.","author":"Devlin Jacob","year":"2019","unstructured":"Jacob Devlin, Ming Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In NAACL. 4171--4186. arXiv:1810.04805"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11023-020-09548-1"},{"key":"e_1_2_1_4_1","volume-title":"Introducing GitHub Copilot: your AI pair programmer. https:\/\/github.blog\/2021-06-29-introducing-github-copilot-ai-pair-programmer\/","author":"Friedman Nat","year":"2021","unstructured":"Nat Friedman. 2021. Introducing GitHub Copilot: your AI pair programmer. https:\/\/github.blog\/2021-06-29-introducing-github-copilot-ai-pair-programmer\/ (2021)."},{"key":"e_1_2_1_5_1","first-page":"2508","article-title":"Scrutinizer: A mixed-initiative approach to large-scale, data-driven claim verification","volume":"13","author":"Karagiannis Georgios","year":"2020","unstructured":"Georgios Karagiannis, Mohammed Saeed, Paolo Papotti, and Immanuel Trummer. 2020. Scrutinizer: A mixed-initiative approach to large-scale, data-driven claim verification. VLDB 13, 12 (2020), 2508--2521.","journal-title":"VLDB"},{"key":"e_1_2_1_6_1","doi-asserted-by":"crossref","unstructured":"Fei Li and HV Jagadish. 2014. NaLIR: an interactive natural language interface for querying relational databases. In SIGMOD. 709--712.","DOI":"10.1145\/2588555.2594519"},{"key":"e_1_2_1_7_1","unstructured":"OpenAI. 2021. https:\/\/openai.com\/blog\/openai-codex\/."},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.14778\/3494124.3494149"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.14778\/3457390.3457391"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.14778\/3551793.3551841"},{"key":"e_1_2_1_11_1","doi-asserted-by":"crossref","unstructured":"Immanuel Trummer. 2022. DB-BERT: a database tuning tool that \"reads the manual\". In SIGMOD. 190--203.","DOI":"10.1145\/3514221.3517843"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.14778\/3554821.3554896"},{"key":"e_1_2_1_13_1","unstructured":"Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan N. Gomez \u0141ukasz Kaiser and Illia Polosukhin. 2017. Attention is all you need. In Advances in Neural Information Processing Systems. 5999--6009. arXiv:1706.03762"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/d18-1425"},{"key":"e_1_2_1_15_1","volume-title":"Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning. CoRR abs\/1709.0, 1","author":"Zhong Victor","year":"2017","unstructured":"Victor Zhong, Caiming Xiong, and Richard Socher. 2017. Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning. CoRR abs\/1709.0, 1 (2017), 1--12. arXiv:1709.00103 http:\/\/arxiv.org\/abs\/1709.00103"}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/3611540.3611630","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,10]],"date-time":"2025-09-10T22:36:53Z","timestamp":1757543813000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/3611540.3611630"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,8]]},"references-count":15,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2023,8]]}},"alternative-id":["10.14778\/3611540.3611630"],"URL":"https:\/\/doi.org\/10.14778\/3611540.3611630","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2023,8]]},"assertion":[{"value":"2023-08-01","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}