{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,3]],"date-time":"2026-06-03T22:12:20Z","timestamp":1780524740744,"version":"3.54.1"},"reference-count":67,"publisher":"Association for Computing Machinery (ACM)","issue":"8","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2024,4]]},"abstract":"<jats:p>We apply foundation models to data discovery and exploration tasks. Foundation models are large language models (LLMS) that show promising performance on a range of diverse tasks unrelated to their training. We show that these models are highly applicable to the data discovery and data exploration domain. When carefully used, they have superior capability on three representative tasks: table-class detection, column-type annotation and join-column prediction. On all three tasks, we show that a foundation-model-based approach outperforms the task-specific models and so the state of the art. Further, our approach often surpasses human-expert task performance. We investigate the fundamental characteristics of this approach including generalizability to several foundation models and the impact of non-determinism on the outputs. All in all, this suggests a future direction in which disparate data management tasks can be unified under foundation models.<\/jats:p>","DOI":"10.14778\/3659437.3659461","type":"journal-article","created":{"date-parts":[[2024,5,31]],"date-time":"2024-05-31T16:22:27Z","timestamp":1717172547000},"page":"2104-2114","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":30,"title":["Chorus: Foundation Models for Unified Data Discovery and Exploration"],"prefix":"10.14778","volume":"17","author":[{"given":"Moe","family":"Kayali","sequence":"first","affiliation":[{"name":"University of Washington"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Anton","family":"Lykov","sequence":"additional","affiliation":[{"name":"University of Washington"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Ilias","family":"Fountalis","sequence":"additional","affiliation":[{"name":"RelationalAI"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Nikolaos","family":"Vasiloglou","sequence":"additional","affiliation":[{"name":"RelationalAI"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Dan","family":"Olteanu","sequence":"additional","affiliation":[{"name":"University of Zurich"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Dan","family":"Suciu","sequence":"additional","affiliation":[{"name":"University of Washington"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2024,5,31]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"Nora Abdelmageed Jiaoyan Chen Vincenzo Cutrona Vasilis Efthymiou Oktie Hassanzadeh Madelon Hulsebos Ernesto Jim\u00e9nez-Ruiz Juan Sequeda and Kavitha Srinivas. Results of semtab 2022. In Vasilis Efthymiou Ernesto Jim\u00e9nez-Ruiz Jiaoyan Chen Vincenzo Cutrona Oktie Hassanzadeh Juan Sequeda Kavitha Srinivas Nora Abdelmageed and Madelon Hulsebos editors Proceedings of the Semantic Web Challenge on Tabular Data to Knowledge Graph Matching SemTab 2021 co-located with the 21st International Semantic Web Conference ISWC 2022 Virtual conference October 23--27 2022 volume 3320 of CEUR Workshop Proceedings pages 1--13. CEUR-WS.org 2022. URL https:\/\/ceur-ws.org\/Vol-3320\/paper0.pdf."},{"key":"e_1_2_1_2_1","volume-title":"July","author":"Inc.","year":"2021","unstructured":"Inc.Anaconda. State of data science. https:\/\/www.anaconda.com\/resources\/whitepapers\/state-of-data-science-2021, July 2021."},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.findings-emnlp.423"},{"key":"e_1_2_1_4_1","volume-title":"Language models enable simple systems for generating structured views of heterogeneous data lakes","author":"Arora Simran","year":"2023","unstructured":"Simran Arora, Brandon Yang, Sabri Eyuboglu, Avanika Narayan, Andrew Hojel, Immanuel Trummer, and Christopher R\u00e9. Language models enable simple systems for generating structured views of heterogeneous data lakes, 2023."},{"key":"e_1_2_1_5_1","unstructured":"Rishi Bommasani Drew A. Hudson Ehsan Adeli Russ B. Altman Simran Arora Sydney von Arx Michael S. Bernstein Jeannette Bohg Antoine Bosselut Emma Brunskill Erik Brynjolfsson Shyamal Buch Dallas Card Rodrigo Castellon Niladri S. Chatterji Annie S. Chen Kathleen Creel Jared Quincy Davis Dorottya Demszky Chris Donahue Moussa Doumbouya Esin Durmus Stefano Ermon John Etchemendy Kawin Ethayarajh Li Fei-Fei Chelsea Finn Trevor Gale Lauren Gillespie Karan Goel Noah D. Goodman Shelby Grossman Neel Guha Tatsunori Hashimoto Peter Henderson John Hewitt Daniel E. Ho Jenny Hong Kyle Hsu Jing Huang Thomas Icard Saahil Jain Dan Jurafsky Pratyusha Kalluri Siddharth Karamcheti Geoff Keeling Fereshte Khani Omar Khattab Pang Wei Koh Mark S. Krass Ranjay Krishna Rohith Kuditipudi and et al. On the opportunities and risks of foundation models. CoRR abs\/2108.07258 2021. URL https:\/\/arxiv.org\/abs\/2108.07258."},{"key":"e_1_2_1_6_1","volume-title":"Yuanzhi Li, Scott Lundberg, Harsha Nori, Hamid Palangi, Marco Tulio Ribeiro, and Yi Zhang. Sparks of artificial general intelligence: Early experiments with gpt-4","author":"Bubeck S\u00e9bastien","year":"2023","unstructured":"S\u00e9bastien Bubeck, Varun Chandrasekaran, Ronen Eldan, Johannes Gehrke, Eric Horvitz, Ece Kamar, Peter Lee, Yin Tat Lee, Yuanzhi Li, Scott Lundberg, Harsha Nori, Hamid Palangi, Marco Tulio Ribeiro, and Yi Zhang. Sparks of artificial general intelligence: Early experiments with gpt-4, 2023."},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.14778\/1453856.1453916"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.14778\/3476311.3476346"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00778-019-00564-x"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.14778\/2733004.2733014"},{"key":"e_1_2_1_11_1","volume-title":"Mata v. avianca, inc. (1:22-cv-01461). Southern District of New York","author":"District Court.","year":"2023","unstructured":"District Court. Mata v. avianca, inc. (1:22-cv-01461). Southern District of New York, New York, June 2023."},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/564691.564719"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.5555\/3430915.3442430"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.emnlp-main.98"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2212.07588"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/3555041.3589409"},{"key":"e_1_2_1_17_1","volume-title":"Common crawl","author":"Foundation Common Crawl","year":"2011","unstructured":"Common Crawl Foundation. Common crawl, 2011. URL https:\/\/commoncrawl.org."},{"key":"e_1_2_1_18_1","volume-title":"The pile: An 800gb dataset of diverse text for language modeling. CoRR, abs\/2101.00027","author":"Gao Leo","year":"2021","unstructured":"Leo Gao, Stella Biderman, Sid Black, Laurence Golding, Travis Hoppe, Charles Foster, Jason Phang, Horace He, Anish Thite, Noa Nabeshima, Shawn Presser, and Connor Leahy. The pile: An 800gb dataset of diverse text for language modeling. CoRR, abs\/2101.00027, 2021. URL https:\/\/arxiv.org\/abs\/2101.00027."},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.findings-emnlp.301"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/1265530.1265535"},{"key":"e_1_2_1_21_1","first-page":"Q1","article-title":"The Forrester Wave","author":"Gualtieri Mike","year":"2016","unstructured":"Mike Gualtieri and Noel Yuhanna. The Forrester Wave: Big Data Hadoop Distributions, Q1 2016. Forrester Research, Inc., January 2016.","journal-title":"Big Data Hadoop Distributions"},{"key":"e_1_2_1_22_1","article-title":"latest large language model survived only three days online","author":"Heaven Will Douglas","year":"2022","unstructured":"Will Douglas Heaven. Why meta's latest large language model survived only three days online. MIT Technology Review, November 2022. URL https:\/\/www.technologyreview.com\/2022\/11\/18\/1063487\/meta-large-language-model-ai-only-survived-three-days-gpt-3-science\/.","journal-title":"MIT Technology Review"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1353\/lib.0.0036"},{"key":"e_1_2_1_24_1","volume-title":"8th International Conference on Learning Representations, ICLR 2020","author":"Holtzman Ari","year":"2020","unstructured":"Ari Holtzman, Jan Buys, Li Du, Maxwell Forbes, and Yejin Choi. The curious case of neural text degeneration. In 8th International Conference on Learning Representations, ICLR 2020, Addis Ababa, Ethiopia, April 26--30, 2020. OpenReview.net, 2020. URL https:\/\/openreview.net\/forum?id=rygGQyrFvH."},{"key":"e_1_2_1_25_1","volume-title":"Proceedings of the 2019 Conference on Human Factors in Computing Systems (CHI). ACM","author":"Hu Kevin","year":"2019","unstructured":"Kevin Hu, Neil Gaikwad, Michiel Bakker, Madelon Hulsebos, Emanuel Zgraggen, C\u00e9sar Hidalgo, Tim Kraska, Guoliang Li, Arvind Satyanarayan, and \u00c7a\u011fatay Demiralp. Viznet: Towards a large-scale visualization learning and benchmarking repository. In Proceedings of the 2019 Conference on Human Factors in Computing Systems (CHI). ACM, 2019."},{"key":"e_1_2_1_26_1","volume-title":"Language models as zero-shot planners: Extracting actionable knowledge for embodied agents. arXiv preprint arXiv:2201.07207","author":"Huang Wenlong","year":"2022","unstructured":"Wenlong Huang, Pieter Abbeel, Deepak Pathak, and Igor Mordatch. Language models as zero-shot planners: Extracting actionable knowledge for embodied agents. arXiv preprint arXiv:2201.07207, 2022."},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/3292500.3330993"},{"key":"e_1_2_1_28_1","volume-title":"Gittables: A large-scale corpus of relational tables. CoRR, abs\/2106.07258","author":"Hulsebos Madelon","year":"2021","unstructured":"Madelon Hulsebos, \u00c7agatay Demiralp, and Paul Groth. Gittables: A large-scale corpus of relational tables. CoRR, abs\/2106.07258, 2021. URL https:\/\/arxiv.org\/abs\/2106.07258."},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/1978942.1979444"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2306.09610"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/3588689"},{"key":"e_1_2_1_33_1","volume-title":"Generating table vector representations. CoRR, abs\/2110.15132","author":"Koleva Aneta","year":"2021","unstructured":"Aneta Koleva, Martin Ringsquandl, Mitchell Joblin, and Volker Tresp. Generating table vector representations. CoRR, abs\/2110.15132, 2021. URL https:\/\/arxiv.org\/abs\/2110.15132."},{"key":"e_1_2_1_34_1","first-page":"729","volume-title":"Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, IJCAI 97","author":"Kushmerick Nicholas","year":"1997","unstructured":"Nicholas Kushmerick, Daniel S. Weld, and Robert B. Doorenbos. Wrapper induction for information extraction. In Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, IJCAI 97, Nagoya, Japan, August 23--29, 1997, 2 Volumes, pages 729--737. Morgan Kaufmann, 1997."},{"key":"e_1_2_1_35_1","volume-title":"The mindlessness of ostensibly thoughtful action: The role of\" placebic\" information in interpersonal interaction. Journal of personality and social psychology, 36(6):635","author":"Langer Ellen J","year":"1978","unstructured":"Ellen J Langer, Arthur Blank, and Benzion Chanowitz. The mindlessness of ostensibly thoughtful action: The role of\" placebic\" information in interpersonal interaction. Journal of personality and social psychology, 36(6):635, 1978."},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1038\/44964"},{"key":"e_1_2_1_37_1","first-page":"707","volume-title":"Soviet Physics Doklady","volume":"10","author":"Levenshtein Vladimir","year":"1966","unstructured":"Vladimir Levenshtein. Binary codes capable of correcting deletions, insertions and reversals. In Soviet Physics Doklady, volume 10, page 707, 1966."},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.acl-long.229"},{"key":"e_1_2_1_39_1","volume-title":"When not to trust language models: Investigating effectiveness and limitations of parametric and non-parametric memories","author":"Mallen Alex","year":"2022","unstructured":"Alex Mallen, Akari Asai, Victor Zhong, Rajarshi Das, Hannaneh Hajishirzi, and Daniel Khashabi. When not to trust language models: Investigating effectiveness and limitations of parametric and non-parametric memories, 2022."},{"key":"e_1_2_1_40_1","first-page":"1813","volume-title":"Proceedings of the Eighth International Conference on Language Resources and Evaluation, LREC 2012","author":"Mendes Pablo N.","year":"2012","unstructured":"Pablo N. Mendes, Max Jakob, and Christian Bizer. Dbpedia: A multilingual cross-domain knowledge base. In Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Ugur Dogan, Bente Maegaard, Joseph Mariani, Jan Odijk, and Stelios Piperidis, editors, Proceedings of the Eighth International Conference on Language Resources and Evaluation, LREC 2012, Istanbul, Turkey, May 23--25, 2012, pages 1813--1817. European Language Resources Association (ELRA), 2012. URL http:\/\/www.lrec-conf.org\/proceedings\/lrec2012\/summaries\/570.html."},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.14778\/3574245.3574258"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.14778\/3352063.3352116"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2021.3091101"},{"key":"e_1_2_1_44_1","volume-title":"Electric vehicle population data electric vehicle population data, 04","author":"Washington State Department of Licensing.","year":"2023","unstructured":"Washington State Department of Licensing. Electric vehicle population data electric vehicle population data, 04 2023. URL https:\/\/catalog.data.gov\/dataset\/electric-vehicle-population-data."},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2203.02155"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1002\/pra2.457"},{"key":"e_1_2_1_47_1","unstructured":"Jack W. Rae Sebastian Borgeaud Trevor Cai Katie Millican Jordan Hoffmann H. Francis Song John Aslanides Sarah Henderson Roman Ring Susannah Young Eliza Rutherford Tom Hennigan Jacob Menick Albin Cassirer Richard Powell George van den Driessche Lisa Anne Hendricks Maribeth Rauh Po-Sen Huang Amelia Glaese Johannes Welbl Sumanth Dathathri Saffron Huang Jonathan Uesato John Mellor Irina Higgins Antonia Creswell Nat McAleese Amy Wu Erich Elsen Siddhant M. Jayakumar Elena Buchatskaya David Budden Esme Sutherland Karen Simonyan Michela Paganini Laurent Sifre Lena Martens Xiang Lorraine Li Adhiguna Kuncoro Aida Nematzadeh Elena Gribovskaya Domenic Donato Angeliki Lazaridou Arthur Mensch Jean-Baptiste Lespiau Maria Tsimpoukelli Nikolai Grigorev Doug Fritz Thibault Sottiaux Mantas Pajarskas Toby Pohlen Zhitao Gong Daniel Toyama Cyprien de Masson d'Autume Yujia Li Tayfun Terzi Vladimir Mikulik Igor Babuschkin Aidan Clark Diego de Las Casas Aurelia Guy Chris Jones James Bradbury Matthew J. Johnson Blake A. Hechtman Laura Weidinger Iason Gabriel William Isaac Edward Lockhart Simon Osindero Laura Rimell Chris Dyer Oriol Vinyals Kareem Ayoub Jeff Stanway Lorrayne Bennett Demis Hassabis Koray Kavukcuoglu and Geoffrey Irving. Scaling language models: Methods analysis & insights from training gopher. CoRR abs\/2112.11446 2021. URL https:\/\/arxiv.org\/abs\/2112.11446."},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.5441\/002\/edbt.2017.20"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE53745.2022.00264"},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2302.00093"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","unstructured":"Karan Singhal Shekoofeh Azizi Tao Tu S. Sara Mahdavi Jason Wei Hyung Won Chung Nathan Scales Ajay Kumar Tanwani Heather Cole-Lewis Stephen Pfohl Perry Payne Martin Seneviratne Paul Gamble Chris Kelly Nathaneal Sch\u00e4rli Aakanksha Chowdhery Philip Andrew Mansfield Blaise Ag\u00fcera y Arcas Dale R. Webster Gregory S. Corrado Yossi Matias Katherine Chou Juraj Gottweis Nenad Tomasev Yun Liu Alvin Rajkomar Joelle K. Barral Christopher Semturs Alan Karthikesalingam and Vivek Natarajan. Large language models encode clinical knowledge. CoRR abs\/2212.13138 2022. 10.48550\/arXiv.2212.13138","DOI":"10.48550\/arXiv.2212.13138"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1145\/3514221.3517906"},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2211.09085"},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2302.13971"},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","unstructured":"Hugo Touvron Louis Martin Kevin Stone Peter Albert Amjad Almahairi Yasmine Babaei Nikolay Bashlykov Soumya Batra Prajjwal Bhargava Shruti Bhosale Dan Bikel Lukas Blecher Cristian Canton-Ferrer Moya Chen Guillem Cucurull David Esiobu Jude Fernandes Jeremy Fu Wenyin Fu Brian Fuller Cynthia Gao Vedanuj Goswami Naman Goyal Anthony Hartshorn Saghar Hosseini Rui Hou Hakan Inan Marcin Kardas Viktor Kerkez Madian Khabsa Isabel Kloumann Artem Korenev Punit Singh Koura Marie-Anne Lachaux Thibaut Lavril Jenya Lee Diana Liskovich Yinghai Lu Yuning Mao Xavier Martinet Todor Mihaylov Pushkar Mishra Igor Molybog Yixin Nie Andrew Poulton Jeremy Reizenstein Rashi Rungta Kalyan Saladi Alan Schelten Ruan Silva Eric Michael Smith Ranjan Subramanian Xiaoqing Ellen Tan Binh Tang Ross Taylor Adina Williams Jian Xiang Kuan Puxin Xu Zheng Yan Iliyan Zarov Yuchen Zhang Angela Fan Melanie Kambadur Sharan Narang Aur\u00e9lien Rodriguez Robert Stojnic Sergey Edunov and Thomas Scialom. Llama 2: Open foundation and fine-tuned chat models. CoRR abs\/2307.09288 2023. 10.48550\/arXiv.2307.09288","DOI":"10.48550\/arXiv.2307.09288"},{"key":"e_1_2_1_56_1","volume-title":"Trifacta wrangler. https:\/\/cloud.trifacta.com","year":"2023","unstructured":"Trifacta. Trifacta wrangler. https:\/\/cloud.trifacta.com, 2023. Accessed: 2023-04-10."},{"key":"e_1_2_1_57_1","volume-title":"Can deep neural networks predict data correlations from column names? CoRR, abs\/2107.04553","author":"Trummer Immanuel","year":"2021","unstructured":"Immanuel Trummer. Can deep neural networks predict data correlations from column names? CoRR, abs\/2107.04553, 2021. URL https:\/\/arxiv.org\/abs\/2107.04553."},{"key":"e_1_2_1_58_1","volume-title":"12th Conference on Innovative Data Systems Research, CIDR 2022","author":"Trummer Immanuel","year":"2022","unstructured":"Immanuel Trummer. Towards nlp-enhanced data profiling tools. In 12th Conference on Innovative Data Systems Research, CIDR 2022, Chaminade, CA, USA, January 9--12, 2022. www.cidrdb.org, 2022. URL https:\/\/www.cidrdb.org\/cidr2022\/papers\/a55-trummer.pdf."},{"key":"e_1_2_1_60_1","volume-title":"Tatsunori Hashimoto, Oriol Vinyals, Percy Liang, Jeff Dean, and William Fedus. Emergent abilities of large language models","author":"Wei Jason","year":"2022","unstructured":"Jason Wei, Yi Tay, Rishi Bommasani, Colin Raffel, Barret Zoph, Sebastian Borgeaud, Dani Yogatama, Maarten Bosma, Denny Zhou, Donald Metzler, Ed H. Chi, Tatsunori Hashimoto, Oriol Vinyals, Percy Liang, Jeff Dean, and William Fedus. Emergent abilities of large language models, 2022."},{"key":"e_1_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.1145\/3318464.3389738"},{"key":"e_1_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.745"},{"key":"e_1_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1145\/2882903.2904442"},{"key":"e_1_2_1_64_1","doi-asserted-by":"publisher","DOI":"10.14778\/3407790.3407793"},{"key":"e_1_2_1_65_1","volume-title":"How language model hallucinations can snowball","author":"Zhang Muru","year":"2023","unstructured":"Muru Zhang, Ofir Press, William Merrill, Alisa Liu, and Noah A. Smith. How language model hallucinations can snowball, 2023."},{"key":"e_1_2_1_66_1","series-title":"Proceedings of Machine Learning Research","first-page":"12697","volume-title":"Proceedings of the 38th International Conference on Machine Learning, ICML","author":"Zhao Zihao","year":"2021","unstructured":"Zihao Zhao, Eric Wallace, Shi Feng, Dan Klein, and Sameer Singh. Calibrate before use: Improving few-shot performance of language models. In Marina Meila and Tong Zhang, editors, Proceedings of the 38th International Conference on Machine Learning, ICML 2021, 18--24 July 2021, Virtual Event, volume 139 of Proceedings of Machine Learning Research, pages 12697--12706. PMLR, 2021. URL http:\/\/proceedings.mlr.press\/v139\/zhao21c.html."},{"key":"e_1_2_1_67_1","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2306.05685"},{"key":"e_1_2_1_68_1","doi-asserted-by":"publisher","DOI":"10.1145\/3299869.3300065"}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/3659437.3659461","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,5,31]],"date-time":"2024-05-31T16:28:04Z","timestamp":1717172884000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/3659437.3659461"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,4]]},"references-count":67,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2024,4]]}},"alternative-id":["10.14778\/3659437.3659461"],"URL":"https:\/\/doi.org\/10.14778\/3659437.3659461","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2024,4]]},"assertion":[{"value":"2024-05-31","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}