{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,24]],"date-time":"2026-04-24T07:50:14Z","timestamp":1777017014473,"version":"3.51.4"},"publisher-location":"New York, NY, USA","reference-count":43,"publisher":"ACM","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,12,17]]},"DOI":"10.1145\/3799830.3799884","type":"proceedings-article","created":{"date-parts":[[2026,4,24]],"date-time":"2026-04-24T06:45:08Z","timestamp":1777013108000},"page":"366-375","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["VayuBench and VayuChat: Executable Benchmarking and Deployment of LLMs for Multi-Dataset Air Quality Analytics"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0003-6240-4457","authenticated-orcid":false,"given":"Vedant","family":"Acharya","sequence":"first","affiliation":[{"name":"CSE, IIT Gandhinagar, Gandhinagar, Gujarat, India"}]},{"ORCID":"https:\/\/orcid.org\/0009-0003-7504-3339","authenticated-orcid":false,"given":"Abhay","family":"Pisharodi","sequence":"additional","affiliation":[{"name":"CSE, IIT Gandhinagar, Gandhinagar, Gujarat, India"}]},{"ORCID":"https:\/\/orcid.org\/0009-0000-0668-9898","authenticated-orcid":false,"given":"Ratnesh","family":"Pasi","sequence":"additional","affiliation":[{"name":"CSE, IIIT Surat, Surat, Gujarat, India"}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-5271-514X","authenticated-orcid":false,"given":"Rishabh","family":"Mondal","sequence":"additional","affiliation":[{"name":"CSE, IIT Gandhinagar, Gandhinagar, Gujarat, India"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0736-7169","authenticated-orcid":false,"given":"Nipun","family":"Batra","sequence":"additional","affiliation":[{"name":"CSE, IIT Gandhinagar, Gandhinagar, Gujarat, India"}]}],"member":"320","published-online":{"date-parts":[[2026,4,23]]},"reference":[{"key":"e_1_3_3_2_2_2","unstructured":"Anyscale (Ray project). 2025. Ray: Scale Machine Learning & AI Computing. https:\/\/www.ray.io\/. Accessed: 2025-08-18."},{"key":"e_1_3_3_2_3_2","volume-title":"ICLR","author":"Austin Jacob","year":"2021","unstructured":"Jacob Austin, Augustus Odena, Maxwell Nye, Maarten Bosma, Henryk Michalewski, David Dohan, Ellen Jiang, Carrie Cai, Michael Terry, Quoc Le, et\u00a0al. 2021. Program Synthesis with Large Language Models. In ICLR."},{"key":"e_1_3_3_2_4_2","unstructured":"Jacob Austin Augustus Odena Maxwell Nye Maarten Bosma Henryk Michalewski David Dohan Ellen Jiang Carrie Cai Michael Terry Quoc Le and Charles Sutton. 2021. Program synthesis with large language models. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2108.07732 (2021)."},{"key":"e_1_3_3_2_5_2","unstructured":"Tom\u00a0B. Brown Benjamin Mann Nick Ryder Melanie Subbiah Jared Kaplan Prafulla Dhariwal Arvind Neelakantan Pranav Shyam Girish Sastry Amanda Askell Sandhini Agarwal Ariel Herbert-Voss Gretchen Krueger Tom Henighan Rewon Child Aditya Ramesh Daniel\u00a0M. Ziegler Jeffrey Wu Clemens Winter Christopher Hesse Mark Chen Eric Sigler Mateusz Litwin Scott Gray Benjamin Chess Jack Clark Christopher Berner Sam McCandlish Alec Radford Ilya Sutskever and Dario Amodei. 2020. Language Models are Few-Shot Learners. arXiv:https:\/\/arXiv.org\/abs\/2005.14165\u00a0[cs.CL] https:\/\/arxiv.org\/abs\/2005.14165"},{"key":"e_1_3_3_2_6_2","doi-asserted-by":"crossref","unstructured":"David\u00a0C. Carslaw and Karl Ropkins. 2012. openair \u2014 An R package for air quality data analysis. Environmental Modelling & Software 27\u201328 0 (2012) 52\u201361. doi:10.1016\/j.envsoft.2011.09.008","DOI":"10.1016\/j.envsoft.2011.09.008"},{"key":"e_1_3_3_2_7_2","unstructured":"Central Pollution Control Board. 2025. CPCB - Overview and Functions. https:\/\/cpcb.nic.in\/."},{"key":"e_1_3_3_2_8_2","unstructured":"Centre for Science and Environment. 2025. Air Quality and Public Health. https:\/\/www.cseindia.org\/page\/air-quality-and-public-health. Accessed: 2025-08-18."},{"key":"e_1_3_3_2_9_2","unstructured":"Mark Chen Jerry Tworek Heewoo Jun Qiming Yuan Henrique\u00a0Ponde de Oliveira\u00a0Pinto Jared Kaplan Harri Edwards Yuri Burda Nicholas Joseph Greg Brockman Alex Ray Raul Puri Gretchen Krueger Michael Petrov Heidy Khlaaf Girish Sastry Pamela Mishkin Brooke Chan Scott Gray Nick Ryder Mikhail Pavlov Alethea Power Lukasz Kaiser Mohammad Bavarian Clemens Winter Philippe Tillet Felipe\u00a0Petroski Such Dave Cummings Matthias Plappert Fotios Chantzis Elizabeth Barnes Ariel Herbert-Voss William\u00a0Hebgen Guss Alex Nichol Alex Paino Nikolas Tezak Jie Tang Igor Babuschkin Suchir Balaji Shantanu Jain William Saunders Christopher Hesse Andrew\u00a0N. Carr Jan Leike Josh Achiam Vedant Misra Evan Morikawa Alec Radford Matthew Knight Miles Brundage Mira Murati Katie Mayer Peter Welinder Bob McGrew Dario Amodei Sam McCandlish Ilya Sutskever and Wojciech Zaremba. 2021. Evaluating Large Language Models Trained on Code. arXiv:https:\/\/arXiv.org\/abs\/2107.03374\u00a0[cs.LG] https:\/\/arxiv.org\/abs\/2107.03374"},{"key":"e_1_3_3_2_10_2","unstructured":"Mark Chen Jerry Tworek Heewoo Jun Qiming Yuan Henrique\u00a0Ponde de Oliveira\u00a0Pinto Jared Kaplan Harri Edwards Yuri Burda Nicholas Joseph Greg Brockman Alex Ray Raul Puri Gretchen Krueger Michael Petrov Heidy Khlaaf Girish Sastry Pamela Mishkin Brooke Chan Scott Gray Nick Ryder Mikhail Pavlov Alethea Power Lukasz Kaiser Mohammad Bavarian Clemens Winter Philippe Tillet Felipe\u00a0Petroski Such Dave Cummings Matthias Plappert Fotios Chantzis Elizabeth Barnes Ariel Herbert-Voss William\u00a0Hebgen Guss Alex Nichol Alex Paino Nikolas Tezak Jie Tang Igor Babuschkin Suchir Balaji Shantanu Jain William Saunders Christopher Hesse Andrew\u00a0N. Carr Jan Leike Josh Achiam Vedant Misra Evan Morikawa Alec Radford Matthew Knight Miles Brundage Mira Murati Katie Mayer Peter Welinder Bob McGrew Dario Amodei Sam McCandlish Ilya Sutskever and Wojciech Zaremba. 2021. Evaluating Large Language Models Trained on Code. arXiv:https:\/\/arXiv.org\/abs\/2107.03374\u00a0[cs.LG] https:\/\/arxiv.org\/abs\/2107.03374"},{"key":"e_1_3_3_2_11_2","unstructured":"Wenhu Chen Jianshu Chen Yu Su Zhiyu Chen and William\u00a0Yang Wang. 2020. Logical Natural Language Generation from Open-Domain Tables. arXiv:https:\/\/arXiv.org\/abs\/2004.10404\u00a0[cs.CL] https:\/\/arxiv.org\/abs\/2004.10404"},{"key":"e_1_3_3_2_12_2","unstructured":"Clarity Movement. 2022. How Open Access Air Pollution Data is Paving the Way for Greater Air Quality Awareness. https:\/\/www.clarity.io\/blog\/how-open-access-air-pollution-data-is-paving-the-way-for-greater-air-quality-awareness Accessed: 2025-05-15."},{"key":"e_1_3_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2025.findings-acl.814"},{"key":"e_1_3_3_2_14_2","unstructured":"CPCB. 2025. National air quality index Real-time data. https:\/\/airquality.cpcb.gov.in\/AQI_India\/ Accessed: 2025-05-15."},{"key":"e_1_3_3_2_15_2","unstructured":"Yahia Dalbah Marcel Worring and Yen-Chia Hsu. 2025. Veli: Unsupervised Method and Unified Benchmark for Low-Cost Air Quality Sensor Correction. arXiv:https:\/\/arXiv.org\/abs\/2508.02724\u00a0[eess.SP] https:\/\/arxiv.org\/abs\/2508.02724"},{"key":"e_1_3_3_2_16_2","unstructured":"Aleksandra Eliseeva Alexander Kovrigin Ilia Kholkin Egor Bogomolov and Yaroslav Zharov. 2025. EnvBench: A Benchmark for Automated Environment Setup. arXiv:https:\/\/arXiv.org\/abs\/2503.14443\u00a0[cs.LG] https:\/\/arxiv.org\/abs\/2503.14443"},{"key":"e_1_3_3_2_17_2","unstructured":"Energy Policy Institute at the University of Chicago (EPIC). 2023. AQLI India Fact Sheet. EPIC Report August 2023 https:\/\/aqli.epic.uchicago.edu\/wp-content\/uploads\/2023\/08\/India-FactSheet-2023_Final.pdf. Accessed: 2025-08-13."},{"key":"e_1_3_3_2_18_2","unstructured":"Dan Hendrycks Collin Burns Steven Basart Andy Zou Mantas Mazeika Dawn Song and Jacob Steinhardt. 2021. Measuring Coding Challenge Competence with APPS. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2105.09938 (2021)."},{"key":"e_1_3_3_2_19_2","doi-asserted-by":"crossref","unstructured":"Jonathan Herzig Pawe\u0142\u00a0Krzysztof Nowak Thomas M\u00fcller Francesco Piccinno and Julian\u00a0Martin Eisenschlos. 2020. TaPas: Weakly supervised table parsing via pre-training. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2004.02349 (2020).","DOI":"10.18653\/v1\/2020.acl-main.398"},{"key":"e_1_3_3_2_20_2","unstructured":"Kethmi\u00a0Hirushini Hettige Jiahao Ji Shili Xiang Cheng Long Gao Cong and Jingyuan Wang. 2024. AirPhyNet: Harnessing Physics-Guided Neural Networks for Air Quality Prediction. arXiv:https:\/\/arXiv.org\/abs\/2402.03784\u00a0[cs.LG] https:\/\/arxiv.org\/abs\/2402.03784 Accepted by ICLR 2024."},{"key":"e_1_3_3_2_21_2","unstructured":"Institute for Health Metrics and Evaluation (IHME). 2025. Global Burden of Disease (GBD). https:\/\/www.healthdata.org\/research-analysis\/gbd. Accessed: 2025-08-18."},{"key":"e_1_3_3_2_22_2","unstructured":"Jinxiang Lai Jie Zhang Jun Liu Jian Li Xiaocheng Lu and Song Guo. 2025. Spider: Any-to-Many Multimodal LLM. arXiv:https:\/\/arXiv.org\/abs\/2411.09439\u00a0[cs.CV] https:\/\/arxiv.org\/abs\/2411.09439"},{"key":"e_1_3_3_2_23_2","unstructured":"Yuhang Lai Chengxi Li Yiming Wang Tianyi Zhang Ruiqi Zhong Luke Zettlemoyer Scott\u00a0Wen tau Yih Daniel Fried Sida Wang and Tao Yu. 2022. DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation. arXiv:https:\/\/arXiv.org\/abs\/2211.11501\u00a0[cs.SE] https:\/\/arxiv.org\/abs\/2211.11501"},{"key":"e_1_3_3_2_24_2","unstructured":"Gyubok Lee Hyeonji Hwang Seongsu Bae Yeonsu Kwon Woncheol Shin Seongjun Yang Minjoon Seo Jong-Yeup Kim and Edward Choi. 2023. EHRSQL: A Practical Text-to-SQL Benchmark for Electronic Health Records. arXiv:https:\/\/arXiv.org\/abs\/2301.07695\u00a0[cs.CL] https:\/\/arxiv.org\/abs\/2301.07695"},{"key":"e_1_3_3_2_25_2","unstructured":"Jinyang Li Binyuan Hui Ge Qu Jiaxi Yang Binhua Li Bowen Li Bailin Wang Bowen Qin Rongyu Cao Ruiying Geng Nan Huo Xuanhe Zhou Chenhao Ma Guoliang Li Kevin C.\u00a0C. Chang Fei Huang Reynold Cheng and Yongbin Li. 2023. Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs. arXiv:https:\/\/arXiv.org\/abs\/2305.03111\u00a0[cs.CL] https:\/\/arxiv.org\/abs\/2305.03111"},{"key":"e_1_3_3_2_26_2","doi-asserted-by":"crossref","unstructured":"Yujia Li David Choi Matthew Gerstenberg Usman Humayoun Adria\u00a0Recasens Jose Pushmeet Kohli June Koo Young Lee Lin Li Wei Li et\u00a0al. 2022. Competition-Level Code Generation with AlphaCode. Science 378 6624 (2022) 1092\u20131097.","DOI":"10.1126\/science.abq1158"},{"key":"e_1_3_3_2_27_2","unstructured":"Shuai Lu Daya Guo Shuo Ren Junjie Huang Alexey Svyatkovskiy Ambrosio Blanco Colin Clement Dawn Drain Daxin Jiang Duyu Tang Ge Li Lidong Zhou Linjun Shou Long Zhou Michele Tufano Ming Gong Ming Zhou Nan Duan Neel Sundaresan Shao\u00a0Kun Deng Shengyu Fu and Shujie Liu. 2021. CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation. arXiv:https:\/\/arXiv.org\/abs\/2102.04664\u00a0[cs.SE] https:\/\/arxiv.org\/abs\/2102.04664"},{"key":"e_1_3_3_2_28_2","first-page":"4376","volume-title":"ACL","author":"Moosavi Nafise\u00a0Sadat","year":"2021","unstructured":"Nafise\u00a0Sadat Moosavi, Lisa Fichtel, Sebastian Pado, and Iryna Gurevych. 2021. SciGen: A Dataset for Scientific Table-to-Text Generation. In ACL. 4376\u20134389."},{"key":"e_1_3_3_2_29_2","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00446"},{"key":"e_1_3_3_2_30_2","unstructured":"OpenAQ Project. 2024. OpenAQ: Open Air Quality Data Platform. https:\/\/openaq.org\/."},{"key":"e_1_3_3_2_31_2","unstructured":"Long Ouyang Jeff Wu Xu Jiang Diogo Almeida Carroll\u00a0L. Wainwright Pamela Mishkin Chong Zhang Sandhini Agarwal Katarina Slama Alex Ray John Schulman Jacob Hilton Fraser Kelton Luke Miller Maddie Simens Amanda Askell Peter Welinder Paul Christiano Jan Leike and Ryan Lowe. 2022. Training language models to follow instructions with human feedback. arXiv:https:\/\/arXiv.org\/abs\/2203.02155\u00a0[cs.CL] https:\/\/arxiv.org\/abs\/2203.02155"},{"key":"e_1_3_3_2_32_2","unstructured":"Anamika Pandey Michael Brauer Maureen\u00a0L Cropper Kalpana Balakrishnan Prashant Mathur Sagnik Dey Burak Turkgulu G\u00a0Anil Kumar Mukesh Khare Gufran Beig et\u00a0al. 2021. Health and economic impact of air pollution in the states of India: the Global Burden of Disease Study 2019. The Lancet Planetary Health 5 1 (2021) e25\u2013e38."},{"key":"e_1_3_3_2_33_2","unstructured":"Ankur\u00a0P. Parikh Xuezhi Wang Sebastian Gehrmann Manaal Faruqui Bhuwan Dhingra Diyi Yang and Dipanjan Das. 2020. ToTTo: A Controlled Table-To-Text Generation Dataset. arXiv:https:\/\/arXiv.org\/abs\/2004.14373\u00a0[cs.CL] https:\/\/arxiv.org\/abs\/2004.14373"},{"key":"e_1_3_3_2_34_2","unstructured":"Particle. 2025. Particle: An Integrated IoT Platform-as-a-Service. https:\/\/www.particle.io\/. Accessed: 2025-08-18."},{"key":"e_1_3_3_2_35_2","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/P15-1142"},{"key":"e_1_3_3_2_36_2","unstructured":"Ruchir Puri David Kung et\u00a0al. 2021. CodeNet: A Large-Scale AI for Code Dataset for Learning a Diversity of Coding Tasks. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2105.12655 (2021)."},{"key":"e_1_3_3_2_37_2","unstructured":"Timo Schick Jane Dwivedi-Yu Roberto Dess\u00ec Roberta Raileanu Maria Lomeli Shruti Bhosale Xian\u00a0Li Wang Leon Derczynski Mike Chrzanowski Thomas Scialom et\u00a0al. 2023. Toolformer: Language Models Can Teach Themselves to Use Tools. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2302.04761 (2023)."},{"key":"e_1_3_3_2_38_2","doi-asserted-by":"crossref","unstructured":"S.\u00a0De Vito E. Esposito M. Salvato O. Popoola F. Formisano R. Jones and G.\u00a0Di Francia. 2018. Calibrating chemical multisensory devices for real world applications: An in-depth comparison of quantitative Machine Learning approaches. Sensors and Actuators B: Chemical 255 (2018) 1191\u20131210. doi:10.1016\/j.snb.2017.07.155","DOI":"10.1016\/j.snb.2017.07.155"},{"key":"e_1_3_3_2_39_2","doi-asserted-by":"publisher","DOI":"10.1145\/3366423.3380120"},{"key":"e_1_3_3_2_40_2","unstructured":"Xiang Wei Xingyu Cui Ning Cheng Xiaobin Wang Xin Zhang Shen Huang Pengjun Xie Jinan Xu Yufeng Chen Meishan Zhang Yong Jiang and Wenjuan Han. 2023. Zero-Shot Information Extraction via Chatting with ChatGPT. arXiv:https:\/\/arXiv.org\/abs\/2302.10205\u00a0[cs.CL] https:\/\/arxiv.org\/abs\/2302.10205"},{"key":"e_1_3_3_2_41_2","unstructured":"Shunyu Yao Jeffrey Zhao Dian Yu Nan Du Izhak Shafran Karthik Narasimhan and Yuan Cao. 2023. ReAct: Synergizing Reasoning and Acting in Language Models. arXiv:https:\/\/arXiv.org\/abs\/2210.03629\u00a0[cs.CL] https:\/\/arxiv.org\/abs\/2210.03629"},{"key":"e_1_3_3_2_42_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1443"},{"key":"e_1_3_3_2_43_2","first-page":"231","volume-title":"Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)","author":"Zhong Victor","year":"2017","unstructured":"Victor Zhong, Caiming Xiong, and Richard Socher. 2017. Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 231\u2013240. doi:10.18653\/v1\/P17-1021"},{"key":"e_1_3_3_2_44_2","unstructured":"Jingwei Zuo Wenbin Li Michele Baldo and Hakim Hacid. 2023. Unleashing Realistic Air Quality Forecasting: Introducing the Ready-to-Use PurpleAirSF Dataset. arXiv:https:\/\/arXiv.org\/abs\/2306.13948\u00a0[cs.LG] https:\/\/arxiv.org\/abs\/2306.13948 Accepted by ACM SIGSPATIAL 2023."}],"event":{"name":"CODS 2025: 13th ACM IKDD International Conference on Data Science","location":"Pune India","acronym":"CODS 2025"},"container-title":["Proceedings of the 13th ACM IKDD International Conference on Data Science"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3799830.3799884","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,24]],"date-time":"2026-04-24T07:21:03Z","timestamp":1777015263000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3799830.3799884"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,12,17]]},"references-count":43,"alternative-id":["10.1145\/3799830.3799884","10.1145\/3799830"],"URL":"https:\/\/doi.org\/10.1145\/3799830.3799884","relation":{},"subject":[],"published":{"date-parts":[[2025,12,17]]},"assertion":[{"value":"2026-04-23","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}