{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,25]],"date-time":"2026-06-25T22:04:01Z","timestamp":1782425041979,"version":"3.54.5"},"reference-count":54,"publisher":"Association for Computing Machinery (ACM)","issue":"OOPSLA1","license":[{"start":{"date-parts":[[2024,4,29]],"date-time":"2024-04-29T00:00:00Z","timestamp":1714348800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Science Foundation","award":["1953933, 1652954"],"award-info":[{"award-number":["1953933, 1652954"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. ACM Program. Lang."],"published-print":{"date-parts":[[2024,4,29]]},"abstract":"<jats:p>While static analysis is instrumental in uncovering software bugs, its precision in analyzing large and intricate codebases remains challenging. The emerging prowess of Large Language Models (LLMs) offers a promising avenue to address these complexities. In this paper, we present LLift, a pioneering framework that synergizes static analysis and LLMs, with a spotlight on identifying use-before-initialization (UBI) bugs within the Linux kernel. Drawing from our insights into variable usage conventions in Linux, we enhance path analysis using post-constraint guidance. This approach, combined with our methodically crafted procedures, empowers LLift to adeptly handle the challenges of bug-specific modeling, extensive codebases, and the unpredictable nature of LLMs. Our real-world evaluations identified four previously undiscovered UBI bugs in the mainstream Linux kernel, which the Linux community has acknowledged. This study reaffirms the potential of marrying static analysis with LLMs, setting a compelling direction for future research in this area.<\/jats:p>","DOI":"10.1145\/3649828","type":"journal-article","created":{"date-parts":[[2024,4,29]],"date-time":"2024-04-29T17:53:50Z","timestamp":1714413230000},"page":"474-499","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":116,"title":["Enhancing Static Analysis for Practical Bug Detection: An LLM-Integrated Approach"],"prefix":"10.1145","volume":"8","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-0357-0888","authenticated-orcid":false,"given":"Haonan","family":"Li","sequence":"first","affiliation":[{"name":"University of California, Riverside, Riverside, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3944-3162","authenticated-orcid":false,"given":"Yu","family":"Hao","sequence":"additional","affiliation":[{"name":"University of California, Riverside, Riverside, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0005-8243-3495","authenticated-orcid":false,"given":"Yizhuo","family":"Zhai","sequence":"additional","affiliation":[{"name":"University of California, Riverside, Riverside, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1506-2522","authenticated-orcid":false,"given":"Zhiyun","family":"Qian","sequence":"additional","affiliation":[{"name":"University of California, Riverside, Riverside, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2024,4,29]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"2024 IEEE\/ACM 45th International Conference on Software Engineering (ICSE).","author":"Ahmed Toufique","unstructured":"Toufique Ahmed, Kunal Suresh Pai, Premkumar Devanbu, and Earl T. Barr. 2024. Automatic Semantic Augmentation of Language Model Prompts (for Code Summarization). In 2024 IEEE\/ACM 45th International Conference on Software Engineering (ICSE)."},{"key":"e_1_2_1_2_1","unstructured":"Anthropic (2023). 2023. Claude 2. https:\/\/www.anthropic.com\/index\/claude-2"},{"key":"e_1_2_1_3_1","unstructured":"Jiuhai Chen Lichang Chen Heng Huang and Tianyi Zhou. 2023. When do you need Chain-of-Thought Prompting for ChatGPT? arxiv:2304.03262 arXiv:2304.03262 [cs]"},{"key":"e_1_2_1_4_1","volume-title":"Jared Kaplan, Harri Edwards, Yuri Burda, Nicholas Joseph, and Greg Brockman.","author":"Chen Mark","year":"2021","unstructured":"Mark Chen, Jerry Tworek, Heewoo Jun, Qiming Yuan, Henrique Ponde de Oliveira Pinto, Jared Kaplan, Harri Edwards, Yuri Burda, Nicholas Joseph, and Greg Brockman. 2021. Evaluating large language models trained on code. arXiv preprint arXiv:2107.03374."},{"key":"e_1_2_1_5_1","unstructured":"Xinyun Chen Maxwell Lin Nathanael Sch\u00e4rli and Denny Zhou. 2023. Teaching Large Language Models to Self-Debug. arxiv:2304.05128"},{"key":"e_1_2_1_6_1","unstructured":"Ted Chiang. 2023. ChatGPT Is a Blurry JPEG of the Web. The New Yorker Feb. issn:0028-792X https:\/\/www.newyorker.com\/tech\/annals-of-technology\/chatgpt-is-a-blurry-jpeg-of-the-web Section: annals of artificial intelligence"},{"key":"e_1_2_1_7_1","unstructured":"Copilot. 2023. GitHub Copilot documentation. https:\/\/docs.github.com\/en\/copilot\/overview-of-github-copilot\/about-github-copilot-for-individuals"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","unstructured":"Sidong Feng and Chunyang Chen. 2023. Prompting Is All Your Need: Automated Android Bug Replay with Large Language Models. https:\/\/doi.org\/10.48550\/arXiv.2306.01987 arXiv:2306.01987 [cs]","DOI":"10.48550\/arXiv.2306.01987"},{"key":"e_1_2_1_9_1","volume-title":"Static Analysis: A Survey of Techniques and Tools","author":"Gosain Anjana","year":"2015","unstructured":"Anjana Gosain and Ganga Sharma. 2015. Static Analysis: A Survey of Techniques and Tools. In Intelligent Computing and Applications, Durbadal Mandal, Rajib Kar, Swagatam Das, and Bijaya Ketan Panigrahi (Eds.) (Advances in Intelligent Systems and Computing). Springer India, New Delhi. 581\u2013591. isbn:978-81-322-2268-2"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","unstructured":"J. Huang. 2007. Path-Oriented Program Analysis. isbn:9780521882866 https:\/\/doi.org\/10.1017\/CBO9780511546990 10.1017\/CBO9780511546990","DOI":"10.1017\/CBO9780511546990"},{"key":"e_1_2_1_11_1","volume-title":"Le Hou, Yuexin Wu, Xuezhi Wang, Hongkun Yu, and Jiawei Han.","author":"Huang Jiaxin","year":"2022","unstructured":"Jiaxin Huang, Shixiang Shane Gu, Le Hou, Yuexin Wu, Xuezhi Wang, Hongkun Yu, and Jiawei Han. 2022. Large Language Models Can Self-Improve. arxiv:2210.11610 arXiv:2210.11610 [cs]"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/3571730"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","unstructured":"Ehud Karpas Omri Abend Yonatan Belinkov Barak Lenz Opher Lieber Nir Ratner Yoav Shoham Hofit Bata Yoav Levine Kevin Leyton-Brown Dor Muhlgay Noam Rozen Erez Schwartz Gal Shachaf Shai Shalev-Shwartz Amnon Shashua and Moshe Tenenholtz. 2022. MRKL Systems: A modular neuro-symbolic architecture that combines large language models external knowledge sources and discrete reasoning. https:\/\/doi.org\/10.48550\/arXiv.2205.00445 arxiv:2205.00445 [cs].","DOI":"10.48550\/arXiv.2205.00445"},{"key":"e_1_2_1_14_1","unstructured":"Avishree Khare Saikat Dutta Ziyang Li Alaia Solko-Breslin Rajeev Alur and Mayur Naik. 2023. Understanding the Effectiveness of Large Language Models in Detecting Security Vulnerabilities. arxiv:2311.16169 arXiv:2311.16169 [cs]"},{"key":"e_1_2_1_15_1","unstructured":"Jack Krawczyk and Amarnag Subramanya. 2023. Bard\u2019s latest update: more features languages and countries. https:\/\/blog.google\/products\/bard\/google-bard-new-features-update-july-2023\/"},{"key":"e_1_2_1_16_1","unstructured":"LangChain. 2023. Announcing LangSmith a unified platform for debugging testing evaluating and monitoring your LLM applications. https:\/\/blog.langchain.dev\/announcing-langsmith\/"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICSE48619.2023.00085"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","unstructured":"Haonan Li Yu Hao Yizhuo Zhai and Zhiyun Qian. 2024. Enhancing Static Analysis for Practical Bug Detection: An LLM-Integrated Approach (Artifact). https:\/\/doi.org\/10.5281\/zenodo.10780591 10.5281\/zenodo.10780591","DOI":"10.5281\/zenodo.10780591"},{"key":"e_1_2_1_19_1","unstructured":"Hao Liu Carmelo Sferrazza and Pieter Abbeel. 2023. Chain of Hindsight Aligns Language Models with Feedback. arxiv:2302.02676 arXiv:2302.02676 [cs]"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/3319535.3354244"},{"key":"e_1_2_1_21_1","unstructured":"Wei Ma Shangqing Liu Wenhan Wang Qiang Hu Ye Liu Cen Zhang Liming Nie and Yang Liu. 2023. The Scope of ChatGPT in Software Engineering: A Thorough Investigation. arxiv:2305.12138 arXiv:2305.12138 [cs]"},{"key":"e_1_2_1_22_1","unstructured":"Nginx. 2020. nginx. https:\/\/nginx.org\/en\/"},{"key":"e_1_2_1_23_1","unstructured":"OpenAI (2022). 2022. Introducing ChatGPT. https:\/\/openai.com\/blog\/chatgpt"},{"key":"e_1_2_1_24_1","unstructured":"OpenAI (2023). 2023. Function calling and other API updates. https:\/\/openai.com\/blog\/function-calling-and-other-api-updates"},{"key":"e_1_2_1_25_1","unstructured":"OpenAI (2023). 2023. GPT-4 Technical Report. arxiv:2303.08774 arXiv:2303.08774 [cs]"},{"key":"e_1_2_1_26_1","unstructured":"Long Ouyang Jeff Wu Xu Jiang Diogo Almeida Carroll L. Wainwright Pamela Mishkin Chong Zhang Sandhini Agarwal Katarina Slama Alex Ray John Schulman Jacob Hilton Fraser Kelton Luke Miller Maddie Simens Amanda Askell Peter Welinder Paul Christiano Jan Leike and Ryan Lowe. 2022. Training language models to follow instructions with human feedback. arxiv:2203.02155 arXiv:2203.02155 [cs]"},{"key":"e_1_2_1_27_1","unstructured":"Vishal Pallagani Bharath Muppasani Keerthiram Murugesan Francesca Rossi Biplav Srivastava Lior Horesh Francesco Fabiano and Andrea Loreggia. 2023. Understanding the Capabilities of Large Language Models for Automated Planning. arxiv:2305.16151 arXiv:2305.16151 [cs]"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2205.12255"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/3464457"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/SP46215.2023.10179420"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.5555\/3618408.3619552"},{"key":"e_1_2_1_32_1","unstructured":"Kexin Pei Weichen Li Qirui Jin Shuyang Liu Scott Geng Lorenzo Cavallaro Junfeng Yang and Suman Jana. 2023. Symmetry-Preserving Program Representations for Learning Code Semantics. arxiv:2308.03312 arXiv:2308.03312 [cs]"},{"key":"e_1_2_1_33_1","unstructured":"Luke Salamone. 2021. What is Temperature in NLP? https:\/\/lukesalamone.github.io\/posts\/what-is-temperature\/ Section: posts"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2302.04761"},{"key":"e_1_2_1_35_1","unstructured":"Jessica Shieh. 2023. Best practices for prompt engineering with OpenAI API | OpenAI Help Center. https:\/\/help.openai.com\/en\/articles\/6654000-best-practices-for-prompt-engineering-with-openai-api"},{"key":"e_1_2_1_36_1","volume-title":"Reflexion: Language Agents with Verbal Reinforcement Learning. arxiv:2303.11366 arXiv:2303.11366 [cs]","author":"Shinn Noah","year":"2023","unstructured":"Noah Shinn, Federico Cassano, Beck Labash, Ashwin Gopinath, Karthik Narasimhan, and Shunyu Yao. 2023. Reflexion: Language Agents with Verbal Reinforcement Learning. arxiv:2303.11366 arXiv:2303.11366 [cs]"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/3582688"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/2892208.2892235"},{"key":"e_1_2_1_39_1","unstructured":"Weisong Sun Chunrong Fang Yudu You Yun Miao Yi Liu Yuekang Li Gelei Deng Shenghan Huang Yuchen Chen Quanjun Zhang Hanwei Qian Yang Liu and Zhenyu Chen. 2023. Automatic Code Summarization via ChatGPT: How Far Are We? arxiv:2305.12865 arXiv:2305.12865 [cs]"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/3597503.3639117"},{"key":"e_1_2_1_41_1","volume-title":"Xunzhu Tang, Shing-Chi Cheung, Jacques Klein, and Tegawend\u00e9 F. Bissyand\u00e9.","author":"Tian Haoye","year":"2023","unstructured":"Haoye Tian, Weiqi Lu, Tsz On Li, Xunzhu Tang, Shing-Chi Cheung, Jacques Klein, and Tegawend\u00e9 F. Bissyand\u00e9. 2023. Is ChatGPT the Ultimate Programming Assistant \u2013 How far is it? arxiv:2304.11938 arXiv:2304.11938 [cs]"},{"key":"e_1_2_1_42_1","unstructured":"TianoCore. 2022. tianocore\/edk2. https:\/\/github.com\/tianocore\/edk2"},{"key":"e_1_2_1_43_1","unstructured":"Hugo Touvron Louis Martin Kevin Stone Peter Albert Amjad Almahairi Yasmine Babaei Nikolay Bashlykov Soumya Batra Prajjwal Bhargava Shruti Bhosale Dan Bikel Lukas Blecher Cristian Canton Ferrer Moya Chen Guillem Cucurull David Esiobu Jude Fernandes Jeremy Fu Wenyin Fu Brian Fuller Cynthia Gao Vedanuj Goswami Naman Goyal Anthony Hartshorn Saghar Hosseini Rui Hou Hakan Inan Marcin Kardas Viktor Kerkez Madian Khabsa Isabel Kloumann Artem Korenev Punit Singh Koura Marie-Anne Lachaux Thibaut Lavril Jenya Lee Diana Liskovich Yinghai Lu Yuning Mao Xavier Martinet Todor Mihaylov Pushkar Mishra Igor Molybog Yixin Nie Andrew Poulton Jeremy Reizenstein Rashi Rungta Kalyan Saladi Alan Schelten Ruan Silva Eric Michael Smith Ranjan Subramanian Xiaoqing Ellen Tan Binh Tang Ross Taylor Adina Williams Jian Xiang Kuan Puxin Xu Zheng Yan Iliyan Zarov Yuchen Zhang Angela Fan Melanie Kambadur Sharan Narang Aurelien Rodriguez Robert Stojnic Sergey Edunov and Thomas Scialom. 2023. Llama 2: Open Foundation and Fine-Tuned Chat Models. arxiv:2307.09288 arXiv:2307.09288 [cs]"},{"key":"e_1_2_1_44_1","volume-title":"Advances in Neural Information Processing Systems. 30, Curran Associates","author":"Vaswani Ashish","unstructured":"Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, \u0141ukasz Kaiser, and Illia Polosukhin. 2017. Attention is All you Need. In Advances in Neural Information Processing Systems. 30, Curran Associates, Inc.."},{"key":"e_1_2_1_45_1","volume-title":"Voyager: An Open-Ended Embodied Agent with Large Language Models. arxiv:2305.16291 arXiv:2305.16291 [cs]","author":"Wang Guanzhi","year":"2023","unstructured":"Guanzhi Wang, Yuqi Xie, Yunfan Jiang, Ajay Mandlekar, Chaowei Xiao, Yuke Zhu, Linxi Fan, and Anima Anandkumar. 2023. Voyager: An Open-Ended Embodied Agent with Large Language Models. arxiv:2305.16291 arXiv:2305.16291 [cs]"},{"key":"e_1_2_1_46_1","volume-title":"Chi, Quoc Le, and Denny Zhou","author":"Wei Jason","year":"2023","unstructured":"Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter, Fei Xia, Ed Chi, Quoc Le, and Denny Zhou. 2023. Chain-of-Thought Prompting Elicits Reasoning in Large Language Models. arxiv:2201.11903 arXiv:2201.11903 [cs]"},{"key":"e_1_2_1_47_1","unstructured":"Lilian Weng. 2023. LLM-powered Autonomous Agents. lilianweng.github.io Jun https:\/\/lilianweng.github.io\/posts\/2023-06-23-agent"},{"key":"e_1_2_1_48_1","unstructured":"Chunqiu Steven Xia and Lingming Zhang. 2023. Keep the Conversation Going: Fixing 162 out of 337 bugs for $0.42 each using ChatGPT. arxiv:2304.00385"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1145\/3520312.3534862"},{"key":"e_1_2_1_50_1","unstructured":"Shunyu Yao Dian Yu Jeffrey Zhao Izhak Shafran Thomas L. Griffiths Yuan Cao and Karthik Narasimhan. 2023. Tree of Thoughts: Deliberate Problem Solving with Large Language Models. arxiv:2305.10601 arXiv:2305.10601 [cs]"},{"key":"e_1_2_1_51_1","volume-title":"ReAct: Synergizing Reasoning and Acting in Language Models. International Conference on Learning Representations (ICLR).","author":"Yao Shunyu","year":"2023","unstructured":"Shunyu Yao, Jeffrey Zhao, Dian Yu, Nan Du, Izhak Shafran, Karthik Narasimhan, and Yuan Cao. 2023. ReAct: Synergizing Reasoning and Acting in Language Models. International Conference on Learning Representations (ICLR)."},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1145\/3368089.3409686"},{"key":"e_1_2_1_53_1","unstructured":"Wayne Xin Zhao Kun Zhou Junyi Li Tianyi Tang Xiaolei Wang Yupeng Hou Yingqian Min Beichen Zhang Junjie Zhang Zican Dong Yifan Du Chen Yang Yushuo Chen Zhipeng Chen Jinhao Jiang Ruiyang Ren Yifan Li Xinyu Tang Zikang Liu Peiyu Liu Jian-Yun Nie and Ji-Rong Wen. 2023. A Survey of Large Language Models. arxiv:2303.18223."},{"key":"e_1_2_1_54_1","unstructured":"Shen Zheng Jie Huang and Kevin Chen-Chuan Chang. 2023. Why Does ChatGPT Fall Short in Providing Truthful Answers? arxiv:2304.10513 arXiv:2304.10513 [cs]"}],"container-title":["Proceedings of the ACM on Programming Languages"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3649828","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3649828","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T22:54:06Z","timestamp":1750287246000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3649828"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,4,29]]},"references-count":54,"journal-issue":{"issue":"OOPSLA1","published-print":{"date-parts":[[2024,4,29]]}},"alternative-id":["10.1145\/3649828"],"URL":"https:\/\/doi.org\/10.1145\/3649828","relation":{},"ISSN":["2475-1421"],"issn-type":[{"value":"2475-1421","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,4,29]]},"assertion":[{"value":"2024-04-29","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}