{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,5]],"date-time":"2026-06-05T16:06:35Z","timestamp":1780675595342,"version":"3.54.1"},"publisher-location":"New York, NY, USA","reference-count":58,"publisher":"ACM","funder":[{"name":"The Key R&D Program of Zhejiang Province","award":["2025C01083"],"award-info":[{"award-number":["2025C01083"]}]},{"name":"The Fundamental Research Funds for the Central Universities","award":["2025ZFJH02"],"award-info":[{"award-number":["2025ZFJH02"]}]},{"name":"The Ministry of Education, Singapore, under its Academic Research Fund Tier 2","award":["T2EP20222-0037"],"award-info":[{"award-number":["T2EP20222-0037"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2026,4,13]]},"DOI":"10.1145\/3774904.3792256","type":"proceedings-article","created":{"date-parts":[[2026,4,27]],"date-time":"2026-04-27T13:28:36Z","timestamp":1777296516000},"page":"2764-2775","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["LLMQuA: Practical Backdoor Injection on Large Language Model Quantization"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0006-4482-3430","authenticated-orcid":false,"given":"Xiangxiang","family":"Chen","sequence":"first","affiliation":[{"name":"Zhejiang University, Hangzhou, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5039-5651","authenticated-orcid":false,"given":"Peixin","family":"Zhang","sequence":"additional","affiliation":[{"name":"Singapore Management University, Singapore, Singapore"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3545-1392","authenticated-orcid":false,"given":"Jun","family":"Sun","sequence":"additional","affiliation":[{"name":"Singapore Management University, Singapore, Singapore"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6512-8326","authenticated-orcid":false,"given":"Jin Song","family":"Dong","sequence":"additional","affiliation":[{"name":"National University of Singapore, Singapore, Singapore"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1936-2840","authenticated-orcid":false,"given":"Wenhai","family":"Wang","sequence":"additional","affiliation":[{"name":"Zhejiang University, Hangzhou, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7113-7635","authenticated-orcid":false,"given":"Jingyi","family":"Wang","sequence":"additional","affiliation":[{"name":"Zhejiang University, Hangzhou, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2026,4,12]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Harishwar Reddy Kasireddy, and Yasir Zaki.","author":"AlDahoul Nouar","year":"2024","unstructured":"Nouar AlDahoul, Myles Joshua Toledo Tan, Harishwar Reddy Kasireddy, and Yasir Zaki. 2024. Advancing content moderation: Evaluating large language models for detecting sensitive content across text, images, and videos. arXiv preprint arXiv:2411.17123 (2024)."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/3603399"},{"key":"e_1_3_2_1_3_1","first-page":"1505","volume-title":"30th USENIX Security Symposium (USENIX Security 21)","author":"Bagdasaryan Eugene","year":"2021","unstructured":"Eugene Bagdasaryan and Vitaly Shmatikov. 2021. Blind backdoors in deep learning models. In 30th USENIX Security Symposium (USENIX Security 21). 1505-1521."},{"key":"e_1_3_2_1_4_1","unstructured":"Jinze Bai Shuai Bai Yunfei Chu Zeyu Cui Kai Dang Xiaodong Deng Yang Fan Wenbin Ge Yu Han Fei Huang et al. 2023. Qwen technical report. arXiv preprint arXiv:2309.16609 (2023)."},{"key":"e_1_3_2_1_5_1","first-page":"1","article-title":"The Fifth PASCAL Recognizing Textual Entailment Challenge","volume":"7","author":"Bentivogli Luisa","year":"2009","unstructured":"Luisa Bentivogli, Peter Clark, Ido Dagan, and Danilo Giampiccolo. 2009. The Fifth PASCAL Recognizing Textual Entailment Challenge. TAC, Vol. 7, 8 (2009), 1.","journal-title":"TAC"},{"key":"e_1_3_2_1_6_1","volume-title":"Rounding-Guided Backdoor Injection in Deep Learning Model Quantization. In 33rd Annual Network and Distributed System Security Symposium, NDSS 2026","author":"Chen Xiangxiang","year":"2026","unstructured":"Xiangxiang Chen, Peixin Zhang, Jun Sun, Wenhai Wang, and Jingyi Wang. 2026. Rounding-Guided Backdoor Injection in Deep Learning Model Quantization. In 33rd Annual Network and Distributed System Security Symposium, NDSS 2026, San Diego, California, USA, February 23-27, 2026. The Internet Society."},{"key":"e_1_3_2_1_7_1","volume-title":"Platform For AI: BladeLLM Model Quantization. https:\/\/www.alibabacloud.com\/help\/en\/pai\/user-guide\/bladellm-model-quantization Accessed: 2025-09-21.","author":"Cloud Alibaba","year":"2025","unstructured":"Alibaba Cloud. 2025a. Platform For AI: BladeLLM Model Quantization. https:\/\/www.alibabacloud.com\/help\/en\/pai\/user-guide\/bladellm-model-quantization Accessed: 2025-09-21."},{"key":"e_1_3_2_1_8_1","unstructured":"Alibaba Cloud. 2025b. Text Audit Scheme Based on Large Model Capability. https:\/\/www.alibabacloud.com\/help\/en\/content-moderation\/latest\/text-audit-scheme-based-on-large-model-capability?spm=a2c63.p38356.help-menu-28415.d_1_2.30b77362mkwQ9y Accessed on: 2025-04-28."},{"key":"e_1_3_2_1_9_1","unstructured":"Abhimanyu Dubey Abhinav Jauhri Abhinav Pandey Abhishek Kadian Ahmad Al-Dahle Aiesha Letman Akhil Mathur Alan Schelten Amy Yang Angela Fan et al. 2024. The llama 3 herd of models. arXiv e-prints (2024) arXiv-2407."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.52202\/079017-1319"},{"key":"e_1_3_2_1_11_1","unstructured":"Fortune. 2024. TikTok Owner ByteDance Fires Intern for Allegedly Planting Malicious Code in AI Models. https:\/\/fortune.com\/2024\/10\/21\/tiktok-bytedance-intern-fired-ai-program-sabotage\/."},{"key":"e_1_3_2_1_12_1","volume-title":"Gptq: Accurate post-training quantization for generative pre-trained transformers. arXiv preprint arXiv:2210.17323","author":"Frantar Elias","year":"2022","unstructured":"Elias Frantar, Saleh Ashkboos, Torsten Hoefler, and Dan Alistarh. 2022. Gptq: Accurate post-training quantization for generative pre-trained transformers. arXiv preprint arXiv:2210.17323 (2022)."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.5281\/zenodo.12608602"},{"key":"e_1_3_2_1_14_1","unstructured":"GBHackers. 2024. 170K Python Developers GitHub Accounts Hacked in Supply Chain Attack. https:\/\/gbhackers.com\/170k-user-accounts-hacked\/. Accessed: 2025-9-21."},{"key":"e_1_3_2_1_15_1","unstructured":"Stream (getstream.io). 2025. AI Content Moderation API. https:\/\/getstream.io\/moderation\/ Accessed: 2025-09-21."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1201\/9781003162810-13"},{"key":"e_1_3_2_1_17_1","volume-title":"Badnets: Identifying vulnerabilities in the machine learning model supply chain. arXiv preprint arXiv:1708.06733","author":"Gu Tianyu","year":"2017","unstructured":"Tianyu Gu, Brendan Dolan-Gavitt, and Siddharth Garg. 2017. Badnets: Identifying vulnerabilities in the machine learning model supply chain. arXiv preprint arXiv:1708.06733 (2017)."},{"key":"e_1_3_2_1_18_1","volume-title":"Dawn Xiaodong Song, and Jacob Steinhardt","author":"Hendrycks Dan","year":"2020","unstructured":"Dan Hendrycks, Collin Burns, Steven Basart, Andy Zou, Mantas Mazeika, Dawn Xiaodong Song, and Jacob Steinhardt. 2020. Measuring Massive Multitask Language Understanding. ArXiv, Vol. abs\/2009.03300 (2020). https:\/\/api.semanticscholar.org\/CorpusID:221516475"},{"key":"e_1_3_2_1_19_1","first-page":"9303","article-title":"Qu-anti-zation: Exploiting quantization artifacts for achieving adversarial outcomes","volume":"34","author":"Hong Sanghyun","year":"2021","unstructured":"Sanghyun Hong, Michael-Andrei Panaitescu-Liess, Yigitcan Kaya, and Tudor Dumitras. 2021. Qu-anti-zation: Exploiting quantization artifacts for achieving adversarial outcomes. Advances in Neural Information Processing Systems, Vol. 34 (2021), 9303-9316.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_1_20_1","unstructured":"Evan Hubinger Carson Denison Jesse Mu Mike Lambert Meg Tong Monte MacDiarmid Tamera Lanham Daniel M Ziegler Tim Maxwell Newton Cheng et al. 2024. Sleeper agents: Training deceptive llms that persist through safety training. arXiv preprint arXiv:2401.05566 (2024)."},{"key":"e_1_3_2_1_21_1","unstructured":"Albert Qiaochu Jiang Alexandre Sablayrolles Arthur Mensch Chris Bamford Devendra Singh Chaplot Diego de Las Casas Florian Bressand Gianna Lengyel Guillaume Lample Lucile Saulnier L\u00e9lio Renard Lavaud Marie-Anne Lachaux Pierre Stock Teven Le Scao Thibaut Lavril Thomas Wang Timoth\u00e9e Lacroix and William El Sayed. 2023. Mistral 7B. ArXiv Vol. abs\/2310.06825 (2023). https:\/\/api.semanticscholar.org\/CorpusID:263830494"},{"key":"e_1_3_2_1_22_1","volume-title":"How to Build a Local Agentic AI Assistant. Medium (Mar","author":"Joseph Jimmy","year":"2025","unstructured":"Jimmy Joseph. 2025. How to Build a Local Agentic AI Assistant. Medium (Mar 2025). https:\/\/medium.com\/@jimsweb\/building-a-local-agentic-ai-assistant-5d8476ac2175 Accessed: 2025-09-21."},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/3600006.3613165"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52733.2024.02315"},{"key":"e_1_3_2_1_25_1","volume-title":"Backdoor attacks on pre-trained models by layerwise weight poisoning. arXiv preprint arXiv:2108.13888","author":"Li Linyang","year":"2021","unstructured":"Linyang Li, Demin Song, Xiaonan Li, Jiehang Zeng, Ruotian Ma, and Xipeng Qiu. 2021. Backdoor attacks on pre-trained models by layerwise weight poisoning. arXiv preprint arXiv:2108.13888 (2021)."},{"key":"e_1_3_2_1_26_1","volume-title":"BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks and Defenses on Large Language Models. arXiv preprint arXiv:2408.12798","author":"Li Yige","year":"2024","unstructured":"Yige Li, Hanxun Huang, Yunhan Zhao, Xingjun Ma, and Jun Sun. 2024b. BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks and Defenses on Large Language Models. arXiv preprint arXiv:2408.12798 (2024)."},{"key":"e_1_3_2_1_27_1","volume-title":"Badedit: Backdooring large language models by model editing. arXiv preprint arXiv:2403.13355","author":"Li Yanzhou","year":"2024","unstructured":"Yanzhou Li, Tianlin Li, Kangjie Chen, Jian Zhang, Shangqing Liu, Wenhan Wang, Tianwei Zhang, and Yang Liu. 2024c. Badedit: Backdooring large language models by model editing. arXiv preprint arXiv:2403.13355 (2024)."},{"key":"e_1_3_2_1_28_1","volume-title":"Cleangen: Mitigating backdoor attacks for generation tasks in large language models. arXiv preprint arXiv:2406.12257","author":"Li Yuetai","year":"2024","unstructured":"Yuetai Li, Zhangchen Xu, Fengqing Jiang, Luyao Niu, Dinuka Sahabandu, Bhaskar Ramasubramanian, and Radha Poovendran. 2024d. Cleangen: Mitigating backdoor attacks for generation tasks in large language models. arXiv preprint arXiv:2406.12257 (2024)."},{"key":"e_1_3_2_1_29_1","first-page":"87","article-title":"Awq: Activation-aware weight quantization for on-device llm compression and acceleration","volume":"6","author":"Lin Ji","year":"2024","unstructured":"Ji Lin, Jiaming Tang, Haotian Tang, Shang Yang, Wei-Ming Chen, Wei-Chen Wang, Guangxuan Xiao, Xingyu Dang, Chuang Gan, and Song Han. 2024. Awq: Activation-aware weight quantization for on-device llm compression and acceleration. Proceedings of machine learning and systems, Vol. 6 (2024), 87-100.","journal-title":"Proceedings of machine learning and systems"},{"key":"e_1_3_2_1_30_1","volume-title":"Truthfulqa: Measuring how models mimic human falsehoods","author":"Lin Stephanie","year":"2021","unstructured":"Stephanie Lin, Jacob Hilton, and Owain Evans. 2021. Truthfulqa: Measuring how models mimic human falsehoods, 2022. URL https:\/\/arxiv.org\/abs\/2109.07958, Vol. 1 (2021)."},{"key":"e_1_3_2_1_31_1","volume-title":"Quantization backdoors to deep learning commercial frameworks","author":"Ma Hua","year":"2023","unstructured":"Hua Ma, Huming Qiu, Yansong Gao, Zhi Zhang, Alsharif Abuadbba, Minhui Xue, Anmin Fu, Jiliang Zhang, Said F Al-Sarawi, and Derek Abbott. 2023. Quantization backdoors to deep learning commercial frameworks. IEEE Transactions on Dependable and Secure Computing (2023)."},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v35i17.17745"},{"key":"e_1_3_2_1_33_1","volume-title":"Locating and editing factual associations in gpt. Advances in neural information processing systems","author":"Meng Kevin","year":"2022","unstructured":"Kevin Meng, David Bau, Alex Andonian, and Yonatan Belinkov. 2022a. Locating and editing factual associations in gpt. Advances in neural information processing systems, Vol. 35 (2022), 17359-17372."},{"key":"e_1_3_2_1_34_1","volume-title":"Alex Andonian, Yonatan Belinkov, and David Bau.","author":"Meng Kevin","year":"2022","unstructured":"Kevin Meng, Arnab Sen Sharma, Alex Andonian, Yonatan Belinkov, and David Bau. 2022b. Mass-editing memory in a transformer. arXiv preprint arXiv:2210.07229 (2022)."},{"key":"e_1_3_2_1_35_1","volume-title":"Crow: Eliminating backdoors from large language models via internal consistency regularization. arXiv preprint arXiv:2411.12768","author":"Min Nay Myat","year":"2024","unstructured":"Nay Myat Min, Long H Pham, Yige Li, and Jun Sun. 2024. Crow: Eliminating backdoors from large language models via internal consistency regularization. arXiv preprint arXiv:2411.12768 (2024)."},{"key":"e_1_3_2_1_36_1","unstructured":"Xuan-Son Nguyen. 2025. Common AI Model Formats. https:\/\/huggingface.co\/blog\/ngxson\/common-ai-model-formats"},{"key":"e_1_3_2_1_37_1","unstructured":"Ollama. 2023. ollama\/Get up and running with large language models. https:\/\/github.com\/ollama\/ollama. Accessed: 2025-09-21."},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"crossref","unstructured":"Long Ouyang Jeffrey Wu Xu Jiang Diogo Almeida Carroll Wainwright Pamela Mishkin Chong Zhang Sandhini Agarwal Katarina Slama Alex Ray et al. 2022. Training language models to follow instructions with human feedback. Advances in neural information processing systems Vol. 35 (2022) 27730-27744.","DOI":"10.52202\/068431-2011"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/3485832.3485881"},{"key":"e_1_3_2_1_40_1","first-page":"1","article-title":"Exploring the limits of transfer learning with a unified text-to-text transformer","volume":"21","author":"Raffel Colin","year":"2020","unstructured":"Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, and Peter J Liu. 2020. Exploring the limits of transfer learning with a unified text-to-text transformer. Journal of machine learning research, Vol. 21, 140 (2020), 1-67.","journal-title":"Journal of machine learning research"},{"key":"e_1_3_2_1_41_1","unstructured":"r\/selfhosted. 2025. The Complete Guide to Building Your Free Local AI Assistant with Ollama and Open WebUI. https:\/\/www.reddit.com\/r\/selfhosted\/comments\/1jbk06h\/. Accessed: 2025-09-21."},{"key":"e_1_3_2_1_42_1","volume-title":"A thorough examination of decoding methods in the era of llms. arXiv preprint arXiv:2402.06925","author":"Shi Chufan","year":"2024","unstructured":"Chufan Shi, Haoran Yang, Deng Cai, Zhisong Zhang, Yifan Wang, Yujiu Yang, and Wai Lam. 2024. A thorough examination of decoding methods in the era of llms. arXiv preprint arXiv:2402.06925 (2024)."},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D13-1170"},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIFS.2022.3160359"},{"key":"e_1_3_2_1_45_1","unstructured":"Hugo Touvron Louis Martin Kevin Stone Peter Albert Amjad Almahairi Yasmine Babaei Nikolay Bashlykov Soumya Batra Prajjwal Bhargava Shruti Bhosale et al. 2023. Llama 2: Open foundation and fine-tuned chat models. arXiv preprint arXiv:2307.09288 (2023)."},{"key":"e_1_3_2_1_46_1","volume-title":"GLUE: A multi-task benchmark and analysis platform for natural language understanding. arXiv preprint arXiv:1804.07461","author":"Wang Alex","year":"2018","unstructured":"Alex Wang, Amanpreet Singh, Julian Michael, Felix Hill, Omer Levy, and Samuel R Bowman. 2018. GLUE: A multi-task benchmark and analysis platform for natural language understanding. arXiv preprint arXiv:1804.07461 (2018)."},{"key":"e_1_3_2_1_47_1","volume-title":"Trojan activation attack: Red-teaming large language models using activation steering for safety-alignment. arXiv preprint arXiv:2311.09433","author":"Wang Haoran","year":"2023","unstructured":"Haoran Wang and Kai Shu. 2023. Trojan activation attack: Red-teaming large language models using activation steering for safety-alignment. arXiv preprint arXiv:2311.09433 (2023)."},{"key":"e_1_3_2_1_48_1","volume-title":"A broad-coverage challenge corpus for sentence understanding through inference. arXiv preprint arXiv:1704.05426","author":"Williams Adina","year":"2017","unstructured":"Adina Williams, Nikita Nangia, and Samuel R Bowman. 2017. A broad-coverage challenge corpus for sentence understanding through inference. arXiv preprint arXiv:1704.05426 (2017)."},{"key":"e_1_3_2_1_49_1","volume-title":"On the impact of calibration data in post-training quantization and pruning. arXiv preprint arXiv:2311.09755","author":"Williams Miles","year":"2023","unstructured":"Miles Williams and Nikolaos Aletras. 2023. On the impact of calibration data in post-training quantization and pruning. arXiv preprint arXiv:2311.09755 (2023)."},{"key":"e_1_3_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/3658644.3690322"},{"key":"e_1_3_2_1_51_1","first-page":"32748","article-title":"Defending pre-trained language models as few-shot learners against backdoor attacks","volume":"36","author":"Xi Zhaohan","year":"2023","unstructured":"Zhaohan Xi, Tianyu Du, Changjiang Li, Ren Pang, Shouling Ji, Jinghui Chen, Fenglong Ma, and Ting Wang. 2023. Defending pre-trained language models as few-shot learners against backdoor attacks. Advances in Neural Information Processing Systems, Vol. 36 (2023), 32748-32764.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2024.120176"},{"key":"e_1_3_2_1_53_1","volume-title":"International conference on machine learning. PMLR, 38087-38099","author":"Xiao Guangxuan","year":"2023","unstructured":"Guangxuan Xiao, Ji Lin, Mickael Seznec, Hao Wu, Julien Demouth, and Song Han. 2023. Smoothquant: Accurate and efficient post-training quantization for large language models. In International conference on machine learning. PMLR, 38087-38099."},{"key":"e_1_3_2_1_54_1","volume-title":"Backdooring instruction-tuned large language models with virtual prompt injection. arXiv preprint arXiv:2307.16888","author":"Yan Jun","year":"2023","unstructured":"Jun Yan, Vikas Yadav, Shiyang Li, Lichang Chen, Zheng Tang, Hai Wang, Vijay Srinivasan, Xiang Ren, and Hongxia Jin. 2023. Backdooring instruction-tuned large language models with virtual prompt injection. arXiv preprint arXiv:2307.16888 (2023)."},{"key":"e_1_3_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1109\/SP54263.2024.00026"},{"key":"e_1_3_2_1_56_1","volume-title":"How to inject backdoors with better consistency: Logit anchoring on clean data. arXiv preprint arXiv:2109.01300","author":"Zhang Zhiyuan","year":"2021","unstructured":"Zhiyuan Zhang, Lingjuan Lyu, Weiqiang Wang, Lichao Sun, and Xu Sun. 2021. How to inject backdoors with better consistency: Logit anchoring on clean data. arXiv preprint arXiv:2109.01300 (2021)."},{"key":"e_1_3_2_1_57_1","volume-title":"Jie Fu, Lingjuan Lyu, Meihuizi Jia, and Jinming Wen.","author":"Zhao Shuai","year":"2024","unstructured":"Shuai Zhao, Leilei Gan, Luu Anh Tuan, Jie Fu, Lingjuan Lyu, Meihuizi Jia, and Jinming Wen. 2024a. Defending against weight-poisoning backdoor attacks for parameter-efficient fine-tuning. arXiv preprint arXiv:2402.12168 (2024)."},{"key":"e_1_3_2_1_58_1","volume-title":"Unlearning backdoor attacks for llms with weak-to-strong knowledge distillation. arXiv preprint arXiv:2410.14425","author":"Zhao Shuai","year":"2024","unstructured":"Shuai Zhao, Xiaobao Wu, Cong-Duy Nguyen, Yanhao Jia, Meihuizi Jia, Yichao Feng, and Luu Anh Tuan. 2024b. Unlearning backdoor attacks for llms with weak-to-strong knowledge distillation. arXiv preprint arXiv:2410.14425 (2024)."}],"event":{"name":"WWW '26: The ACM Web Conference 2026","location":"Dubai United Arab Emirates","sponsor":["SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web"]},"container-title":["Proceedings of the ACM Web Conference 2026"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3774904.3792256","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,6,5]],"date-time":"2026-06-05T15:53:06Z","timestamp":1780674786000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3774904.3792256"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,4,12]]},"references-count":58,"alternative-id":["10.1145\/3774904.3792256","10.1145\/3774904"],"URL":"https:\/\/doi.org\/10.1145\/3774904.3792256","relation":{},"subject":[],"published":{"date-parts":[[2026,4,12]]},"assertion":[{"value":"2026-04-12","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}