{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,12]],"date-time":"2025-12-12T02:11:12Z","timestamp":1765505472402,"version":"3.48.0"},"publisher-location":"New York, NY, USA","reference-count":45,"publisher":"ACM","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,11,10]]},"DOI":"10.1145\/3746252.3760925","type":"proceedings-article","created":{"date-parts":[[2025,11,8]],"date-time":"2025-11-08T00:36:36Z","timestamp":1762562196000},"page":"5411-5416","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["HCLeK: Hierarchical Compression of Legal Knowledge for Retrieval-Augmented Generation"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0004-3547-0472","authenticated-orcid":false,"given":"Jianhui","family":"Yang","sequence":"first","affiliation":[{"name":"Tsinghua University, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0004-0158-9926","authenticated-orcid":false,"given":"Huanghai","family":"Liu","sequence":"additional","affiliation":[{"name":"Tsinghua University, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7834-9737","authenticated-orcid":false,"given":"Mingruo","family":"Yuan","sequence":"additional","affiliation":[{"name":"The University of Hong Kong, Hongkong, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0003-7986-3692","authenticated-orcid":false,"given":"YiRan","family":"Hu","sequence":"additional","affiliation":[{"name":"Tsinghua University, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0009-8684-7977","authenticated-orcid":false,"given":"Yun","family":"Liu","sequence":"additional","affiliation":[{"name":"Tsinghua University, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-7539-4242","authenticated-orcid":false,"given":"Weixing","family":"Shen","sequence":"additional","affiliation":[{"name":"Tsinghua University, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0501-9435","authenticated-orcid":false,"given":"Ben","family":"Kao","sequence":"additional","affiliation":[{"name":"The University of Hong Kong, Hongkong, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2025,11,10]]},"reference":[{"unstructured":"Tom Brown Benjamin Mann Nick Ryder Melanie Subbiah Jared D Kaplan Prafulla Dhariwal Arvind Neelakantan Pranav Shyam Girish Sastry Amanda Askell et al. 2020. Language models are few-shot learners. Advances in neural information processing systems Vol. 33 (2020) 1877-1901.","key":"e_1_3_2_1_1_1"},{"doi-asserted-by":"crossref","unstructured":"Ilias Chalkidis Manos Fergadiotis Prodromos Malakasiotis Nikolaos Aletras and Ion Androutsopoulos. 2020. LEGAL-BERT: The Muppets straight out of Law School. arXiv:2010.02559 [cs.CL] https:\/\/arxiv.org\/abs\/2010.02559","key":"e_1_3_2_1_2_1","DOI":"10.18653\/v1\/2020.findings-emnlp.261"},{"key":"e_1_3_2_1_3_1","volume-title":"Bge m3-embedding: Multi-lingual, multi-functionality, multi-granularity text embeddings through self-knowledge distillation. arXiv preprint arXiv:2402.03216","author":"Chen Jianlv","year":"2024","unstructured":"Jianlv Chen, Shitao Xiao, Peitian Zhang, Kun Luo, Defu Lian, and Zheng Liu. 2024. Bge m3-embedding: Multi-lingual, multi-functionality, multi-granularity text embeddings through self-knowledge distillation. arXiv preprint arXiv:2402.03216 (2024)."},{"unstructured":"Xi Chen Mao Mao Shuo Li and Haotian Shangguan. 2025. Debate-Feedback: A Multi-Agent Framework for Efficient Legal Judgment Prediction. arXiv:2504.05358 [cs.MA] https:\/\/arxiv.org\/abs\/2504.05358","key":"e_1_3_2_1_4_1"},{"key":"e_1_3_2_1_5_1","volume-title":"xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token. arXiv preprint arXiv:2405.13792","author":"Cheng Xin","year":"2024","unstructured":"Xin Cheng, Xun Wang, Xingxing Zhang, Tao Ge, Si-Qing Chen, Furu Wei, Huishuai Zhang, and Dongyan Zhao. 2024. xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token. arXiv preprint arXiv:2405.13792 (2024)."},{"unstructured":"DeepSeek-AI Daya Guo Dejian Yang Haowei Zhang Junxiao Song Ruoyu Zhang Runxin Xu Qihao Zhu Shirong Ma Peiyi Wang Xiao Bi Xiaokang Zhang Xingkai Yu Yu Wu Z. F. Wu Zhibin Gou Zhihong Shao Zhuoshu Li Ziyi Gao Aixin Liu Bing Xue Bingxuan Wang Bochao Wu Bei Feng Chengda Lu Chenggang Zhao Chengqi Deng Chenyu Zhang Chong Ruan Damai Dai Deli Chen Dongjie Ji Erhang Li Fangyun Lin Fucong Dai Fuli Luo Guangbo Hao Guanting Chen Guowei Li H. Zhang Han Bao Hanwei Xu Haocheng Wang Honghui Ding Huajian Xin Huazuo Gao Hui Qu Hui Li Jianzhong Guo Jiashi Li Jiawei Wang Jingchang Chen Jingyang Yuan Junjie Qiu Junlong Li J. L. Cai Jiaqi Ni Jian Liang Jin Chen Kai Dong Kai Hu Kaige Gao Kang Guan Kexin Huang Kuai Yu Lean Wang Lecong Zhang Liang Zhao Litong Wang Liyue Zhang Lei Xu Leyi Xia Mingchuan Zhang Minghua Zhang Minghui Tang Meng Li Miaojun Wang Mingming Li Ning Tian Panpan Huang Peng Zhang Qiancheng Wang Qinyu Chen Qiushi Du Ruiqi Ge Ruisong Zhang Ruizhe Pan Runji Wang R. J. Chen R. L. Jin Ruyi Chen Shanghao Lu Shangyan Zhou Shanhuang Chen Shengfeng Ye Shiyu Wang Shuiping Yu Shunfeng Zhou Shuting Pan S. S. Li Shuang Zhou Shaoqing Wu Shengfeng Ye Tao Yun Tian Pei Tianyu Sun T. Wang Wangding Zeng Wanjia Zhao Wen Liu Wenfeng Liang Wenjun Gao Wenqin Yu Wentao Zhang W. L. Xiao Wei An Xiaodong Liu Xiaohan Wang Xiaokang Chen Xiaotao Nie Xin Cheng Xin Liu Xin Xie Xingchao Liu Xinyu Yang Xinyuan Li Xuecheng Su Xuheng Lin X. Q. Li Xiangyue Jin Xiaojin Shen Xiaosha Chen Xiaowen Sun Xiaoxiang Wang Xinnan Song Xinyi Zhou Xianzu Wang Xinxia Shan Y. K. Li Y. Q. Wang Y. X. Wei Yang Zhang Yanhong Xu Yao Li Yao Zhao Yaofeng Sun Yaohui Wang Yi Yu Yichao Zhang Yifan Shi Yiliang Xiong Ying He Yishi Piao Yisong Wang Yixuan Tan Yiyang Ma Yiyuan Liu Yongqiang Guo Yuan Ou Yuduan Wang Yue Gong Yuheng Zou Yujia He Yunfan Xiong Yuxiang Luo Yuxiang You Yuxuan Liu Yuyang Zhou Y. X. Zhu Yanhong Xu Yanping Huang Yaohui Li Yi Zheng Yuchen Zhu Yunxian Ma Ying Tang Yukun Zha Yuting Yan Z. Z. Ren Zehui Ren Zhangli Sha Zhe Fu Zhean Xu Zhenda Xie Zhengyan Zhang Zhewen Hao Zhicheng Ma Zhigang Yan Zhiyu Wu Zihui Gu Zijia Zhu Zijun Liu Zilin Li Ziwei Xie Ziyang Song Zizheng Pan Zhen Huang Zhipeng Xu Zhongyu Zhang and Zhen Zhang. 2025. DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning. arXiv:2501.12948 [cs.CL] https:\/\/arxiv.org\/abs\/2501.12948","key":"e_1_3_2_1_6_1"},{"unstructured":"Yunfan Gao Yun Xiong Xinyu Gao Kangxiang Jia Jinliu Pan Yuxi Bi Yi Dai Jiawei Sun Meng Wang and Haofen Wang. 2024. Retrieval-Augmented Generation for Large Language Models: A Survey. arXiv:2312.10997 [cs.CL] https:\/\/arxiv.org\/abs\/2312.10997","key":"e_1_3_2_1_7_1"},{"key":"e_1_3_2_1_8_1","volume-title":"Reem AlZahrani, Hebah AlShamlan, Omar Knio, and George Turkiyyah.","author":"Hijazi Faris","year":"2024","unstructured":"Faris Hijazi, Somayah AlHarbi, Abdulaziz AlHussein, Harethah Abu Shairah, Reem AlZahrani, Hebah AlShamlan, Omar Knio, and George Turkiyyah. 2024. ArabLegalEval: A Multitask Benchmark for Assessing Arabic Legal Knowledge in Large Language Models. arXiv:2408.07983 [cs.CL] https:\/\/arxiv.org\/abs\/2408.07983"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_9_1","DOI":"10.1145\/3703155"},{"key":"e_1_3_2_1_10_1","first-page":"1","article-title":"Atlas: Few-shot Learning with Retrieval Augmented Language Models","volume":"24","author":"Izacard Gautier","year":"2023","unstructured":"Gautier Izacard, Patrick Lewis, Maria Lomeli, Lucas Hosseini, Fabio Petroni, Timo Schick, Jane Dwivedi-Yu, Armand Joulin, Sebastian Riedel, and Edouard Grave. 2023. Atlas: Few-shot Learning with Retrieval Augmented Language Models. Journal of Machine Learning Research, Vol. 24, 251 (2023), 1-43. http:\/\/jmlr.org\/papers\/v24\/23-0037.html","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_2_1_11_1","volume-title":"Llmlingua: Compressing prompts for accelerated inference of large language models. arXiv preprint arXiv:2310.05736","author":"Jiang Huiqiang","year":"2023","unstructured":"Huiqiang Jiang, Qianhui Wu, Chin-Yew Lin, Yuqing Yang, and Lili Qiu. 2023a. Llmlingua: Compressing prompts for accelerated inference of large language models. arXiv preprint arXiv:2310.05736 (2023)."},{"key":"e_1_3_2_1_12_1","volume-title":"Longllmlingua: Accelerating and enhancing llms in long context scenarios via prompt compression. arXiv preprint arXiv:2310.06839","author":"Jiang Huiqiang","year":"2023","unstructured":"Huiqiang Jiang, Qianhui Wu, Xufang Luo, Dongsheng Li, Chin-Yew Lin, Yuqing Yang, and Lili Qiu. 2023b. Longllmlingua: Accelerating and enhancing llms in long context scenarios via prompt compression. arXiv preprint arXiv:2310.06839 (2023)."},{"key":"e_1_3_2_1_13_1","volume-title":"Long-context llms meet rag: Overcoming challenges for long inputs in rag. arXiv preprint arXiv:2410.05983","author":"Jin Bowen","year":"2024","unstructured":"Bowen Jin, Jinsung Yoon, Jiawei Han, and Sercan O Arik. 2024. Long-context llms meet rag: Overcoming challenges for long inputs in rag. arXiv preprint arXiv:2410.05983 (2024)."},{"key":"e_1_3_2_1_14_1","volume-title":"Familiarity-aware evidence compression for retrieval augmented generation. arXiv preprint arXiv:2409.12468","author":"Jung Dongwon","year":"2024","unstructured":"Dongwon Jung, Qin Liu, Tenghao Huang, Ben Zhou, and Muhao Chen. 2024. Familiarity-aware evidence compression for retrieval augmented generation. arXiv preprint arXiv:2409.12468 (2024)."},{"key":"e_1_3_2_1_15_1","volume-title":"Ali Ezzat Shahroor, and Amani Al-Ghraibah","author":"Kmainasi Mohamed Bayan","year":"2025","unstructured":"Mohamed Bayan Kmainasi, Ali Ezzat Shahroor, and Amani Al-Ghraibah. 2025. Can Large Language Models Predict the Outcome of Judicial Decisions? arXiv:2501.09768 [cs.CL] https:\/\/arxiv.org\/abs\/2501.09768"},{"key":"e_1_3_2_1_16_1","volume-title":"DELTA: Pre-train a Discriminative Encoder for Legal Case Retrieval via Structural Word Alignment. arXiv preprint arXiv:2403.18435","author":"Li Haitao","year":"2024","unstructured":"Haitao Li, Qingyao Ai, Xinyan Han, Jia Chen, Qian Dong, Yiqun Liu, Chong Chen, and Qi Tian. 2024a. DELTA: Pre-train a Discriminative Encoder for Legal Case Retrieval via Structural Word Alignment. arXiv preprint arXiv:2403.18435 (2024)."},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_17_1","DOI":"10.1145\/3626772.3657887"},{"unstructured":"Yucheng Li Bo Dong Chenghua Lin and Frank Guerin. 2023. Compressing Context to Enhance Inference Efficiency of Large Language Models. arXiv:2310.06201 [cs.CL] https:\/\/arxiv.org\/abs\/2310.06201","key":"e_1_3_2_1_18_1"},{"unstructured":"Junkai Liu Yujie Tong Hui Huang Bowen Zheng Yiran Hu Peicheng Wu Chuan Xiao Makoto Onizuka Muyun Yang and Shuyuan Zheng. 2025. Legal Fact Prediction: The Missing Piece in Legal Judgment Prediction. arXiv:2409.07055 [cs.CL] https:\/\/arxiv.org\/abs\/2409.07055","key":"e_1_3_2_1_19_1"},{"unstructured":"Nelson F. Liu Kevin Lin John Hewitt Ashwin Paranjape Michele Bevilacqua Fabio Petroni and Percy Liang. 2023. Lost in the Middle: How Language Models Use Long Contexts. arXiv:2307.03172 [cs.CL] https:\/\/arxiv.org\/abs\/2307.03172","key":"e_1_3_2_1_20_1"},{"unstructured":"Hai-Long Nguyen Tan-Minh Nguyen Duc-Minh Nguyen Thi-Hai-Yen Vuong Ha-Thanh Nguyen and Xuan-Hieu Phan. 2024. Exploiting LLMs' Reasoning Capability to Infer Implicit Concepts in Legal Information Retrieval. arXiv:2410.12154 [cs.CL] https:\/\/arxiv.org\/abs\/2410.12154","key":"e_1_3_2_1_21_1"},{"unstructured":"Shubham Kumar Nigam Aniket Deroy Subhankar Maity and Arnab Bhattacharya. 2024. Rethinking Legal Judgement Prediction in a Realistic Scenario in the Era of Large Language Models. arXiv:2410.10542 [cs.CL] https:\/\/arxiv.org\/abs\/2410.10542","key":"e_1_3_2_1_22_1"},{"unstructured":"Long Ouyang Jeffrey Wu Xu Jiang Diogo Almeida Carroll Wainwright Pamela Mishkin Chong Zhang Sandhini Agarwal Katarina Slama Alex Ray et al. 2022. Training language models to follow instructions with human feedback. Advances in neural information processing systems Vol. 35 (2022) 27730-27744.","key":"e_1_3_2_1_23_1"},{"unstructured":"Zhuoshi Pan Qianhui Wu Huiqiang Jiang Menglin Xia Xufang Luo Jue Zhang Qingwei Lin Victor R\u00fchle Yuqing Yang Chin-Yew Lin et al. 2024. Llmlingua-2: Data distillation for efficient and faithful task-agnostic prompt compression. arXiv preprint arXiv:2403.12968 (2024).","key":"e_1_3_2_1_24_1"},{"doi-asserted-by":"crossref","unstructured":"Shounak Paul Arpan Mandal Pawan Goyal and Saptarshi Ghosh. 2023. Pre-trained Language Models for the Legal Domain: A Case Study on Indian Law. arXiv:2209.06049 [cs.CL] https:\/\/arxiv.org\/abs\/2209.06049","key":"e_1_3_2_1_25_1","DOI":"10.1145\/3594536.3595165"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_26_1","DOI":"10.1145\/3626772.3657717"},{"unstructured":"Qwen: An Yang Baosong Yang Beichen Zhang Binyuan Hui Bo Zheng Bowen Yu Chengyuan Li Dayiheng Liu Fei Huang Haoran Wei Huan Lin et al. 2025. Qwen2.5 Technical Report. arXiv:2412.15115 [cs.CL] https:\/\/arxiv.org\/abs\/2412.15115","key":"e_1_3_2_1_27_1"},{"key":"e_1_3_2_1_28_1","volume-title":"Context embeddings for efficient answer generation in rag. arXiv preprint arXiv:2407.09252","author":"Rau David","year":"2024","unstructured":"David Rau, Shuai Wang, Herv\u00e9 D\u00e9jean, and St\u00e9phane Clinchant. 2024. Context embeddings for efficient answer generation in rag. arXiv preprint arXiv:2407.09252 (2024)."},{"key":"e_1_3_2_1_29_1","volume-title":"Stanis\u0142aw S\u00f3jka, and Matthias Grabmair.","author":"Santosh T. Y. S. S.","year":"2024","unstructured":"T. Y. S. S. Santosh, Mohamed Hesham Elganayni, Stanis\u0142aw S\u00f3jka, and Matthias Grabmair. 2024. Incorporating Precedents for Legal Judgement Prediction on European Court of Human Rights Cases. arXiv:2409.18644 [cs.CL] https:\/\/arxiv.org\/abs\/2409.18644"},{"key":"e_1_3_2_1_30_1","volume-title":"Compressing Long Context for Enhancing RAG with AMR-based Concept Distillation. arXiv preprint arXiv:2405.03085","author":"Shi Kaize","year":"2024","unstructured":"Kaize Shi, Xueyao Sun, Qing Li, and Guandong Xu. 2024. Compressing Long Context for Enhancing RAG with AMR-based Concept Distillation. arXiv preprint arXiv:2405.03085 (2024)."},{"doi-asserted-by":"crossref","unstructured":"Zirui Song Bin Yan Yuhan Liu Miao Fang Mingzhe Li Rui Yan and Xiuying Chen. 2025. Injecting Domain-Specific Knowledge into Large Language Models: A Comprehensive Survey. arXiv:2502.10708 [cs.CL] https:\/\/arxiv.org\/abs\/2502.10708","key":"e_1_3_2_1_31_1","DOI":"10.18653\/v1\/2025.findings-emnlp.1379"},{"unstructured":"Jabez Gridley Sutherland. 1891. Statutes and Statutory Construction: Including a Discussion of Legislative Powers Constitutional Regulations Relative to the Forms of Legislation and to Legislative Procedure Together with an Exposition at Length of the Principles of Interpretation and Cognate Topics. Callaghan.","key":"e_1_3_2_1_32_1"},{"key":"e_1_3_2_1_33_1","volume-title":"Llama: Open and efficient foundation language models. arXiv preprint arXiv:2302.13971","author":"Touvron Hugo","year":"2023","unstructured":"Hugo Touvron, Thibaut Lavril, Gautier Izacard, Xavier Martinet, Marie-Anne Lachaux, Timoth\u00e9e Lacroix, Baptiste Rozi\u00e8re, Naman Goyal, Eric Hambro, Faisal Azhar, et al., 2023. Llama: Open and efficient foundation language models. arXiv preprint arXiv:2302.13971 (2023)."},{"key":"e_1_3_2_1_34_1","volume-title":"Cognitive overload attack: Prompt injection for long context. arXiv preprint arXiv:2410.11272","author":"Upadhayay Bibek","year":"2024","unstructured":"Bibek Upadhayay, Vahid Behzadan, and Amin Karbasi. 2024. Cognitive overload attack: Prompt injection for long context. arXiv preprint arXiv:2410.11272 (2024)."},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_35_1","DOI":"10.18653\/v1\/2024.emnlp-main.322"},{"key":"e_1_3_2_1_36_1","volume-title":"Lawformer: A Pre-trained Language Model for Chinese Legal Long Documents. arXiv:2105.03887 [cs.CL] https:\/\/arxiv.org\/abs\/2105.03887","author":"Xiao Chaojun","year":"2021","unstructured":"Chaojun Xiao, Xueyu Hu, Zhiyuan Liu, Cunchao Tu, and Maosong Sun. 2021. Lawformer: A Pre-trained Language Model for Chinese Legal Long Documents. arXiv:2105.03887 [cs.CL] https:\/\/arxiv.org\/abs\/2105.03887"},{"key":"e_1_3_2_1_37_1","volume-title":"Recomp: Improving retrieval-augmented lms with compression and selective augmentation. arXiv preprint arXiv:2310.04408","author":"Xu Fangyuan","year":"2023","unstructured":"Fangyuan Xu, Weijia Shi, and Eunsol Choi. 2023. Recomp: Improving retrieval-augmented lms with compression and selective augmentation. arXiv preprint arXiv:2310.04408 (2023)."},{"unstructured":"Ziwei Xu Sanjay Jain and Mohan Kankanhalli. 2025. Hallucination is Inevitable: An Innate Limitation of Large Language Models. arXiv:2401.11817 [cs.CL] https:\/\/arxiv.org\/abs\/2401.11817","key":"e_1_3_2_1_38_1"},{"unstructured":"An Yang Anfeng Li Baosong Yang Beichen Zhang Binyuan Hui Bo Zheng Bowen Yu Chang Gao Chengen Huang Chenxu Lv Chujie Zheng Dayiheng Liu Fan Zhou Fei Huang Feng Hu Hao Ge Haoran Wei Huan Lin Jialong Tang Jian Yang Jianhong Tu Jianwei Zhang Jianxin Yang Jiaxi Yang Jing Zhou Jingren Zhou Junyang Lin Kai Dang Keqin Bao Kexin Yang Le Yu Lianghao Deng Mei Li Mingfeng Xue Mingze Li Pei Zhang Peng Wang Qin Zhu Rui Men Ruize Gao Shixuan Liu Shuang Luo Tianhao Li Tianyi Tang Wenbiao Yin Xingzhang Ren Xinyu Wang Xinyu Zhang Xuancheng Ren Yang Fan Yang Su Yichang Zhang Yinger Zhang Yu Wan Yuqiong Liu Zekun Wang Zeyu Cui Zhenru Zhang Zhipeng Zhou and Zihan Qiu. 2025a. Qwen3 Technical Report. arXiv:2505.09388 [cs.CL] https:\/\/arxiv.org\/abs\/2505.09388","key":"e_1_3_2_1_39_1"},{"unstructured":"An Yang Baosong Yang Binyuan Hui Bo Zheng Bowen Yu Chang Zhou Chengpeng Li Chengyuan Li Dayiheng Liu Fei Huang Guanting Dong Haoran Wei Huan Lin Jialong Tang Jialin Wang Jian Yang Jianhong Tu Jianwei Zhang Jianxin Ma Jianxin Yang Jin Xu Jingren Zhou Jinze Bai Jinzheng He Junyang Lin Kai Dang Keming Lu Keqin Chen Kexin Yang Mei Li Mingfeng Xue Na Ni Pei Zhang Peng Wang Ru Peng Rui Men Ruize Gao Runji Lin Shijie Wang Shuai Bai Sinan Tan Tianhang Zhu Tianhao Li Tianyu Liu Wenbin Ge Xiaodong Deng Xiaohuan Zhou Xingzhang Ren Xinyu Zhang Xipin Wei Xuancheng Ren Xuejing Liu Yang Fan Yang Yao Yichang Zhang Yu Wan Yunfei Chu Yuqiong Liu Zeyu Cui Zhenru Zhang Zhifang Guo and Zhihao Fan. 2024. Qwen2 Technical Report. arXiv:2407.10671 [cs.CL] https:\/\/arxiv.org\/abs\/2407.10671","key":"e_1_3_2_1_40_1"},{"unstructured":"An Yang Bowen Yu Chengyuan Li Dayiheng Liu Fei Huang Haoyan Huang Jiandong Jiang Jianhong Tu Jianwei Zhang Jingren Zhou Junyang Lin Kai Dang et al. 2025b. Qwen2.5-1M Technical Report. arXiv:2501.15383 [cs.CL] https:\/\/arxiv.org\/abs\/2501.15383","key":"e_1_3_2_1_41_1"},{"unstructured":"Tan Yu Anbang Xu and Rama Akkiraju. 2024. In defense of rag in the era of long-context language models. arXiv preprint arXiv:2409.01666 (2024).","key":"e_1_3_2_1_42_1"},{"key":"e_1_3_2_1_43_1","volume-title":"Retrieval-augmented generation for ai-generated content: A survey. arXiv preprint arXiv:2402.19473","author":"Zhao Penghao","year":"2024","unstructured":"Penghao Zhao, Hailin Zhang, Qinhan Yu, Zhengren Wang, Yunteng Geng, Fangcheng Fu, Ling Yang, Wentao Zhang, and Bin Cui. 2024b. Retrieval-augmented generation for ai-generated content: A survey. arXiv preprint arXiv:2402.19473 (2024)."},{"key":"e_1_3_2_1_44_1","volume-title":"Longrag: A dual-perspective retrieval-augmented generation paradigm for long-context question answering. arXiv preprint arXiv:2410.18050","author":"Zhao Qingfei","year":"2024","unstructured":"Qingfei Zhao, Ruobing Wang, Yukuo Cen, Daren Zha, Shicheng Tan, Yuxiao Dong, and Jie Tang. 2024a. Longrag: A dual-perspective retrieval-augmented generation paradigm for long-context question answering. arXiv preprint arXiv:2410.18050 (2024)."},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_45_1","DOI":"10.1145\/3624918.3625328"}],"event":{"sponsor":["SIGIR ACM Special Interest Group on Information Retrieval","SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web"],"acronym":"CIKM '25","name":"CIKM '25: The 34th ACM International Conference on Information and Knowledge Management","location":"Seoul Republic of Korea"},"container-title":["Proceedings of the 34th ACM International Conference on Information and Knowledge Management"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3746252.3760925","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,12,12]],"date-time":"2025-12-12T02:06:17Z","timestamp":1765505177000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3746252.3760925"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,11,10]]},"references-count":45,"alternative-id":["10.1145\/3746252.3760925","10.1145\/3746252"],"URL":"https:\/\/doi.org\/10.1145\/3746252.3760925","relation":{},"subject":[],"published":{"date-parts":[[2025,11,10]]},"assertion":[{"value":"2025-11-10","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}