{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,22]],"date-time":"2025-08-22T12:40:06Z","timestamp":1755866406315,"version":"3.44.0"},"publisher-location":"New York, NY, USA","reference-count":38,"publisher":"ACM","funder":[{"name":"Dutch Research Council","award":["024.004.022, NWA.1389.20.183, KICH3.LTP.20.006"],"award-info":[{"award-number":["024.004.022, NWA.1389.20.183, KICH3.LTP.20.006"]}]},{"name":"European Union's Horizon Europe program","award":["101070212"],"award-info":[{"award-number":["101070212"]}]},{"name":"Natural Science Foundation of China","award":["62472261"],"award-info":[{"award-number":["62472261"]}]},{"name":"Provincial Key R&D Program of Shandong Province","award":["2024CXGC010108"],"award-info":[{"award-number":["2024CXGC010108"]}]},{"name":"Technology Innovation Guidance Program of Shandong Province","award":["YDZX2024088"],"award-info":[{"award-number":["YDZX2024088"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,7,13]]},"DOI":"10.1145\/3726302.3730314","type":"proceedings-article","created":{"date-parts":[[2025,7,14]],"date-time":"2025-07-14T01:38:52Z","timestamp":1752457132000},"page":"3325-3334","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Replication and Exploration of Generative Retrieval over Dynamic Corpora"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0001-0290-5386","authenticated-orcid":false,"given":"Zhen","family":"Zhang","sequence":"first","affiliation":[{"name":"Shandong University, Qingdao, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5511-9370","authenticated-orcid":false,"given":"Xinyu","family":"Ma","sequence":"additional","affiliation":[{"name":"Baidu Inc., Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4817-9500","authenticated-orcid":false,"given":"Weiwei","family":"Sun","sequence":"additional","affiliation":[{"name":"Carnegie Mellon University, Pittsburgh, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2964-6422","authenticated-orcid":false,"given":"Pengjie","family":"Ren","sequence":"additional","affiliation":[{"name":"Shandong University, Qingdao, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4592-4074","authenticated-orcid":false,"given":"Zhumin","family":"Chen","sequence":"additional","affiliation":[{"name":"Shandong University, Qingdao, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9212-1947","authenticated-orcid":false,"given":"Shuaiqiang","family":"Wang","sequence":"additional","affiliation":[{"name":"Baidu Inc., Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0684-6205","authenticated-orcid":false,"given":"Dawei","family":"Yin","sequence":"additional","affiliation":[{"name":"Baidu Inc., Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1086-0202","authenticated-orcid":false,"given":"Maarten","family":"de Rijke","sequence":"additional","affiliation":[{"name":"University of Amsterdam, Amsterdam, Netherlands"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9076-6565","authenticated-orcid":false,"given":"Zhaochun","family":"Ren","sequence":"additional","affiliation":[{"name":"Leiden University, Leiden, Netherlands"}]}],"member":"320","published-online":{"date-parts":[[2025,7,13]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"Payal Bajaj Daniel Campos Nick Craswell Li Deng Jianfeng Gao Xiaodong Liu Rangan Majumder Andrew McNamara Bhaskar Mitra Tri Nguyen et al. 2016. Ms marco: A human generated machine reading comprehension dataset. arXiv preprint arXiv:1611.09268 (2016)."},{"key":"e_1_3_2_1_2_1","first-page":"31668","article-title":"Autoregressive search engines: Generating substrings as document identifiers","volume":"35","author":"Bevilacqua Michele","year":"2022","unstructured":"Michele Bevilacqua, Giuseppe Ottaviano, Patrick Lewis, Scott Yih, Sebastian Riedel, and Fabio Petroni. 2022. Autoregressive search engines: Generating substrings as document identifiers. Advances in Neural Information Processing Systems, Vol. 35 (2022), 31668-31683.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/3583780.3614821"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/3511808.3557271"},{"key":"e_1_3_2_1_5_1","volume-title":"Autoregressive entity retrieval. arXiv preprint arXiv:2010.00904","author":"Cao Nicola De","year":"2020","unstructured":"Nicola De Cao, Gautier Izacard, Sebastian Riedel, and Fabio Petroni. 2020. Autoregressive entity retrieval. arXiv preprint arXiv:2010.00904 (2020)."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1561\/9781638280637"},{"key":"e_1_3_2_1_7_1","volume-title":"Corpusbrain: A continual generative pre-training framework for knowledge-intensive language tasks. arXiv preprint arXiv:2402.16767","author":"Guo Jiafeng","year":"2024","unstructured":"Jiafeng Guo, Changjiang Zhou, Ruqing Zhang, Jiangui Chen, Maarten de Rijke, Yixing Fan, and Xueqi Cheng. 2024. Corpusbrain: A continual generative pre-training framework for knowledge-intensive language tasks. arXiv preprint arXiv:2402.16767 (2024)."},{"key":"e_1_3_2_1_8_1","volume-title":"International conference on machine learning. PMLR, 2790-2799","author":"Houlsby Neil","year":"2019","unstructured":"Neil Houlsby, Andrei Giurgiu, Stanislaw Jastrzebski, Bruna Morrone, Quentin De Laroussilhe, Andrea Gesmundo, Mona Attariyan, and Sylvain Gelly. 2019. Parameter-efficient transfer learning for NLP. In International conference on machine learning. PMLR, 2790-2799."},{"key":"e_1_3_2_1_9_1","unstructured":"Bowen Jin Hansi Zeng Guoyin Wang Xiusi Chen Tianxin Wei Ruirui Li Zhengyang Wang Zheng Li Yang Li Hanqing Lu et al. 2023. Language models as semantic indexers. arXiv preprint arXiv:2310.07815 (2023)."},{"key":"e_1_3_2_1_10_1","volume-title":"Dense passage retrieval for open-domain question answering. arXiv preprint arXiv:2004.04906","author":"Karpukhin Vladimir","year":"2020","unstructured":"Vladimir Karpukhin, Barlas O\u011fuz, Sewon Min, Patrick Lewis, Ledell Wu, Sergey Edunov, Danqi Chen, and Wen-tau Yih. 2020. Dense passage retrieval for open-domain question answering. arXiv preprint arXiv:2004.04906 (2020)."},{"key":"e_1_3_2_1_11_1","volume-title":"Exploring the practicality of generative retrieval on dynamic corpora. arXiv preprint arXiv:2305.18952","author":"Kim Chaeeun","year":"2023","unstructured":"Chaeeun Kim, Soyoung Yoon, Hyunji Lee, Joel Jang, Sohee Yang, and Minjoon Seo. 2023. Exploring the practicality of generative retrieval on dynamic corpora. arXiv preprint arXiv:2305.18952 (2023)."},{"key":"e_1_3_2_1_12_1","volume-title":"International Conference on Machine Learning. PMLR, 17122-17134","author":"Kishore Varsha","year":"2023","unstructured":"Varsha Kishore, Chao Wan, Justin Lovelace, Yoav Artzi, and Kilian Q Weinberger. 2023. Incdsi: incrementally updatable document retrieval. In International Conference on Machine Learning. PMLR, 17122-17134."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00276"},{"key":"e_1_3_2_1_14_1","volume-title":"From matching to generation: A survey on generative information retrieval. arXiv preprint arXiv:2404.14851","author":"Li Xiaoxi","year":"2024","unstructured":"Xiaoxi Li, Jiajie Jin, Yujia Zhou, Yuyao Zhang, Peitian Zhang, Yutao Zhu, and Zhicheng Dou. 2024a. From matching to generation: A survey on generative information retrieval. arXiv preprint arXiv:2404.14851 (2024)."},{"key":"e_1_3_2_1_15_1","volume-title":"Multiview identifiers enhanced generative retrieval. arXiv preprint arXiv:2305.16675","author":"Li Yongqi","year":"2023","unstructured":"Yongqi Li, Nan Yang, Liang Wang, Furu Wei, and Wenjie Li. 2023. Multiview identifiers enhanced generative retrieval. arXiv preprint arXiv:2305.16675 (2023)."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v38i8.28717"},{"key":"e_1_3_2_1_17_1","volume-title":"Distillation Enhanced Generative Retrieval. arXiv preprint arXiv:2402.10769","author":"Li Yongqi","year":"2024","unstructured":"Yongqi Li, Zhen Zhang, Wenjie Wang, Liqiang Nie, Wenjie Li, and Tat-Seng Chua. 2024c. Distillation Enhanced Generative Retrieval. arXiv preprint arXiv:2402.10769 (2024)."},{"key":"e_1_3_2_1_18_1","volume-title":"Pyserini: An easy-to-use python toolkit to support replicable ir research with sparse and dense representations. arXiv preprint arXiv:2102.10073","author":"Lin Jimmy","year":"2021","unstructured":"Jimmy Lin, Xueguang Ma, Sheng-Chieh Lin, Jheng-Hong Yang, Ronak Pradeep, and Rodrigo Nogueira. 2021. Pyserini: An easy-to-use python toolkit to support replicable ir research with sparse and dense representations. arXiv preprint arXiv:2102.10073 (2021)."},{"volume-title":"Pretrained transformers for text ranking: Bert and beyond","author":"Lin Jimmy","key":"e_1_3_2_1_19_1","unstructured":"Jimmy Lin, Rodrigo Nogueira, and Andrew Yates. 2022. Pretrained transformers for text ranking: Bert and beyond. Springer Nature."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/3477495.3531772"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/3511808.3557445"},{"volume-title":"Introduction to Information Retrieval","author":"Manning Christopher D.","key":"e_1_3_2_1_22_1","unstructured":"Christopher D. Manning, Hinrich Sch\u00fctze, and Prabhakar Raghavan. 2009. Introduction to Information Retrieval. Cambridge University Press."},{"key":"e_1_3_2_1_23_1","volume-title":"DSI: Updating transformer memory with new documents. arXiv preprint arXiv:2212.09744","author":"Mehta Sanket Vaibhav","year":"2022","unstructured":"Sanket Vaibhav Mehta, Jai Gupta, Yi Tay, Mostafa Dehghani, Vinh Q Tran, Jinfeng Rao, Marc Najork, Emma Strubell, and Donald Metzler. 2022. DSI: Updating transformer memory with new documents. arXiv preprint arXiv:2212.09744 (2022)."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/3476415.3476428"},{"key":"e_1_3_2_1_25_1","volume-title":"Daxiang Dong, Hua Wu, and Haifeng Wang.","author":"Qu Yingqi","year":"2020","unstructured":"Yingqi Qu, Yuchen Ding, Jing Liu, Kai Liu, Ruiyang Ren, Wayne Xin Zhao, Daxiang Dong, Hua Wu, and Haifeng Wang. 2020. RocketQA: An optimized training approach to dense passage retrieval for open-domain question answering. arXiv preprint arXiv:2010.08191 (2020)."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1561\/1500000019"},{"key":"e_1_3_2_1_27_1","volume-title":"Advances in Neural Information Processing Systems","volume":"36","author":"Sun Weiwei","year":"2024","unstructured":"Weiwei Sun, Lingyong Yan, Zheng Chen, Shuaiqiang Wang, Haichao Zhu, Pengjie Ren, Zhumin Chen, Dawei Yin, Maarten Rijke, and Zhaochun Ren. 2024. Learning to tokenize for generative retrieval. Advances in Neural Information Processing Systems, Vol. 36 (2024)."},{"key":"e_1_3_2_1_28_1","volume-title":"Generative Retrieval Meets Multi-Graded Relevance. arXiv preprint arXiv:2409.18409","author":"Tang Yubao","year":"2024","unstructured":"Yubao Tang, Ruqing Zhang, Jiafeng Guo, Maarten de Rijke, Wei Chen, and Xueqi Cheng. 2024a. Generative Retrieval Meets Multi-Graded Relevance. arXiv preprint arXiv:2409.18409 (2024)."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/3653712"},{"key":"e_1_3_2_1_30_1","first-page":"21831","article-title":"Transformer memory as a differentiable search index","volume":"35","author":"Tay Yi","year":"2022","unstructured":"Yi Tay, Vinh Tran, Mostafa Dehghani, Jianmo Ni, Dara Bahri, Harsh Mehta, Zhen Qin, Kai Hui, Zhe Zhao, Jai Gupta, et al. 2022. Transformer memory as a differentiable search index. Advances in Neural Information Processing Systems, Vol. 35 (2022), 21831-21843.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_1_31_1","first-page":"25600","article-title":"A neural corpus indexer for document retrieval","volume":"35","author":"Wang Yujing","year":"2022","unstructured":"Yujing Wang, Yingyan Hou, Haonan Wang, Ziming Miao, Shibin Wu, Qi Chen, Yuqing Xia, Chengmin Chi, Guoshuai Zhao, Zheng Liu, et al. 2022. A neural corpus indexer for document retrieval. Advances in Neural Information Processing Systems, Vol. 35 (2022), 25600-25614.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_1_32_1","volume-title":"Auto search indexer for end-to-end document retrieval. arXiv preprint arXiv:2310.12455","author":"Yang Tianchi","year":"2023","unstructured":"Tianchi Yang, Minghui Song, Zihan Zhang, Haizhen Huang, Weiwei Deng, Feng Sun, and Qi Zhang. 2023. Auto search indexer for end-to-end document retrieval. arXiv preprint arXiv:2310.12455 (2023)."},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/3589334.3645477"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/3626772.3657797"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.emnlp-main.768"},{"key":"e_1_3_2_1_36_1","volume-title":"Peitian Zhang, and Ji rong Wen.","author":"Zhou Yujia","year":"2022","unstructured":"Yujia Zhou, Jing Yao, Zhicheng Dou, Ledell Yu Wu, Peitian Zhang, and Ji rong Wen. 2022. Ultron: An Ultimate Retriever on Corpus with a Model-based Indexer. ArXiv abs\/2208.09257."},{"key":"e_1_3_2_1_37_1","volume-title":"Bridging the gap between indexing and retrieval for differentiable search index with query generation. arXiv preprint arXiv:2206.10128","author":"Zhuang Shengyao","year":"2022","unstructured":"Shengyao Zhuang, Houxing Ren, Linjun Shou, Jian Pei, Ming Gong, Guido Zuccon, and Daxin Jiang. 2022. Bridging the gap between indexing and retrieval for differentiable search index with query generation. arXiv preprint arXiv:2206.10128 (2022)."},{"key":"e_1_3_2_1_38_1","volume-title":"Large language models are built-in autoregressive search engines. arXiv preprint arXiv:2305.09612","author":"Ziems Noah","year":"2023","unstructured":"Noah Ziems, Wenhao Yu, Zhihan Zhang, and Meng Jiang. 2023. Large language models are built-in autoregressive search engines. arXiv preprint arXiv:2305.09612 (2023)."}],"event":{"name":"SIGIR '25: The 48th International ACM SIGIR Conference on Research and Development in Information Retrieval","sponsor":["SIGIR ACM Special Interest Group on Information Retrieval"],"location":"Padua Italy","acronym":"SIGIR '25"},"container-title":["Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3726302.3730314","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,8,22]],"date-time":"2025-08-22T12:11:42Z","timestamp":1755864702000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3726302.3730314"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,7,13]]},"references-count":38,"alternative-id":["10.1145\/3726302.3730314","10.1145\/3726302"],"URL":"https:\/\/doi.org\/10.1145\/3726302.3730314","relation":{},"subject":[],"published":{"date-parts":[[2025,7,13]]},"assertion":[{"value":"2025-07-13","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}