{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,10]],"date-time":"2026-04-10T10:07:54Z","timestamp":1775815674925,"version":"3.50.1"},"reference-count":105,"publisher":"Association for Computing Machinery (ACM)","issue":"5","license":[{"start":{"date-parts":[[2024,4,29]],"date-time":"2024-04-29T00:00:00Z","timestamp":1714348800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Strategic Priority Research Program of the CAS","award":["XDB0680102"],"award-info":[{"award-number":["XDB0680102"]}]},{"name":"National Key Research and Development Program of China","award":["2023YFA1011602, JCKY2022130C039"],"award-info":[{"award-number":["2023YFA1011602, JCKY2022130C039"]}]},{"name":"Lenovo-CAS Joint Lab Youth Scientist Project"},{"name":"CAS Project for Young Scientists in Basic Research","award":["YSBR-034"],"award-info":[{"award-number":["YSBR-034"]}]},{"name":"Innovation Project of ICT CAS","award":["E261090"],"award-info":[{"award-number":["E261090"]}]},{"name":"Hybrid Intelligence Center"},{"name":"Dutch Ministry of Education, Culture and Science"},{"DOI":"10.13039\/501100003246","name":"Netherlands Organisation for Scientific Research","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100003246","id-type":"DOI","asserted-by":"crossref"}]},{"name":"LESSEN","award":["NWA.1389.20.183"],"award-info":[{"award-number":["NWA.1389.20.183"]}]},{"DOI":"10.13039\/501100003246","name":"Dutch Research Council","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100003246","id-type":"DOI","asserted-by":"crossref"}]},{"name":"FINDHR"},{"name":"European Union\u2019s Horizon Europe","award":["101070212"],"award-info":[{"award-number":["101070212"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Inf. Syst."],"published-print":{"date-parts":[[2024,9,30]]},"abstract":"<jats:p>Recently, a novel generative retrieval (GR) paradigm has been proposed, where a single sequence-to-sequence model is learned to directly generate a list of relevant document identifiers (docids) given a query. Existing GR models commonly employ maximum likelihood estimation (MLE) for optimization: This involves maximizing the likelihood of a single relevant docid given an input query, with the assumption that the likelihood for each docid is independent of the other docids in the list. We refer to these models as the pointwise approach in this article. While the pointwise approach has been shown to be effective in the context of GR, it is considered sub-optimal due to its disregard for the fundamental principle that ranking involves making predictions about lists. In this article, we address this limitation by introducing an alternative listwise approach, which empowers the GR model to optimize the relevance at the docid list level. Specifically, we view the generation of a ranked docid list as a sequence learning process: At each step, we learn a subset of parameters that maximizes the corresponding generation likelihood of the<jats:italic>i<\/jats:italic>th docid given the (preceding) top<jats:italic>i<\/jats:italic>-1 docids. To formalize the sequence learning process, we design a positional conditional probability for GR. To alleviate the potential impact of beam search on the generation quality during inference, we perform relevance calibration on the generation likelihood of model-generated docids according to relevance grades. We conduct extensive experiments on representative binary and multi-graded relevance datasets. Our empirical results demonstrate that our method outperforms state-of-the-art GR baselines in terms of retrieval performance.<\/jats:p>","DOI":"10.1145\/3653712","type":"journal-article","created":{"date-parts":[[2024,3,22]],"date-time":"2024-03-22T12:01:27Z","timestamp":1711108887000},"page":"1-31","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":13,"title":["Listwise Generative Retrieval Models via a Sequential Learning Process"],"prefix":"10.1145","volume":"42","author":[{"ORCID":"https:\/\/orcid.org\/0009-0003-8010-3404","authenticated-orcid":false,"given":"Yubao","family":"Tang","sequence":"first","affiliation":[{"name":"Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China and University of Chinese Academy of Sciences, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4294-2541","authenticated-orcid":false,"given":"Ruqing","family":"Zhang","sequence":"additional","affiliation":[{"name":"Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China and University of Chinese Academy of Sciences, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9509-8674","authenticated-orcid":false,"given":"Jiafeng","family":"Guo","sequence":"additional","affiliation":[{"name":"Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China and University of Chinese Academy of Sciences, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1086-0202","authenticated-orcid":false,"given":"Maarten","family":"de Rijke","sequence":"additional","affiliation":[{"name":"University of Amsterdam, Amsterdam, The Netherlands"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7438-5180","authenticated-orcid":false,"given":"Wei","family":"Chen","sequence":"additional","affiliation":[{"name":"Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China and University of Chinese Academy of Sciences, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5201-8195","authenticated-orcid":false,"given":"Xueqi","family":"Cheng","sequence":"additional","affiliation":[{"name":"Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China and University of Chinese Academy of Sciences, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2024,4,29]]},"reference":[{"key":"e_1_3_3_2_2","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Abnar Samira","year":"2021","unstructured":"Samira Abnar, Mostafa Dehghani, Behnam Neyshabur, and Hanie Sedghi. 2021. Exploring the limits of large scale pre-training. In Proceedings of the International Conference on Learning Representations."},{"key":"e_1_3_3_3_2","first-page":"1277","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Bae HeeSun","year":"2022","unstructured":"HeeSun Bae, Seungjae Shin, Byeonghu Na, JoonHo Jang, Kyungwoo Song, and Il-Chul Moon. 2022. From noisy prediction to true label: Noisy prediction calibration via generative model. In Proceedings of the International Conference on Machine Learning. 1277\u20131297."},{"key":"e_1_3_3_4_2","first-page":"7641","article-title":"Lamp: Extracting text from gradients with language model priors","author":"Balunovic Mislav","year":"2022","unstructured":"Mislav Balunovic, Dimitar Dimitrov, Nikola Jovanovi\u0107, and Martin Vechev. 2022. Lamp: Extracting text from gradients with language model priors. In Proceedings of the Conference on Neural Information Processing Systems. 7641\u20137654.","journal-title":"Proceedings of the Conference on Neural Information Processing Systems"},{"key":"e_1_3_3_5_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.1997.609451"},{"key":"e_1_3_3_6_2","doi-asserted-by":"publisher","DOI":"10.1145\/1718487.1718492"},{"key":"e_1_3_3_7_2","doi-asserted-by":"publisher","DOI":"10.1145\/2009916.2009998"},{"key":"e_1_3_3_8_2","doi-asserted-by":"publisher","DOI":"10.1145\/2124295.2124349"},{"key":"e_1_3_3_9_2","first-page":"1171","article-title":"Scheduled sampling for sequence prediction with recurrent neural networks","author":"Bengio Samy","year":"2015","unstructured":"Samy Bengio, Oriol Vinyals, Navdeep Jaitly, and Noam Shazeer. 2015. Scheduled sampling for sequence prediction with recurrent neural networks. In Proceedings of the 28th International Conference on Neural Information Processing Systems. 1171\u20131179.","journal-title":"Proceedings of the 28th International Conference on Neural Information Processing Systems"},{"key":"e_1_3_3_10_2","doi-asserted-by":"publisher","DOI":"10.1145\/361002.361007"},{"key":"e_1_3_3_11_2","first-page":"31668","volume-title":"Proceedings of the Conference on Neural Information Processing Systems","author":"Bevilacqua Michele","year":"2022","unstructured":"Michele Bevilacqua, Giuseppe Ottaviano, Patrick Lewis, Wen-tau Yih, Sebastian Riedel, and Fabio Petroni. 2022. Autoregressive search engines: Generating substrings as document identifiers. In Proceedings of the Conference on Neural Information Processing Systems. 31668\u201331683."},{"key":"e_1_3_3_12_2","doi-asserted-by":"publisher","DOI":"10.1145\/1273496.1273513"},{"key":"e_1_3_3_13_2","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Chang Wei-Cheng","year":"2020","unstructured":"Wei-Cheng Chang and Yu. 2020. Pre-training tasks for embedding-based large-scale retrieval. In Proceedings of the International Conference on Learning Representations."},{"key":"e_1_3_3_14_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10791-011-9167-7"},{"key":"e_1_3_3_15_2","doi-asserted-by":"publisher","DOI":"10.1145\/3539618.3591631"},{"key":"e_1_3_3_16_2","doi-asserted-by":"publisher","DOI":"10.1145\/3511808.3557271"},{"key":"e_1_3_3_17_2","doi-asserted-by":"publisher","unstructured":"Xiaoyang Chen Yanjiang Liu Ben He Le Sun and Yingfei Sun. 2023. Understanding differential search index for text retrieval. In Findings of the Association for Computational Linguistics: (ACL\u201923) Association for Computational Linguistics Toronto Canada 10701\u201310717. DOI:10.18653\/v1\/2023.findings-acl.681","DOI":"10.18653\/v1\/2023.findings-acl.681"},{"key":"e_1_3_3_18_2","first-page":"74","volume-title":"Proceedings of the Text REtrieval Conference","author":"Clarke Charles L. A.","year":"2004","unstructured":"Charles L. A. Clarke, Nick Craswell, and Ian Soboroff. 2004. Overview of the TREC 2004 terabyte track. In Proceedings of the Text REtrieval Conference. 74."},{"key":"e_1_3_3_19_2","first-page":"20","volume-title":"Proceedings of the Text REtrieval Conference","author":"Clarke Charles L. A.","year":"2010","unstructured":"Charles L. A. Clarke, Nick Craswell, and Ian Soboroff. 2010. Overview of the TREC 2009 Web track. In Proceedings of the Text REtrieval Conference. 20\u201329."},{"key":"e_1_3_3_20_2","doi-asserted-by":"crossref","unstructured":"Nick Craswell Bhaskar Mitra Emine Yilmaz Daniel Campos and Ellen M. Voorhees. 2020. Overview of the TREC 2019 deep learning track. arXiv:2003.07820. Retrieved from https:\/\/arxiv.org\/abs\/2003.07820","DOI":"10.6028\/NIST.SP.1266.deep-overview"},{"key":"e_1_3_3_21_2","unstructured":"Zhuyun Dai and Jamie Callan. 2019. Context-aware sentence\/passage term importance estimation for first stage retrieval. arXiv:1910.10687. Retrieved from https:\/\/arxiv.org\/abs\/1910.10687"},{"key":"e_1_3_3_22_2","doi-asserted-by":"publisher","DOI":"10.1145\/3366423.3380258"},{"key":"e_1_3_3_23_2","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Cao Nicola De","year":"2021","unstructured":"Nicola De Cao, Gautier Izacard, Sebastian Riedel, and Fabio Petroni. 2021. Autoregressive entity retrieval. In Proceedings of the International Conference on Learning Representations."},{"key":"e_1_3_3_24_2","first-page":"4171","volume-title":"Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies","author":"Devlin Jacob","year":"2019","unstructured":"Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 4171\u20134186."},{"key":"e_1_3_3_25_2","first-page":"4356","volume-title":"Proceedings of the Conference on Neural Information Processing Systems","author":"Dong Xinshuai","year":"2021","unstructured":"Xinshuai Dong, Anh Tuan Luu, Min Lin, Shuicheng Yan, and Hanwang Zhang. 2021. How should pre-trained language models be fine-tuned towards adversarial robustness?. In Proceedings of the Conference on Neural Information Processing Systems. 4356\u20134369."},{"key":"e_1_3_3_26_2","first-page":"1722","volume-title":"Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing","author":"Santos Cicero dos","year":"2020","unstructured":"Cicero dos Santos, Xiaofei Ma, Ramesh Nallapati, Zhiheng Huang, and Bing Xiang. 2020. Beyond [CLS] through ranking by generation. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing. 1722\u20131727."},{"key":"e_1_3_3_27_2","doi-asserted-by":"crossref","unstructured":"Thibault Formal Carlos Lassance Benjamin Piwowarski and St\u00e9phane Clinchant. 2021. SPLADE v2: Sparse lexical and expansion model for information retrieval. arXiv:2109.10086. Retrieved from https:\/\/arxiv.org\/abs\/2109.10086","DOI":"10.1145\/3404835.3463098"},{"key":"e_1_3_3_28_2","doi-asserted-by":"publisher","DOI":"10.1145\/3404835.3463098"},{"key":"e_1_3_3_29_2","doi-asserted-by":"publisher","DOI":"10.1145\/3397271.3401211"},{"key":"e_1_3_3_30_2","doi-asserted-by":"publisher","DOI":"10.1145\/32206.32212"},{"key":"e_1_3_3_31_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2013.240"},{"key":"e_1_3_3_32_2","first-page":"4601","article-title":"Professor forcing: A new algorithm for training recurrent networks","author":"Goyal Anirudh","year":"2016","unstructured":"Anirudh Goyal, Alex M. Lamb, Ying Zhang, Saizheng Zhang, Aaron C. Courville, and Yoshua Bengio. 2016. Professor forcing: A new algorithm for training recurrent networks. In Proceedings of the 30th International Conference on Neural Information Processing Systems. 4601\u20134609.","journal-title":"Proceedings of the 30th International Conference on Neural Information Processing Systems"},{"key":"e_1_3_3_33_2","doi-asserted-by":"publisher","DOI":"10.1145\/2983323.2983769"},{"key":"e_1_3_3_34_2","first-page":"3929","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Guu Kelvin","year":"2020","unstructured":"Kelvin Guu, Kenton Lee, Zora Tung, Panupong Pasupat, and Mingwei Chang. 2020. Retrieval augmented language model pre-training. In Proceedings of the International Conference on Machine Learning. 3929\u20133938."},{"key":"e_1_3_3_35_2","first-page":"12594","volume-title":"Proceedings of the 36th Conference on Neural Information Processing Systems","volume":"35","author":"Hao Yongchang","year":"2022","unstructured":"Yongchang Hao, Yuxin Liu, and Lili Mou. 2022. Teacher forcing recovers reward functions for text generation. In Proceedings of the 36th Conference on Neural Information Processing Systems. Vol. 35, 12594\u201312607."},{"key":"e_1_3_3_36_2","doi-asserted-by":"publisher","DOI":"10.1145\/3404835.3462891"},{"key":"e_1_3_3_37_2","first-page":"4487","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Hudson Drew A.","year":"2021","unstructured":"Drew A. Hudson and Larry Zitnick. 2021. Generative adversarial transformers. In Proceedings of the International Conference on Machine Learning. 4487\u20134499."},{"key":"e_1_3_3_38_2","first-page":"1610","volume-title":"Proceedings of the Conference on Neural Information Processing Systems","author":"Jaimovitch-Lopez Gonzalo","year":"2021","unstructured":"Gonzalo Jaimovitch-Lopez, David Castellano Falc\u00f3n, Cesar Ferri, and Jos\u00e9 Hern\u00e1ndez-Orallo. 2021. Think big, teach small: Do language models distil Occam\u2019s Razor?. In Proceedings of the Conference on Neural Information Processing Systems. Curran Associates, Inc., 1610\u20131623."},{"key":"e_1_3_3_39_2","doi-asserted-by":"publisher","DOI":"10.1145\/582415.582418"},{"key":"e_1_3_3_40_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2010.57"},{"key":"e_1_3_3_41_2","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00300"},{"key":"e_1_3_3_42_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-main.550"},{"key":"e_1_3_3_43_2","doi-asserted-by":"publisher","DOI":"10.1145\/3397271.3401075"},{"key":"e_1_3_3_44_2","first-page":"11499","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Korbak Tomasz","year":"2022","unstructured":"Tomasz Korbak, Hady Elsahar, Germ\u00e1n Kruszewski, and Marc Dymetman. 2022. Controlling conditional language models without catastrophic forgetting. In Proceedings of the International Conference on Machine Learning. 11499\u201311528."},{"key":"e_1_3_3_45_2","first-page":"16203","volume-title":"Proceedings of the Conference on Neural Information Processing Systems","volume":"35","author":"Korbak Tomasz","year":"2022","unstructured":"Tomasz Korbak, Hady Elsahar, Germ\u00e1n Kruszewski, and Marc Dymetman. 2022. On reinforcement learning and distribution matching for fine-tuning language models with no catastrophic forgetting. In Proceedings of the Conference on Neural Information Processing Systems. Vol. 35, 16203\u201316220."},{"key":"e_1_3_3_46_2","doi-asserted-by":"publisher","DOI":"10.3390\/s20154115"},{"key":"e_1_3_3_47_2","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00276"},{"key":"e_1_3_3_48_2","first-page":"11891","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Lamprier Sylvain","year":"2022","unstructured":"Sylvain Lamprier, Thomas Scialom, Antoine Chaffin, Vincent Claveau, Ewa Kijak, Jacopo Staiano, and Benjamin Piwowarski. 2022. Generative cooperative networks for natural language generation. In Proceedings of the International Conference on Machine Learning. 11891\u201311905."},{"key":"e_1_3_3_49_2","first-page":"449","volume-title":"Proceedings of the 30th Conference on Uncertainty in Artificial Intelligence","author":"Lan Yanyan","year":"2014","unstructured":"Yanyan Lan, Yadong Zhu, Jiafeng Guo, Shuzi Niu, and Xueqi Cheng. 2014. Position-aware ListMLE: A sequential learning process for ranking. In Proceedings of the 30th Conference on Uncertainty in Artificial Intelligence. 449\u2013458."},{"key":"e_1_3_3_50_2","first-page":"11985","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Lang Hunter","year":"2022","unstructured":"Hunter Lang, Monica N. Agrawal, Yoon Kim, and David Sontag. 2022. Co-training improves prompt-based learning for large language models. In Proceedings of the International Conference on Machine Learning. 11985\u201312003."},{"key":"e_1_3_3_51_2","first-page":"260","volume-title":"Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"51","author":"Lavrenko Victor","year":"2017","unstructured":"Victor Lavrenko and W. Bruce Croft. 2017. Relevance-based language models. In Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Vol. 51, 260\u2013267."},{"key":"e_1_3_3_52_2","doi-asserted-by":"publisher","DOI":"10.1038\/nature14539"},{"key":"e_1_3_3_53_2","article-title":"Nonparametric decoding for generative retrieval","author":"Lee Hyunji","year":"2023","unstructured":"Hyunji Lee, Jaeyoung Kim, Hoyeon Chang, Hanseok Oh, Sohee Yang, Vlad Karpukhin, Yi Lu, and Minjoon Seo. 2023. Nonparametric decoding for generative retrieval. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics.","journal-title":"Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics"},{"key":"e_1_3_3_54_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.85"},{"key":"e_1_3_3_55_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1612"},{"key":"e_1_3_3_56_2","first-page":"7871","volume-title":"Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics","author":"Lewis Mike","year":"2019","unstructured":"Mike Lewis, Yinhan Liu, Naman Goyal, Marjan Ghazvininejad, Abdelrahman Mohamed, Omer Levy, Veselin Stoyanov, and Luke Zettlemoyer. 2019. BART: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 7871\u20137880."},{"key":"e_1_3_3_57_2","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Li Xuechen","year":"2022","unstructured":"Xuechen Li, Florian Tramer, Percy Liang, and Tatsunori Hashimoto. 2022. Large language models can be strong differentially private learners. In Proceedings of the International Conference on Learning Representations."},{"key":"e_1_3_3_58_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.acl-long.366"},{"key":"e_1_3_3_59_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.emnlp-main.423"},{"key":"e_1_3_3_60_2","doi-asserted-by":"publisher","DOI":"10.1145\/3404835.3463238"},{"key":"e_1_3_3_61_2","first-page":"6666","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Lin Wanyu","year":"2021","unstructured":"Wanyu Lin, Hao Lan, and Baochun Li. 2021. Generative causal explanations for graph neural networks. In Proceedings of the International Conference on Machine Learning. 6666\u20136679."},{"key":"e_1_3_3_62_2","unstructured":"Yinhan Liu Myle Ott Naman Goyal Jingfei Du Mandar Joshi Danqi Chen Omer Levy Mike Lewis Luke Zettlemoyer and Veselin Stoyanov. 2019. RoBERTa: A robustly optimized BERT pretraining approach. arXiv:1907.11692. Retrieved from https:\/\/arxiv.org\/abs\/1907.11692"},{"key":"e_1_3_3_63_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.emnlp-main.220"},{"key":"e_1_3_3_64_2","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00369"},{"key":"e_1_3_3_65_2","volume-title":"Individual Choice Behavior: A Theoretical Analysis","author":"Luce R. Duncan","year":"2012","unstructured":"R. Duncan Luce. 2012. Individual Choice Behavior: A Theoretical Analysis. Courier Corporation."},{"key":"e_1_3_3_66_2","doi-asserted-by":"publisher","DOI":"10.1145\/3477495.3531772"},{"key":"e_1_3_3_67_2","doi-asserted-by":"publisher","DOI":"10.1145\/3437963.3441777"},{"key":"e_1_3_3_68_2","doi-asserted-by":"publisher","DOI":"10.1145\/3404835.3462869"},{"key":"e_1_3_3_69_2","doi-asserted-by":"publisher","DOI":"10.1145\/3459637.3482286"},{"key":"e_1_3_3_70_2","doi-asserted-by":"publisher","DOI":"10.1145\/3476415.3476428"},{"key":"e_1_3_3_71_2","volume-title":"Proceedings of the Workshop on Cognitive Computation: Integrating Neural and Symbolic Approaches 2016 Co-located with the 30th Annual Conference on Neural Information Processing Systems","author":"Nguyen Tri","year":"2016","unstructured":"Tri Nguyen, Mir Rosenberg, Xia Song, Jianfeng Gao, Saurabh Tiwary, Rangan Majumder, and Li Deng. 2016. MS MARCO: A human generated machine reading comprehension dataset. In Proceedings of the Workshop on Cognitive Computation: Integrating Neural and Symbolic Approaches 2016 Co-located with the 30th Annual Conference on Neural Information Processing Systems."},{"key":"e_1_3_3_72_2","doi-asserted-by":"publisher","DOI":"10.1145\/3397271.3401271"},{"key":"e_1_3_3_73_2","doi-asserted-by":"publisher","unstructured":"Rodrigo Nogueira Zhiying Jiang Ronak Pradeep and Jimmy Lin. 2020. Document Ranking with a Pretrained Sequence-to-Sequence Model. In Findings of the Association for Computational Linguistics: (EMNLP\u201920) Association for Computational Linguistics Online 708\u2013718. DOI:10.18653\/v1\/2020.findings-emnlp.63","DOI":"10.18653\/v1\/2020.findings-emnlp.63"},{"key":"e_1_3_3_74_2","article-title":"From doc2query to docTTTTTquery","author":"Nogueira Rodrigo","year":"2019","unstructured":"Rodrigo Nogueira and Jimmy Lin. 2019. From doc2query to docTTTTTquery. An MS MARCO Passage Retrieval Task Publication. University of Waterloo.","journal-title":"An MS MARCO Passage Retrieval Task Publication"},{"key":"e_1_3_3_75_2","first-page":"27730","article-title":"Training language models to follow instructions with human feedback","author":"Ouyang Long","year":"2022","unstructured":"Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, and Mishkin. 2022. Training language models to follow instructions with human feedback. In Proceedings of the Conference on Neural Information Processing Systems. 27730\u201327744.","journal-title":"Proceedings of the Conference on Neural Information Processing Systems"},{"key":"e_1_3_3_76_2","doi-asserted-by":"publisher","DOI":"10.2307\/2346567"},{"key":"e_1_3_3_77_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.emnlp-main.83"},{"key":"e_1_3_3_78_2","doi-asserted-by":"publisher","DOI":"10.5555\/3455716.3455856"},{"key":"e_1_3_3_79_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.acl-long.336"},{"key":"e_1_3_3_80_2","first-page":"109","volume-title":"Proceedings of the 3rd Text REtrieval Conference, TREC 1994","author":"Robertson Stephen E.","year":"1995","unstructured":"Stephen E. Robertson, Steve Walker, Susan Jones, Micheline M. Hancock-Beaulieu, and Mike Gatford. 1995. Okapi at TREC-3. In Proceedings of the 3rd Text REtrieval Conference, TREC 1994. NIST, 109\u2013126."},{"key":"e_1_3_3_81_2","doi-asserted-by":"publisher","DOI":"10.1145\/1935826.1935864"},{"key":"e_1_3_3_82_2","doi-asserted-by":"publisher","DOI":"10.1145\/3498366.3505816"},{"key":"e_1_3_3_83_2","volume-title":"Proceedings of the Conference on Neural Information Processing Systems","author":"Sun Weiwei","year":"2023","unstructured":"Weiwei Sun, Lingyong Yan, Zheng Chen, Shuaiqiang Wang, Haichao Zhu, Pengjie Ren, Zhumin Chen, Dawei Yin, Maarten de Rijke, and Zhaochun Ren. 2023. Learning to tokenize for generative retrieval. In Proceedings of the Conference on Neural Information Processing Systems."},{"key":"e_1_3_3_84_2","doi-asserted-by":"publisher","DOI":"10.1145\/3580305.3599903"},{"key":"e_1_3_3_85_2","first-page":"21831","volume-title":"Proceedings of the Conference on Neural Information Processing Systems","volume":"35","author":"Tay Yi","year":"2022","unstructured":"Yi Tay, Vinh Q. Tran, Mostafa Dehghani, Jianmo Ni, and Dara Bahri. 2022. Transformer memory as a differentiable search index. In Proceedings of the Conference on Neural Information Processing Systems. Vol. 35, 21831\u201321843."},{"issue":"11","key":"e_1_3_3_86_2","article-title":"Visualizing data using t-SNE.","volume":"9","author":"Maaten Laurens van der","year":"2008","unstructured":"Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE. Journal of Machine Learning Research 9, 11 (2008), 2579\u20132605.","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_3_87_2","first-page":"69","volume-title":"Proceedings of the Text REtrieval Conference","author":"Voorhees Ellen M.","year":"2004","unstructured":"Ellen M. Voorhees. 2004. Overview of the TREC 2004 robust retrieval track. In Proceedings of the Text REtrieval Conference. 69\u201377."},{"key":"e_1_3_3_88_2","first-page":"25600","volume-title":"Proceedings of the Conference on Neural Information Processing Systems","volume":"35","author":"Wang Yujing","year":"2022","unstructured":"Yujing Wang, Yingyan Hou, Haonan Wang, Ziming Miao, Shibin Wu, Hao Sun, Qi Chen, Yuqing Xia, Chengmin Chi, Guoshuai Zhao, Zheng Liu, Xing Xie, Hao Sun, Weiwei Deng, Qi Zhang, and Mao Yang. 2022. A neural corpus indexer for document retrieval. In Proceedings of the Conference on Neural Information Processing Systems. Vol. 35, 25600\u201325614."},{"key":"e_1_3_3_89_2","first-page":"16158","volume-title":"Proceedings of the Conference on Neural Information Processing Systems","volume":"34","author":"Wei Colin","year":"2021","unstructured":"Colin Wei, Sang Michael Xie, and Tengyu Ma. 2021. Why do pretrained language models help in downstream tasks? an analysis of head and prompt tuning. In Proceedings of the Conference on Neural Information Processing Systems. Vol. 34, 16158\u201316170."},{"key":"e_1_3_3_90_2","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Wei Jason","year":"2022","unstructured":"Jason Wei, Maarten Bosma, Vincent Zhao, Kelvin Guu, Adams Wei Yu, Brian Lester, Nan Du, Andrew M. Dai, and Quoc V. Le. 2022. Finetuned language models are zero-shot learners. In Proceedings of the International Conference on Learning Representations."},{"key":"e_1_3_3_91_2","doi-asserted-by":"publisher","unstructured":"Ronald J. Williams and David Zipser. 1989. A learning algorithm for continually running fully recurrent neural networks. Neural Comput. 1 2 (1989) 270\u2013280. DOI:10.1162\/NECO.1989.1.2.270","DOI":"10.1162\/NECO.1989.1.2.270"},{"key":"e_1_3_3_92_2","doi-asserted-by":"publisher","DOI":"10.1145\/1390156.1390306"},{"key":"e_1_3_3_93_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.emnlp-main.35"},{"key":"e_1_3_3_94_2","unstructured":"Lee Xiong Chenyan Xiong Ye Li Kwok-Fung Tang Jialin Liu Paul N Bennett Junaid Ahmed and Arnold Overwijk. 2020. Approximate nearest neighbor negative contrastive learning for dense text retrieval. In International Conference on Learning Representations."},{"key":"e_1_3_3_95_2","doi-asserted-by":"publisher","DOI":"10.1145\/3366424.3386195"},{"key":"e_1_3_3_96_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1342"},{"key":"e_1_3_3_97_2","volume-title":"Proceedings of the Web Conference","author":"Zeng Hansi","year":"2024","unstructured":"Hansi Zeng, Chen Luo, Bowen Jin, Sheikh Muhammad Sarwar, Tianxin Wei, and Hamed Zamani. 2024. Scalable and effective generative information retrieval. In Proceedings of the Web Conference."},{"key":"e_1_3_3_98_2","doi-asserted-by":"publisher","DOI":"10.1145\/3477495.3531791"},{"key":"e_1_3_3_99_2","first-page":"1503","volume-title":"Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval","author":"Zhan Jingtao","year":"2020","unstructured":"Jingtao Zhan, Jiaxin Mao, Yiqun Liu, Jiafeng Guo, Min Zhang, and Shaoping Ma. 2020. Optimizing dense retrieval model training with hard negatives. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 1503\u20131512."},{"key":"e_1_3_3_100_2","unstructured":"Jingtao Zhan Jiaxin Mao Yiqun Liu Min Zhang and Shaoping Ma. 2020. RepBERT: Contextualized text embeddings for first-stage retrieval. arXiv:2006.15498. Retrieved from https:\/\/arxiv.org\/abs\/2006.15498"},{"key":"e_1_3_3_101_2","first-page":"11328","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Zhang Jingqing","year":"2020","unstructured":"Jingqing Zhang, Yao Zhao, Mohammad Saleh, and Peter Liu. 2020. Pegasus: Pre-training with extracted gap-sentences for abstractive summarization. In Proceedings of the International Conference on Machine Learning. 11328\u201311339."},{"key":"e_1_3_3_102_2","series-title":"Proceedings of Machine Learning Research","first-page":"12427","volume-title":"Proceedings of the 38th International Conference on Machine Learning","volume":"139","author":"Zhang Lily","year":"2021","unstructured":"Lily Zhang, Mark Goldstein, and Rajesh Ranganath. 2021. Understanding failures in out-of-distribution detection with deep generative models. In Proceedings of the 38th International Conference on Machine Learning(Proceedings of Machine Learning Research, Vol. 139). 12427\u201312436."},{"key":"e_1_3_3_103_2","doi-asserted-by":"publisher","DOI":"10.1145\/1871437.1871474"},{"key":"e_1_3_3_104_2","volume-title":"Proceedings of the 11th International Conference on Learning Representations","author":"Zhao Yao","year":"2023","unstructured":"Yao Zhao, Misha Khalman, Rishabh Joshi, Shashi Narayan, Mohammad Saleh, and Peter J. Liu. 2023. Calibrating sequence likelihood improves conditional language generation. In Proceedings of the 11th International Conference on Learning Representations."},{"key":"e_1_3_3_105_2","doi-asserted-by":"publisher","DOI":"10.1145\/2766462.2767700"},{"key":"e_1_3_3_106_2","unstructured":"Shengyao Zhuang Houxing Ren Linjun Shou Jian Pei Ming Gong Guido Zuccon and Daxin Jiang. 2023. Bridging the gap between indexing and retrieval for differentiable search index with query generation. In Gen-IR@SIGIR 2023: The First Workshop on Generative Information Retrieval."}],"container-title":["ACM Transactions on Information Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3653712","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3653712","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T00:03:58Z","timestamp":1750291438000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3653712"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,4,29]]},"references-count":105,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2024,9,30]]}},"alternative-id":["10.1145\/3653712"],"URL":"https:\/\/doi.org\/10.1145\/3653712","relation":{},"ISSN":["1046-8188","1558-2868"],"issn-type":[{"value":"1046-8188","type":"print"},{"value":"1558-2868","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,4,29]]},"assertion":[{"value":"2023-07-04","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-03-14","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-04-29","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}