{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,6]],"date-time":"2025-10-06T18:45:56Z","timestamp":1759776356453,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":22,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,7,6]],"date-time":"2022-07-06T00:00:00Z","timestamp":1657065600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,7,6]]},"DOI":"10.1145\/3477495.3531835","type":"proceedings-article","created":{"date-parts":[[2022,7,7]],"date-time":"2022-07-07T15:12:13Z","timestamp":1657206733000},"page":"2232-2236","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":8,"title":["Learned Token Pruning in Contextualized Late Interaction over BERT (ColBERT)"],"prefix":"10.1145","author":[{"given":"Carlos","family":"Lassance","sequence":"first","affiliation":[{"name":"Naver Labs Europe, Meylan, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Maroua","family":"Maachou","sequence":"additional","affiliation":[{"name":"Naver Labs Europe, Meylan, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Joohee","family":"Park","sequence":"additional","affiliation":[{"name":"Naver, Seoul, Republic of Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"St\u00e9phane","family":"Clinchant","sequence":"additional","affiliation":[{"name":"Naver Labs Europe, Meylan, France"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2022,7,7]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"MS MARCO: A Human Generated MAchine Reading COmprehension Dataset. arxiv: 1611.09268 [cs.CL]","author":"Bajaj Payal","year":"2018","unstructured":"Payal Bajaj , Daniel Campos , Nick Craswell , Li Deng , Jianfeng Gao , Xiaodong Liu , Rangan Majumder , Andrew McNamara , Bhaskar Mitra , Tri Nguyen , Mir Rosenberg , Xia Song , Alina Stoica , Saurabh Tiwary , and Tong Wang . 2018 . MS MARCO: A Human Generated MAchine Reading COmprehension Dataset. arxiv: 1611.09268 [cs.CL] Payal Bajaj, Daniel Campos, Nick Craswell, Li Deng, Jianfeng Gao, Xiaodong Liu, Rangan Majumder, Andrew McNamara, Bhaskar Mitra, Tri Nguyen, Mir Rosenberg, Xia Song, Alina Stoica, Saurabh Tiwary, and Tong Wang. 2018. MS MARCO: A Human Generated MAchine Reading COmprehension Dataset. arxiv: 1611.09268 [cs.CL]"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/383952.383958"},{"key":"e_1_3_2_1_3_1","volume-title":"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. CoRR","author":"Devlin Jacob","year":"2018","unstructured":"Jacob Devlin , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . 2018 . BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. CoRR , Vol. abs\/ 1810 .04805 (2018). arxiv: 1810.04805 http:\/\/arxiv.org\/abs\/1810.04805 Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. CoRR , Vol. abs\/1810.04805 (2018). arxiv: 1810.04805 http:\/\/arxiv.org\/abs\/1810.04805"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"crossref","unstructured":"Thibault Formal Carlos Lassance Benjamin Piwowarski and St\u00e9phane Clinchant. 2021. SPLADE v2: Sparse Lexical and Expansion Model for Information Retrieval. arxiv: 2109.10086 [cs.IR]  Thibault Formal Carlos Lassance Benjamin Piwowarski and St\u00e9phane Clinchant. 2021. SPLADE v2: Sparse Lexical and Expansion Model for Information Retrieval. arxiv: 2109.10086 [cs.IR]","DOI":"10.1145\/3404835.3463098"},{"key":"e_1_3_2_1_5_1","unstructured":"Luyu Gao and Jamie Callan. 2021. Unsupervised Corpus Aware Language Model Pre-training for Dense Passage Retrieval. arxiv: 2108.05540 [cs.IR]  Luyu Gao and Jamie Callan. 2021. Unsupervised Corpus Aware Language Model Pre-training for Dense Passage Retrieval. arxiv: 2108.05540 [cs.IR]"},{"key":"e_1_3_2_1_6_1","volume-title":"Proceedings of the 37th International Conference on Machine Learning (Proceedings of Machine Learning Research","author":"Goyal Saurabh","year":"2020","unstructured":"Saurabh Goyal , Anamitra Roy Choudhury , Saurabh Raje , Venkatesan Chakaravarthy , Yogish Sabharwal , and Ashish Verma . 2020 . PoWER-BERT: Accelerating BERT Inference via Progressive Word-vector Elimination . In Proceedings of the 37th International Conference on Machine Learning (Proceedings of Machine Learning Research , Vol. 119), , Hal Daum\u00e9 III and Aarti Singh (Eds.). PMLR, 3690--3699. https:\/\/proceedings.mlr.press\/v119\/goyal20a.html Saurabh Goyal, Anamitra Roy Choudhury, Saurabh Raje, Venkatesan Chakaravarthy, Yogish Sabharwal, and Ashish Verma. 2020. PoWER-BERT: Accelerating BERT Inference via Progressive Word-vector Elimination. In Proceedings of the 37th International Conference on Machine Learning (Proceedings of Machine Learning Research, Vol. 119), , Hal Daum\u00e9 III and Aarti Singh (Eds.). PMLR, 3690--3699. https:\/\/proceedings.mlr.press\/v119\/goyal20a.html"},{"key":"e_1_3_2_1_7_1","volume-title":"Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval","author":"Sebastian","year":"2021","unstructured":"Sebastian Hofst\"atter, Sheng-Chieh Lin , Jheng-Hong Yang , Jimmy Lin , and Allan Hanbury . 2021 . Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling . In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval ( Virtual Event, Canada) (SIGIR '21). Association for Computing Machinery, New York, NY, USA, 113--122. https:\/\/doi.org\/10.1145\/3404835.3462891 10.1145\/3404835.3462891 Sebastian Hofst\"atter, Sheng-Chieh Lin, Jheng-Hong Yang, Jimmy Lin, and Allan Hanbury. 2021. Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (Virtual Event, Canada) (SIGIR '21). Association for Computing Machinery, New York, NY, USA, 113--122. https:\/\/doi.org\/10.1145\/3404835.3462891"},{"key":"e_1_3_2_1_8_1","volume-title":"Mitigating the Position Bias of Transformer Models in Passage Re-Ranking. arXiv preprint arXiv:2101.06980","author":"Sebastian","year":"2021","unstructured":"Sebastian Hofst\"atter, Aldo Lipani , Sophia Althammer , Markus Zlabinger , and Allan Hanbury . 2021. Mitigating the Position Bias of Transformer Models in Passage Re-Ranking. arXiv preprint arXiv:2101.06980 ( 2021 ). Sebastian Hofst\"atter, Aldo Lipani, Sophia Althammer, Markus Zlabinger, and Allan Hanbury. 2021. Mitigating the Position Bias of Transformer Models in Passage Re-Ranking. arXiv preprint arXiv:2101.06980 (2021)."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/3397271.3401075"},{"key":"#cr-split#-e_1_3_2_1_10_1.1","doi-asserted-by":"crossref","unstructured":"Gyuwan Kim and Kyunghyun Cho. 2021. Length-Adaptive Transformer: Train Once with Length Drop Use Anytime with Search. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). Association for Computational Linguistics Online 6501--6511. https:\/\/doi.org\/10.18653\/v1\/2021.acl-long.508 10.18653\/v1","DOI":"10.18653\/v1\/2021.acl-long.508"},{"key":"#cr-split#-e_1_3_2_1_10_1.2","doi-asserted-by":"crossref","unstructured":"Gyuwan Kim and Kyunghyun Cho. 2021. Length-Adaptive Transformer: Train Once with Length Drop Use Anytime with Search. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). Association for Computational Linguistics Online 6501--6511. https:\/\/doi.org\/10.18653\/v1\/2021.acl-long.508","DOI":"10.18653\/v1\/2021.acl-long.508"},{"key":"e_1_3_2_1_11_1","volume-title":"Pretrained Transformers for Text Ranking: BERT and Beyond . arXiv:2010.06467 [cs] (Oct","author":"Lin Jimmy","year":"2020","unstructured":"Jimmy Lin , Rodrigo Nogueira , and Andrew Yates . 2020. Pretrained Transformers for Text Ranking: BERT and Beyond . arXiv:2010.06467 [cs] (Oct . 2020 ). http:\/\/arxiv.org\/abs\/2010.06467 ZSCC : NoCitationData [s0] arXiv: 2010.06467. Jimmy Lin, Rodrigo Nogueira, and Andrew Yates. 2020. Pretrained Transformers for Text Ranking: BERT and Beyond . arXiv:2010.06467 [cs] (Oct. 2020). http:\/\/arxiv.org\/abs\/2010.06467 ZSCC: NoCitationData[s0] arXiv: 2010.06467."},{"key":"e_1_3_2_1_12_1","volume-title":"Passage Re-ranking with BERT. arxiv","author":"Nogueira Rodrigo","year":"1901","unstructured":"Rodrigo Nogueira and Kyunghyun Cho . 2019. Passage Re-ranking with BERT. arxiv : 1901 .04085 [cs.IR] Rodrigo Nogueira and Kyunghyun Cho. 2019. Passage Re-ranking with BERT. arxiv: 1901.04085 [cs.IR]"},{"key":"e_1_3_2_1_13_1","volume-title":"Sentence-bert: Sentence embeddings using siamese bert-networks. arXiv preprint arXiv:1908.10084","author":"Reimers Nils","year":"2019","unstructured":"Nils Reimers and Iryna Gurevych . 2019 . Sentence-bert: Sentence embeddings using siamese bert-networks. arXiv preprint arXiv:1908.10084 (2019). Nils Reimers and Iryna Gurevych. 2019. Sentence-bert: Sentence embeddings using siamese bert-networks. arXiv preprint arXiv:1908.10084 (2019)."},{"key":"e_1_3_2_1_14_1","volume-title":"a distilled version of BERT: smaller, faster, cheaper and lighter. arXiv preprint arXiv:1910.01108","author":"Sanh Victor","year":"2019","unstructured":"Victor Sanh , Lysandre Debut , Julien Chaumond , and Thomas Wolf . 2019. DistilBERT , a distilled version of BERT: smaller, faster, cheaper and lighter. arXiv preprint arXiv:1910.01108 ( 2019 ). Victor Sanh, Lysandre Debut, Julien Chaumond, and Thomas Wolf. 2019. DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter. arXiv preprint arXiv:1910.01108 (2019)."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"crossref","unstructured":"Keshav Santhanam Omar Khattab Jon Saad-Falcon Christopher Potts and Matei Zaharia. 2021. ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction. arxiv: 2112.01488 [cs.IR]  Keshav Santhanam Omar Khattab Jon Saad-Falcon Christopher Potts and Matei Zaharia. 2021. ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction. arxiv: 2112.01488 [cs.IR]","DOI":"10.18653\/v1\/2022.naacl-main.272"},{"key":"e_1_3_2_1_16_1","unstructured":"Nandan Thakur Nils Reimers Andreas R\u00fcckl\u00e9 Abhishek Srivastava and Iryna Gurevych. 2021. BEIR: A Heterogeneous Benchmark for Zero-shot Evaluation of Information Retrieval Models. In Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 2) . https:\/\/openreview.net\/forum?id=wCu6T5xFjeJ  Nandan Thakur Nils Reimers Andreas R\u00fcckl\u00e9 Abhishek Srivastava and Iryna Gurevych. 2021. BEIR: A Heterogeneous Benchmark for Zero-shot Evaluation of Information Retrieval Models. In Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 2) . https:\/\/openreview.net\/forum?id=wCu6T5xFjeJ"},{"key":"e_1_3_2_1_17_1","volume-title":"Query Embedding Pruning for Dense Retrieval. CoRR","author":"Tonellotto Nicola","year":"2021","unstructured":"Nicola Tonellotto and Craig Macdonald . 2021. Query Embedding Pruning for Dense Retrieval. CoRR , Vol. abs\/ 2108 .10341 ( 2021 ). showeprint[arXiv]2108.10341 https:\/\/arxiv.org\/abs\/2108.10341 Nicola Tonellotto and Craig Macdonald. 2021. Query Embedding Pruning for Dense Retrieval. CoRR , Vol. abs\/2108.10341 (2021). showeprint[arXiv]2108.10341 https:\/\/arxiv.org\/abs\/2108.10341"},{"key":"e_1_3_2_1_18_1","volume-title":"MiniLM: Deep Self-Attention Distillation for Task-Agnostic Compression of Pre-Trained Transformers. arxiv","author":"Wang Wenhui","year":"2002","unstructured":"Wenhui Wang , Furu Wei , Li Dong , Hangbo Bao , Nan Yang , and Ming Zhou . 2020. MiniLM: Deep Self-Attention Distillation for Task-Agnostic Compression of Pre-Trained Transformers. arxiv : 2002 .10957 [cs.CL] Wenhui Wang, Furu Wei, Li Dong, Hangbo Bao, Nan Yang, and Ming Zhou. 2020. MiniLM: Deep Self-Attention Distillation for Task-Agnostic Compression of Pre-Trained Transformers. arxiv: 2002.10957 [cs.CL]"},{"key":"e_1_3_2_1_19_1","volume-title":"Pseudo-Relevance Feedback for Multiple Representation Dense Retrieval. In ICTIR '21 , , Faegheh Hasibi, Yi Fang, and Akiko Aizawa (Eds.). ACM, 297--306","author":"Wang Xiao","year":"2021","unstructured":"Xiao Wang , Craig Macdonald , Nicola Tonellotto , and Iadh Ounis . 2021 . Pseudo-Relevance Feedback for Multiple Representation Dense Retrieval. In ICTIR '21 , , Faegheh Hasibi, Yi Fang, and Akiko Aizawa (Eds.). ACM, 297--306 . https:\/\/doi.org\/10.1145\/3471158.3472250 10.1145\/3471158.3472250 Xiao Wang, Craig Macdonald, Nicola Tonellotto, and Iadh Ounis. 2021. Pseudo-Relevance Feedback for Multiple Representation Dense Retrieval. In ICTIR '21 , , Faegheh Hasibi, Yi Fang, and Akiko Aizawa (Eds.). ACM, 297--306. https:\/\/doi.org\/10.1145\/3471158.3472250"},{"key":"e_1_3_2_1_20_1","unstructured":"Ikuya Yamada Akari Asai and Hannaneh Hajishirzi. 2021. Efficient Passage Retrieval with Hashing for Open-domain Question Answering. arxiv: 2106.00882 [cs.CL]  Ikuya Yamada Akari Asai and Hannaneh Hajishirzi. 2021. Efficient Passage Retrieval with Hashing for Open-domain Question Answering. arxiv: 2106.00882 [cs.CL]"},{"key":"e_1_3_2_1_21_1","volume-title":"Inverted files for text search engines. ACM computing surveys (CSUR)","author":"Zobel Justin","year":"2006","unstructured":"Justin Zobel and Alistair Moffat . 2006. Inverted files for text search engines. ACM computing surveys (CSUR) , Vol. 38 , 2 ( 2006 ), 6--es. Justin Zobel and Alistair Moffat. 2006. Inverted files for text search engines. ACM computing surveys (CSUR) , Vol. 38, 2 (2006), 6--es."}],"event":{"name":"SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval","sponsor":["SIGIR ACM Special Interest Group on Information Retrieval"],"location":"Madrid Spain","acronym":"SIGIR '22"},"container-title":["Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3477495.3531835","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3477495.3531835","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T18:10:26Z","timestamp":1750183826000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3477495.3531835"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,7,6]]},"references-count":22,"alternative-id":["10.1145\/3477495.3531835","10.1145\/3477495"],"URL":"https:\/\/doi.org\/10.1145\/3477495.3531835","relation":{},"subject":[],"published":{"date-parts":[[2022,7,6]]},"assertion":[{"value":"2022-07-07","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}