{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,10]],"date-time":"2025-11-10T21:16:01Z","timestamp":1762809361086,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":52,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,10,17]],"date-time":"2022-10-17T00:00:00Z","timestamp":1665964800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,10,17]]},"DOI":"10.1145\/3511808.3557588","type":"proceedings-article","created":{"date-parts":[[2022,10,16]],"date-time":"2022-10-16T01:29:57Z","timestamp":1665883797000},"page":"4464-4469","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":7,"title":["Early Stage Sparse Retrieval with Entity Linking"],"prefix":"10.1145","author":[{"given":"Dahlia","family":"Shehata","sequence":"first","affiliation":[{"name":"University of Waterloo, Waterloo, ON, Canada"}]},{"given":"Negar","family":"Arabzadeh","sequence":"additional","affiliation":[{"name":"University of Waterloo, Waterloo, ON, Canada"}]},{"given":"Charles L. A.","family":"Clarke","sequence":"additional","affiliation":[{"name":"University of Waterloo, Waterloo, ON, Canada"}]}],"member":"320","published-online":{"date-parts":[[2022,10,17]]},"reference":[{"key":"e_1_3_2_2_1_1","volume-title":"MS MARCO Chameleons: Challenging the MS MARCO Leaderboard with Extremely Obstinate Queries","author":"Arabzadeh Negar","year":"2011","unstructured":"Negar Arabzadeh , Bhaskar Mitra , and Ebrahim Bagheri . 2021. MS MARCO Chameleons: Challenging the MS MARCO Leaderboard with Extremely Obstinate Queries . Association for Computing Machinery , New York, NY, USA , 4426--4435. https:\/\/doi.org\/10.1145\/3459637.348 2011 10.1145\/3459637.3482011 Negar Arabzadeh, Bhaskar Mitra, and Ebrahim Bagheri. 2021. MS MARCO Chameleons: Challenging the MS MARCO Leaderboard with Extremely Obstinate Queries. Association for Computing Machinery, New York, NY, USA, 4426--4435. https:\/\/doi.org\/10.1145\/3459637.3482011"},{"key":"e_1_3_2_2_2_1","volume-title":"Clarke","author":"Arabzadeh Negar","year":"2021","unstructured":"Negar Arabzadeh , Xinyi Yan , and Charles L. A . Clarke . 2021 . Predicting Efficiency\/Effectiveness Trade-offs for Dense vs. Sparse Retrieval Strategy Selection . (2021). arXiv:cs.IR\/2109.10739 Negar Arabzadeh, Xinyi Yan, and Charles L. A. Clarke. 2021. Predicting Efficiency\/Effectiveness Trade-offs for Dense vs. Sparse Retrieval Strategy Selection. (2021). arXiv:cs.IR\/2109.10739"},{"key":"e_1_3_2_2_3_1","volume-title":"MS MARCO: A Human Generated MAchine Reading COmprehension Dataset.","author":"Bajaj Payal","year":"2018","unstructured":"Payal Bajaj , Daniel Campos , Nick Craswell , Li Deng , Jianfeng Gao , Xiaodong Liu , Rangan Majumder , Andrew McNamara , Bhaskar Mitra , Tri Nguyen , Mir Rosenberg , Xia Song , Alina Stoica , Saurabh Tiwary , and Tong Wang . 2018 . MS MARCO: A Human Generated MAchine Reading COmprehension Dataset. (2018). arXiv:cs.CL\/1611.09268 Payal Bajaj, Daniel Campos, Nick Craswell, Li Deng, Jianfeng Gao, Xiaodong Liu, Rangan Majumder, Andrew McNamara, Bhaskar Mitra, Tri Nguyen, Mir Rosenberg, Xia Song, Alina Stoica, Saurabh Tiwary, and Tong Wang. 2018. MS MARCO: A Human Generated MAchine Reading COmprehension Dataset. (2018). arXiv:cs.CL\/1611.09268"},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/3130348.3130371"},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/956863.956944"},{"key":"e_1_3_2_2_6_1","volume-title":"Semantic Models for the First-stage Retrieval: A Comprehensive Review. ArXiv abs\/2103.04831","author":"Cai Yinqiong","year":"2021","unstructured":"Yinqiong Cai , Yixing Fan , Jiafeng Guo , Fei Sun , Ruqing Zhang , and Xueqi Cheng . 2021. Semantic Models for the First-stage Retrieval: A Comprehensive Review. ArXiv abs\/2103.04831 ( 2021 ). Yinqiong Cai, Yixing Fan, Jiafeng Guo, Fei Sun, Ruqing Zhang, and Xueqi Cheng. 2021. Semantic Models for the First-stage Retrieval: A Comprehensive Review. ArXiv abs\/2103.04831 (2021)."},{"key":"e_1_3_2_2_7_1","unstructured":"Qiwei Chen Huan Zhao Wei Li Pipei Huang and Wenwu Ou. 2019. Behavior Sequence Transformer for E-commerce Recommendation in Alibaba. (2019). arXiv:cs.IR\/1905.06874 Qiwei Chen Huan Zhao Wei Li Pipei Huang and Wenwu Ou. 2019. Behavior Sequence Transformer for E-commerce Recommendation in Alibaba. (2019). arXiv:cs.IR\/1905.06874"},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/1571941.1572114"},{"key":"e_1_3_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/1961209.1961211"},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/1961209.1961211"},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2019.05.005"},{"key":"e_1_3_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/3018661.3018692"},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/3132847.3133138"},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-018-1190-1"},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/1871437.1871689"},{"key":"e_1_3_2_2_16_1","unstructured":"Gopichand G K. Sola C.B. Sai Reddy M.V. Rakesh Kumar and P. Harsha Vardhan. 2020. Vocabulary mismatch avoidance techniques. (2020) 2585 - 2594 pages. Gopichand G K. Sola C.B. Sai Reddy M.V. Rakesh Kumar and P. Harsha Vardhan. 2020. Vocabulary mismatch avoidance techniques. (2020) 2585 - 2594 pages."},{"volume-title":"Proceedings of the Twelfth ACM International Conference onWeb Search and Data Mining (WSDM '19)","author":"Gallagher Luke","key":"e_1_3_2_2_17_1","unstructured":"Luke Gallagher , Ruey-Cheng Chen , Roi Blanco , and J. Shane Culpepper . 2019. Joint Optimization of Cascade Ranking Models . In Proceedings of the Twelfth ACM International Conference onWeb Search and Data Mining (WSDM '19) . Association for Computing Machinery, New York, NY, USA, 15--23. https:\/\/doi.org\/10.1145\/3289600.3290986 10.1145\/3289600.3290986 Luke Gallagher, Ruey-Cheng Chen, Roi Blanco, and J. Shane Culpepper. 2019. Joint Optimization of Cascade Ranking Models. In Proceedings of the Twelfth ACM International Conference onWeb Search and Data Mining (WSDM '19). Association for Computing Machinery, New York, NY, USA, 15--23. https:\/\/doi.org\/10.1145\/3289600.3290986"},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/1008992.1009024"},{"key":"e_1_3_2_2_19_1","volume-title":"Benjamin Van Durme, and Jamie Callan","author":"Gao Luyu","year":"2021","unstructured":"Luyu Gao , Zhuyun Dai , Tongfei Chen , Zhen Fan , Benjamin Van Durme, and Jamie Callan . 2021 . Complementing Lexical Retrieval with Semantic Residual Embedding . (2021). arXiv:cs.IR\/2004.13969 Luyu Gao, Zhuyun Dai, Tongfei Chen, Zhen Fan, Benjamin Van Durme, and Jamie Callan. 2021. Complementing Lexical Retrieval with Semantic Residual Embedding. (2021). arXiv:cs.IR\/2004.13969"},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"crossref","unstructured":"Gustavo Gon\u00e7alves Jo\u00e3o Magalh\u00e3es Chenyan Xiong and Jamie Callan. 2018. Improving Ad Hoc Retrieval With Bag Of Entities. In TREC. Gustavo Gon\u00e7alves Jo\u00e3o Magalh\u00e3es Chenyan Xiong and Jamie Callan. 2018. Improving Ad Hoc Retrieval With Bag Of Entities. In TREC.","DOI":"10.6028\/NIST.SP.500-331.core-NOVASearch"},{"key":"e_1_3_2_2_21_1","doi-asserted-by":"crossref","unstructured":"Sebastian Hofst\u00e4tter Hamed Zamani Bhaskar Mitra Nick Craswell and Allan Hanbury. 2020. Local Self-Attention over Long Text for Efficient Document Retrieval. (2020). arXiv:cs.IR\/2005.04908 Sebastian Hofst\u00e4tter Hamed Zamani Bhaskar Mitra Nick Craswell and Allan Hanbury. 2020. Local Self-Attention over Long Text for Efficient Document Retrieval. (2020). arXiv:cs.IR\/2005.04908","DOI":"10.1145\/3397271.3401224"},{"key":"e_1_3_2_2_22_1","volume-title":"arXiv:cs.IR\/2002.01854","author":"Hofst\u00e4tter Sebastian","year":"2020","unstructured":"Sebastian Hofst\u00e4tter , Markus Zlabinger , and Allan Hanbury . 2020. Interpretable & Time-Budget-Constrained Contextualization for Re-Ranking . ( 2020 ). arXiv:cs.IR\/2002.01854 Sebastian Hofst\u00e4tter, Markus Zlabinger, and Allan Hanbury. 2020. Interpretable & Time-Budget-Constrained Contextualization for Re-Ranking. (2020). arXiv:cs.IR\/2002.01854"},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/2009916.2010018"},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"crossref","unstructured":"Omar Khattab and Matei Zaharia. 2020. ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERT. (2020). arXiv:cs.IR\/2004.12832 Omar Khattab and Matei Zaharia. 2020. ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERT. (2020). arXiv:cs.IR\/2004.12832","DOI":"10.1145\/3397271.3401075"},{"key":"e_1_3_2_2_25_1","volume-title":"Leveraging Semantic and Lexical Matching to Improve the Recall of Document Retrieval Systems: A Hybrid Approach. ArXiv abs\/2010.01195","author":"Kuzi Saar","year":"2020","unstructured":"Saar Kuzi , Mingyang Zhang , Cheng Li , Michael Bendersky , and Marc Najork . 2020. Leveraging Semantic and Lexical Matching to Improve the Recall of Document Retrieval Systems: A Hybrid Approach. ArXiv abs\/2010.01195 ( 2020 ). Saar Kuzi, Mingyang Zhang, Cheng Li, Michael Bendersky, and Marc Najork. 2020. Leveraging Semantic and Lexical Matching to Improve the Recall of Document Retrieval Systems: A Hybrid Approach. ArXiv abs\/2010.01195 (2020)."},{"key":"e_1_3_2_2_26_1","volume-title":"Le and Tomas Mikolov","author":"Quoc","year":"2014","unstructured":"Quoc V. Le and Tomas Mikolov . 2014 . Distributed Representations of Sentences and Documents . (2014). arXiv:cs.CL\/1405.4053 Quoc V. Le and Tomas Mikolov. 2014. Distributed Representations of Sentences and Documents. (2014). arXiv:cs.CL\/1405.4053"},{"key":"e_1_3_2_2_27_1","doi-asserted-by":"crossref","unstructured":"Belinda Z. Li Sewon Min Srinivasan Iyer Yashar Mehdad and Wen tau Yih. 2020. Efficient One-Pass End-to-End Entity Linking for Questions. (2020). arXiv:cs.CL\/2010.02413 Belinda Z. Li Sewon Min Srinivasan Iyer Yashar Mehdad and Wen tau Yih. 2020. Efficient One-Pass End-to-End Entity Linking for Questions. (2020). arXiv:cs.CL\/2010.02413","DOI":"10.18653\/v1\/2020.emnlp-main.522"},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1561\/1500000035"},{"key":"e_1_3_2_2_29_1","unstructured":"Sheng-Chieh Lin Jheng-Hong Yang and Jimmy Lin. 2020. Distilling Dense Representations for Ranking using Tightly-Coupled Teachers. (2020). arXiv:cs.IR\/2010.11386 Sheng-Chieh Lin Jheng-Hong Yang and Jimmy Lin. 2020. Distilling Dense Representations for Ranking using Tightly-Coupled Teachers. (2020). arXiv:cs.IR\/2010.11386"},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/3097983.3098011"},{"key":"e_1_3_2_2_31_1","unstructured":"Tie-Yan Liu. 2009. Learning to Rank for Information Retrieval. Tie-Yan Liu. 2009. Learning to Rank for Information Retrieval."},{"key":"e_1_3_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10791-015-9267-x"},{"key":"e_1_3_2_2_33_1","volume-title":"arXiv:cs.CL\/2005.00181","author":"Luan Yi","year":"2021","unstructured":"Yi Luan , Jacob Eisenstein , Kristina Toutanova , and Michael Collins . 2021. Sparse, Dense, and Attentional Representations for Text Retrieval . ( 2021 ). arXiv:cs.CL\/2005.00181 Yi Luan, Jacob Eisenstein, Kristina Toutanova, and Michael Collins. 2021. Sparse, Dense, and Attentional Representations for Text Retrieval. (2021). arXiv:cs.CL\/2005.00181"},{"key":"e_1_3_2_2_34_1","unstructured":"Rodrigo Nogueira. 2019. From doc2query to docTTTTTquery. Rodrigo Nogueira. 2019. From doc2query to docTTTTTquery."},{"key":"e_1_3_2_2_35_1","unstructured":"Rodrigo Nogueira Wei Yang Kyunghyun Cho and Jimmy Lin. 2019. Multi-Stage Document Ranking with BERT. (2019). arXiv:cs.IR\/1910.14424 Rodrigo Nogueira Wei Yang Kyunghyun Cho and Jimmy Lin. 2019. Multi-Stage Document Ranking with BERT. (2019). arXiv:cs.IR\/1910.14424"},{"key":"e_1_3_2_2_36_1","unstructured":"Rodrigo Nogueira Wei Yang Jimmy Lin and Kyunghyun Cho. 2019. Document Expansion by Query Prediction. (2019). Rodrigo Nogueira Wei Yang Jimmy Lin and Kyunghyun Cho. 2019. Document Expansion by Query Prediction. (2019)."},{"key":"e_1_3_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/2911451.2911508"},{"key":"e_1_3_2_2_38_1","doi-asserted-by":"crossref","unstructured":"Christopher Sciavolino Zexuan Zhong Jinhyuk Lee and Danqi Chen. 2021. Simple Entity-Centric Questions Challenge Dense Retrievers. (2021). arXiv:cs.CL\/2109.08535 Christopher Sciavolino Zexuan Zhong Jinhyuk Lee and Danqi Chen. 2021. Simple Entity-Centric Questions Challenge Dense Retrievers. (2021). arXiv:cs.CL\/2109.08535","DOI":"10.18653\/v1\/2021.emnlp-main.496"},{"key":"e_1_3_2_2_39_1","unstructured":"Wei Shen Yuhan Li Yinan Liu Jiawei Han Jianyong Wang and Xiaojie Yuan. 2021. Entity Linking Meets Deep Learning: Techniques and Solutions. (2021). arXiv:cs.CL\/2109.12520 Wei Shen Yuhan Li Yinan Liu Jiawei Han Jianyong Wang and Xiaojie Yuan. 2021. Entity Linking Meets Deep Learning: Techniques and Solutions. (2021). arXiv:cs.CL\/2109.12520"},{"key":"e_1_3_2_2_40_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.datak.2012.02.003"},{"key":"e_1_3_2_2_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/2009916.2009934"},{"key":"e_1_3_2_2_42_1","volume-title":"COLD: Towards the Next Generation of Pre-Ranking System.","author":"Wang Zhe","year":"2020","unstructured":"Zhe Wang , Liqin Zhao , Biye Jiang , Guorui Zhou , Xiaoqiang Zhu , and Kun Gai . 2020 . COLD: Towards the Next Generation of Pre-Ranking System. (2020). arXiv:cs.IR\/2007.16122 Zhe Wang, Liqin Zhao, Biye Jiang, Guorui Zhou, Xiaoqiang Zhu, and Kun Gai. 2020. COLD: Towards the Next Generation of Pre-Ranking System. (2020). arXiv:cs.IR\/2007.16122"},{"key":"e_1_3_2_2_43_1","doi-asserted-by":"publisher","DOI":"10.2753\/MIS0742-1222240309"},{"volume-title":"Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '06)","author":"Wei Xing","key":"e_1_3_2_2_44_1","unstructured":"Xing Wei and W. Bruce Croft . 2006. LDA-Based Document Models for Ad-Hoc Retrieval . In Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '06) . Association for Computing Machinery, New York, NY, USA, 178--185. https:\/\/doi.org\/10.1145\/1148170.1148204 10.1145\/1148170.1148204 Xing Wei and W. Bruce Croft. 2006. LDA-Based Document Models for Ad-Hoc Retrieval. In Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '06). Association for Computing Machinery, New York, NY, USA, 178--185. https:\/\/doi.org\/10.1145\/1148170.1148204"},{"key":"e_1_3_2_2_45_1","unstructured":"LedellWu Fabio Petroni Martin Josifoski Sebastian Riedel and Luke Zettlemoyer. 2020. Scalable Zero-shot Entity Linking with Dense Entity Retrieval. (2020). arXiv:cs.CL\/1911.03814 LedellWu Fabio Petroni Martin Josifoski Sebastian Riedel and Luke Zettlemoyer. 2020. Scalable Zero-shot Entity Linking with Dense Entity Retrieval. (2020). arXiv:cs.CL\/1911.03814"},{"key":"e_1_3_2_2_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/2806416.2806456"},{"key":"e_1_3_2_2_47_1","doi-asserted-by":"publisher","DOI":"10.1145\/2970398.2970423"},{"volume-title":"Word-Entity Duet Representations for Document Ranking (SIGIR '17)","author":"Xiong Chenyan","key":"e_1_3_2_2_48_1","unstructured":"Chenyan Xiong , Jamie Callan , and Tie-Yan Liu . 2017. Word-Entity Duet Representations for Document Ranking (SIGIR '17) . Association for Computing Machinery , New York, NY, USA , 763--772. https:\/\/doi.org\/10.1145\/3077136.3080768 10.1145\/3077136.3080768 Chenyan Xiong, Jamie Callan, and Tie-Yan Liu. 2017. Word-Entity Duet Representations for Document Ranking (SIGIR '17). Association for Computing Machinery, New York, NY, USA, 763--772. https:\/\/doi.org\/10.1145\/3077136.3080768"},{"key":"e_1_3_2_2_49_1","unstructured":"Lee Xiong Chenyan Xiong Ye Li Kwok-Fung Tang Jialin Liu Paul Bennett Junaid Ahmed and Arnold Overwijk. 2020. Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval. (2020). arXiv:cs.IR\/2007.00808 Lee Xiong Chenyan Xiong Ye Li Kwok-Fung Tang Jialin Liu Paul Bennett Junaid Ahmed and Arnold Overwijk. 2020. Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval. (2020). arXiv:cs.IR\/2007.00808"},{"volume-title":"Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '96)","author":"Xu Jinxi","key":"e_1_3_2_2_50_1","unstructured":"Jinxi Xu and W. Bruce Croft . 1996. Query Expansion Using Local and Global Document Analysis . In Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '96) . Association for Computing Machinery, New York, NY, USA, 4--11. https:\/\/doi.org\/10.1145\/243199.243202 10.1145\/243199.243202 Jinxi Xu and W. Bruce Croft. 1996. Query Expansion Using Local and Global Document Analysis. In Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '96). Association for Computing Machinery, New York, NY, USA, 4--11. https:\/\/doi.org\/10.1145\/243199.243202"},{"key":"e_1_3_2_2_51_1","doi-asserted-by":"publisher","DOI":"10.1145\/3077136.3080721"},{"key":"e_1_3_2_2_52_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.sustainlp-1.8"}],"event":{"name":"CIKM '22: The 31st ACM International Conference on Information and Knowledge Management","sponsor":["SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web","SIGIR ACM Special Interest Group on Information Retrieval"],"location":"Atlanta GA USA","acronym":"CIKM '22"},"container-title":["Proceedings of the 31st ACM International Conference on Information &amp; Knowledge Management"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3511808.3557588","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3511808.3557588","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T17:51:09Z","timestamp":1750182669000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3511808.3557588"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,10,17]]},"references-count":52,"alternative-id":["10.1145\/3511808.3557588","10.1145\/3511808"],"URL":"https:\/\/doi.org\/10.1145\/3511808.3557588","relation":{},"subject":[],"published":{"date-parts":[[2022,10,17]]},"assertion":[{"value":"2022-10-17","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}