{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,3]],"date-time":"2025-12-03T18:02:11Z","timestamp":1764784931364,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":41,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,7,6]],"date-time":"2022-07-06T00:00:00Z","timestamp":1657065600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100006785","name":"Google","doi-asserted-by":"publisher","award":["2019 Research Grant"],"award-info":[{"award-number":["2019 Research Grant"]}],"id":[{"id":"10.13039\/100006785","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100004703","name":"Bloomberg L.P.","doi-asserted-by":"publisher","award":["2019 Data Science Research Grant"],"award-info":[{"award-number":["2019 Data Science Research Grant"]}],"id":[{"id":"10.13039\/100004703","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100000266","name":"Engineering and Physical Sciences Research Council","doi-asserted-by":"publisher","award":["EP\/V025708\/1"],"award-info":[{"award-number":["EP\/V025708\/1"]}],"id":[{"id":"10.13039\/501100000266","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,7,6]]},"DOI":"10.1145\/3477495.3531712","type":"proceedings-article","created":{"date-parts":[[2022,7,7]],"date-time":"2022-07-07T15:12:13Z","timestamp":1657206733000},"page":"3067-3077","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":5,"title":["CODEC: Complex Document and Entity Collection"],"prefix":"10.1145","author":[{"given":"Iain","family":"Mackie","sequence":"first","affiliation":[{"name":"University of Glasgow, Glasgow, United Kingdom"}]},{"given":"Paul","family":"Owoicho","sequence":"additional","affiliation":[{"name":"University of Glasgow, Glasgow, United Kingdom"}]},{"given":"Carlos","family":"Gemmell","sequence":"additional","affiliation":[{"name":"University of Glasgow, Glasgow, United Kingdom"}]},{"given":"Sophie","family":"Fischer","sequence":"additional","affiliation":[{"name":"University of Glasgow, Glasgow, United Kingdom"}]},{"given":"Sean","family":"MacAvaney","sequence":"additional","affiliation":[{"name":"University of Glasgow, Glasgow, United Kingdom"}]},{"given":"Jeffrey","family":"Dalton","sequence":"additional","affiliation":[{"name":"University of Glasgow, Glasgow, United Kingdom"}]}],"member":"320","published-online":{"date-parts":[[2022,7,7]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.21236\/ADA460118"},{"key":"e_1_3_2_1_2_1","volume-title":"Proceedings of the 27th international conference on computational linguistics. 1638--1649","author":"Akbik Alan","year":"2018","unstructured":"Alan Akbik , Duncan Blythe , and Roland Vollgraf . 2018 . Contextual string embeddings for sequence labeling . In Proceedings of the 27th international conference on computational linguistics. 1638--1649 . Alan Akbik, Duncan Blythe, and Roland Vollgraf. 2018. Contextual string embeddings for sequence labeling. In Proceedings of the 27th international conference on computational linguistics. 1638--1649."},{"key":"e_1_3_2_1_3_1","volume-title":"Proceedings of the Twenty-Sixth Text REtrieval Conference (TREC","author":"Allan James","year":"2017","unstructured":"James Allan , Donna Harman , Evangelos Kanoulas , Dan Li , Christophe Van Gysel , and Ellen Voorhees . 2017 . TREC 2017 Common Core Track Overview . In Proceedings of the Twenty-Sixth Text REtrieval Conference (TREC 2017). Gaithersburg, Maryland. James Allan, Donna Harman, Evangelos Kanoulas, Dan Li, Christophe Van Gysel, and Ellen Voorhees. 2017. TREC 2017 Common Core Track Overview. In Proceedings of the Twenty-Sixth Text REtrieval Conference (TREC 2017). Gaithersburg, Maryland."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/2484028.2484165"},{"key":"e_1_3_2_1_5_1","volume-title":"Autoregressive Entity Retrieval. In International Conference on Learning Representations . https:\/\/openreview.net\/forum?id=5k8F6UU39V","author":"Cao Nicola De","year":"2021","unstructured":"Nicola De Cao , Gautier Izacard , Sebastian Riedel , and Fabio Petroni . 2021 . Autoregressive Entity Retrieval. In International Conference on Learning Representations . https:\/\/openreview.net\/forum?id=5k8F6UU39V Nicola De Cao, Gautier Izacard, Sebastian Riedel, and Fabio Petroni. 2021. Autoregressive Entity Retrieval. In International Conference on Learning Representations . https:\/\/openreview.net\/forum?id=5k8F6UU39V"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/3404835.3463035"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/3340531.3412779"},{"key":"e_1_3_2_1_8_1","volume-title":"Overview of the TREC 2020 deep learning track. In Text REtrieval Conference (TREC) . TREC.","author":"Craswell Nick","year":"2021","unstructured":"Nick Craswell , Bhaskar Mitra , Emine Yilmaz , and Daniel Campos . 2021 . Overview of the TREC 2020 deep learning track. In Text REtrieval Conference (TREC) . TREC. Nick Craswell, Bhaskar Mitra, Emine Yilmaz, and Daniel Campos. 2021. Overview of the TREC 2020 deep learning track. In Text REtrieval Conference (TREC) . TREC."},{"key":"e_1_3_2_1_9_1","unstructured":"J Shane Culpepper Guglielmo Faggioli Nicola Ferro and Oren Kurland. 2021. Do hard topics exist? A statistical analysis. In IIR . J Shane Culpepper Guglielmo Faggioli Nicola Ferro and Oren Kurland. 2021. Do hard topics exist? A statistical analysis. In IIR ."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/2600428.2609628"},{"key":"e_1_3_2_1_11_1","volume-title":"Overview of the INEX 2009 entity ranking track. In International Workshop of the Initiative for the Evaluation of XML Retrieval. Springer, 254--264","author":"Demartini Gianluca","year":"2009","unstructured":"Gianluca Demartini , Tereza Iofciu , and Arjen P de Vries . 2009 . Overview of the INEX 2009 entity ranking track. In International Workshop of the Initiative for the Evaluation of XML Retrieval. Springer, 254--264 . Gianluca Demartini, Tereza Iofciu, and Arjen P de Vries. 2009. Overview of the INEX 2009 entity ranking track. In International Workshop of the Initiative for the Evaluation of XML Retrieval. Springer, 254--264."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/3331184.3331257"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"crossref","unstructured":"Laura Dietz Manisha Verma Filip Radlinski and Nick Craswell. 2017. TREC Complex Answer Retrieval Overview.. In TREC . Laura Dietz Manisha Verma Filip Radlinski and Nick Craswell. 2017. TREC Complex Answer Retrieval Overview.. In TREC .","DOI":"10.6028\/NIST.SP.500-324.car-overview"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/1571941.1571989"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/3077136.3080751"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/1835449.1835499"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/3397271.3401075"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/1772690.1772748"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"crossref","unstructured":"Dawn Lawrie James Mayfield Douglas W. Oard and Eugene Yang. 2022. HC4: A New Suite of Test Collections for Ad Hoc CLIR. https:\/\/arxiv.org\/abs\/2201.09992 Dawn Lawrie James Mayfield Douglas W. Oard and Eugene Yang. 2022. HC4: A New Suite of Test Collections for Ad Hoc CLIR. https:\/\/arxiv.org\/abs\/2201.09992","DOI":"10.1007\/978-3-030-99736-6_24"},{"key":"e_1_3_2_1_20_1","volume-title":"PARADE: Passage Representation Aggregation for Document Reranking. arXiv:2008.09093","author":"Li Canjia","year":"2020","unstructured":"Canjia Li , Andrew Yates , Sean MacAvaney , Ben He , and Yingfei Sun . 2020 . PARADE: Passage Representation Aggregation for Document Reranking. arXiv:2008.09093 (2020). Canjia Li, Andrew Yates, Sean MacAvaney, Ben He, and Yingfei Sun. 2020. PARADE: Passage Representation Aggregation for Document Reranking. arXiv:2008.09093 (2020)."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/3404835.3463238"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/3341981.3344223"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"crossref","unstructured":"Sean MacAvaney Craig Macdonald and Iadh Ounis. 2022. Streamlining Evaluation with ir-measures. In ECIR . https:\/\/arxiv.org\/abs\/2111.13466 Sean MacAvaney Craig Macdonald and Iadh Ounis. 2022. Streamlining Evaluation with ir-measures. In ECIR . https:\/\/arxiv.org\/abs\/2111.13466","DOI":"10.1007\/978-3-030-99739-7_38"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/3331184.3331317"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"crossref","unstructured":"Sean MacAvaney Andrew Yates Sergey Feldman Doug Downey Arman Cohan and Nazli Goharian. 2021. Simplified Data Wrangling with ir_datasets. In SIGIR . Sean MacAvaney Andrew Yates Sergey Feldman Doug Downey Arman Cohan and Nazli Goharian. 2021. Simplified Data Wrangling with ir_datasets. In SIGIR .","DOI":"10.1145\/3404835.3463254"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/3404835.3463262"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/3197026.3197047"},{"key":"e_1_3_2_1_28_1","volume-title":"MS MARCO: A Human Generated MAchine Reading COmprehension Dataset . arXiv:1611.09268v1","author":"Nguyen Tri","year":"2016","unstructured":"Tri Nguyen , Mir Rosenberg , Xia Song , Jianfeng Gao , Saurabh Tiwary , Rangan Majumder , and Li Deng . 2016 . MS MARCO: A Human Generated MAchine Reading COmprehension Dataset . arXiv:1611.09268v1 (2016). Tri Nguyen, Mir Rosenberg, Xia Song, Jianfeng Gao, Saurabh Tiwary, Rangan Majumder, and Li Deng. 2016. MS MARCO: A Human Generated MAchine Reading COmprehension Dataset . arXiv:1611.09268v1 (2016)."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.findings-emnlp.63"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.naacl-main.200"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/3340531.3412875"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4471-2099-5_24"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2018.07.003"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.emnlp-main.496"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/1321440.1321528"},{"volume-title":"Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '20)","author":"van Hulst Johannes M.","key":"e_1_3_2_1_36_1","unstructured":"Johannes M. van Hulst , Faegheh Hasibi , Koen Dercksen , Krisztian Balog , and Arjen P . de Vries. 2020. REL: An Entity Linker Standing on the Shoulders of Giants . In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '20) . ACM. Johannes M. van Hulst, Faegheh Hasibi, Koen Dercksen, Krisztian Balog, and Arjen P. de Vries. 2020. REL: An Entity Linker Standing on the Shoulders of Giants. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '20). ACM."},{"key":"e_1_3_2_1_37_1","volume-title":"Proceedings of the Thirteenth Text REtrieval Conference (TREC","author":"Voorhees Ellen M.","year":"2004","unstructured":"Ellen M. Voorhees . 2004 . Overview of the TREC 2004 Robust Track . In Proceedings of the Thirteenth Text REtrieval Conference (TREC 2004). Gaithersburg, Maryland, 52--69. Ellen M. Voorhees. 2004. Overview of the TREC 2004 Robust Track. In Proceedings of the Thirteenth Text REtrieval Conference (TREC 2004). Gaithersburg, Maryland, 52--69."},{"key":"e_1_3_2_1_38_1","volume-title":"2020 a. Zero-shot Entity Linking with Dense Entity Retrieval. CoRR abs\/1911.03814","author":"Wu Ledell","year":"2019","unstructured":"Ledell Wu , Fabio Petroni , Martin Josifoski , Sebastian Riedel , and Luke Zettlemoyer . 2020 a. Zero-shot Entity Linking with Dense Entity Retrieval. CoRR abs\/1911.03814 ( 2019 ). (2020). Ledell Wu, Fabio Petroni, Martin Josifoski, Sebastian Riedel, and Luke Zettlemoyer. 2020 a. Zero-shot Entity Linking with Dense Entity Retrieval. CoRR abs\/1911.03814 (2019). (2020)."},{"key":"e_1_3_2_1_39_1","unstructured":"Ledell Yu Wu F. Petroni Martin Josifoski Sebastian Riedel and Luke Zettlemoyer. 2020 b. Zero-shot Entity Linking with Dense Entity Retrieval. In EMNLP . Ledell Yu Wu F. Petroni Martin Josifoski Sebastian Riedel and Luke Zettlemoyer. 2020 b. Zero-shot Entity Linking with Dense Entity Retrieval. In EMNLP ."},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/3077136.3080768"},{"key":"e_1_3_2_1_41_1","volume-title":"Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval. In International Conference on Learning Representations .","author":"Xiong Lee","year":"2020","unstructured":"Lee Xiong , Chenyan Xiong , Ye Li , Kwok-Fung Tang , Jialin Liu , Paul N Bennett , Junaid Ahmed , and Arnold Overwijk . 2020 . Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval. In International Conference on Learning Representations . Lee Xiong, Chenyan Xiong, Ye Li, Kwok-Fung Tang, Jialin Liu, Paul N Bennett, Junaid Ahmed, and Arnold Overwijk. 2020. Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval. In International Conference on Learning Representations ."}],"event":{"name":"SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval","sponsor":["SIGIR ACM Special Interest Group on Information Retrieval"],"location":"Madrid Spain","acronym":"SIGIR '22"},"container-title":["Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3477495.3531712","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3477495.3531712","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:02:07Z","timestamp":1750186927000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3477495.3531712"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,7,6]]},"references-count":41,"alternative-id":["10.1145\/3477495.3531712","10.1145\/3477495"],"URL":"https:\/\/doi.org\/10.1145\/3477495.3531712","relation":{},"subject":[],"published":{"date-parts":[[2022,7,6]]},"assertion":[{"value":"2022-07-07","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}