{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,23]],"date-time":"2026-03-23T23:28:09Z","timestamp":1774308489667,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":54,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,7,6]],"date-time":"2022-07-06T00:00:00Z","timestamp":1657065600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["U1936104, 62192784, 62022077, 61976198"],"award-info":[{"award-number":["U1936104, 62192784, 62022077, 61976198"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,7,6]]},"DOI":"10.1145\/3477495.3531799","type":"proceedings-article","created":{"date-parts":[[2022,7,7]],"date-time":"2022-07-07T15:12:13Z","timestamp":1657206733000},"page":"1513-1523","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":28,"title":["Distill-VQ"],"prefix":"10.1145","author":[{"given":"Shitao","family":"Xiao","sequence":"first","affiliation":[{"name":"Beijing University of Posts and Telecommunications, Beijing, China"}]},{"given":"Zheng","family":"Liu","sequence":"additional","affiliation":[{"name":"Microsoft Research Asia, Beijing, China"}]},{"given":"Weihao","family":"Han","sequence":"additional","affiliation":[{"name":"Microsoft Search Technology Center, Beijing, China"}]},{"given":"Jianjin","family":"Zhang","sequence":"additional","affiliation":[{"name":"Microsoft Search Technology Center, Beijing, China"}]},{"given":"Defu","family":"Lian","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China, Hefei, China"}]},{"given":"Yeyun","family":"Gong","sequence":"additional","affiliation":[{"name":"Microsoft Research Asia, Beijing, China"}]},{"given":"Qi","family":"Chen","sequence":"additional","affiliation":[{"name":"Microsoft Research Asia, Beijing, China"}]},{"given":"Fan","family":"Yang","sequence":"additional","affiliation":[{"name":"Microsoft Research Asia, Beijing, China"}]},{"given":"Hao","family":"Sun","sequence":"additional","affiliation":[{"name":"Microsoft Search Technology Center, Beijing, China"}]},{"given":"Yingxia","family":"Shao","sequence":"additional","affiliation":[{"name":"Beijing University of Posts and Telecommunications, Beijing, China"}]},{"given":"Xing","family":"Xie","sequence":"additional","affiliation":[{"name":"Microsoft Research Asia, Beijing, China"}]}],"member":"320","published-online":{"date-parts":[[2022,7,7]]},"reference":[{"key":"e_1_3_2_2_1_1","volume-title":"The inverted multi-index","author":"Babenko Artem","year":"2014","unstructured":"Artem Babenko and Victor Lempitsky . 2014. The inverted multi-index . IEEE transactions on pattern analysis and machine intelligence 37, 6 ( 2014 ), 1247--1260. Artem Babenko and Victor Lempitsky. 2014. The inverted multi-index. IEEE transactions on pattern analysis and machine intelligence 37, 6 (2014), 1247--1260."},{"key":"e_1_3_2_2_2_1","doi-asserted-by":"crossref","unstructured":"Dmitry Baranchuk Artem Babenko and Yury Malkov. 2018. Revisiting the inverted indices for billion-scale approximate nearest neighbors. In ECCV. 202-- 216.  Dmitry Baranchuk Artem Babenko and Yury Malkov. 2018. Revisiting the inverted indices for billion-scale approximate nearest neighbors. In ECCV. 202-- 216.","DOI":"10.1007\/978-3-030-01258-8_13"},{"key":"e_1_3_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.104"},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/1273496.1273513"},{"key":"e_1_3_2_2_5_1","volume-title":"Pre-training tasks for embedding-based large-scale retrieval. arXiv preprint arXiv:2002.03932","author":"Chang Wei-Cheng","year":"2020","unstructured":"Wei-Cheng Chang , Felix X Yu , Yin-Wen Chang , Yiming Yang , and Sanjiv Kumar . 2020. Pre-training tasks for embedding-based large-scale retrieval. arXiv preprint arXiv:2002.03932 ( 2020 ). Wei-Cheng Chang, Felix X Yu, Yin-Wen Chang, Yiming Yang, and Sanjiv Kumar. 2020. Pre-training tasks for embedding-based large-scale retrieval. arXiv preprint arXiv:2002.03932 (2020)."},{"key":"e_1_3_2_2_6_1","volume-title":"SPANN: Highly-efficient Billion-scale Approximate Nearest Neighbor Search. arXiv preprint arXiv:2111.08566","author":"Chen Qi","year":"2021","unstructured":"Qi Chen , Bing Zhao , Haidong Wang , Mingqin Li , Chuanjie Liu , Zengzhong Li , Mao Yang , and Jingdong Wang . 2021 . SPANN: Highly-efficient Billion-scale Approximate Nearest Neighbor Search. arXiv preprint arXiv:2111.08566 (2021). Qi Chen, Bing Zhao, Haidong Wang, Mingqin Li, Chuanjie Liu, Zengzhong Li, Mao Yang, and Jingdong Wang. 2021. SPANN: Highly-efficient Billion-scale Approximate Nearest Neighbor Search. arXiv preprint arXiv:2111.08566 (2021)."},{"key":"e_1_3_2_2_7_1","volume-title":"Big self-supervised models are strong semi-supervised learners. arXiv preprint arXiv:2006.10029","author":"Chen Ting","year":"2020","unstructured":"Ting Chen , Simon Kornblith , Kevin Swersky , Mohammad Norouzi , and Geoffrey Hinton . 2020. Big self-supervised models are strong semi-supervised learners. arXiv preprint arXiv:2006.10029 ( 2020 ). Ting Chen, Simon Kornblith, Kevin Swersky, Mohammad Norouzi, and Geoffrey Hinton. 2020. Big self-supervised models are strong semi-supervised learners. arXiv preprint arXiv:2006.10029 (2020)."},{"key":"e_1_3_2_2_8_1","volume-title":"International Conference on Machine Learning. PMLR, 1617--1626","author":"Chen Ting","year":"2020","unstructured":"Ting Chen , Lala Li , and Yizhou Sun . 2020 . Differentiable product quantization for end-to-end embedding compression . In International Conference on Machine Learning. PMLR, 1617--1626 . Ting Chen, Lala Li, and Yizhou Sun. 2020. Differentiable product quantization for end-to-end embedding compression. In International Conference on Machine Learning. PMLR, 1617--1626."},{"key":"e_1_3_2_2_9_1","volume-title":"Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805","author":"Devlin Jacob","year":"2018","unstructured":"Jacob Devlin , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . 2018 . Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018). Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)."},{"key":"e_1_3_2_2_10_1","volume-title":"Unsupervised corpus aware language model pre-training for dense passage retrieval. arXiv preprint arXiv:2108.05540","author":"Gao Luyu","year":"2021","unstructured":"Luyu Gao and Jamie Callan . 2021. Unsupervised corpus aware language model pre-training for dense passage retrieval. arXiv preprint arXiv:2108.05540 ( 2021 ). Luyu Gao and Jamie Callan. 2021. Unsupervised corpus aware language model pre-training for dense passage retrieval. arXiv preprint arXiv:2108.05540 (2021)."},{"key":"e_1_3_2_2_11_1","volume-title":"Optimized product quantization","author":"Ge Tiezheng","year":"2013","unstructured":"Tiezheng Ge , Kaiming He , Qifa Ke , and Jian Sun . 2013. Optimized product quantization . IEEE transactions on pattern analysis and machine intelligence 36, 4 ( 2013 ), 744--755. Tiezheng Ge, Kaiming He, Qifa Ke, and Jian Sun. 2013. Optimized product quantization. IEEE transactions on pattern analysis and machine intelligence 36, 4 (2013), 744--755."},{"key":"e_1_3_2_2_12_1","volume-title":"Learning dense representations for entity retrieval. arXiv preprint arXiv:1909.10506","author":"Gillick Daniel","year":"2019","unstructured":"Daniel Gillick , Sayali Kulkarni , Larry Lansing , Alessandro Presta , Jason Baldridge , Eugene Ie , and Diego Garcia-Olano . 2019. Learning dense representations for entity retrieval. arXiv preprint arXiv:1909.10506 ( 2019 ). Daniel Gillick, Sayali Kulkarni, Larry Lansing, Alessandro Presta, Jason Baldridge, Eugene Ie, and Diego Garcia-Olano. 2019. Learning dense representations for entity retrieval. arXiv preprint arXiv:1909.10506 (2019)."},{"key":"e_1_3_2_2_13_1","volume-title":"International Conference on Machine Learning. PMLR, 3887--3896","author":"Guo Ruiqi","year":"2020","unstructured":"Ruiqi Guo , Philip Sun , Erik Lindgren , Quan Geng , David Simcha , Felix Chern , and Sanjiv Kumar . 2020 . Accelerating large-scale inference with anisotropic vector quantization . In International Conference on Machine Learning. PMLR, 3887--3896 . Ruiqi Guo, Philip Sun, Erik Lindgren, Quan Geng, David Simcha, Felix Chern, and Sanjiv Kumar. 2020. Accelerating large-scale inference with anisotropic vector quantization. In International Conference on Machine Learning. PMLR, 3887--3896."},{"key":"e_1_3_2_2_14_1","volume-title":"Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531","author":"Hinton Geoffrey","year":"2015","unstructured":"Geoffrey Hinton , Oriol Vinyals , and Jeff Dean . 2015. Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531 ( 2015 ). Geoffrey Hinton, Oriol Vinyals, and Jeff Dean. 2015. Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531 (2015)."},{"key":"e_1_3_2_2_15_1","volume-title":"Improving efficient neural ranking models with crossarchitecture knowledge distillation. arXiv preprint arXiv:2010.02666","author":"Hofst\u00e4tter Sebastian","year":"2020","unstructured":"Sebastian Hofst\u00e4tter , Sophia Althammer , Michael Schr\u00f6der , Mete Sertkan , and Allan Hanbury . 2020. Improving efficient neural ranking models with crossarchitecture knowledge distillation. arXiv preprint arXiv:2010.02666 ( 2020 ). Sebastian Hofst\u00e4tter, Sophia Althammer, Michael Schr\u00f6der, Mete Sertkan, and Allan Hanbury. 2020. Improving efficient neural ranking models with crossarchitecture knowledge distillation. arXiv preprint arXiv:2010.02666 (2020)."},{"key":"e_1_3_2_2_16_1","volume-title":"Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling. arXiv preprint arXiv:2104.06967","author":"Hofst\u00e4tter Sebastian","year":"2021","unstructured":"Sebastian Hofst\u00e4tter , Sheng-Chieh Lin , Jheng-Hong Yang , Jimmy Lin , and Allan Hanbury . 2021. Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling. arXiv preprint arXiv:2104.06967 ( 2021 ). Sebastian Hofst\u00e4tter, Sheng-Chieh Lin, Jheng-Hong Yang, Jimmy Lin, and Allan Hanbury. 2021. Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling. arXiv preprint arXiv:2104.06967 (2021)."},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/3394486.3403305"},{"key":"e_1_3_2_2_18_1","volume-title":"Product quantization for nearest neighbor search","author":"Jegou Herve","year":"2010","unstructured":"Herve Jegou , Matthijs Douze , and Cordelia Schmid . 2010. Product quantization for nearest neighbor search . IEEE transactions on pattern analysis and machine intelligence 33, 1 ( 2010 ), 117--128. Herve Jegou, Matthijs Douze, and Cordelia Schmid. 2010. Product quantization for nearest neighbor search. IEEE transactions on pattern analysis and machine intelligence 33, 1 (2010), 117--128."},{"key":"e_1_3_2_2_19_1","volume-title":"Searching in one billion vectors: re-rank with source coding","author":"J\u00e9gou Herv\u00e9","unstructured":"Herv\u00e9 J\u00e9gou , Romain Tavenard , Matthijs Douze , and Laurent Amsaleg . 2011. Searching in one billion vectors: re-rank with source coding . In ICASSP. IEEE , 861--864. Herv\u00e9 J\u00e9gou, Romain Tavenard, Matthijs Douze, and Laurent Amsaleg. 2011. Searching in one billion vectors: re-rank with source coding. In ICASSP. IEEE, 861--864."},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/3404835.3462941"},{"key":"e_1_3_2_2_21_1","volume-title":"Tinybert: Distilling bert for natural language understanding. arXiv preprint arXiv:1909.10351","author":"Jiao Xiaoqi","year":"2019","unstructured":"Xiaoqi Jiao , Yichun Yin , Lifeng Shang , Xin Jiang , Xiao Chen , Linlin Li , Fang Wang , and Qun Liu . 2019 . Tinybert: Distilling bert for natural language understanding. arXiv preprint arXiv:1909.10351 (2019). Xiaoqi Jiao, Yichun Yin, Lifeng Shang, Xin Jiang, Xiao Chen, Linlin Li, Fang Wang, and Qun Liu. 2019. Tinybert: Distilling bert for natural language understanding. arXiv preprint arXiv:1909.10351 (2019)."},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/TBDATA.2019.2921572"},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"crossref","unstructured":"Vladimir Karpukhin Barlas Oguz Sewon Min Patrick Lewis Ledell Wu Sergey Edunov Danqi Chen and Wen-tau Yih. 2020. Dense Passage Retrieval for OpenDomain Question Answering. In EMNLP. 6769--6781.  Vladimir Karpukhin Barlas Oguz Sewon Min Patrick Lewis Ledell Wu Sergey Edunov Danqi Chen and Wen-tau Yih. 2020. Dense Passage Retrieval for OpenDomain Question Answering. In EMNLP. 6769--6781.","DOI":"10.18653\/v1\/2020.emnlp-main.550"},{"key":"e_1_3_2_2_24_1","volume-title":"Convolutional Neural Networks for Sentence Classification","author":"Kim Yoon","unstructured":"Yoon Kim . 2014. Convolutional Neural Networks for Sentence Classification . In EMNLP. Association for Computational Linguistics , Doha, Qatar , 1746--1751. https:\/\/doi.org\/10.3115\/v1\/D14--1181 10.3115\/v1 Yoon Kim. 2014. Convolutional Neural Networks for Sentence Classification. In EMNLP. Association for Computational Linguistics, Doha, Qatar, 1746--1751. https:\/\/doi.org\/10.3115\/v1\/D14--1181"},{"key":"e_1_3_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00276"},{"key":"e_1_3_2_2_26_1","volume-title":"AdsGNN: BehaviorGraph Augmented Relevance Modeling in Sponsored Search. arXiv preprint arXiv:2104.12080","author":"Li Chaozhuo","year":"2021","unstructured":"Chaozhuo Li , Bochen Pang , Yuming Liu , Hao Sun , Zheng Liu , Xing Xie , Tianqi Yang , Yanling Cui , Liangjie Zhang , and Qi Zhang . 2021. AdsGNN: BehaviorGraph Augmented Relevance Modeling in Sponsored Search. arXiv preprint arXiv:2104.12080 ( 2021 ). Chaozhuo Li, Bochen Pang, Yuming Liu, Hao Sun, Zheng Liu, Xing Xie, Tianqi Yang, Yanling Cui, Liangjie Zhang, and Qi Zhang. 2021. AdsGNN: BehaviorGraph Augmented Relevance Modeling in Sponsored Search. arXiv preprint arXiv:2104.12080 (2021)."},{"key":"e_1_3_2_2_27_1","volume-title":"Embedding-based Product Retrieval in Taobao Search. arXiv preprint arXiv:2106.09297","author":"Li Sen","year":"2021","unstructured":"Sen Li , Fuyu Lv , Taiwei Jin , Guli Lin , Keping Yang , Xiaoyi Zeng , Xiao-Ming Wu , and Qianli Ma. 2021. Embedding-based Product Retrieval in Taobao Search. arXiv preprint arXiv:2106.09297 ( 2021 ). Sen Li, Fuyu Lv, Taiwei Jin, Guli Lin, Keping Yang, Xiaoyi Zeng, Xiao-Ming Wu, and Qianli Ma. 2021. Embedding-based Product Retrieval in Taobao Search. arXiv preprint arXiv:2106.09297 (2021)."},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2019.2909204"},{"key":"e_1_3_2_2_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/3366423.3380151"},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/3447548.3467149"},{"key":"e_1_3_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/3340531.3412747"},{"key":"e_1_3_2_2_32_1","volume-title":"MS MARCO: A human generated machine reading comprehension dataset. In CoCo@ NIPS.","author":"Nguyen Tri","year":"2016","unstructured":"Tri Nguyen , Mir Rosenberg , Xia Song , Jianfeng Gao , Saurabh Tiwary , Rangan Majumder , and Li Deng . 2016 . MS MARCO: A human generated machine reading comprehension dataset. In CoCo@ NIPS. Tri Nguyen, Mir Rosenberg, Xia Song, Jianfeng Gao, Saurabh Tiwary, Rangan Majumder, and Li Deng. 2016. MS MARCO: A human generated machine reading comprehension dataset. In CoCo@ NIPS."},{"key":"e_1_3_2_2_33_1","volume-title":"Shujun Wang, Zhiyong Feng, and Guohui Xiao.","author":"Qin Xiaoyu","year":"2021","unstructured":"Xiaoyu Qin , Xiaowang Zhang , Muhammad Qasim Yasin , Shujun Wang, Zhiyong Feng, and Guohui Xiao. 2021 . SUMA : A Partial Materialization-Based Scalable Query Answering in OWL 2 DL. Data Science and Engineering ( 2021), 229--245. Xiaoyu Qin, Xiaowang Zhang, Muhammad Qasim Yasin, Shujun Wang, Zhiyong Feng, and Guohui Xiao. 2021. SUMA: A Partial Materialization-Based Scalable Query Answering in OWL 2 DL. Data Science and Engineering (2021), 229--245."},{"key":"e_1_3_2_2_34_1","volume-title":"Daxiang Dong, Hua Wu, and Haifeng Wang.","author":"Qu Yingqi","year":"2021","unstructured":"Yingqi Qu , Yuchen Ding , Jing Liu , Kai Liu , Ruiyang Ren , Wayne Xin Zhao , Daxiang Dong, Hua Wu, and Haifeng Wang. 2021 . RocketQA: An optimized training approach to dense passage retrieval for open-domain question answering. In NAACL. 5835--5847. Yingqi Qu, Yuchen Ding, Jing Liu, Kai Liu, Ruiyang Ren, Wayne Xin Zhao, Daxiang Dong, Hua Wu, and Haifeng Wang. 2021. RocketQA: An optimized training approach to dense passage retrieval for open-domain question answering. In NAACL. 5835--5847."},{"key":"e_1_3_2_2_35_1","volume-title":"Sentence-bert: Sentence embeddings using siamese bert-networks. arXiv preprint arXiv:1908.10084","author":"Reimers Nils","year":"2019","unstructured":"Nils Reimers and Iryna Gurevych . 2019 . Sentence-bert: Sentence embeddings using siamese bert-networks. arXiv preprint arXiv:1908.10084 (2019). Nils Reimers and Iryna Gurevych. 2019. Sentence-bert: Sentence embeddings using siamese bert-networks. arXiv preprint arXiv:1908.10084 (2019)."},{"key":"e_1_3_2_2_36_1","volume-title":"Qiaoqiao She, Hua Wu, Haifeng Wang, and Ji-Rong Wen.","author":"Ren Ruiyang","year":"2021","unstructured":"Ruiyang Ren , Yingqi Qu , Jing Liu , Wayne Xin Zhao , Qiaoqiao She, Hua Wu, Haifeng Wang, and Ji-Rong Wen. 2021 . RocketQAv2: A Joint Training Method for Dense Passage Retrieval and Passage Re-ranking . arXiv preprint arXiv:2110.07367 (2021). Ruiyang Ren, Yingqi Qu, Jing Liu, Wayne Xin Zhao, Qiaoqiao She, Hua Wu, Haifeng Wang, and Ji-Rong Wen. 2021. RocketQAv2: A Joint Training Method for Dense Passage Retrieval and Passage Re-ranking. arXiv preprint arXiv:2110.07367 (2021)."},{"key":"e_1_3_2_2_37_1","volume-title":"BPR: Bayesian personalized ranking from implicit feedback. arXiv preprint arXiv:1205.2618","author":"Rendle Steffen","year":"2012","unstructured":"Steffen Rendle , Christoph Freudenthaler , Zeno Gantner , and Lars Schmidt-Thieme . 2012 . BPR: Bayesian personalized ranking from implicit feedback. arXiv preprint arXiv:1205.2618 (2012). Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme. 2012. BPR: Bayesian personalized ranking from implicit feedback. arXiv preprint arXiv:1205.2618 (2012)."},{"key":"e_1_3_2_2_38_1","volume-title":"a distilled version of BERT: smaller, faster, cheaper and lighter. arXiv preprint arXiv:1910.01108","author":"Sanh Victor","year":"2019","unstructured":"Victor Sanh , Lysandre Debut , Julien Chaumond , and Thomas Wolf . 2019. DistilBERT , a distilled version of BERT: smaller, faster, cheaper and lighter. arXiv preprint arXiv:1910.01108 ( 2019 ). Victor Sanh, Lysandre Debut, Julien Chaumond, and Thomas Wolf. 2019. DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter. arXiv preprint arXiv:1910.01108 (2019)."},{"key":"e_1_3_2_2_39_1","doi-asserted-by":"crossref","unstructured":"Yelong Shen Xiaodong He Jianfeng Gao Li Deng and Gr\u00e9goire Mesnil. 2014. Learning semantic representations using convolutional neural networks for web search. In WWW. 373--374.  Yelong Shen Xiaodong He Jianfeng Gao Li Deng and Gr\u00e9goire Mesnil. 2014. Learning semantic representations using convolutional neural networks for web search. In WWW. 373--374.","DOI":"10.1145\/2567948.2577348"},{"key":"e_1_3_2_2_40_1","volume-title":"Proceedings of the 33rd International Conference on Neural Information Processing Systems. 13766--13776","author":"Subramanya Suhas Jayaram","year":"2019","unstructured":"Suhas Jayaram Subramanya , Rohan Kadekodi , Ravishankar Krishaswamy , and Harsha Vardhan Simhadri . 2019 . Diskann: Fast accurate billion-point nearest neighbor search on a single node . In Proceedings of the 33rd International Conference on Neural Information Processing Systems. 13766--13776 . Suhas Jayaram Subramanya, Rohan Kadekodi, Ravishankar Krishaswamy, and Harsha Vardhan Simhadri. 2019. Diskann: Fast accurate billion-point nearest neighbor search on a single node. In Proceedings of the 33rd International Conference on Neural Information Processing Systems. 13766--13776."},{"key":"e_1_3_2_2_41_1","volume-title":"Patient knowledge distillation for bert model compression. arXiv preprint arXiv:1908.09355","author":"Sun Siqi","year":"2019","unstructured":"Siqi Sun , Yu Cheng , Zhe Gan , and Jingjing Liu . 2019. Patient knowledge distillation for bert model compression. arXiv preprint arXiv:1908.09355 ( 2019 ). Siqi Sun, Yu Cheng, Zhe Gan, and Jingjing Liu. 2019. Patient knowledge distillation for bert model compression. arXiv preprint arXiv:1908.09355 (2019)."},{"key":"e_1_3_2_2_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/3442381.3449946"},{"key":"e_1_3_2_2_43_1","doi-asserted-by":"crossref","unstructured":"Shitao Xiao Zheng Liu Weihao Han Jianjin Zhang Chaozhuo Li Yingxia Shao Defu Lian Xing Xie Hao Sun Denvy Deng etal 2022. Progressively Optimized Bi-Granular Document Representation for Scalable Embedding Based Retrieval. In WWW.  Shitao Xiao Zheng Liu Weihao Han Jianjin Zhang Chaozhuo Li Yingxia Shao Defu Lian Xing Xie Hao Sun Denvy Deng et al. 2022. Progressively Optimized Bi-Granular Document Representation for Scalable Embedding Based Retrieval. In WWW.","DOI":"10.1145\/3485447.3511957"},{"key":"e_1_3_2_2_44_1","volume-title":"Training Large-Scale News Recommenders with Pretrained Language Models in the Loop. arXiv e-prints","author":"Xiao Shitao","year":"2021","unstructured":"Shitao Xiao , Zheng Liu , Yingxia Shao , Tao Di , and Xing Xie . 2021. Training Large-Scale News Recommenders with Pretrained Language Models in the Loop. arXiv e-prints ( 2021 ), arXiv--2102. Shitao Xiao, Zheng Liu, Yingxia Shao, Tao Di, and Xing Xie. 2021. Training Large-Scale News Recommenders with Pretrained Language Models in the Loop. arXiv e-prints (2021), arXiv--2102."},{"key":"e_1_3_2_2_45_1","doi-asserted-by":"crossref","unstructured":"Shitao Xiao Zheng Liu Yingxia Shao Defu Lian and Xing Xie. 2021. Matchingoriented Embedding Quantization For Ad-hoc Retrieval. In EMNLP. 8119--8129.  Shitao Xiao Zheng Liu Yingxia Shao Defu Lian and Xing Xie. 2021. Matchingoriented Embedding Quantization For Ad-hoc Retrieval. In EMNLP. 8119--8129.","DOI":"10.18653\/v1\/2021.emnlp-main.640"},{"key":"e_1_3_2_2_46_1","doi-asserted-by":"crossref","unstructured":"Qizhe Xie Minh-Thang Luong Eduard Hovy and Quoc V Le. 2020. Self-training with noisy student improves imagenet classification. In ECCV. 10687--10698.  Qizhe Xie Minh-Thang Luong Eduard Hovy and Quoc V Le. 2020. Self-training with noisy student improves imagenet classification. In ECCV. 10687--10698.","DOI":"10.1109\/CVPR42600.2020.01070"},{"key":"e_1_3_2_2_47_1","volume-title":"Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval. In International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=zeFrfgyZln","author":"Xiong Lee","year":"2021","unstructured":"Lee Xiong , Chenyan Xiong , Ye Li , Kwok-Fung Tang , Jialin Liu , Paul N. Bennett , Junaid Ahmed , and Arnold Overwijk . 2021 . Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval. In International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=zeFrfgyZln Lee Xiong, Chenyan Xiong, Ye Li, Kwok-Fung Tang, Jialin Liu, Paul N. Bennett, Junaid Ahmed, and Arnold Overwijk. 2021. Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval. In International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=zeFrfgyZln"},{"key":"e_1_3_2_2_48_1","volume-title":"Guilin Li, Tao Wang, and Jiashi Feng.","author":"Yuan Li","year":"2020","unstructured":"Li Yuan , Francis EH Tay , Guilin Li, Tao Wang, and Jiashi Feng. 2020 . Revisiting knowledge distillation via label smoothing regularization. In ECCV. 3903--3911. Li Yuan, Francis EH Tay, Guilin Li, Tao Wang, and Jiashi Feng. 2020. Revisiting knowledge distillation via label smoothing regularization. In ECCV. 3903--3911."},{"key":"e_1_3_2_2_49_1","unstructured":"C Yue M Long J Wang Z Han and Q Wen. 2016. Deep quantization network for efficient image retrieval. In AAAI. 3457--3463.  C Yue M Long J Wang Z Han and Q Wen. 2016. Deep quantization network for efficient image retrieval. In AAAI. 3457--3463."},{"key":"e_1_3_2_2_50_1","doi-asserted-by":"crossref","unstructured":"Jingtao Zhan Jiaxin Mao Yiqun Liu Jiafeng Guo Min Zhang and Shaoping Ma. 2021. Jointly optimizing query encoder and product quantization to improve retrieval performance. In CIKM. 2487--2496.  Jingtao Zhan Jiaxin Mao Yiqun Liu Jiafeng Guo Min Zhang and Shaoping Ma. 2021. Jointly optimizing query encoder and product quantization to improve retrieval performance. In CIKM. 2487--2496.","DOI":"10.1145\/3459637.3482358"},{"key":"e_1_3_2_2_51_1","doi-asserted-by":"crossref","unstructured":"Jingtao Zhan Jiaxin Mao Yiqun Liu Jiafeng Guo Min Zhang and Shaoping Ma. 2022. Learning Discrete Representations via Constrained Clustering for Effective and Efficient Dense Retrieval. In WSDM. 1328--1336.  Jingtao Zhan Jiaxin Mao Yiqun Liu Jiafeng Guo Min Zhang and Shaoping Ma. 2022. Learning Discrete Representations via Constrained Clustering for Effective and Efficient Dense Retrieval. In WSDM. 1328--1336.","DOI":"10.24963\/ijcai.2022\/754"},{"key":"e_1_3_2_2_52_1","volume-title":"Adversarial Retriever-Ranker for dense text retrieval. arXiv preprint arXiv:2110.03611","author":"Zhang Hang","year":"2021","unstructured":"Hang Zhang , Yeyun Gong , Yelong Shen , Jiancheng Lv , Nan Duan , and Weizhu Chen . 2021. Adversarial Retriever-Ranker for dense text retrieval. arXiv preprint arXiv:2110.03611 ( 2021 ). Hang Zhang, Yeyun Gong, Yelong Shen, Jiancheng Lv, Nan Duan, and Weizhu Chen. 2021. Adversarial Retriever-Ranker for dense text retrieval. arXiv preprint arXiv:2110.03611 (2021)."},{"key":"e_1_3_2_2_53_1","volume-title":"Joint Learning of Deep Retrieval Model and Product Quantization based Embedding Index. arXiv preprint arXiv:2105.03933","author":"Zhang Han","year":"2021","unstructured":"Han Zhang , Hongwei Shen , Yiming Qiu , Yunjiang Jiang , Songlin Wang , Sulong Xu , Yun Xiao , Bo Long , and Wen-Yun Yang . 2021. Joint Learning of Deep Retrieval Model and Product Quantization based Embedding Index. arXiv preprint arXiv:2105.03933 ( 2021 ). Han Zhang, Hongwei Shen, Yiming Qiu, Yunjiang Jiang, Songlin Wang, Sulong Xu, Yun Xiao, Bo Long, and Wen-Yun Yang. 2021. Joint Learning of Deep Retrieval Model and Product Quantization based Embedding Index. arXiv preprint arXiv:2105.03933 (2021)."},{"key":"e_1_3_2_2_54_1","volume-title":"Scalable Multi-grained Cross-modal Similarity Query with Interpretability. Data Science and Engineering","author":"Zhu Mingdong","year":"2021","unstructured":"Mingdong Zhu , Derong Shen , Lixin Xu , and Xianfang Wang . 2021. Scalable Multi-grained Cross-modal Similarity Query with Interpretability. Data Science and Engineering ( 2021 ), 280--293. Mingdong Zhu, Derong Shen, Lixin Xu, and Xianfang Wang. 2021. Scalable Multi-grained Cross-modal Similarity Query with Interpretability. Data Science and Engineering (2021), 280--293."}],"event":{"name":"SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval","location":"Madrid Spain","acronym":"SIGIR '22","sponsor":["SIGIR ACM Special Interest Group on Information Retrieval"]},"container-title":["Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3477495.3531799","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3477495.3531799","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T18:10:25Z","timestamp":1750183825000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3477495.3531799"}},"subtitle":["Learning Retrieval Oriented Vector Quantization By Distilling Knowledge from Dense Embeddings"],"short-title":[],"issued":{"date-parts":[[2022,7,6]]},"references-count":54,"alternative-id":["10.1145\/3477495.3531799","10.1145\/3477495"],"URL":"https:\/\/doi.org\/10.1145\/3477495.3531799","relation":{},"subject":[],"published":{"date-parts":[[2022,7,6]]},"assertion":[{"value":"2022-07-07","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}