{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,15]],"date-time":"2025-10-15T10:33:38Z","timestamp":1760524418277,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":45,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,10,17]],"date-time":"2022-10-17T00:00:00Z","timestamp":1665964800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,10,17]]},"DOI":"10.1145\/3511808.3557139","type":"proceedings-article","created":{"date-parts":[[2022,10,16]],"date-time":"2022-10-16T01:22:22Z","timestamp":1665883342000},"page":"3654-3663","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["SwiftPruner: Reinforced Evolutionary Pruning for Efficient Ad Relevance"],"prefix":"10.1145","author":[{"given":"Li Lyna","family":"Zhang","sequence":"first","affiliation":[{"name":"Microsoft Research, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Youkow","family":"Homma","sequence":"additional","affiliation":[{"name":"Microsoft, Redmond, WA, WA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yujing","family":"Wang","sequence":"additional","affiliation":[{"name":"Microsoft, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Min","family":"Wu","sequence":"additional","affiliation":[{"name":"Microsoft, Redmond, WA, WA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mao","family":"Yang","sequence":"additional","affiliation":[{"name":"Microsoft Research, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ruofei","family":"Zhang","sequence":"additional","affiliation":[{"name":"Microsoft, Redmond, WA, WA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ting","family":"Cao","sequence":"additional","affiliation":[{"name":"Microsoft Research, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wei","family":"Shen","sequence":"additional","affiliation":[{"name":"Microsoft, Redmond, WA, WA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2022,10,17]]},"reference":[{"key":"e_1_3_2_2_1_1","volume-title":"Algorithms for hyper-parameter optimization. Advances in Neural Information Processing Systems","author":"Bergstra James","year":"2011","unstructured":"James Bergstra , R\u00e9mi Bardenet , Yoshua Bengio , and Bal\u00e1zs K\u00e9gl . 2011. Algorithms for hyper-parameter optimization. Advances in Neural Information Processing Systems ( 2011 ), 2546--2554. James Bergstra, R\u00e9mi Bardenet, Yoshua Bengio, and Bal\u00e1zs K\u00e9gl. 2011. Algorithms for hyper-parameter optimization. Advances in Neural Information Processing Systems (2011), 2546--2554."},{"key":"e_1_3_2_2_2_1","unstructured":"Microsoft blog. 2021a. Microsoft open sources breakthrough optimizations for transformer inference on GPU and CPU. https:\/\/cloudblogs.microsoft.com\/opensource\/2020\/01\/21\/microsoft-onnx-open-source-optimizations-transformer-inference-gpu-cpu\/  Microsoft blog. 2021a. Microsoft open sources breakthrough optimizations for transformer inference on GPU and CPU. https:\/\/cloudblogs.microsoft.com\/opensource\/2020\/01\/21\/microsoft-onnx-open-source-optimizations-transformer-inference-gpu-cpu\/"},{"key":"e_1_3_2_2_3_1","unstructured":"Microsoft blog. 2021b. Optimizing BERT model for Intel CPU Cores using ONNX runtime default execution provider. https:\/\/cloudblogs.microsoft.com\/opensource\/2021\/03\/01\/optimizing-bert-model-for-intel-cpu-cores-using-onnx-runtime-default-execution-provider\/  Microsoft blog. 2021b. Optimizing BERT model for Intel CPU Cores using ONNX runtime default execution provider. https:\/\/cloudblogs.microsoft.com\/opensource\/2021\/03\/01\/optimizing-bert-model-for-intel-cpu-cores-using-onnx-runtime-default-execution-provider\/"},{"key":"e_1_3_2_2_4_1","unstructured":"Han Cai Chuang Gan Tianzhe Wang Zhekai Zhang and Song Han. 2020. Once-for-All: Train One Network and Specialize it for Efficient Deployment. In ICLR.  Han Cai Chuang Gan Tianzhe Wang Zhekai Zhang and Song Han. 2020. Once-for-All: Train One Network and Specialize it for Efficient Deployment. In ICLR."},{"key":"e_1_3_2_2_5_1","unstructured":"Wentao Chen Hailong Qiu Jian Zhuang Chutong Zhang Yu Hu Qing Lu Tianchen Wang Yiyu Shi Meiping Huang and Xiaowe Xu. 2021. Quantization of Deep Neural Networks for Accurate Edge Computing. arxiv: 2104.12046 [cs.CV]  Wentao Chen Hailong Qiu Jian Zhuang Chutong Zhang Yu Hu Qing Lu Tianchen Wang Yiyu Shi Meiping Huang and Xiaowe Xu. 2021. Quantization of Deep Neural Networks for Accurate Edge Computing. arxiv: 2104.12046 [cs.CV]"},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00492"},{"key":"e_1_3_2_2_7_1","unstructured":"Yiren Chen Yaming Yang Hong Sun Yujing Wang Yu Xu Wei Shen Rong Zhou Yunhai Tong Jing Bai and Ruofei Zhang. 2020. AutoADR: Automatic Model Design for Ad Relevance. In CIKM.  Yiren Chen Yaming Yang Hong Sun Yujing Wang Yu Xu Wei Shen Rong Zhou Yunhai Tong Jing Bai and Ruofei Zhang. 2020. AutoADR: Automatic Model Design for Ad Relevance. In CIKM."},{"key":"e_1_3_2_2_8_1","volume-title":"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In NAACL-HLT.","author":"Devlin Jacob","year":"2019","unstructured":"Jacob Devlin , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . 2019 . BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In NAACL-HLT. Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In NAACL-HLT."},{"key":"e_1_3_2_2_9_1","volume-title":"Pablo Samuel Castro, and Erich Elsen","author":"Evci Utku","year":"2021","unstructured":"Utku Evci , Trevor Gale , Jacob Menick , Pablo Samuel Castro, and Erich Elsen . 2021 . Rigging the Lottery : Making All Tickets Winners . Utku Evci, Trevor Gale, Jacob Menick, Pablo Samuel Castro, and Erich Elsen. 2021. Rigging the Lottery: Making All Tickets Winners."},{"key":"e_1_3_2_2_10_1","volume-title":"Compressing BERT: Studying the Effects of Weight Pruning on Transfer Learning. arxiv","author":"Gordon Mitchell A.","year":"2002","unstructured":"Mitchell A. Gordon , Kevin Duh , and Nicholas Andrews . 2020. Compressing BERT: Studying the Effects of Weight Pruning on Transfer Learning. arxiv : 2002 .08307 [cs.CL] Mitchell A. Gordon, Kevin Duh, and Nicholas Andrews. 2020. Compressing BERT: Studying the Effects of Weight Pruning on Transfer Learning. arxiv: 2002.08307 [cs.CL]"},{"key":"e_1_3_2_2_11_1","unstructured":"Zichao Guo Xiangyu Zhang Haoyuan Mu Wen Heng Zechun Liu Yichen Wei and Jian Sun. 2020. Single Path One-Shot Neural Architecture Search with Uniform Sampling.  Zichao Guo Xiangyu Zhang Haoyuan Mu Wen Heng Zechun Liu Yichen Wei and Jian Sun. 2020. Single Path One-Shot Neural Architecture Search with Uniform Sampling."},{"key":"e_1_3_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01234-2_48"},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"crossref","unstructured":"J. Weston Hughes Keng-hao Chang and Ruofei Zhang. 2019. Generating Better Search Engine Text Advertisements with Deep Reinforcement Learning. In KDD.  J. Weston Hughes Keng-hao Chang and Ruofei Zhang. 2019. Generating Better Search Engine Text Advertisements with Deep Reinforcement Learning. In KDD.","DOI":"10.1145\/3292500.3330754"},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"crossref","unstructured":"Xiaoqi Jiao Yichun Yin Lifeng Shang Xin Jiang Xiao Chen Linlin Li Fang Wang and Qun Liu. 2020. TinyBERT: Distilling BERT for Natural Language Understanding.  Xiaoqi Jiao Yichun Yin Lifeng Shang Xin Jiang Xiao Chen Linlin Li Fang Wang and Qun Liu. 2020. TinyBERT: Distilling BERT for Natural Language Understanding.","DOI":"10.18653\/v1\/2020.findings-emnlp.372"},{"key":"e_1_3_2_2_15_1","volume-title":"I-bert: Integer-only bert quantization. arXiv preprint arXiv:2101.01321","author":"Kim Sehoon","year":"2021","unstructured":"Sehoon Kim , Amir Gholami , Zhewei Yao , Michael W Mahoney , and Kurt Keutzer . 2021 . I-bert: Integer-only bert quantization. arXiv preprint arXiv:2101.01321 (2021). Sehoon Kim, Amir Gholami, Zhewei Yao, Michael W Mahoney, and Kurt Keutzer. 2021. I-bert: Integer-only bert quantization. arXiv preprint arXiv:2101.01321 (2021)."},{"key":"e_1_3_2_2_16_1","volume-title":"Fastformers: Highly efficient transformer models for natural language understanding. arXiv preprint arXiv:2010.13382","author":"Kim Young Jin","year":"2020","unstructured":"Young Jin Kim and Hany Hassan Awadalla . 2020 . Fastformers: Highly efficient transformer models for natural language understanding. arXiv preprint arXiv:2010.13382 (2020). Young Jin Kim and Hany Hassan Awadalla. 2020. Fastformers: Highly efficient transformer models for natural language understanding. arXiv preprint arXiv:2010.13382 (2020)."},{"key":"e_1_3_2_2_17_1","volume-title":"Rush","author":"Lagunas Francois","year":"2021","unstructured":"Francois Lagunas , Ella Charlaix , Victor Sanh , and Alexander M . Rush . 2021 . Block Pruning For Faster Transformers. In EMNLP. Francois Lagunas, Ella Charlaix, Victor Sanh, and Alexander M. Rush. 2021. Block Pruning For Faster Transformers. In EMNLP."},{"key":"e_1_3_2_2_18_1","volume-title":"Layer-adaptive Sparsity for the Magnitude-based Pruning. In International Conference on Learning Representations.","author":"Lee Jaeho","year":"2021","unstructured":"Jaeho Lee , Sejun Park , Sangwoo Mo , Sungsoo Ahn , and Jinwoo Shin . 2021 . Layer-adaptive Sparsity for the Magnitude-based Pruning. In International Conference on Learning Representations. Jaeho Lee, Sejun Park, Sangwoo Mo, Sungsoo Ahn, and Jinwoo Shin. 2021. Layer-adaptive Sparsity for the Magnitude-based Pruning. In International Conference on Learning Representations."},{"key":"e_1_3_2_2_19_1","doi-asserted-by":"crossref","unstructured":"Breiman Leo. 2001. Random Forests. In Machine Learning. 5--32.  Breiman Leo. 2001. Random Forests. In Machine Learning. 5--32.","DOI":"10.1023\/A:1010933404324"},{"key":"e_1_3_2_2_20_1","volume-title":"Tim Kwang-Ting Cheng, and Jian Sun","author":"Liu Zechun","year":"2019","unstructured":"Zechun Liu , Haoyuan Mu , Xiangyu Zhang , Zichao Guo , Xin Yang , Tim Kwang-Ting Cheng, and Jian Sun . 2019 . MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning. In ICCV. Zechun Liu, Haoyuan Mu, Xiangyu Zhang, Zichao Guo, Xin Yang, Tim Kwang-Ting Cheng, and Jian Sun. 2019. MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning. In ICCV."},{"key":"e_1_3_2_2_21_1","unstructured":"Wenhao Lu Jian Jiao and Ruofei Zhang. 2020. TwinBERT: Distilling Knowledge to Twin-Structured Compressed BERT Models for Large-Scale Retrieval. In CIKM.  Wenhao Lu Jian Jiao and Ruofei Zhang. 2020. TwinBERT: Distilling Knowledge to Twin-Structured Compressed BERT Models for Large-Scale Retrieval. In CIKM."},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/3446640"},{"key":"e_1_3_2_2_23_1","volume-title":"Structured pruning of a bert-based question answering model. arXiv preprint arXiv:1910.06360","author":"McCarley JS","year":"2019","unstructured":"JS McCarley , Rishav Chakravarti , and Avirup Sil . 2019. Structured pruning of a bert-based question answering model. arXiv preprint arXiv:1910.06360 ( 2019 ). JS McCarley, Rishav Chakravarti, and Avirup Sil. 2019. Structured pruning of a bert-based question answering model. arXiv preprint arXiv:1910.06360 (2019)."},{"key":"e_1_3_2_2_24_1","unstructured":"Paul Michel Omer Levy and Graham Neubig. 2019. Are sixteen heads really better than one?. In NeurIPS.  Paul Michel Omer Levy and Graham Neubig. 2019. Are sixteen heads really better than one?. In NeurIPS."},{"key":"e_1_3_2_2_25_1","doi-asserted-by":"crossref","unstructured":"Bert Moons Parham Noorzad Andrii Skliar Giovanni Mariani Dushyant Mehta Chris Lott and Tijmen Blankevoort. 2021. Distilling Optimal Neural Networks: Rapid Search in Diverse Spaces. In ICCV.  Bert Moons Parham Noorzad Andrii Skliar Giovanni Mariani Dushyant Mehta Chris Lott and Tijmen Blankevoort. 2021. Distilling Optimal Neural Networks: Rapid Search in Diverse Spaces. In ICCV.","DOI":"10.1109\/ICCV48922.2021.01201"},{"key":"e_1_3_2_2_26_1","unstructured":"Hieu Pham Melody Y. Guan Barret Zoph Quoc V. Le and Jeff Dean. 2018. Efficient Neural Architecture Search via Parameter Sharing.  Hieu Pham Melody Y. Guan Barret Zoph Quoc V. Le and Jeff Dean. 2018. Efficient Neural Architecture Search via Parameter Sharing."},{"key":"e_1_3_2_2_27_1","doi-asserted-by":"crossref","unstructured":"Esteban Real Alok Aggarwal Yanping Huang and Quoc V Le. 2019. Regularized Evolution for Image Classifier Architecture Search. In AAAI.  Esteban Real Alok Aggarwal Yanping Huang and Quoc V Le. 2019. Regularized Evolution for Image Classifier Architecture Search. In AAAI.","DOI":"10.1609\/aaai.v33i01.33014780"},{"key":"e_1_3_2_2_28_1","volume-title":"a distilled version of BERT: smaller, faster, cheaper and lighter. arxiv","author":"Sanh Victor","year":"1910","unstructured":"Victor Sanh , Lysandre Debut , Julien Chaumond , and Thomas Wolf . 2020a. DistilBERT , a distilled version of BERT: smaller, faster, cheaper and lighter. arxiv : 1910 .01108 [cs.CL] Victor Sanh, Lysandre Debut, Julien Chaumond, and Thomas Wolf. 2020a. DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter. arxiv: 1910.01108 [cs.CL]"},{"key":"e_1_3_2_2_29_1","unstructured":"Victor Sanh Thomas Wolf and Alexander M Rush. 2020b. Movement pruning: Adaptive sparsity by fine-tuning. In NeurIPS.  Victor Sanh Thomas Wolf and Alexander M Rush. 2020b. Movement pruning: Adaptive sparsity by fine-tuning. In NeurIPS."},{"key":"e_1_3_2_2_30_1","unstructured":"Jasper Snoek Hugo Larochelle and Ryan P Adams. 2012. Practical Bayesian Optimization of Machine Learning Algorithms. In Advances in Neural Information Processing Systems.  Jasper Snoek Hugo Larochelle and Ryan P Adams. 2012. Practical Bayesian Optimization of Machine Learning Algorithms. In Advances in Neural Information Processing Systems."},{"key":"e_1_3_2_2_31_1","volume-title":"N: M sparse schemes from dense neural networks. Advances in Neural Information Processing Systems","author":"Sun Wei","year":"2021","unstructured":"Wei Sun , Aojun Zhou , Sander Stuijk , Rob Wijnhoven , Andrew O Nelson , Henk Corp oraal, 2021 . DominoSearch : Find layer-wise fine-grained N: M sparse schemes from dense neural networks. Advances in Neural Information Processing Systems , Vol. 34 (2021). Wei Sun, Aojun Zhou, Sander Stuijk, Rob Wijnhoven, Andrew O Nelson, Henk Corporaal, et al. 2021. DominoSearch: Find layer-wise fine-grained N: M sparse schemes from dense neural networks. Advances in Neural Information Processing Systems, Vol. 34 (2021)."},{"key":"e_1_3_2_2_32_1","unstructured":"Zhiqing Sun Hongkun Yu Xiaodan Song Renjie Liu Yiming Yang and Denny Zhou. 2020. MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices. In ACL.  Zhiqing Sun Hongkun Yu Xiaodan Song Renjie Liu Yiming Yang and Denny Zhou. 2020. MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices. In ACL."},{"key":"e_1_3_2_2_33_1","volume-title":"Well-Read Students Learn Better: The Impact of Student Initialization on Knowledge Distillation. CoRR","author":"Turc Iulia","year":"1908","unstructured":"Iulia Turc , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . 2019. Well-Read Students Learn Better: The Impact of Student Initialization on Knowledge Distillation. CoRR , Vol. abs\/ 1908 .08962. showeprint[arXiv] 1908 .08962 Iulia Turc, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. Well-Read Students Learn Better: The Impact of Student Initialization on Knowledge Distillation. CoRR, Vol. abs\/1908.08962. showeprint[arXiv]1908.08962"},{"key":"e_1_3_2_2_34_1","unstructured":"Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan N. Gomez and Lukasz Kaiser. 2017. Attention is All You Need. In NIPS.  Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan N. Gomez and Lukasz Kaiser. 2017. Attention is All You Need. In NIPS."},{"key":"e_1_3_2_2_35_1","doi-asserted-by":"crossref","unstructured":"Elena Voita David Talbot Fedor Moiseev Rico Sennrich and Ivan Titov. 2019. Analyzing multi-head self-attention: Specialized heads do the heavy lifting the rest can be pruned. In ACL.  Elena Voita David Talbot Fedor Moiseev Rico Sennrich and Ivan Titov. 2019. Analyzing multi-head self-attention: Specialized heads do the heavy lifting the rest can be pruned. In ACL.","DOI":"10.18653\/v1\/P19-1580"},{"key":"e_1_3_2_2_36_1","volume-title":"Bowman","author":"Wang Alex","year":"2020","unstructured":"Alex Wang , Yada Pruksachatkun , Nikita Nangia , Amanpreet Singh , Julian Michael , Felix Hill , Omer Levy , and Samuel R . Bowman . 2020 . SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems . Alex Wang, Yada Pruksachatkun, Nikita Nangia, Amanpreet Singh, Julian Michael, Felix Hill, Omer Levy, and Samuel R. Bowman. 2020. SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems."},{"key":"e_1_3_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00635"},{"key":"e_1_3_2_2_38_1","doi-asserted-by":"crossref","unstructured":"Xiting Wang Xinwei Gu Jie Cao Zhihua Zhao Yulan Yan Bhuvan Middha and Xing Xie. 2021a. Reinforcing Pretrained Models for Generating Attractive Text Advertisements. In KDD.  Xiting Wang Xinwei Gu Jie Cao Zhihua Zhao Yulan Yan Bhuvan Middha and Xing Xie. 2021a. Reinforcing Pretrained Models for Generating Attractive Text Advertisements. In KDD.","DOI":"10.1145\/3447548.3467105"},{"key":"e_1_3_2_2_39_1","doi-asserted-by":"crossref","unstructured":"Ronald J. Williams. 1992. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning. (1992).  Ronald J. Williams. 1992. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning. (1992).","DOI":"10.1007\/978-1-4615-3618-5_2"},{"key":"e_1_3_2_2_40_1","unstructured":"Mengzhou Xia Zexuan Zhong and Danqi Chen. 2022. Structured Pruning Learns Compact and Accurate Models. In ACL.  Mengzhou Xia Zexuan Zhong and Danqi Chen. 2022. Structured Pruning Learns Compact and Accurate Models. In ACL."},{"key":"e_1_3_2_2_41_1","doi-asserted-by":"crossref","unstructured":"Jin Xu Xu Tan Renqian Luo Kaitao Song Jian Li Tao Qin and Tie-Yan Liu. 2021. NAS-BERT: Task-Agnostic and Adaptive-Size BERT Compression with Neural Architecture Search. In kdd.  Jin Xu Xu Tan Renqian Luo Kaitao Song Jian Li Tao Qin and Tie-Yan Liu. 2021. NAS-BERT: Task-Agnostic and Adaptive-Size BERT Compression with Neural Architecture Search. In kdd.","DOI":"10.1145\/3447548.3467262"},{"key":"e_1_3_2_2_42_1","volume-title":"Mahoney","author":"Yao Zhewei","year":"2021","unstructured":"Zhewei Yao , Linjian Ma , Sheng Shen , Kurt Keutzer , and Michael W . Mahoney . 2021 . MLPruning: A Multilevel Structured Pruning Framework for Transformer-based Models . arxiv: 2105.14636 [cs.CL] Zhewei Yao, Linjian Ma, Sheng Shen, Kurt Keutzer, and Michael W. Mahoney. 2021. MLPruning: A Multilevel Structured Pruning Framework for Transformer-based Models. arxiv: 2105.14636 [cs.CL]"},{"key":"e_1_3_2_2_43_1","volume-title":"Ranking Relevance in Yahoo Search (KDD '16)","author":"Yin Dawei","year":"2016","unstructured":"Dawei Yin , Yuening Hu , Jiliang Tang , Tim Daly , Mianwei Zhou , Hua Ouyang , Jianhui Chen , Changsung Kang , Hongbo Deng , Chikashi Nobata , Jean-Marc Langlois , and Yi Chang . 2016 . Ranking Relevance in Yahoo Search (KDD '16) . Association for Computing Machinery, 323--332. Dawei Yin, Yuening Hu, Jiliang Tang, Tim Daly, Mianwei Zhou, Hua Ouyang, Jianhui Chen, Changsung Kang, Hongbo Deng, Chikashi Nobata, Jean-Marc Langlois, and Yi Chang. 2016. Ranking Relevance in Yahoo Search (KDD '16). Association for Computing Machinery, 323--332."},{"key":"e_1_3_2_2_44_1","volume-title":"Q8bert: Quantized 8bit bert. arXiv preprint arXiv:1910.06188","author":"Zafrir Ofir","year":"2019","unstructured":"Ofir Zafrir , Guy Boudoukh , Peter Izsak , and Moshe Wasserblat . 2019. Q8bert: Quantized 8bit bert. arXiv preprint arXiv:1910.06188 ( 2019 ). Ofir Zafrir, Guy Boudoukh, Peter Izsak, and Moshe Wasserblat. 2019. Q8bert: Quantized 8bit bert. arXiv preprint arXiv:1910.06188 (2019)."},{"key":"e_1_3_2_2_45_1","volume-title":"Le","author":"Zoph Barret","year":"2017","unstructured":"Barret Zoph and Quoc V . Le . 2017 . Neural Architecture Search with Reinforcement Learning . Barret Zoph and Quoc V. Le. 2017. Neural Architecture Search with Reinforcement Learning."}],"event":{"name":"CIKM '22: The 31st ACM International Conference on Information and Knowledge Management","sponsor":["SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web","SIGIR ACM Special Interest Group on Information Retrieval"],"location":"Atlanta GA USA","acronym":"CIKM '22"},"container-title":["Proceedings of the 31st ACM International Conference on Information &amp; Knowledge Management"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3511808.3557139","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3511808.3557139","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:30:57Z","timestamp":1750188657000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3511808.3557139"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,10,17]]},"references-count":45,"alternative-id":["10.1145\/3511808.3557139","10.1145\/3511808"],"URL":"https:\/\/doi.org\/10.1145\/3511808.3557139","relation":{},"subject":[],"published":{"date-parts":[[2022,10,17]]},"assertion":[{"value":"2022-10-17","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}