{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,6]],"date-time":"2026-03-06T18:40:06Z","timestamp":1772822406983,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":32,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,8,20]],"date-time":"2020-08-20T00:00:00Z","timestamp":1597881600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"National Key Research and Development Program of China","award":["2018AAA0101100"],"award-info":[{"award-number":["2018AAA0101100"]}]},{"name":"National Science Foundation of China (NSFC)","award":["61822201, U1811463, 71531001"],"award-info":[{"award-number":["61822201, U1811463, 71531001"]}]},{"name":"Beijing Municipal Science and Technology Project","award":["Z191100002519012"],"award-info":[{"award-number":["Z191100002519012"]}]},{"name":"Singapore Ministry of Education (MOE) Academic Research Fund (AcRF) Tier 1 grant","award":["19-C220-SMU-012"],"award-info":[{"award-number":["19-C220-SMU-012"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,8,23]]},"DOI":"10.1145\/3394486.3403058","type":"proceedings-article","created":{"date-parts":[[2020,8,20]],"date-time":"2020-08-20T23:17:27Z","timestamp":1597965447000},"page":"155-164","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":20,"title":["Rethinking Pruning for Accelerating Deep Inference At the Edge"],"prefix":"10.1145","author":[{"given":"Dawei","family":"Gao","sequence":"first","affiliation":[{"name":"Beihang University, Beijing, China"}]},{"given":"Xiaoxi","family":"He","sequence":"additional","affiliation":[{"name":"ETH Z\u00fcrich, Z\u00fcrich, Switzerland"}]},{"given":"Zimu","family":"Zhou","sequence":"additional","affiliation":[{"name":"Singapore Management University, Singapore, Singapore"}]},{"given":"Yongxin","family":"Tong","sequence":"additional","affiliation":[{"name":"Beihang University, Beijing, China"}]},{"given":"Ke","family":"Xu","sequence":"additional","affiliation":[{"name":"Beihang University, Beijing, China"}]},{"given":"Lothar","family":"Thiele","sequence":"additional","affiliation":[{"name":"ETH Z\u00fcrich, Z\u00fcrich, Switzerland"}]}],"member":"320","published-online":{"date-parts":[[2020,8,20]]},"reference":[{"key":"e_1_3_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/2996864"},{"key":"e_1_3_2_2_2_1","volume-title":"Proceedings of International Conference on Machine Learning. ACM","author":"Dai Bin","year":"2018","unstructured":"Bin Dai , Chen Zhu , Baining Guo , and David Wipf . 2018 . Compressing neural networks using the variational information bottleneck . In Proceedings of International Conference on Machine Learning. ACM , New York, NY, USA, 1143--1152. Bin Dai, Chen Zhu, Baining Guo, and David Wipf. 2018. Compressing neural networks using the variational information bottleneck. In Proceedings of International Conference on Machine Learning. ACM, New York, NY, USA, 1143--1152."},{"key":"e_1_3_2_2_3_1","volume-title":"Proceedings of Advances In Neural Information Processing Systems. Curran Associates Inc.","author":"Denil Misha","year":"2013","unstructured":"Misha Denil , Babak Shakibi , Laurent Dinh , Nando De Freitas , 2013 . Predicting parameters in deep learning . In Proceedings of Advances In Neural Information Processing Systems. Curran Associates Inc. , Red Hook, NY, USA, 2148--2156. Misha Denil, Babak Shakibi, Laurent Dinh, Nando De Freitas, et al. 2013. Predicting parameters in deep learning. In Proceedings of Advances In Neural Information Processing Systems. Curran Associates Inc., Red Hook, NY, USA, 2148--2156."},{"key":"e_1_3_2_2_4_1","volume-title":"Proceedings of Advances in Neural Information Processing Systems. Curran Associates Inc.","author":"Dong Xin","year":"2017","unstructured":"Xin Dong , Shangyu Chen , and Sinno Pan . 2017 . Learning to prune deep neural networks via layer-wise optimal brain surgeon . In Proceedings of Advances in Neural Information Processing Systems. Curran Associates Inc. , Red Hook, NY, USA, 4860--4874. Xin Dong, Shangyu Chen, and Sinno Pan. 2017. Learning to prune deep neural networks via layer-wise optimal brain surgeon. In Proceedings of Advances in Neural Information Processing Systems. Curran Associates Inc., Red Hook, NY, USA, 4860--4874."},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.1986.1168654"},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/3081333.3081358"},{"key":"e_1_3_2_2_7_1","volume-title":"Supervised sequence labelling with recurrent neural networks","author":"Graves Alex","unstructured":"Alex Graves . 2012. Supervised sequence labelling . In Supervised sequence labelling with recurrent neural networks . Springer , 5--13. Alex Graves. 2012. Supervised sequence labelling. In Supervised sequence labelling with recurrent neural networks. Springer, 5--13."},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/3007787.3001163"},{"key":"e_1_3_2_2_9_1","volume-title":"Proceedings of International Conference on Learning Representations.","author":"Han Song","year":"2016","unstructured":"Song Han , Huizi Mao , and William J Dally . 2016 b. Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding . In Proceedings of International Conference on Learning Representations. Song Han, Huizi Mao, and William J Dally. 2016b. Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding. In Proceedings of International Conference on Learning Representations."},{"key":"e_1_3_2_2_10_1","unstructured":"Awni Hannun Carl Case Jared Casper Bryan Catanzaro Greg Diamos Erich Elsen Ryan Prenger Sanjeev Satheesh Shubho Sengupta Adam Coates etal 2014. Deep speech: scaling up end-to-end speech recognition. arxiv: 1412.5567  Awni Hannun Carl Case Jared Casper Bryan Catanzaro Greg Diamos Erich Elsen Ryan Prenger Sanjeev Satheesh Shubho Sengupta Adam Coates et al. 2014. Deep speech: scaling up end-to-end speech recognition. arxiv: 1412.5567"},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/MSP.2012.2205597"},{"key":"e_1_3_2_2_12_1","unstructured":"Zhiheng Huang Wei Xu and Kai Yu. 2015. Bidirectional LSTM-CRF Models for Sequence Tagging. arxiv: 1508.01991  Zhiheng Huang Wei Xu and Kai Yu. 2015. Bidirectional LSTM-CRF Models for Sequence Tagging. arxiv: 1508.01991"},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/3037697.3037698"},{"key":"e_1_3_2_2_14_1","volume-title":"Proceedings of International Conference on Learning Representations.","author":"Diederik","unstructured":"Diederik P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization . In Proceedings of International Conference on Learning Representations. Diederik P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. In Proceedings of International Conference on Learning Representations."},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPSN.2016.7460664"},{"key":"e_1_3_2_2_16_1","volume-title":"Proceedings of International Conference on Learning Representations.","author":"Li Hao","year":"2017","unstructured":"Hao Li , Asim Kadav , Igor Durdanovic , Hanan Samet , and Hans Peter Graf . 2017 . Pruning filters for efficient convnets . In Proceedings of International Conference on Learning Representations. Hao Li, Asim Kadav, Igor Durdanovic, Hanan Samet, and Hans Peter Graf. 2017. Pruning filters for efficient convnets. In Proceedings of International Conference on Learning Representations."},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.sysarc.2019.01.011"},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1006\/csla.2001.0184"},{"key":"e_1_3_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2015.7178964"},{"key":"e_1_3_2_2_20_1","volume-title":"Proceedings of Conference on Empirical Methods in Natural Language Processing. 1532--1543","author":"Pennington Jeffrey","unstructured":"Jeffrey Pennington , Richard Socher , and Christopher D. Manning . 2014. Glove: Global Vectors for Word Representation . In Proceedings of Conference on Empirical Methods in Natural Language Processing. 1532--1543 . Jeffrey Pennington, Richard Socher, and Christopher D. Manning. 2014. Glove: Global Vectors for Word Representation. In Proceedings of Conference on Empirical Methods in Natural Language Processing. 1532--1543."},{"key":"e_1_3_2_2_21_1","volume-title":"Proceedings of Workshop on Automatic Speech Recognition and Understanding. IEEE Press","author":"Povey Daniel","year":"2011","unstructured":"Daniel Povey , Arnab Ghoshal , Gilles Boulianne , Lukas Burget , Ondrej Glembek , Nagendra Goel , Mirko Hannemann , Petr Motlicek , Yanmin Qian , Petr Schwarz , Jan Silovsky , Georg Stemmer , and Karel Vesely . 2011 . The Kaldi Speech Recognition Toolkit . In Proceedings of Workshop on Automatic Speech Recognition and Understanding. IEEE Press , Piscataway, NJ, USA. Daniel Povey, Arnab Ghoshal, Gilles Boulianne, Lukas Burget, Ondrej Glembek, Nagendra Goel, Mirko Hannemann, Petr Motlicek, Yanmin Qian, Petr Schwarz, Jan Silovsky, Georg Stemmer, and Karel Vesely. 2011. The Kaldi Speech Recognition Toolkit. In Proceedings of Workshop on Automatic Speech Recognition and Understanding. IEEE Press, Piscataway, NJ, USA."},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2017-233"},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2019.8683713"},{"key":"e_1_3_2_2_24_1","volume-title":"Proceedings of Conference on Natural Language Learning at HLT-NAACL. ACL","author":"Erik","unstructured":"Erik F. Tjong Kim Sang and Fien De Meulder. 2003. Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition . In Proceedings of Conference on Natural Language Learning at HLT-NAACL. ACL , Stroudsburg, PA, USA, 142--147. Erik F. Tjong Kim Sang and Fien De Meulder. 2003. Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition. In Proceedings of Conference on Natural Language Learning at HLT-NAACL. ACL, Stroudsburg, PA, USA, 142--147."},{"key":"e_1_3_2_2_25_1","volume-title":"Proceedings of Advances in Neural Information Processing Systems. Curran Associates Inc.","author":"Sutskever Ilya","year":"2014","unstructured":"Ilya Sutskever , Oriol Vinyals , and Quoc V Le . 2014 . Sequence to sequence learning with neural networks . In Proceedings of Advances in Neural Information Processing Systems. Curran Associates Inc. , Red Hook, NY, USA, 3104--3112. Ilya Sutskever, Oriol Vinyals, and Quoc V Le. 2014. Sequence to sequence learning with neural networks. In Proceedings of Advances in Neural Information Processing Systems. Curran Associates Inc., Red Hook, NY, USA, 3104--3112."},{"key":"e_1_3_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/JPROC.2017.2761740"},{"key":"e_1_3_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.515"},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2017.7953159"},{"key":"e_1_3_2_2_29_1","volume-title":"Proceedings of International Conference on Computational Linguistics. ACL","author":"Yadav Vikas","year":"2018","unstructured":"Vikas Yadav and Steven Bethard . 2018 . A Survey on Recent Advances in Named Entity Recognition from Deep Learning models . In Proceedings of International Conference on Computational Linguistics. ACL , Santa Fe, NM, USA, 2145--2158. Vikas Yadav and Steven Bethard. 2018. A Survey on Recent Advances in Named Entity Recognition from Deep Learning models. In Proceedings of International Conference on Computational Linguistics. ACL, Santa Fe, NM, USA, 2145--2158."},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA.2018.00071"},{"key":"e_1_3_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2018.8461404"},{"key":"e_1_3_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/JPROC.2019.2918951"}],"event":{"name":"KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining","location":"Virtual Event CA USA","acronym":"KDD '20","sponsor":["SIGMOD ACM Special Interest Group on Management of Data","SIGKDD ACM Special Interest Group on Knowledge Discovery in Data"]},"container-title":["Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery &amp; Data Mining"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3394486.3403058","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3394486.3403058","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:41:38Z","timestamp":1750200098000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3394486.3403058"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,8,20]]},"references-count":32,"alternative-id":["10.1145\/3394486.3403058","10.1145\/3394486"],"URL":"https:\/\/doi.org\/10.1145\/3394486.3403058","relation":{},"subject":[],"published":{"date-parts":[[2020,8,20]]},"assertion":[{"value":"2020-08-20","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}