{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:28:21Z","timestamp":1750220901117,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":31,"publisher":"ACM","license":[{"start":{"date-parts":[[2019,12,6]],"date-time":"2019-12-06T00:00:00Z","timestamp":1575590400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2019,12,6]]},"DOI":"10.1145\/3374587.3374641","type":"proceedings-article","created":{"date-parts":[[2020,3,4]],"date-time":"2020-03-04T18:16:31Z","timestamp":1583345791000},"page":"122-127","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Research on the Current Status of Sparse Neural Network Acceleration Processing Technology in Deep Learning"],"prefix":"10.1145","author":[{"given":"Zhichen","family":"Wang","sequence":"first","affiliation":[{"name":"Jiangnan Institute of Computing Technology, Wuxi, China, Officers College of Pap, Chengdu, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hongliang","family":"Li","sequence":"additional","affiliation":[{"name":"Jiangnan Institute of Computing Technology, Wuxi, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2020,3,4]]},"reference":[{"key":"e_1_3_2_1_1_1","first-page":"2074","volume-title":"Advances in Neural Information Processing Systems","author":"Wen Wei","year":"2016","unstructured":"Wei Wen , Chunpeng Wu , Yandan Wang , Yiran Chen , and Hai Li . Learning structured sparsity in deep neural networks . In Advances in Neural Information Processing Systems , pages 2074 -- 2082 , 2016 . Wei Wen, Chunpeng Wu, Yandan Wang, Yiran Chen, and Hai Li. Learning structured sparsity in deep neural networks. In Advances in Neural Information Processing Systems, pages 2074--2082, 2016."},{"key":"e_1_3_2_1_2_1","volume-title":"Gpu kernels for block-sparse weights. https:\/\/s3-us-west-2.amazonaws.com\/ openaiassets\/blocksparse\/blocksparsepaper.pdf","author":"Gray Scott","year":"2017","unstructured":"Scott Gray , Alec Radford , and Diederik Kingma . Gpu kernels for block-sparse weights. https:\/\/s3-us-west-2.amazonaws.com\/ openaiassets\/blocksparse\/blocksparsepaper.pdf , 2017 . [Online; accessed 12-January-2018] Scott Gray, Alec Radford, and Diederik Kingma. Gpu kernels for block-sparse weights. https:\/\/s3-us-west-2.amazonaws.com\/ openaiassets\/blocksparse\/blocksparsepaper.pdf, 2017. [Online; accessed 12-January-2018]"},{"key":"e_1_3_2_1_3_1","unstructured":"Yihui He Xiangyu Zhang and Jian Sun. Channel pruning for accelerating very deep neural networks.  Yihui He Xiangyu Zhang and Jian Sun. Channel pruning for accelerating very deep neural networks."},{"key":"e_1_3_2_1_4_1","volume-title":"Thinet: A filter level pruning method for deep neural network compression. arXiv preprint arXiv:1707.06342","author":"Luo Jian-Hao","year":"2017","unstructured":"Jian-Hao Luo , Jianxin Wu , and Weiyao Lin . Thinet: A filter level pruning method for deep neural network compression. arXiv preprint arXiv:1707.06342 , 2017 . Jian-Hao Luo, Jianxin Wu, and Weiyao Lin. Thinet: A filter level pruning method for deep neural network compression. arXiv preprint arXiv:1707.06342, 2017."},{"key":"e_1_3_2_1_5_1","volume-title":"Blocksparse recurrent neural networks. CoRR, abs\/1711.02782","author":"Narang Sharan","year":"2017","unstructured":"Sharan Narang , Eric Undersander , and Gregory F. Diamos . Blocksparse recurrent neural networks. CoRR, abs\/1711.02782 , 2017 . Sharan Narang, Eric Undersander, and Gregory F. Diamos. Blocksparse recurrent neural networks. CoRR, abs\/1711.02782, 2017."},{"key":"e_1_3_2_1_6_1","volume-title":"Condensenet: An efficient densenet using learned group convolutions. arXiv preprint arXiv:1711.09224","author":"Huang Gao","year":"2017","unstructured":"Gao Huang , Shichen Liu , Laurens van der Maaten , and Kilian Q Weinberger . Condensenet: An efficient densenet using learned group convolutions. arXiv preprint arXiv:1711.09224 , 2017 . Gao Huang, Shichen Liu, Laurens van der Maaten, and Kilian Q Weinberger. Condensenet: An efficient densenet using learned group convolutions. arXiv preprint arXiv:1711.09224, 2017."},{"volume-title":"ISCA '16:Proceedings of the 43rd International Symposium on Computer Architecture Seoul, Republic of Korea 2016 P1--13 1063-6897","author":"Cnvlutin A.","key":"e_1_3_2_1_7_1","unstructured":"Albericio, J; Judd, P; Hetherington, T; Aamodt, T; Jerger, NE; Moshovos, A. Cnvlutin : ineffectual-neuron-free deep neural network computing . ISCA '16:Proceedings of the 43rd International Symposium on Computer Architecture Seoul, Republic of Korea 2016 P1--13 1063-6897 Albericio, J; Judd, P; Hetherington, T; Aamodt, T; Jerger, NE; Moshovos, A. Cnvlutin: ineffectual-neuron-free deep neural network computing. ISCA '16:Proceedings of the 43rd International Symposium on Computer Architecture Seoul, Republic of Korea 2016 P1--13 1063-6897"},{"key":"e_1_3_2_1_8_1","volume-title":"Taiwan","author":"Zhang Shijin","year":"2016","unstructured":"Shijin Zhang , Zidong Du , Lei Zhang , Huiying Lan , Shaoli Liu , Ling Li , Qi Guo , Tianshi Chen , Yunji Chen . Cambricon-X: an accelerator for sparse neural networks MICRO-49:The 49th Annual IEEE\/ACM International Symposium on Microarchitecture Taipei , Taiwan 2016 . DOI= http:\/\/10.1109\/MICRO.2016.7783723 Shijin Zhang, Zidong Du, Lei Zhang, Huiying Lan, Shaoli Liu, Ling Li, Qi Guo, Tianshi Chen, Yunji Chen. Cambricon-X: an accelerator for sparse neural networks MICRO-49:The 49th Annual IEEE\/ACM International Symposium on Microarchitecture Taipei, Taiwan 2016. DOI= http:\/\/10.1109\/MICRO.2016.7783723"},{"volume-title":"Field Programmable Logic and Applications (FPL), 2016 26th International Conference on Lausanne, Switzerland 2016","author":"Li Huimin","key":"e_1_3_2_1_9_1","unstructured":"Huimin Li , Xitian Fan , Li Jiao , Wei Cao , Xuegong Zhou , Lingli Wang . A high performance FPGA-based accelerator for large-scale convolutional neural networks . Field Programmable Logic and Applications (FPL), 2016 26th International Conference on Lausanne, Switzerland 2016 Huimin Li, Xitian Fan, Li Jiao, Wei Cao, Xuegong Zhou, Lingli Wang. A high performance FPGA-based accelerator for large-scale convolutional neural networks. Field Programmable Logic and Applications (FPL), 2016 26th International Conference on Lausanne, Switzerland 2016"},{"key":"e_1_3_2_1_10_1","volume-title":"ACM\/SIGDA Thirteenth ACM International Symposium on Field Programmable Gate Arrays - FPGA2005 Monterey, CA, United States 2005","author":"Viktor","year":"2007","unstructured":"Zhuo, Ling, Prasanna, Viktor K. Sparse matrix-vector multiplication on FPGAs . ACM\/SIGDA Thirteenth ACM International Symposium on Field Programmable Gate Arrays - FPGA2005 Monterey, CA, United States 2005 . DOI= http:\/\/10.1109\/FCCM. 2007 .56 Zhuo, Ling, Prasanna, Viktor K. Sparse matrix-vector multiplication on FPGAs. ACM\/SIGDA Thirteenth ACM International Symposium on Field Programmable Gate Arrays - FPGA2005 Monterey, CA, United States 2005. DOI= http:\/\/10.1109\/FCCM.2007.56"},{"volume-title":"High-efficiency convolutional ternary neural networks with custom adder trees and weight compression. ACM Transactions on Reconfigurable Technology and Systems 2018","author":"Prost-Boucle Adrien","key":"e_1_3_2_1_11_1","unstructured":"Adrien Prost-Boucle , Alban Bourge . High-efficiency convolutional ternary neural networks with custom adder trees and weight compression. ACM Transactions on Reconfigurable Technology and Systems 2018 Vol. 11 No. 3-1936-7406; 1936--7414. Adrien Prost-Boucle, Alban Bourge. High-efficiency convolutional ternary neural networks with custom adder trees and weight compression. ACM Transactions on Reconfigurable Technology and Systems 2018 Vol.11 No. 3-1936-7406; 1936--7414."},{"key":"e_1_3_2_1_12_1","unstructured":"Zhouhan Lin Matthieu Courbariaux Roland Memisevic Yoshua Bengio. Neural networks with few multiplications. Computer Science 2015  Zhouhan Lin Matthieu Courbariaux Roland Memisevic Yoshua Bengio. Neural networks with few multiplications. Computer Science 2015"},{"key":"e_1_3_2_1_13_1","volume-title":"Massachusetts","author":"Ding Caiwen","year":"2017","unstructured":"Caiwen Ding , Siyu Liao , Yanzhi Wang , Zhe Li , Ning et al. CirCNN: accelerating and compressing deep neural networks using block-circulant weight matrices. MICRO-50:Proceedings of the 50th Annual IEEE\/ACM International Symposium on Microarchitecture Cambridge , Massachusetts 2017 . DOI= http:\/\/10.1145\/3123939.3124552 Caiwen Ding, Siyu Liao, Yanzhi Wang, Zhe Li, Ning et al. CirCNN: accelerating and compressing deep neural networks using block-circulant weight matrices. MICRO-50:Proceedings of the 50th Annual IEEE\/ACM International Symposium on Microarchitecture Cambridge, Massachusetts 2017. DOI= http:\/\/10.1145\/3123939.3124552"},{"volume-title":"ASPLOS '19 Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems Providence, RI, USA 2019","author":"Kung H.T.","key":"e_1_3_2_1_14_1","unstructured":"H.T. Kung , Bradley McDanel , Sai Qian Zhang . Packing sparse convolutional neural networks for efficient systolic array implementations: column combining under joint optimization . ASPLOS '19 Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems Providence, RI, USA 2019 H.T. Kung, Bradley McDanel, Sai Qian Zhang. Packing sparse convolutional neural networks for efficient systolic array implementations: column combining under joint optimization. ASPLOS '19 Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems Providence, RI, USA 2019"},{"volume-title":"Computer-Aided Design (ICCAD), 2016 IEEE\/ACM International Conference on Austin, TX, USA 2016","author":"JS.","key":"e_1_3_2_1_15_1","unstructured":"Kadetotad, D., Arunachalam, S., Chakrabarti, C., Seo, JS. Efficient memory compression in deep neural networks using coarse-grain sparsification for speech applications . Computer-Aided Design (ICCAD), 2016 IEEE\/ACM International Conference on Austin, TX, USA 2016 Kadetotad, D., Arunachalam, S., Chakrabarti, C., Seo, JS. Efficient memory compression in deep neural networks using coarse-grain sparsification for speech applications. Computer-Aided Design (ICCAD), 2016 IEEE\/ACM International Conference on Austin, TX, USA 2016"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA.2016.30"},{"key":"e_1_3_2_1_17_1","unstructured":"Song Han Huizi Mao William J. Dally. Deep compression - compressing deep neural networks with pruning trained quantization and huffman coding. Computer Science 2015  Song Han Huizi Mao William J. Dally. Deep compression - compressing deep neural networks with pruning trained quantization and huffman coding. Computer Science 2015"},{"volume-title":"Automation & Test in Europe Conference & Exhibition (DATE) Lausanne, Switzerland 2017","author":"Qiqieh Issa","key":"e_1_3_2_1_18_1","unstructured":"Issa Qiqieh , Rishad Shafik , Ghaith Tarawneh , Danil Sokolov , Alex Yakovlev . Energy-efficient approximate multiplier design using bit significance-driven logic compression. 2017 Design , Automation & Test in Europe Conference & Exhibition (DATE) Lausanne, Switzerland 2017 Issa Qiqieh, Rishad Shafik, Ghaith Tarawneh, Danil Sokolov, Alex Yakovlev. Energy-efficient approximate multiplier design using bit significance-driven logic compression. 2017 Design, Automation & Test in Europe Conference & Exhibition (DATE) Lausanne, Switzerland 2017"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/ASPDAC.2016.7428084"},{"volume-title":"Computer-Aided Design (ICCAD), 2016 IEEE\/ACM International Conference on Austin, TX, USA 2016. DOI= http:\/\/doi.acm.org\/10","author":"Mrazek Vojtech","key":"e_1_3_2_1_20_1","unstructured":"Vojtech Mrazek , Syed Shakib Sarwar , Lukas Sekanina , Zdenek Vasicek , Kaushik Roy . Design of power-efficient approximate multipliers for approximate artificial neural networks . Computer-Aided Design (ICCAD), 2016 IEEE\/ACM International Conference on Austin, TX, USA 2016. DOI= http:\/\/doi.acm.org\/10 .1145\/2966986.2967021 Vojtech Mrazek, Syed Shakib Sarwar, Lukas Sekanina, Zdenek Vasicek, Kaushik Roy. Design of power-efficient approximate multipliers for approximate artificial neural networks. Computer-Aided Design (ICCAD), 2016 IEEE\/ACM International Conference on Austin, TX, USA 2016. DOI= http:\/\/doi.acm.org\/10.1145\/2966986.2967021"},{"volume-title":"August 2016 ISLPED '16: Proceedings of the 2016 International Symposium on Low Power Electronics and Design","author":"Na Taesik","key":"e_1_3_2_1_21_1","unstructured":"Taesik Na , Saibal Mukhopadhyay . Speeding up convolutional neural network training with dynamic precision scaling and flexible multiplier-accumulator . August 2016 ISLPED '16: Proceedings of the 2016 International Symposium on Low Power Electronics and Design Taesik Na, Saibal Mukhopadhyay. Speeding up convolutional neural network training with dynamic precision scaling and flexible multiplier-accumulator. August 2016 ISLPED '16: Proceedings of the 2016 International Symposium on Low Power Electronics and Design"},{"volume-title":"32nd International Conference on Machine Learning, ICML 2015 Lile, France 2015","author":"S.","key":"e_1_3_2_1_22_1","unstructured":"Lederer, C., Altstadt, S., Andriamonje, S. et al. Deep learning with limited numerical precision . 32nd International Conference on Machine Learning, ICML 2015 Lile, France 2015 Lederer, C., Altstadt, S., Andriamonje, S. et al. Deep learning with limited numerical precision. 32nd International Conference on Machine Learning, ICML 2015 Lile, France 2015"},{"key":"e_1_3_2_1_23_1","unstructured":"Matthieu Courbariaux Yoshua Bengio Jean-Pierre David. Training deep neural networks with low precision multiplications. Computer Science 2014  Matthieu Courbariaux Yoshua Bengio Jean-Pierre David. Training deep neural networks with low precision multiplications. Computer Science 2014"},{"volume-title":"33rd International Conference on Machine Learning, ICML 2016 New York City, NY, United states 2016","author":"J.","key":"e_1_3_2_1_24_1","unstructured":"Lederer, C., Altstadt, S., Andriamonje, S., Andrzejewski, J. et al. Fixed point quantization of deep convolutional networks . 33rd International Conference on Machine Learning, ICML 2016 New York City, NY, United states 2016 Lederer, C., Altstadt, S., Andriamonje, S., Andrzejewski, J. et al. Fixed point quantization of deep convolutional networks. 33rd International Conference on Machine Learning, ICML 2016 New York City, NY, United states 2016"},{"volume-title":"28th Annual Conference on Neural Information Processing Systems 2014, NIPS 2014 Montreal, QC, Canada 2014","author":"J","key":"e_1_3_2_1_25_1","unstructured":"Lederer, C., Altstadt, S., Andriamonje, S., Andrzejewski, J et al. Exploiting linear structure within convolutional networks for efficient evaluation . 28th Annual Conference on Neural Information Processing Systems 2014, NIPS 2014 Montreal, QC, Canada 2014 Lederer, C., Altstadt, S., Andriamonje, S., Andrzejewski, J et al. Exploiting linear structure within convolutional networks for efficient evaluation. 28th Annual Conference on Neural Information Processing Systems 2014, NIPS 2014 Montreal, QC, Canada 2014"},{"volume-title":"DAC '17:Proceedings of the 54th Annual Design Automation Conference 2017 Austin, TX, USA 2017","author":"Ko Jong Hwan","key":"e_1_3_2_1_26_1","unstructured":"Jong Hwan Ko , Burhan Mudassar , Taesik Na , Saibal Mukhopadhyay . Design of an energy-efficient accelerator for training of convolutional neural networks using frequency-domain computation . DAC '17:Proceedings of the 54th Annual Design Automation Conference 2017 Austin, TX, USA 2017 Jong Hwan Ko, Burhan Mudassar, Taesik Na, Saibal Mukhopadhyay. Design of an energy-efficient accelerator for training of convolutional neural networks using frequency-domain computation. DAC '17:Proceedings of the 54th Annual Design Automation Conference 2017 Austin, TX, USA 2017"},{"key":"e_1_3_2_1_27_1","unstructured":"N Vasilache J Johnson M Mathieu S Chintala. Fast convolutional nets With fbfft-a GPU performance evaluation. 2014 arXiv preprint arXiv:1412.7580. DOI= http:\/\/hgpu.org\/?p=13280  N Vasilache J Johnson M Mathieu S Chintala. Fast convolutional nets With fbfft-a GPU performance evaluation. 2014 arXiv preprint arXiv:1412.7580. DOI= http:\/\/hgpu.org\/?p=13280"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.435"},{"key":"e_1_3_2_1_29_1","first-page":"1","article-title":"Design and analysis of a neural network inference engine based on adaptive weight compression","volume":"38","author":"Ko Jong Hwan","year":"2019","unstructured":"Jong Hwan Ko , Duckhwan Kim , Taesik Na , Saibal Mukhopadhyay . Design and analysis of a neural network inference engine based on adaptive weight compression . IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems ( Volume: 38 , Issue: 1 , Jan. 2019 109--121 10.1109\/TCAD.2018.2801228. DOI= http:\/\/10.1109\/TCAD.2018.2801228 Jong Hwan Ko, Duckhwan Kim, Taesik Na, Saibal Mukhopadhyay. Design and analysis of a neural network inference engine based on adaptive weight compression. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (Volume: 38, Issue: 1, Jan. 2019 109--121 10.1109\/TCAD.2018.2801228. DOI= http:\/\/10.1109\/TCAD.2018.2801228","journal-title":"IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems ("},{"key":"e_1_3_2_1_30_1","volume-title":"STATISTICS","author":"Denil Misha","year":"2013","unstructured":"Misha Denil , Babak Shakibi , Laurent Dinh , Marc'Aurelio Ranzato , Nando de Freitas . Predicting parameters in deep learning . STATISTICS 2013 . Misha Denil, Babak Shakibi, Laurent Dinh, Marc'Aurelio Ranzato, Nando de Freitas. Predicting parameters in deep learning. STATISTICS 2013."},{"key":"e_1_3_2_1_31_1","volume-title":"29th Annual Conference on Neural Information Processing Systems, NIPS 2015 Montreal, QC","author":"J.","year":"2015","unstructured":"Lederer, C., Altstadt, S., Andriamonje, S., Andrzejewski, J. , Learning both weights and connections for efficient neural networks . 29th Annual Conference on Neural Information Processing Systems, NIPS 2015 Montreal, QC , Canada 2015 . Lederer, C., Altstadt, S., Andriamonje, S., Andrzejewski, J., Learning both weights and connections for efficient neural networks. 29th Annual Conference on Neural Information Processing Systems, NIPS 2015 Montreal, QC, Canada 2015."}],"event":{"name":"CSAI2019: 2019 3rd International Conference on Computer Science and Artificial Intelligence","sponsor":["Shenzhen University Shenzhen University"],"location":"Normal IL USA","acronym":"CSAI2019"},"container-title":["Proceedings of the 2019 3rd International Conference on Computer Science and Artificial Intelligence"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3374587.3374641","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3374587.3374641","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T23:44:45Z","timestamp":1750203885000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3374587.3374641"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,12,6]]},"references-count":31,"alternative-id":["10.1145\/3374587.3374641","10.1145\/3374587"],"URL":"https:\/\/doi.org\/10.1145\/3374587.3374641","relation":{},"subject":[],"published":{"date-parts":[[2019,12,6]]},"assertion":[{"value":"2020-03-04","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}