{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,11]],"date-time":"2026-03-11T01:54:49Z","timestamp":1773194089841,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":66,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,3,9]],"date-time":"2020-03-09T00:00:00Z","timestamp":1583712000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["CNS-1739748,CCF-1937500,CCF-1919117,CCF-1901378,CCF-1919289"],"award-info":[{"award-number":["CNS-1739748,CCF-1937500,CCF-1919117,CCF-1901378,CCF-1919289"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,3,9]]},"DOI":"10.1145\/3373376.3378534","type":"proceedings-article","created":{"date-parts":[[2020,3,13]],"date-time":"2020-03-13T22:37:01Z","timestamp":1584139021000},"page":"907-922","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":202,"title":["PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning"],"prefix":"10.1145","author":[{"given":"Wei","family":"Niu","sequence":"first","affiliation":[{"name":"College of William and Mary, Williamsburg, VA, USA"}]},{"given":"Xiaolong","family":"Ma","sequence":"additional","affiliation":[{"name":"Northeastern University, Boston, MA, USA"}]},{"given":"Sheng","family":"Lin","sequence":"additional","affiliation":[{"name":"Northeastern University, Boston, MA, USA"}]},{"given":"Shihao","family":"Wang","sequence":"additional","affiliation":[{"name":"Northeastern University, Boston, MA, USA"}]},{"given":"Xuehai","family":"Qian","sequence":"additional","affiliation":[{"name":"University of Southern California, Los Angeles, CA, USA"}]},{"given":"Xue","family":"Lin","sequence":"additional","affiliation":[{"name":"Northeastern University, Boston, MA, USA"}]},{"given":"Yanzhi","family":"Wang","sequence":"additional","affiliation":[{"name":"Northeastern University, Boston, MA, USA"}]},{"given":"Bin","family":"Ren","sequence":"additional","affiliation":[{"name":"College of William and Mary, Williamsburg, VA, USA"}]}],"member":"320","published-online":{"date-parts":[[2020,3,13]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"Alibaba. 2019. MNN. https:\/\/github.com\/alibaba\/MNN  Alibaba. 2019. MNN. https:\/\/github.com\/alibaba\/MNN"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/PERCOMW.2016.7457169"},{"key":"e_1_3_2_1_3_1","volume-title":"Proceedings of the 9th International Conference of the Learning Sciences-Volume 1. International Society of the Learning Sciences, 500--507","author":"Boticki Ivica","year":"2010","unstructured":"Ivica Boticki and Hyo-Jeong So . 2010 . Quiet captures: A tool for capturing the evidence of seamless learning with mobile devices . In Proceedings of the 9th International Conference of the Learning Sciences-Volume 1. International Society of the Learning Sciences, 500--507 . Ivica Boticki and Hyo-Jeong So. 2010. Quiet captures: A tool for capturing the evidence of seamless learning with mobile devices. In Proceedings of the 9th International Conference of the Learning Sciences-Volume 1. International Society of the Learning Sciences, 500--507."},{"key":"e_1_3_2_1_4_1","volume-title":"Foundations and Trends\u00ae in Machine Learning","volume":"3","author":"Boyd Stephen","year":"2011","unstructured":"Stephen Boyd , Neal Parikh , Eric Chu , Borja Peleato , and Jonathan Eckstein . 2011 . Distributed optimization and statistical learning via the alternating direction method of multipliers . Foundations and Trends\u00ae in Machine Learning , Vol. 3 , 1 (2011), 1--122. Stephen Boyd, Neal Parikh, Eric Chu, Borja Peleato, and Jonathan Eckstein. 2011. Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundations and Trends\u00ae in Machine Learning , Vol. 3, 1 (2011), 1--122."},{"key":"e_1_3_2_1_5_1","volume-title":"13th USENIX Symposium on Operating Systems Design and Implementation (OSDI 18)","author":"Chen Tianqi","year":"2018","unstructured":"Tianqi Chen , Thierry Moreau , Ziheng Jiang , Lianmin Zheng , Eddie Yan , Haichen Shen , Meghan Cowan , Leyuan Wang , Yuwei Hu , Luis Ceze , 2018 . TVM: An automated end-to-end optimizing compiler for deep learning . In 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI 18) . 578--594. Tianqi Chen, Thierry Moreau, Ziheng Jiang, Lianmin Zheng, Eddie Yan, Haichen Shen, Meghan Cowan, Leyuan Wang, Yuwei Hu, Luis Ceze, et almbox. 2018. TVM: An automated end-to-end optimizing compiler for deep learning. In 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI 18) . 578--594."},{"key":"e_1_3_2_1_6_1","volume-title":"Binaryconnect: Training deep neural networks with binary weights during propagations. In Advances in neural information processing systems. 3123--3131.","author":"Courbariaux Matthieu","year":"2015","unstructured":"Matthieu Courbariaux , Yoshua Bengio , and Jean-Pierre David . 2015 . Binaryconnect: Training deep neural networks with binary weights during propagations. In Advances in neural information processing systems. 3123--3131. Matthieu Courbariaux, Yoshua Bengio, and Jean-Pierre David. 2015. Binaryconnect: Training deep neural networks with binary weights during propagations. In Advances in neural information processing systems. 3123--3131."},{"key":"e_1_3_2_1_7_1","unstructured":"Matthieu Courbariaux Itay Hubara Daniel Soudry Ran El-Yaniv and Yoshua Bengio. 2016. Binarized neural networks: Training deep neural networks with weights and activations constrained to  Matthieu Courbariaux Itay Hubara Daniel Soudry Ran El-Yaniv and Yoshua Bengio. 2016. Binarized neural networks: Training deep neural networks with weights and activations constrained to"},{"key":"e_1_3_2_1_8_1","volume-title":"arXiv preprint arXiv:1602.02830","year":"2016","unstructured":"1 or-1. arXiv preprint arXiv:1602.02830 ( 2016 ). 1 or-1. arXiv preprint arXiv:1602.02830 (2016)."},{"key":"e_1_3_2_1_9_1","volume-title":"NeST: a neural network synthesis tool based on a grow-and-prune paradigm. arXiv preprint arXiv:1711.02017","author":"Dai Xiaoliang","year":"2017","unstructured":"Xiaoliang Dai , Hongxu Yin , and Niraj K Jha . 2017. NeST: a neural network synthesis tool based on a grow-and-prune paradigm. arXiv preprint arXiv:1711.02017 ( 2017 ). Xiaoliang Dai, Hongxu Yin, and Niraj K Jha. 2017. NeST: a neural network synthesis tool based on a grow-and-prune paradigm. arXiv preprint arXiv:1711.02017 (2017)."},{"key":"e_1_3_2_1_10_1","volume-title":"Deep Learning on Mobile Devices -- A Review. arXiv preprint arXiv:1904.09274","author":"Deng Yunbin","year":"2019","unstructured":"Yunbin Deng . 2019. Deep Learning on Mobile Devices -- A Review. arXiv preprint arXiv:1904.09274 ( 2019 ). Yunbin Deng. 2019. Deep Learning on Mobile Devices -- A Review. arXiv preprint arXiv:1904.09274 (2019)."},{"key":"e_1_3_2_1_11_1","unstructured":"Google. 2019. TensorFlow Lite. https:\/\/www.tensorflow.org\/mobile\/tflite\/  Google. 2019. TensorFlow Lite. https:\/\/www.tensorflow.org\/mobile\/tflite\/"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/2909437.2909442"},{"key":"e_1_3_2_1_13_1","unstructured":"Yiwen Guo Anbang Yao and Yurong Chen. 2016. Dynamic network surgery for efficient dnns. In Advances In Neural Information Processing Systems. 1379--1387.  Yiwen Guo Anbang Yao and Yurong Chen. 2016. Dynamic network surgery for efficient dnns. In Advances In Neural Information Processing Systems. 1379--1387."},{"key":"e_1_3_2_1_14_1","volume-title":"International Conference on Machine Learning . 1737--1746","author":"Gupta Suyog","year":"2015","unstructured":"Suyog Gupta , Ankur Agrawal , Kailash Gopalakrishnan , and Pritish Narayanan . 2015 . Deep learning with limited numerical precision . In International Conference on Machine Learning . 1737--1746 . Suyog Gupta, Ankur Agrawal, Kailash Gopalakrishnan, and Pritish Narayanan. 2015. Deep learning with limited numerical precision. In International Conference on Machine Learning . 1737--1746."},{"key":"e_1_3_2_1_15_1","volume-title":"Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding. arXiv preprint arXiv:1510.00149","author":"Han Song","year":"2015","unstructured":"Song Han , Huizi Mao , and William J Dally . 2015a. Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding. arXiv preprint arXiv:1510.00149 ( 2015 ). Song Han, Huizi Mao, and William J Dally. 2015a. Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding. arXiv preprint arXiv:1510.00149 (2015)."},{"key":"e_1_3_2_1_16_1","unstructured":"Song Han Jeff Pool John Tran and William Dally. 2015b. Learning both weights and connections for efficient neural network. In Advances in Neural Information Processing Systems. 1135--1143.  Song Han Jeff Pool John Tran and William Dally. 2015b. Learning both weights and connections for efficient neural network. In Advances in Neural Information Processing Systems. 1135--1143."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/2906388.2906396"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_1_19_1","volume-title":"AMC: AutoML for Model Compression and Acceleration on Mobile Devices. In European Conference on Computer Vision. Springer, 815--832","author":"He Yihui","year":"2018","unstructured":"Yihui He , Ji Lin , Zhijian Liu , Hanrui Wang , Li-Jia Li , and Song Han . 2018 . AMC: AutoML for Model Compression and Acceleration on Mobile Devices. In European Conference on Computer Vision. Springer, 815--832 . Yihui He, Ji Lin, Zhijian Liu, Hanrui Wang, Li-Jia Li, and Song Han. 2018. AMC: AutoML for Model Compression and Acceleration on Mobile Devices. In European Conference on Computer Vision. Springer, 815--832."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.155"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/3123939.3123970"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1137\/140990309"},{"key":"e_1_3_2_1_23_1","unstructured":"Itay Hubara Matthieu Courbariaux Daniel Soudry Ran El-Yaniv and Yoshua Bengio. 2016. Binarized neural networks. In Advances in neural information processing systems. 4107--4115.  Itay Hubara Matthieu Courbariaux Daniel Soudry Ran El-Yaniv and Yoshua Bengio. 2016. Binarized neural networks. In Advances in neural information processing systems. 4107--4115."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.5555\/3122009.3242044"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/3081333.3081360"},{"key":"e_1_3_2_1_26_1","volume-title":"Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv preprint arXiv:1502.03167","author":"Ioffe Sergey","year":"2015","unstructured":"Sergey Ioffe and Christian Szegedy . 2015. Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv preprint arXiv:1502.03167 ( 2015 ). Sergey Ioffe and Christian Szegedy. 2015. Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv preprint arXiv:1502.03167 (2015)."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.223"},{"key":"e_1_3_2_1_28_1","volume-title":"Proceedings of the International Conference on Learning Representations (ICLR) .","author":"Diederik","unstructured":"Diederik P. Kingma and Jimmy Ba. 2014. Adam: A Method for Stochastic Optimization . In Proceedings of the International Conference on Learning Representations (ICLR) . Diederik P. Kingma and Jimmy Ba. 2014. Adam: A Method for Stochastic Optimization. In Proceedings of the International Conference on Learning Representations (ICLR) ."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.5555\/2959355.2959378"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/2820975.2820980"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/MPRV.2017.2940968"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/2750858.2804262"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.435"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.280"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/1553374.1553453"},{"key":"e_1_3_2_1_36_1","volume-title":"Extremely low bit neural network: Squeeze the last bit out with admm. arXiv preprint arXiv:1707.09870","author":"Leng Cong","year":"2017","unstructured":"Cong Leng , Hao Li , Shenghuo Zhu , and Rong Jin . 2017. Extremely low bit neural network: Squeeze the last bit out with admm. arXiv preprint arXiv:1707.09870 ( 2017 ). Cong Leng, Hao Li, Shenghuo Zhu, and Rong Jin. 2017. Extremely low bit neural network: Squeeze the last bit out with admm. arXiv preprint arXiv:1707.09870 (2017)."},{"key":"e_1_3_2_1_37_1","volume-title":"International Conference on Learning Representations (ICLR) .","author":"Li Hao","year":"2017","unstructured":"Hao Li , Asim Kadav , Igor Durdanovic , Hanan Samet , and Hans Peter Graf . 2017 . Pruning filters for efficient convnets . In International Conference on Learning Representations (ICLR) . Hao Li, Asim Kadav, Igor Durdanovic, Hanan Samet, and Hans Peter Graf. 2017. Pruning filters for efficient convnets. In International Conference on Learning Representations (ICLR) ."},{"key":"e_1_3_2_1_38_1","volume-title":"International Conference on Machine Learning. 2849--2858","author":"Lin Darryl","year":"2016","unstructured":"Darryl Lin , Sachin Talathi , and Sreekanth Annapureddy . 2016 . Fixed point quantization of deep convolutional networks . In International Conference on Machine Learning. 2849--2858 . Darryl Lin, Sachin Talathi, and Sreekanth Annapureddy. 2016. Fixed point quantization of deep convolutional networks. In International Conference on Machine Learning. 2849--2858."},{"key":"e_1_3_2_1_39_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 806--814","author":"Liu Baoyuan","year":"2015","unstructured":"Baoyuan Liu , Min Wang , Hassan Foroosh , Marshall Tappen , and Marianna Pensky . 2015 . Sparse convolutional neural networks . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 806--814 . Baoyuan Liu, Min Wang, Hassan Foroosh, Marshall Tappen, and Marianna Pensky. 2015. Sparse convolutional neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 806--814."},{"key":"e_1_3_2_1_40_1","volume-title":"Zeroth-Order Online Alternating Direction Method of Multipliers: Convergence Analysis and Applications. In International Conference on Artificial Intelligence and Statistics. 288--297","author":"Liu Sijia","year":"2018","unstructured":"Sijia Liu , Jie Chen , Pin-Yu Chen , and Alfred Hero . 2018 a. Zeroth-Order Online Alternating Direction Method of Multipliers: Convergence Analysis and Applications. In International Conference on Artificial Intelligence and Statistics. 288--297 . Sijia Liu, Jie Chen, Pin-Yu Chen, and Alfred Hero. 2018a. Zeroth-Order Online Alternating Direction Method of Multipliers: Convergence Analysis and Applications. In International Conference on Artificial Intelligence and Statistics. 288--297."},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/3210240.3210337"},{"key":"e_1_3_2_1_42_1","volume-title":"Pconv: The missing but desirable sparsity in dnn weight pruning for real-time execution on mobile devices. arXiv preprint arXiv:1909.05073","author":"Ma Xiaolong","year":"2019","unstructured":"Xiaolong Ma , Fu-Ming Guo , Wei Niu , Xue Lin , Jian Tang , Kaisheng Ma , Bin Ren , and Yanzhi Wang . 2019 . Pconv: The missing but desirable sparsity in dnn weight pruning for real-time execution on mobile devices. arXiv preprint arXiv:1909.05073 (2019). Xiaolong Ma, Fu-Ming Guo, Wei Niu, Xue Lin, Jian Tang, Kaisheng Ma, Bin Ren, and Yanzhi Wang. 2019. Pconv: The missing but desirable sparsity in dnn weight pruning for real-time execution on mobile devices. arXiv preprint arXiv:1909.05073 (2019)."},{"key":"e_1_3_2_1_43_1","volume-title":"Exploring the regularity of sparse structure in convolutional neural networks. arXiv preprint arXiv:1705.08922","author":"Mao Huizi","year":"2017","unstructured":"Huizi Mao , Song Han , Jeff Pool , Wenshuo Li , Xingyu Liu , Yu Wang , and William J Dally . 2017. Exploring the regularity of sparse structure in convolutional neural networks. arXiv preprint arXiv:1705.08922 ( 2017 ). Huizi Mao, Song Han, Jeff Pool, Wenshuo Li, Xingyu Liu, Yu Wang, and William J Dally. 2017. Exploring the regularity of sparse structure in convolutional neural networks. arXiv preprint arXiv:1705.08922 (2017)."},{"key":"e_1_3_2_1_44_1","first-page":"34","article-title":"Deep learning for mobile multimedia: A survey","volume":"13","author":"Ota Kaoru","year":"2017","unstructured":"Kaoru Ota , Minh Son Dao , Vasileios Mezaris , and Francesco GB De Natale . 2017 . Deep learning for mobile multimedia: A survey . ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) , Vol. 13 , 3s (2017), 34 . Kaoru Ota, Minh Son Dao, Vasileios Mezaris, and Francesco GB De Natale. 2017. Deep learning for mobile multimedia: A survey. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) , Vol. 13, 3s (2017), 34.","journal-title":"ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM)"},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1145\/3079856.3080254"},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.761"},{"key":"e_1_3_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/MASS.2011.52"},{"key":"e_1_3_2_1_48_1","unstructured":"Qualcomm. 2019. Snapdragon 855. https:\/\/www.qualcomm.com\/products\/snapdragon-855-mobile-platform  Qualcomm. 2019. Snapdragon 855. https:\/\/www.qualcomm.com\/products\/snapdragon-855-mobile-platform"},{"key":"e_1_3_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46493-0_32"},{"key":"e_1_3_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/3297858.3304076"},{"key":"e_1_3_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/JSEN.2014.2357257"},{"key":"e_1_3_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00474"},{"key":"e_1_3_2_1_53_1","volume-title":"Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556","author":"Simonyan Karen","year":"2014","unstructured":"Karen Simonyan and Andrew Zisserman . 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 ( 2014 ). Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)."},{"key":"e_1_3_2_1_54_1","volume-title":"Tensor comprehensions: Framework-agnostic high-performance machine learning abstractions. arXiv preprint arXiv:1802.04730","author":"Vasilache Nicolas","year":"2018","unstructured":"Nicolas Vasilache , Oleksandr Zinenko , Theodoros Theodoridis , Priya Goyal , Zachary DeVito , William S Moses , Sven Verdoolaege , Andrew Adams , and Albert Cohen . 2018. Tensor comprehensions: Framework-agnostic high-performance machine learning abstractions. arXiv preprint arXiv:1802.04730 ( 2018 ). Nicolas Vasilache, Oleksandr Zinenko, Theodoros Theodoridis, Priya Goyal, Zachary DeVito, William S Moses, Sven Verdoolaege, Andrew Adams, and Albert Cohen. 2018. Tensor comprehensions: Framework-agnostic high-performance machine learning abstractions. arXiv preprint arXiv:1802.04730 (2018)."},{"key":"e_1_3_2_1_55_1","unstructured":"Wei Wen Chunpeng Wu Yandan Wang Yiran Chen and Hai Li. 2016. Learning structured sparsity in deep neural networks. In Advances in Neural Information Processing Systems. 2074--2082.  Wei Wen Chunpeng Wu Yandan Wang Yiran Chen and Hai Li. 2016. Learning structured sparsity in deep neural networks. In Advances in Neural Information Processing Systems. 2074--2082."},{"key":"e_1_3_2_1_56_1","volume-title":"Arithmetic complexity of computations","author":"Winograd Shmuel","unstructured":"Shmuel Winograd . 1980. Arithmetic complexity of computations . Vol. 33 . Siam . Shmuel Winograd. 1980. Arithmetic complexity of computations . Vol. 33. Siam."},{"key":"e_1_3_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.521"},{"key":"e_1_3_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1145\/3241539.3241563"},{"key":"e_1_3_2_1_59_1","volume-title":"Using goal-driven deep learning models to understand sensory cortex. Nature neuroscience","author":"Yamins Daniel LK","year":"2016","unstructured":"Daniel LK Yamins and James J DiCarlo . 2016. Using goal-driven deep learning models to understand sensory cortex. Nature neuroscience , Vol. 19 , 3 ( 2016 ), 356. Daniel LK Yamins and James J DiCarlo. 2016. Using goal-driven deep learning models to understand sensory cortex. Nature neuroscience , Vol. 19, 3 (2016), 356."},{"key":"e_1_3_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.1403112111"},{"key":"e_1_3_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.1145\/3038912.3052577"},{"key":"e_1_3_2_1_62_1","volume-title":"et almbox","author":"Ye Shaokai","year":"2019","unstructured":"Shaokai Ye , Xiaoyu Feng , Tianyun Zhang , Xiaolong Ma , Sheng Lin , Zhengang Li , Kaidi Xu , Wujie Wen , Sijia Liu , Jian Tang , et almbox . 2019 . Progressive DNN Compression: A Key to Achieve Ultra-High Weight Pruning and Quantization Rates using ADMM. arXiv preprint arXiv:1903.09769 (2019). Shaokai Ye, Xiaoyu Feng, Tianyun Zhang, Xiaolong Ma, Sheng Lin, Zhengang Li, Kaidi Xu, Wujie Wen, Sijia Liu, Jian Tang, et almbox. 2019. Progressive DNN Compression: A Key to Achieve Ultra-High Weight Pruning and Quantization Rates using ADMM. arXiv preprint arXiv:1903.09769 (2019)."},{"key":"e_1_3_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1109\/MSP.2010.939038"},{"key":"e_1_3_2_1_64_1","doi-asserted-by":"publisher","DOI":"10.1109\/COMST.2019.2904897"},{"key":"e_1_3_2_1_65_1","volume-title":"Systematic Weight Pruning of DNNs using Alternating Direction Method of Multipliers. arXiv preprint arXiv:1802.05747","author":"Zhang Tianyun","year":"2018","unstructured":"Tianyun Zhang , Shaokai Ye , Yipeng Zhang , Yanzhi Wang , and Makan Fardad . 2018. Systematic Weight Pruning of DNNs using Alternating Direction Method of Multipliers. arXiv preprint arXiv:1802.05747 ( 2018 ). Tianyun Zhang, Shaokai Ye, Yipeng Zhang, Yanzhi Wang, and Makan Fardad. 2018. Systematic Weight Pruning of DNNs using Alternating Direction Method of Multipliers. arXiv preprint arXiv:1802.05747 (2018)."},{"key":"e_1_3_2_1_66_1","volume-title":"International Conference on Learning Representations (ICLR) .","author":"Zhou Aojun","year":"2017","unstructured":"Aojun Zhou , Anbang Yao , Yiwen Guo , Lin Xu , and Yurong Chen . 2017 . Incremental network quantization: Towards lossless cnns with low-precision weights . In International Conference on Learning Representations (ICLR) . Aojun Zhou, Anbang Yao, Yiwen Guo, Lin Xu, and Yurong Chen. 2017. Incremental network quantization: Towards lossless cnns with low-precision weights. In International Conference on Learning Representations (ICLR) ."}],"event":{"name":"ASPLOS '20: Architectural Support for Programming Languages and Operating Systems","location":"Lausanne Switzerland","acronym":"ASPLOS '20","sponsor":["SIGPLAN ACM Special Interest Group on Programming Languages","SIGOPS ACM Special Interest Group on Operating Systems","SIGARCH ACM Special Interest Group on Computer Architecture","SIGBED ACM Special Interest Group on Embedded Systems"]},"container-title":["Proceedings of the Twenty-Fifth International Conference on Architectural Support for Programming Languages and Operating Systems"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3373376.3378534","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3373376.3378534","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3373376.3378534","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:38:16Z","timestamp":1750199896000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3373376.3378534"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,3,9]]},"references-count":66,"alternative-id":["10.1145\/3373376.3378534","10.1145\/3373376"],"URL":"https:\/\/doi.org\/10.1145\/3373376.3378534","relation":{},"subject":[],"published":{"date-parts":[[2020,3,9]]},"assertion":[{"value":"2020-03-13","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}