{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,2]],"date-time":"2026-04-02T22:10:21Z","timestamp":1775167821656,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":35,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,10,28]],"date-time":"2023-10-28T00:00:00Z","timestamp":1698451200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,10,28]]},"DOI":"10.1145\/3613424.3614312","type":"proceedings-article","created":{"date-parts":[[2023,12,8]],"date-time":"2023-12-08T17:22:15Z","timestamp":1702056135000},"page":"324-337","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":15,"title":["Eureka: Efficient Tensor Cores for One-sided Unstructured Sparsity in DNN Inference"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3370-3576","authenticated-orcid":false,"given":"Ashish","family":"Gondimalla","sequence":"first","affiliation":[{"name":"Google, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4164-4542","authenticated-orcid":false,"given":"Mithuna","family":"Thottethodi","sequence":"additional","affiliation":[{"name":"Purdue University, United States of America"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6624-4372","authenticated-orcid":false,"given":"T. N.","family":"Vijaykumar","sequence":"additional","affiliation":[{"name":"Purdue University, United States of America"}]}],"member":"320","published-online":{"date-parts":[[2023,12,8]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/3123939.3123982"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/3007787.3001138"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISSCC.2016.7418007"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/3297858.3304041"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/3352460.3358291"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/3007787.3001163"},{"key":"e_1_3_2_1_8_1","volume-title":"Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding. In 4th International Conference on Learning Representations, ICLR","author":"Han Song","year":"2016","unstructured":"Song Han , Huizi Mao , and William\u00a0 J. Dally . 2016 . Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding. In 4th International Conference on Learning Representations, ICLR 2016, San Juan, Puerto Rico , May 2-4, 2016, Conference Track Proceedings . http:\/\/arxiv.org\/abs\/1510.00149 Song Han, Huizi Mao, and William\u00a0J. Dally. 2016. Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding. In 4th International Conference on Learning Representations, ICLR 2016, San Juan, Puerto Rico, May 2-4, 2016, Conference Track Proceedings. http:\/\/arxiv.org\/abs\/1510.00149"},{"key":"e_1_3_2_1_9_1","volume-title":"Advances in Neural Information Processing Systems 28, C.\u00a0Cortes, N.\u00a0D.","author":"Han Song","unstructured":"Song Han , Jeff Pool , John Tran , and William Dally . 2015. Learning both Weights and Connections for Efficient Neural Network . In Advances in Neural Information Processing Systems 28, C.\u00a0Cortes, N.\u00a0D. Lawrence , D.\u00a0D. Lee, M.\u00a0Sugiyama, and R.\u00a0Garnett (Eds.). Curran Associates, Inc ., 1135\u20131143. http:\/\/papers.nips.cc\/paper\/5784-learning-both-weights-and-connections-for-efficient-neural-network.pdf Song Han, Jeff Pool, John Tran, and William Dally. 2015. Learning both Weights and Connections for Efficient Neural Network. In Advances in Neural Information Processing Systems 28, C.\u00a0Cortes, N.\u00a0D. Lawrence, D.\u00a0D. Lee, M.\u00a0Sugiyama, and R.\u00a0Garnett (Eds.). Curran Associates, Inc., 1135\u20131143. http:\/\/papers.nips.cc\/paper\/5784-learning-both-weights-and-connections-for-efficient-neural-network.pdf"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA51647.2021.00017"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA52012.2021.00010"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/3079856.3080246"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA45697.2020.00047"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/3297858.3304028"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/LCA.2020.2979965"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA53966.2022.00049"},{"key":"e_1_3_2_1_17_1","unstructured":"NCSU. [n. d.]. FreePDK45. https:\/\/www.eda.ncsu.edu\/wiki\/FreePDK15\/.  NCSU. [n. d.]. FreePDK45. https:\/\/www.eda.ncsu.edu\/wiki\/FreePDK15\/."},{"key":"e_1_3_2_1_18_1","unstructured":"Neural Magic. 2021. Sparse Zoo. https:\/\/docs.neuralmagic.com\/sparsezoo\/  Neural Magic. 2021. Sparse Zoo. https:\/\/docs.neuralmagic.com\/sparsezoo\/"},{"key":"e_1_3_2_1_19_1","unstructured":"Nvidia. [n. d.]. NVIDIA TESLA V100 GPU ARCHITECTURE. https:\/\/images.nvidia.com\/content\/volta-architecture\/pdf\/volta-architecture-whitepaper.pdf  Nvidia. [n. d.]. NVIDIA TESLA V100 GPU ARCHITECTURE. https:\/\/images.nvidia.com\/content\/volta-architecture\/pdf\/volta-architecture-whitepaper.pdf"},{"key":"e_1_3_2_1_20_1","unstructured":"Nvidia. 2022. Nvidia Deep Learning Performance documentation. https:\/\/docs.nvidia.com\/deeplearning\/performance\/dl-performance-convolutional\/index.html. Updated: 2022-May-17.  Nvidia. 2022. Nvidia Deep Learning Performance documentation. https:\/\/docs.nvidia.com\/deeplearning\/performance\/dl-performance-convolutional\/index.html. Updated: 2022-May-17."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2018.00067"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/3079856.3080254"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISPASS.2019.00016"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"crossref","unstructured":"Pranav Rajpurkar Jian Zhang Konstantin Lopyrev and Percy Liang. 2016. SQuAD: 100 000+ Questions for Machine Comprehension of Text. arxiv:1606.05250\u00a0[cs.CL]  Pranav Rajpurkar Jian Zhang Konstantin Lopyrev and Percy Liang. 2016. SQuAD: 100 000+ Questions for Machine Comprehension of Text. arxiv:1606.05250\u00a0[cs.CL]","DOI":"10.18653\/v1\/D16-1264"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/3307650.3322255"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO50266.2020.00068"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.vlsi.2017.02.002"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.308"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA52012.2021.00088"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS47924.2020.00071"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2016.7783723"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA47549.2020.00030"},{"key":"e_1_3_2_1_33_1","volume-title":"International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=K9bw7vqp_s","author":"Zhou Aojun","year":"2021","unstructured":"Aojun Zhou , Yukun Ma , Junnan Zhu , Jianbo Liu , Zhijie Zhang , Kun Yuan , Wenxiu Sun , and Hongsheng Li . 2021 . Learning N:M Fine-grained Structured Sparse Neural Networks From Scratch . In International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=K9bw7vqp_s Aojun Zhou, Yukun Ma, Junnan Zhu, Jianbo Liu, Zhijie Zhang, Kun Yuan, Wenxiu Sun, and Hongsheng Li. 2021. Learning N:M Fine-grained Structured Sparse Neural Networks From Scratch. In International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=K9bw7vqp_s"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2018.00011"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/3352460.3358269"}],"event":{"name":"MICRO '23: 56th Annual IEEE\/ACM International Symposium on Microarchitecture","location":"Toronto ON Canada","acronym":"MICRO '23","sponsor":["SIGMICRO ACM Special Interest Group on Microarchitectural Research and Processing"]},"container-title":["56th Annual IEEE\/ACM International Symposium on Microarchitecture"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3613424.3614312","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3613424.3614312","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:36:30Z","timestamp":1750178190000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3613424.3614312"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,10,28]]},"references-count":35,"alternative-id":["10.1145\/3613424.3614312","10.1145\/3613424"],"URL":"https:\/\/doi.org\/10.1145\/3613424.3614312","relation":{},"subject":[],"published":{"date-parts":[[2023,10,28]]},"assertion":[{"value":"2023-12-08","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}