{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:17:13Z","timestamp":1750220233144,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":23,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,6,23]],"date-time":"2022-06-23T00:00:00Z","timestamp":1655942400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"The Open Project of State Key Laboratory of Plateau Ecology and Agriculture, Qinghai University","award":["No.2020-ZZ-03"],"award-info":[{"award-number":["No.2020-ZZ-03"]}]},{"name":"2021 Graduate Course Construction Project of Qinghai University","award":["No.qdyk-210413"],"award-info":[{"award-number":["No.qdyk-210413"]}]},{"name":"Natural Science Foundation of Qinghai Province","award":["No.2022-ZJ-701"],"award-info":[{"award-number":["No.2022-ZJ-701"]}]},{"name":"Youth Foundation Program of Qinghai University","award":["No.2021-QGY-13"],"award-info":[{"award-number":["No.2021-QGY-13"]}]},{"name":"the national Natural Science Foundation of China","award":["No.62062059"],"award-info":[{"award-number":["No.62062059"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,6,23]]},"DOI":"10.1145\/3546000.3546011","type":"proceedings-article","created":{"date-parts":[[2022,8,19]],"date-time":"2022-08-19T16:08:37Z","timestamp":1660925317000},"page":"72-76","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Performance Optimization of Sparse Deep Neural Networks Based on GPU"],"prefix":"10.1145","author":[{"given":"yucheng","family":"shi","sequence":"first","affiliation":[{"name":"Department of Computer Technology and Applications, Qinghai university, China"}]},{"given":"long","family":"ren","sequence":"additional","affiliation":[{"name":"Informationization Technolog, Qinghai university, China"}]}],"member":"320","published-online":{"date-parts":[[2022,8,19]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556(2014).  Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556(2014)."},{"volume-title":"Neural Networks and Speech Processing","author":"Morgan P","key":"e_1_3_2_1_2_1","unstructured":"David\u00a0 P Morgan and Christopher\u00a0 L Scofield . 1991. Neural networks and speech processing . In Neural Networks and Speech Processing . Springer , 329\u2013348. David\u00a0P Morgan and Christopher\u00a0L Scofield. 1991. Neural networks and speech processing. In Neural Networks and Speech Processing. Springer, 329\u2013348."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/TNNLS.2020.2979670"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/AIAM48774.2019.00083"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/2964284.2964325"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/TITS.2019.2962338"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW.2017.61"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"crossref","unstructured":"Jeremy Kepner Simon Alford Vijay Gadepally Michael Jones Lauren Milechin Albert Reuther Ryan Robinett and Sid Samsi. 2020. Graphchallenge. org sparse deep neural network performance. arXiv preprint arXiv:2004.01181(2020).  Jeremy Kepner Simon Alford Vijay Gadepally Michael Jones Lauren Milechin Albert Reuther Ryan Robinett and Sid Samsi. 2020. Graphchallenge. org sparse deep neural network performance. arXiv preprint arXiv:2004.01181(2020).","DOI":"10.1109\/HPEC43674.2020.9286253"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPEC.2019.8916506"},{"key":"e_1_3_2_1_10_1","volume-title":"The perceptron: a probabilistic model for information storage and organization in the brain.Psychological review 65, 6","author":"Rosenblatt Frank","year":"1958","unstructured":"Frank Rosenblatt . 1958. The perceptron: a probabilistic model for information storage and organization in the brain.Psychological review 65, 6 ( 1958 ), 386. Frank Rosenblatt. 1958. The perceptron: a probabilistic model for information storage and organization in the brain.Psychological review 65, 6 (1958), 386."},{"key":"e_1_3_2_1_11_1","volume-title":"International Conference on Machine Learning. PMLR, 5189\u20135200","author":"Kag Anil","year":"2021","unstructured":"Anil Kag and Venkatesh Saligrama . 2021 . Training Recurrent Neural Networks via Forward Propagation Through Time . In International Conference on Machine Learning. PMLR, 5189\u20135200 . Anil Kag and Venkatesh Saligrama. 2021. Training Recurrent Neural Networks via Forward Propagation Through Time. In International Conference on Machine Learning. PMLR, 5189\u20135200."},{"key":"e_1_3_2_1_12_1","volume-title":"International Conference on Machine Learning. PMLR, 6659\u20136667","author":"Wang Shengjie","year":"2019","unstructured":"Shengjie Wang , Tianyi Zhou , and Jeff Bilmes . 2019 . Bias also matters: Bias attribution for deep neural network explanation . In International Conference on Machine Learning. PMLR, 6659\u20136667 . Shengjie Wang, Tianyi Zhou, and Jeff Bilmes. 2019. Bias also matters: Bias attribution for deep neural network explanation. In International Conference on Machine Learning. PMLR, 6659\u20136667."},{"volume-title":"Advances in computer science and information engineering","author":"Li Jing","key":"e_1_3_2_1_13_1","unstructured":"Jing Li , Ji-hang Cheng, Jing-yuan Shi, and Fei Huang . 2012. Brief introduction of back propagation (BP) neural network algorithm and its improvement . In Advances in computer science and information engineering . Springer , 553\u2013558. Jing Li, Ji-hang Cheng, Jing-yuan Shi, and Fei Huang. 2012. Brief introduction of back propagation (BP) neural network algorithm and its improvement. In Advances in computer science and information engineering. Springer, 553\u2013558."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/SC.2014.68"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/HiPC.2012.6507483"},{"key":"e_1_3_2_1_16_1","unstructured":"Jonathan Passerat-Palmbach Jonathan Caux Pridi Siregar Claude Mazel and David Hill. 2015. Warp-level parallelism: Enabling multiple replications in parallel on GPU. arXiv preprint arXiv:1501.01405(2015).  Jonathan Passerat-Palmbach Jonathan Caux Pridi Siregar Claude Mazel and David Hill. 2015. Warp-level parallelism: Enabling multiple replications in parallel on GPU. arXiv preprint arXiv:1501.01405(2015)."},{"key":"e_1_3_2_1_17_1","volume-title":"International Conference on Parallel Processing and Applied Mathematics. Springer, 570\u2013579","author":"Bialas Piotr","year":"2015","unstructured":"Piotr Bialas and Adam Strzelecki . 2015 . Benchmarking the cost of thread divergence in CUDA . In International Conference on Parallel Processing and Applied Mathematics. Springer, 570\u2013579 . Piotr Bialas and Adam Strzelecki. 2015. Benchmarking the cost of thread divergence in CUDA. In International Conference on Parallel Processing and Applied Mathematics. Springer, 570\u2013579."},{"volume-title":"Computer Graphics Forum, Vol.\u00a032","author":"Rosen Paul","key":"e_1_3_2_1_18_1","unstructured":"Paul Rosen . 2013. A visual approach to investigating shared and global memory behavior of CUDA kernels . In Computer Graphics Forum, Vol.\u00a032 . Wiley Online Library , 161\u2013170. Paul Rosen. 2013. A visual approach to investigating shared and global memory behavior of CUDA kernels. In Computer Graphics Forum, Vol.\u00a032. Wiley Online Library, 161\u2013170."},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cpc.2016.11.011"},{"key":"e_1_3_2_1_20_1","volume-title":"Third Workshop on Software Tools for MultiCore Systems. 33","author":"Boyer Michael","year":"2008","unstructured":"Michael Boyer , Kevin Skadron , and Westley Weimer . 2008 . Automated dynamic analysis of CUDA programs . In Third Workshop on Software Tools for MultiCore Systems. 33 . Michael Boyer, Kevin Skadron, and Westley Weimer. 2008. Automated dynamic analysis of CUDA programs. In Third Workshop on Software Tools for MultiCore Systems. 33."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/ASSET.1999.756775"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2013.95"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1002\/cpe.3973"}],"event":{"name":"HP3C'22: 2022 6th International Conference on High Performance Compilation, Computing and Communications","acronym":"HP3C'22","location":"Jilin China"},"container-title":["Proceedings of the 6th International Conference on High Performance Compilation, Computing and Communications"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3546000.3546011","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3546000.3546011","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:30:18Z","timestamp":1750188618000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3546000.3546011"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,6,23]]},"references-count":23,"alternative-id":["10.1145\/3546000.3546011","10.1145\/3546000"],"URL":"https:\/\/doi.org\/10.1145\/3546000.3546011","relation":{},"subject":[],"published":{"date-parts":[[2022,6,23]]},"assertion":[{"value":"2022-08-19","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}