{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,23]],"date-time":"2026-03-23T23:09:55Z","timestamp":1774307395228,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":42,"publisher":"ACM","license":[{"start":{"date-parts":[[2018,2,10]],"date-time":"2018-02-10T00:00:00Z","timestamp":1518220800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"IBM Ph.D. Fellowship Award"},{"name":"DOE Early Career Award","award":["DE-SC0013700"],"award-info":[{"award-number":["DE-SC0013700"]}]},{"DOI":"10.13039\/100000001","name":"National Science Foundation (NSF)","doi-asserted-by":"publisher","award":["CCF-1455404, CCF-1525609, CNS-1717425, CCF-1703487"],"award-info":[{"award-number":["CCF-1455404, CCF-1525609, CNS-1717425, CCF-1703487"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2018,2,10]]},"DOI":"10.1145\/3178487.3178495","type":"proceedings-article","created":{"date-parts":[[2018,2,6]],"date-time":"2018-02-06T13:12:23Z","timestamp":1517922743000},"page":"94-108","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":69,"title":["Bridging the gap between deep learning and sparse matrix format selection"],"prefix":"10.1145","author":[{"given":"Yue","family":"Zhao","sequence":"first","affiliation":[{"name":"North Carolina State University"}]},{"given":"Jiajia","family":"Li","sequence":"additional","affiliation":[{"name":"Georgia Institute of Technology"}]},{"given":"Chunhua","family":"Liao","sequence":"additional","affiliation":[{"name":"Lawrence Livermore National Laboratory"}]},{"given":"Xipeng","family":"Shen","sequence":"additional","affiliation":[{"name":"North Carolina State University"}]}],"member":"320","published-online":{"date-parts":[[2018,2,10]]},"reference":[{"key":"e_1_3_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/CGO.2006.37"},{"key":"e_1_3_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/997163.997196"},{"key":"e_1_3_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/1542476.1542481"},{"key":"e_1_3_2_2_4_1","volume-title":"Efficiency of General Krylov Methods on GPUs - An Experimental Study. In 2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW). 683--691","author":"Anzt H.","unstructured":"H. Anzt , J. Dongarra , M. Kreutzer , G. Wellein , and M. K\u00c3\u0171hler . 2016 . Efficiency of General Krylov Methods on GPUs - An Experimental Study. In 2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW). 683--691 . H. Anzt, J. Dongarra, M. Kreutzer, G. Wellein, and M. K\u00c3\u0171hler. 2016. Efficiency of General Krylov Methods on GPUs - An Experimental Study. In 2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW). 683--691."},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1137\/080716542"},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/1654059.1654078"},{"key":"e_1_3_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-01970-8_45"},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/1201775.882364"},{"key":"e_1_3_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.5555\/297805.297827"},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/1693453.1693471"},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/2049662.2049663"},{"key":"e_1_3_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/2737924.2737969"},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/MCSE.2006.105"},{"key":"e_1_3_2_2_14_1","volume-title":"Proceedings of the GCC Developers' Summit.","author":"Fursin Grigori","year":"2008","unstructured":"Grigori Fursin , Cupertino Miranda , Olivier Temam , Mircea Namolaru , Elad Yom-Tov , Ayal Zaks , Bilha Mendelson , Edwin Bonilla , John Thomson , Hugh Leather , Chris Williams , Michael O'Boyle , Phil Barnard , Elton Ashton , Eric Courtois , and Francois Bodin . 2008 . MILEPOST GCC: machine learning based research compiler . In Proceedings of the GCC Developers' Summit. Grigori Fursin, Cupertino Miranda, Olivier Temam, Mircea Namolaru, Elad Yom-Tov, Ayal Zaks, Bilha Mendelson, Edwin Bonilla, John Thomson, Hugh Leather, Chris Williams, Michael O'Boyle, Phil Barnard, Elton Ashton, Eric Courtois, and Francois Bodin. 2008. MILEPOST GCC: machine learning based research compiler. In Proceedings of the GCC Developers' Summit."},{"key":"e_1_3_2_2_15_1","volume-title":"Deep residual learning for image recognition. arXiv preprint arXiv:1512.03385","author":"He Kaiming","year":"2015","unstructured":"Kaiming He , Xiangyu Zhang , Shaoqing Ren , and Jian Sun . 2015. Deep residual learning for image recognition. arXiv preprint arXiv:1512.03385 ( 2015 ). Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2015. Deep residual learning for image recognition. arXiv preprint arXiv:1512.03385 (2015)."},{"key":"e_1_3_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1029\/2001GL013552"},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/1941553.1941587"},{"key":"e_1_3_2_2_18_1","unstructured":"Alex Krizhevsky Ilya Sutskever and Geoffrey E Hinton. 2012. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems. 1097--1105.   Alex Krizhevsky Ilya Sutskever and Geoffrey E Hinton. 2012. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems. 1097--1105."},{"key":"e_1_3_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2015.2401575"},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/2491956.2462181"},{"key":"e_1_3_2_2_21_1","unstructured":"Weifeng Liu. 2016. Benchmark SpMV using CSR5. https:\/\/github.com\/bhSPARSE\/Benchmark_SpMV_using_CSR5. (2016).  Weifeng Liu. 2016. Benchmark SpMV using CSR5. https:\/\/github.com\/bhSPARSE\/Benchmark_SpMV_using_CSR5. (2016)."},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1002\/cpe.4244"},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/2751205.2751209"},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.parco.2015.04.004"},{"key":"e_1_3_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/2464996.2465013"},{"key":"e_1_3_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/2851141.2851190"},{"key":"e_1_3_2_2_27_1","volume-title":"Machine learning: a probabilistic perspective","author":"Murphy Kevin P","unstructured":"Kevin P Murphy . 2012. Machine learning: a probabilistic perspective . MIT press . Kevin P Murphy. 2012. Machine learning: a probabilistic perspective. MIT press."},{"key":"e_1_3_2_2_28_1","volume-title":"CUSPARSE Library: A Set of Basic Linear Algebra Subroutines for Sparse Matrices. In GPU Technology Conference","volume":"2070","author":"Naumov M","year":"2010","unstructured":"M Naumov , LS Chien , P Vandermersch , and U Kapasi . 2010 . CUSPARSE Library: A Set of Basic Linear Algebra Subroutines for Sparse Matrices. In GPU Technology Conference , Vol. 2070 . M Naumov, LS Chien, P Vandermersch, and U Kapasi. 2010. CUSPARSE Library: A Set of Basic Linear Algebra Subroutines for Sparse Matrices. In GPU Technology Conference, Vol. 2070."},{"key":"e_1_3_2_2_29_1","volume-title":"IEEE\/ACM International Symposium on Code Generation and Optimization. 119 --129","author":"Park Eunjung","unstructured":"Eunjung Park , L.-N. Pouche , J. Cavazos , A. Cohen , and P. Sadayappan . 2011. Predictive modeling in a polyhedral optimization space . In IEEE\/ACM International Symposium on Code Generation and Optimization. 119 --129 . Eunjung Park, L.-N. Pouche, J. Cavazos, A. Cohen, and P. Sadayappan. 2011. Predictive modeling in a polyhedral optimization space. In IEEE\/ACM International Symposium on Code Generation and Optimization. 119 --129."},{"key":"e_1_3_2_2_30_1","volume-title":"CNN Features off-the-shelf: an Astounding Baseline for Recognition. CoRR abs\/1403.6382","author":"Razavian Ali Sharif","year":"2014","unstructured":"Ali Sharif Razavian , Hossein Azizpour , Josephine Sullivan , and Stefan Carlsson . 2014. CNN Features off-the-shelf: an Astounding Baseline for Recognition. CoRR abs\/1403.6382 ( 2014 ). http:\/\/arxiv.org\/abs\/1403.6382 Ali Sharif Razavian, Hossein Azizpour, Josephine Sullivan, and Stefan Carlsson. 2014. CNN Features off-the-shelf: an Astounding Baseline for Recognition. CoRR abs\/1403.6382 (2014). http:\/\/arxiv.org\/abs\/1403.6382"},{"key":"e_1_3_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/2751205.2751244"},{"key":"e_1_3_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/2304576.2304624"},{"key":"e_1_3_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"e_1_3_2_2_35_1","volume-title":"Design and Implementation of Adaptive SpMV Library for Multicore and Manycore Architecture. ACM Trans. Math. Softw. (To appear)","author":"Tan Guangming","year":"2018","unstructured":"Guangming Tan , Junhong Liu , and Jiajia Li. 2018. Design and Implementation of Adaptive SpMV Library for Multicore and Manycore Architecture. ACM Trans. Math. Softw. (To appear) ( 2018 ). Guangming Tan, Junhong Liu, and Jiajia Li. 2018. Design and Implementation of Adaptive SpMV Library for Multicore and Manycore Architecture. ACM Trans. Math. Softw. (To appear) (2018)."},{"key":"e_1_3_2_2_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/1869459.1869471"},{"key":"e_1_3_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.1007\/11557654_91"},{"key":"e_1_3_2_2_39_1","volume-title":"High-Performance Computing on the Intel\u00ae Xeon Phi\u2122","author":"Wang Endong","unstructured":"Endong Wang , Qing Zhang , Bo Shen , Guangyong Zhang , Xiaowei Lu , Qing Wu , and Yajuan Wang . 2014. Intel math kernel library . In High-Performance Computing on the Intel\u00ae Xeon Phi\u2122 . Springer , 167--188. Endong Wang, Qing Zhang, Bo Shen, Guangyong Zhang, Xiaowei Lu, Qing Wu, and Yajuan Wang. 2014. Intel math kernel library. In High-Performance Computing on the Intel\u00ae Xeon Phi\u2122. Springer, 167--188."},{"key":"e_1_3_2_2_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/3178487.3178513"},{"key":"e_1_3_2_2_41_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.parco.2008.12.006"},{"key":"e_1_3_2_2_42_1","volume-title":"CVR: Efficient SpMV Vectorization on X86 Processors. The 2018 International Symposium on Code Generation and Optimization (To appear)","author":"Xie Biwei","year":"2018","unstructured":"Biwei Xie , Jianfeng Zhan , Zhen Jia , Wanling Gao , Lixin Zhang , and Xu Liu . 2018 . CVR: Efficient SpMV Vectorization on X86 Processors. The 2018 International Symposium on Code Generation and Optimization (To appear) (2018). Biwei Xie, Jianfeng Zhan, Zhen Jia, Wanling Gao, Lixin Zhang, and Xu Liu. 2018. CVR: Efficient SpMV Vectorization on X86 Processors. The 2018 International Symposium on Code Generation and Optimization (To appear) (2018)."},{"key":"e_1_3_2_2_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/2555243.2555255"},{"key":"e_1_3_2_2_44_1","volume-title":"How transferable are features in deep neural networks? CoRR abs\/1411.1792","author":"Yosinski Jason","year":"2014","unstructured":"Jason Yosinski , Jeff Clune , Yoshua Bengio , and Hod Lipson . 2014. How transferable are features in deep neural networks? CoRR abs\/1411.1792 ( 2014 ). http:\/\/arxiv.org\/abs\/1411.1792 Jason Yosinski, Jeff Clune, Yoshua Bengio, and Hod Lipson. 2014. How transferable are features in deep neural networks? CoRR abs\/1411.1792 (2014). http:\/\/arxiv.org\/abs\/1411.1792"}],"event":{"name":"PPoPP '18: 23nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming","location":"Vienna Austria","acronym":"PPoPP '18","sponsor":["SIGPLAN ACM Special Interest Group on Programming Languages","SIGHPC ACM Special Interest Group on High Performance Computing, Special Interest Group on High Performance Computing"]},"container-title":["Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3178487.3178495","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3178487.3178495","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3178487.3178495","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T21:39:07Z","timestamp":1750196347000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3178487.3178495"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,2,10]]},"references-count":42,"alternative-id":["10.1145\/3178487.3178495","10.1145\/3178487"],"URL":"https:\/\/doi.org\/10.1145\/3178487.3178495","relation":{"is-identical-to":[{"id-type":"doi","id":"10.1145\/3200691.3178495","asserted-by":"object"}]},"subject":[],"published":{"date-parts":[[2018,2,10]]},"assertion":[{"value":"2018-02-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}