{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,9]],"date-time":"2026-06-09T08:43:43Z","timestamp":1780994623884,"version":"3.54.1"},"reference-count":86,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2017,3,31]],"date-time":"2017-03-31T00:00:00Z","timestamp":1490918400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Defense Advanced Research Projects Agency, STTR awards","award":["D14PC00023, D15PC00010"],"award-info":[{"award-number":["D14PC00023, D15PC00010"]}]},{"name":"Defense Advanced Research Projects Agency, XDATA Program, US Army award","award":["W911QX-12-C-0059"],"award-info":[{"award-number":["W911QX-12-C-0059"]}]},{"name":"UC Lab Fees Research Program Award","award":["12-LR-238449"],"award-info":[{"award-number":["12-LR-238449"]}]},{"name":"National Science Foundation awards","award":["CCF-1017399, OCI-1032859, CCF-1629657"],"award-info":[{"award-number":["CCF-1017399, OCI-1032859, CCF-1629657"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Parallel Comput."],"published-print":{"date-parts":[[2017,3,31]]},"abstract":"<jats:p>For large-scale graph analytics on the GPU, the irregularity of data access and control flow, and the complexity of programming GPUs, have presented two significant challenges to developing a programmable high-performance graph library. \u201cGunrock,\u201d our graph-processing system designed specifically for the GPU, uses a high-level, bulk-synchronous, data-centric abstraction focused on operations on a vertex or edge frontier. Gunrock achieves a balance between performance and expressiveness by coupling high-performance GPU computing primitives and optimization strategies with a high-level programming model that allows programmers to quickly develop new graph primitives with small code size and minimal GPU programming knowledge. We characterize the performance of various optimization strategies and evaluate Gunrock\u2019s overall performance on different GPU architectures on a wide range of graph primitives that span from traversal-based algorithms and ranking algorithms, to triangle counting and bipartite-graph-based algorithms. The results show that on a single GPU, Gunrock has on average at least an order of magnitude speedup over Boost and PowerGraph, comparable performance to the fastest GPU hardwired primitives and CPU shared-memory graph libraries, such as Ligra and Galois, and better performance than any other GPU high-level graph library.<\/jats:p>","DOI":"10.1145\/3108140","type":"journal-article","created":{"date-parts":[[2017,8,24]],"date-time":"2017-08-24T11:49:04Z","timestamp":1503575344000},"page":"1-49","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":115,"title":["Gunrock"],"prefix":"10.1145","volume":"4","author":[{"given":"Yangzihao","family":"Wang","sequence":"first","affiliation":[{"name":"University of California, Davis, CA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yuechao","family":"Pan","sequence":"additional","affiliation":[{"name":"University of California, Davis, CA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Andrew","family":"Davidson","sequence":"additional","affiliation":[{"name":"University of California, Davis, CA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yuduo","family":"Wu","sequence":"additional","affiliation":[{"name":"University of California, Davis, CA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Carl","family":"Yang","sequence":"additional","affiliation":[{"name":"University of California, Davis, CA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Leyuan","family":"Wang","sequence":"additional","affiliation":[{"name":"University of California, Davis, CA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Muhammad","family":"Osama","sequence":"additional","affiliation":[{"name":"University of California, Davis, CA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Chenshan","family":"Yuan","sequence":"additional","affiliation":[{"name":"University of California, Davis, CA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Weitang","family":"Liu","sequence":"additional","affiliation":[{"name":"University of California, Davis, CA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Andy T.","family":"Riffel","sequence":"additional","affiliation":[{"name":"University of California, Davis, CA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"John D.","family":"Owens","sequence":"additional","affiliation":[{"name":"University of California, Davis, CA"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2017,8,23]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"EmptyHeaded: Boolean algebra based graph processing. CoRR abs\/1503.02368","author":"Aberger Christopher R.","year":"2015","unstructured":"Christopher R. Aberger , Andres N\u00f6tzli , Kunle Olukotun , and Christopher R\u00e9. 2015. EmptyHeaded: Boolean algebra based graph processing. CoRR abs\/1503.02368 ( 2015 ). http:\/\/arxiv.org\/abs\/1503.02368 Christopher R. Aberger, Andres N\u00f6tzli, Kunle Olukotun, and Christopher R\u00e9. 2015. EmptyHeaded: Boolean algebra based graph processing. CoRR abs\/1503.02368 (2015). http:\/\/arxiv.org\/abs\/1503.02368"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/2851141.2851169"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2011.59"},{"key":"e_1_2_1_4_1","unstructured":"Sean Baxter. 2013. Modern GPU Multisets. Retrieved from https:\/\/nvlabs.github.io\/moderngpu\/sets.html.  Sean Baxter. 2013. Modern GPU Multisets. Retrieved from https:\/\/nvlabs.github.io\/moderngpu\/sets.html."},{"key":"e_1_2_1_5_1","unstructured":"Sean Baxter. 2013--2016. Moderngpu: Patterns and Behaviors for GPU Computing. Retrieved from http:\/\/moderngpu.github.io\/moderngpu.  Sean Baxter. 2013--2016. Moderngpu: Patterns and Behaviors for GPU Computing. Retrieved from http:\/\/moderngpu.github.io\/moderngpu."},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.5555\/2388996.2389013"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1572769.1572795"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1080\/0022250X.2001.9990249"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/ASONAM.2010.80"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/IISWC.2012.6402918"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/IISWC.2012.6402918"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2014.2330597"},{"key":"e_1_2_1_13_1","volume-title":"Graphics Hardware","author":"Cederman Daniel","year":"2008","unstructured":"Daniel Cederman and Philippas Tsigas . 2008. On dynamic load-balancing on graphics processors . In Graphics Hardware 2008 . 57--64. Daniel Cederman and Philippas Tsigas. 2008. On dynamic load-balancing on graphics processors. In Graphics Hardware 2008. 57--64."},{"key":"e_1_2_1_14_1","volume-title":"Proceedings of the 49th Annual IEEE\/ACM International Symposium on Microarchitecture (MICRO\u201916)","author":"Chen Yunji","year":"2016","unstructured":"Yunji Chen , Tao Luo , Shaoli Liu , Shijin Zhang , Liqiang He , Jia Wang , Ling Li , Tianshi Chen , Zhiwei Xu , Ninghui Sun , and Olivier Temam . 2016 . Graphicionado: A high-performance and energy-efficient accelerator for graph analytics . In Proceedings of the 49th Annual IEEE\/ACM International Symposium on Microarchitecture (MICRO\u201916) . IEEE Computer Society, Washington, DC. Yunji Chen, Tao Luo, Shaoli Liu, Shijin Zhang, Liqiang He, Jia Wang, Ling Li, Tianshi Chen, Zhiwei Xu, Ninghui Sun, and Olivier Temam. 2016. Graphicionado: A high-performance and energy-efficient accelerator for graph analytics. In Proceedings of the 49th Annual IEEE\/ACM International Symposium on Microarchitecture (MICRO\u201916). IEEE Computer Society, Washington, DC."},{"key":"e_1_2_1_15_1","volume-title":"GPU Technology Conference.","author":"Cohen Jonathan","year":"2012","unstructured":"Jonathan Cohen and Patrice CastonGuay . 2012 . Efficient graph matching and coloring on the GPU . GPU Technology Conference. Retrieved from http:\/\/on-demand.gputechconf.com\/gtc\/2012\/presentations\/S0332-Efficient-Graph-Matching-and-Coloring-on-GPUs.pdf. Jonathan Cohen and Patrice CastonGuay. 2012. Efficient graph matching and coloring on the GPU. GPU Technology Conference. Retrieved from http:\/\/on-demand.gputechconf.com\/gtc\/2012\/presentations\/S0332-Efficient-Graph-Matching-and-Coloring-on-GPUs.pdf."},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/800195.805928"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2014.45"},{"key":"e_1_2_1_18_1","volume-title":"Werneck","author":"Delling Daniel","year":"2010","unstructured":"Daniel Delling , Andrew V. Goldberg , Andreas Nowatzyk , and Renato F . Werneck . 2010 . PHAST : Hardware-accelerated shortest path trees. J. Parallel and Distrib. Comput . 73 (Sept. 2010), 940--952. Daniel Delling, Andrew V. Goldberg, Andreas Nowatzyk, and Renato F. Werneck. 2010. PHAST: Hardware-accelerated shortest path trees. J. Parallel and Distrib. Comput. 73 (Sept. 2010), 940--952."},{"key":"e_1_2_1_19_1","unstructured":"Erich Elsen and Vishal Vaidyanathan. 2013. A vertex-centric CUDA\/C++ API for large graph analytics on GPUs using the Gather-Apply-Scatter abstraction. Retrieved from http:\/\/www.github.com\/RoyalCaliber\/vertexAPI2.  Erich Elsen and Vishal Vaidyanathan. 2013. A vertex-centric CUDA\/C++ API for large graph analytics on GPUs using the Gather-Apply-Scatter abstraction. Retrieved from http:\/\/www.github.com\/RoyalCaliber\/vertexAPI2."},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/2621934.2621936"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/2660460.2660481"},{"key":"e_1_2_1_22_1","volume-title":"Scott Sallinen, and Matei Ripeanu.","author":"Gharaibeh Abdullah","year":"2014","unstructured":"Abdullah Gharaibeh , Tahsin Reza , Elizeu Santos-Neto , Lauro Beltrao Costa , Scott Sallinen, and Matei Ripeanu. 2014 . Efficient large-scale graph processing on hybrid CPU and GPU systems. CoRR abs\/1312.3018, 1312.3018v2 (Dec. 2014). arXiv:1312.3018v2 Abdullah Gharaibeh, Tahsin Reza, Elizeu Santos-Neto, Lauro Beltrao Costa, Scott Sallinen, and Matei Ripeanu. 2014. Efficient large-scale graph processing on hybrid CPU and GPU systems. CoRR abs\/1312.3018, 1312.3018v2 (Dec. 2014). arXiv:1312.3018v2"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1287\/inte.2014.0784"},{"key":"e_1_2_1_24_1","volume-title":"Proceedings of the 10th USENIX Conference on Operating Systems Design and Implementation (OSDI\u201912)","author":"Gonzalez Joseph E.","year":"2012","unstructured":"Joseph E. Gonzalez , Yucheng Low , Haijie Gu , Danny Bickson , and Carlos Guestrin . 2012 . PowerGraph: Distributed graph-parallel computation on natural graphs . In Proceedings of the 10th USENIX Conference on Operating Systems Design and Implementation (OSDI\u201912) . USENIX Association, 17--30. Joseph E. Gonzalez, Yucheng Low, Haijie Gu, Danny Bickson, and Carlos Guestrin. 2012. PowerGraph: Distributed graph-parallel computation on natural graphs. In Proceedings of the 10th USENIX Conference on Operating Systems Design and Implementation (OSDI\u201912). USENIX Association, 17--30."},{"key":"e_1_2_1_25_1","volume-title":"Proceedings of the 11th USENIX Conference on Operating Systems Design and Implementation (OSDI\u201914)","author":"Gonzalez Joseph E.","year":"2014","unstructured":"Joseph E. Gonzalez , Reynold S. Xin , Ankur Dave , Daniel Crankshaw , Michael J. Franklin , and Ion Stoica . 2014 . GraphX: Graph processing in a distributed dataflow framework . In Proceedings of the 11th USENIX Conference on Operating Systems Design and Implementation (OSDI\u201914) . USENIX Association, Berkeley, CA, 599--613. Retrieved from http:\/\/dl.acm.org\/citation.cfm?id&equals;2685048.2685096 Joseph E. Gonzalez, Reynold S. Xin, Ankur Dave, Daniel Crankshaw, Michael J. Franklin, and Ion Stoica. 2014. GraphX: Graph processing in a distributed dataflow framework. In Proceedings of the 11th USENIX Conference on Operating Systems Design and Implementation (OSDI\u201914). USENIX Association, Berkeley, CA, 599--613. Retrieved from http:\/\/dl.acm.org\/citation.cfm?id&equals;2685048.2685096"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/2304576.2304621"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/2567634.2567635"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.5555\/2688283.2688284"},{"key":"e_1_2_1_29_1","volume-title":"Proceedings of the Conference on Parallel Object-Oriented Scientific Computing (POOSC\u201905)","author":"Gregor Douglas","year":"2005","unstructured":"Douglas Gregor and Andrew Lumsdaine . 2005 . The parallel BGL: A generic library for distributed graph computations . In Proceedings of the Conference on Parallel Object-Oriented Scientific Computing (POOSC\u201905) . Douglas Gregor and Andrew Lumsdaine. 2005. The parallel BGL: A generic library for distributed graph computations. In Proceedings of the Conference on Parallel Object-Oriented Scientific Computing (POOSC\u201905)."},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/181014.181021"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/2488388.2488433"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/2463676.2465300"},{"key":"e_1_2_1_33_1","volume-title":"Proceedings of the 14th International Conference on High Performance Computing (HiPC\u201907)","author":"Harish Pawan","unstructured":"Pawan Harish and P. J. Narayanan . 2007. Accelerating large graph algorithms on the GPU using CUDA . In Proceedings of the 14th International Conference on High Performance Computing (HiPC\u201907) . Springer-Verlag, Berlin, 197--208. Pawan Harish and P. J. Narayanan. 2007. Accelerating large graph algorithms on the GPU using CUDA. In Proceedings of the 14th International Conference on High Performance Computing (HiPC\u201907). Springer-Verlag, Berlin, 197--208."},{"key":"e_1_2_1_34_1","volume-title":"2009--2016","author":"Harris Mark","year":"2009","unstructured":"Mark Harris , John D. Owens , Shubho Sengupta , Yao Zhang , and Andrew Davidson . 2009--2016 . CUDPP : CUDA Data Parallel Primitives Library . ( 2009 --2016). Retrieved from http:\/\/cudpp.github.io\/. Mark Harris, John D. Owens, Shubho Sengupta, Yao Zhang, and Andrew Davidson. 2009--2016. CUDPP: CUDA Data Parallel Primitives Library. (2009--2016). Retrieved from http:\/\/cudpp.github.io\/."},{"key":"e_1_2_1_35_1","volume-title":"Proceedings of the 2010 IEEE International Symposium on Parallel and Distributed Processing (IPDPS\u201910)","author":"He Zhengyu","year":"2010","unstructured":"Zhengyu He and Bo Hong . 2010 . Dynamically tuned push-relabel algorithm for the maximum flow problem on CPU-GPU-hybrid platforms . In Proceedings of the 2010 IEEE International Symposium on Parallel and Distributed Processing (IPDPS\u201910) . Georgia Institute of Technology, Atlanta, United States. Zhengyu He and Bo Hong. 2010. Dynamically tuned push-relabel algorithm for the maximum flow problem on CPU-GPU-hybrid platforms. In Proceedings of the 2010 IEEE International Symposium on Parallel and Distributed Processing (IPDPS\u201910). Georgia Institute of Technology, Atlanta, United States."},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/2150976.2151013"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/1941553.1941590"},{"key":"e_1_2_1_38_1","first-page":"15","article-title":"Edge v. node parallelism for graph centrality metrics. In GPU Computing Gems Jade Edition, Wen-mei W. Hwu (Ed.). Morgan Kaufmann","volume":"2","author":"Jia Yuntao","year":"2011","unstructured":"Yuntao Jia , Victor Lu , Jared Hoberock , Michael Garland , and John C. Hart . 2011 . Edge v. node parallelism for graph centrality metrics. In GPU Computing Gems Jade Edition, Wen-mei W. Hwu (Ed.). Morgan Kaufmann , Chapter 2 , 15 -- 28 . Yuntao Jia, Victor Lu, Jared Hoberock, Michael Garland, and John C. Hart. 2011. Edge v. node parallelism for graph centrality metrics. In GPU Computing Gems Jade Edition, Wen-mei W. Hwu (Ed.). Morgan Kaufmann, Chapter 2, 15--28.","journal-title":"Chapter"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/2716282.2716289"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/MM.2011.89"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPEC.2016.7761646"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/2600212.2600227"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/1772690.1772751"},{"key":"e_1_2_1_44_1","volume-title":"Proceedings of the 10th USENIX Conference on Operating Systems Design and Implementation (OSDI\u201912)","author":"Kyrola Aapo","year":"2012","unstructured":"Aapo Kyrola , Guy Blelloch , and Carlos Guestrin . 2012 . GraphChi: Large-scale graph computation on just a PC . In Proceedings of the 10th USENIX Conference on Operating Systems Design and Implementation (OSDI\u201912) . USENIX Association, Berkeley, CA, 31--46. http:\/\/dl.acm.org\/citation.cfm?id&equals;2387880.2387884 Aapo Kyrola, Guy Blelloch, and Carlos Guestrin. 2012. GraphChi: Large-scale graph computation on just a PC. In Proceedings of the 10th USENIX Conference on Operating Systems Design and Implementation (OSDI\u201912). USENIX Association, Berkeley, CA, 31--46. http:\/\/dl.acm.org\/citation.cfm?id&equals;2387880.2387884"},{"key":"e_1_2_1_45_1","volume-title":"2009--2016","author":"Leskovec Jure","year":"2009","unstructured":"Jure Leskovec . 2009--2016 . SNAP : Stanford Large Network Dataset Collection ( 2009 --2016). Retrieved from http:\/\/snap.stanford.edu\/data\/. Jure Leskovec. 2009--2016. SNAP: Stanford Large Network Dataset Collection (2009--2016). Retrieved from http:\/\/snap.stanford.edu\/data\/."},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/2898361"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1145\/2807591.2807594"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1145\/2882903.2882959"},{"key":"e_1_2_1_49_1","volume-title":"Proceedings of the 26th Annual Conference on Uncertainty in Artificial Intelligence (UAI\u201910)","author":"Low Yucheng","unstructured":"Yucheng Low , Joseph Gonzalez , Aapo Kyrola , Danny Bickson , Carlos Guestrin , and Joseph M. Hellerstein . 2010. GraphLab: A new parallel framework for machine learning . In Proceedings of the 26th Annual Conference on Uncertainty in Artificial Intelligence (UAI\u201910) . 340--349. Yucheng Low, Joseph Gonzalez, Aapo Kyrola, Danny Bickson, Carlos Guestrin, and Joseph M. Hellerstein. 2010. GraphLab: A new parallel framework for machine learning. In Proceedings of the 26th Annual Conference on Uncertainty in Artificial Intelligence (UAI\u201910). 340--349."},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/1807167.1807184"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1145\/2567634.2567638"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/SC.2014.52"},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPADS.2015.10"},{"key":"e_1_2_1_54_1","volume-title":"Proceedings of the IEEE High Performance Extreme Computing Conference (HPEC\u201915)","author":"McLaughlin A.","unstructured":"A. McLaughlin , J. Riedy , and D. A. Bader . 2015. A fast, energy-efficient abstraction for simultaneous breadth-first searches . In Proceedings of the IEEE High Performance Extreme Computing Conference (HPEC\u201915) . 1--6. A. McLaughlin, J. Riedy, and D. A. Bader. 2015. A fast, energy-efficient abstraction for simultaneous breadth-first searches. In Proceedings of the IEEE High Performance Extreme Computing Conference (HPEC\u201915). 1--6."},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1145\/2145816.2145832"},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0196-6774(03)00076-2"},{"key":"e_1_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1145\/2517349.2522739"},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1145\/2983990.2984015"},{"key":"e_1_2_1_59_1","volume-title":"Owens","author":"Pan Yuechao","year":"2016","unstructured":"Yuechao Pan , Yangzihao Wang , Yuduo Wu , Carl Yang , and John D . Owens . 2016 . Multi-GPU graph analytics. CoRR abs\/1504.04804, 1504.04804v3 (April 2016). arxiv:cs.DC\/1504.04804v3 Yuechao Pan, Yangzihao Wang, Yuduo Wu, Carl Yang, and John D. Owens. 2016. Multi-GPU graph analytics. CoRR abs\/1504.04804, 1504.04804v3 (April 2016). arxiv:cs.DC\/1504.04804v3"},{"key":"e_1_2_1_60_1","volume-title":"Proceedings of the IEEE Conference on High Performance Embedded Computing.","author":"Pushkar","unstructured":"Pushkar R. Pande and David A. Bader. 2011. Computing betweenness centrality for small world networks on a GPU . In Proceedings of the IEEE Conference on High Performance Embedded Computing. Pushkar R. Pande and David A. Bader. 2011. Computing betweenness centrality for small world networks on a GPU. In Proceedings of the IEEE Conference on High Performance Embedded Computing."},{"key":"e_1_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.1109\/SC.2014.50"},{"key":"e_1_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.1145\/1993498.1993501"},{"key":"e_1_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPSW.2016.108"},{"key":"e_1_2_1_64_1","doi-asserted-by":"publisher","DOI":"10.1145\/2517349.2522740"},{"key":"e_1_2_1_65_1","doi-asserted-by":"publisher","DOI":"10.1145\/2621934.2621938"},{"key":"e_1_2_1_66_1","volume-title":"Accelerating direction-optimized breadth first search on hybrid architectures. CoRR abs\/1503.04359, 1503.04359v1 (March","author":"Sallinen Scott","year":"2015","unstructured":"Scott Sallinen , Abdullah Gharaibeh , and Matei Ripeanu . 2015. Accelerating direction-optimized breadth first search on hybrid architectures. CoRR abs\/1503.04359, 1503.04359v1 (March 2015 ). arxiv:cs.DC\/1503.04359v1 Scott Sallinen, Abdullah Gharaibeh, and Matei Ripeanu. 2015. Accelerating direction-optimized breadth first search on hybrid architectures. CoRR abs\/1503.04359, 1503.04359v1 (March 2015). arxiv:cs.DC\/1503.04359v1"},{"key":"e_1_2_1_67_1","doi-asserted-by":"publisher","DOI":"10.1145\/2458523.2458531"},{"key":"e_1_2_1_68_1","doi-asserted-by":"publisher","DOI":"10.1007\/11427186_54"},{"key":"e_1_2_1_69_1","doi-asserted-by":"publisher","DOI":"10.1145\/2688500.2688526"},{"key":"e_1_2_1_70_1","doi-asserted-by":"publisher","DOI":"10.1145\/2688500.2688542"},{"key":"e_1_2_1_71_1","doi-asserted-by":"publisher","DOI":"10.1145\/2442516.2442530"},{"key":"e_1_2_1_72_1","first-page":"1063","volume-title":"Proceedings of the IEEE 31st International Conference on Data Engineering. 149--160","author":"Shun J.","unstructured":"J. Shun and K. Tangwongsan . 2015. Multicore triangle computations without tuning . In Proceedings of the IEEE 31st International Conference on Data Engineering. 149--160 . 1063 - 6382 J. Shun and K. Tangwongsan. 2015. Multicore triangle computations without tuning. In Proceedings of the IEEE 31st International Conference on Data Engineering. 149--160. 1063-6382"},{"key":"e_1_2_1_73_1","volume-title":"The Boost Graph Library: User Guide and Reference Manual","author":"Siek Jeremy G.","unstructured":"Jeremy G. Siek , Lie-Quan Lee , and Andrew Lumsdaine . 2001. The Boost Graph Library: User Guide and Reference Manual . Addison-Wesley . Jeremy G. Siek, Lie-Quan Lee, and Andrew Lumsdaine. 2001. The Boost Graph Library: User Guide and Reference Manual. Addison-Wesley."},{"key":"e_1_2_1_74_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2014.64"},{"key":"e_1_2_1_75_1","volume-title":"Workshops and PhD Forum (IPDPSW\u201910)","author":"Soman Jyothish","unstructured":"Jyothish Soman , Kothapalli Kishore , and P. J. Narayanan . 2010. A fast GPU algorithm for graph connectivity. In pRO24th IEEE International Symposium on Parallel and Distributed Processing , Workshops and PhD Forum (IPDPSW\u201910) . 1--8. Jyothish Soman, Kothapalli Kishore, and P. J. Narayanan. 2010. A fast GPU algorithm for graph connectivity. In pRO24th IEEE International Symposium on Parallel and Distributed Processing, Workshops and PhD Forum (IPDPSW\u201910). 1--8."},{"key":"e_1_2_1_76_1","doi-asserted-by":"publisher","DOI":"10.14778\/2311906.2311907"},{"key":"e_1_2_1_77_1","doi-asserted-by":"publisher","DOI":"10.1007\/3-540-45571-X"},{"key":"e_1_2_1_78_1","doi-asserted-by":"publisher","DOI":"10.1109\/MC.2012.255"},{"key":"e_1_2_1_79_1","doi-asserted-by":"publisher","DOI":"10.1145\/79173.79181"},{"key":"e_1_2_1_80_1","volume-title":"Proceedings of the Conference on Innovative data Systems Research (CIDR\u201913)","author":"Wang Guozhang","year":"2013","unstructured":"Guozhang Wang , Wenlei Xie , Alan J. Demers , and Johannes Gehrke . 2013 . Asynchronous large-scale graph processing made easy . In Proceedings of the Conference on Innovative data Systems Research (CIDR\u201913) . www.cidrdb.org. Guozhang Wang, Wenlei Xie, Alan J. Demers, and Johannes Gehrke. 2013. Asynchronous large-scale graph processing made easy. In Proceedings of the Conference on Innovative data Systems Research (CIDR\u201913). www.cidrdb.org."},{"key":"e_1_2_1_81_1","doi-asserted-by":"publisher","DOI":"10.1145\/2915516.2915521"},{"key":"e_1_2_1_82_1","doi-asserted-by":"publisher","DOI":"10.1145\/2851141.2851145"},{"key":"e_1_2_1_83_1","doi-asserted-by":"publisher","DOI":"10.1145\/2442516.2442523"},{"key":"e_1_2_1_84_1","doi-asserted-by":"publisher","DOI":"10.1109\/IISWC.2015.13"},{"key":"e_1_2_1_85_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPSW.2015.77"},{"key":"e_1_2_1_86_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2013.111"}],"container-title":["ACM Transactions on Parallel Computing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3108140","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3108140","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T02:13:43Z","timestamp":1750212823000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3108140"}},"subtitle":["GPU Graph Analytics"],"short-title":[],"issued":{"date-parts":[[2017,3,31]]},"references-count":86,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2017,3,31]]}},"alternative-id":["10.1145\/3108140"],"URL":"https:\/\/doi.org\/10.1145\/3108140","relation":{},"ISSN":["2329-4949","2329-4957"],"issn-type":[{"value":"2329-4949","type":"print"},{"value":"2329-4957","type":"electronic"}],"subject":[],"published":{"date-parts":[[2017,3,31]]},"assertion":[{"value":"2016-12-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2017-08-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2017-08-23","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}