{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:53:43Z","timestamp":1750308823074,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":27,"publisher":"ACM","license":[{"start":{"date-parts":[[2011,3,5]],"date-time":"2011-03-05T00:00:00Z","timestamp":1299283200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000149","name":"Division of Engineering Education and Centers","doi-asserted-by":"publisher","award":["EEC-0946463"],"award-info":[{"award-number":["EEC-0946463"]}],"id":[{"id":"10.13039\/100000149","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2011,3,5]]},"DOI":"10.1145\/1964179.1964193","type":"proceedings-article","created":{"date-parts":[[2011,4,1]],"date-time":"2011-04-01T15:55:37Z","timestamp":1301673337000},"page":"1-8","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":19,"title":["Analyzing program flow within a many-kernel OpenCL application"],"prefix":"10.1145","author":[{"given":"Perhaad","family":"Mistry","sequence":"first","affiliation":[{"name":"Northeastern University, Boston, MA"}]},{"given":"Chris","family":"Gregg","sequence":"additional","affiliation":[{"name":"University of Virginia, Charlottesville, VA"}]},{"given":"Norman","family":"Rubin","sequence":"additional","affiliation":[{"name":"Advanced Micro Devices, Boxborough, MA"}]},{"given":"David","family":"Kaeli","sequence":"additional","affiliation":[{"name":"Northeastern University, Boston, MA"}]},{"given":"Kim","family":"Hazelwood","sequence":"additional","affiliation":[{"name":"University of Virginia, Charlottesville, VA"}]}],"member":"320","published-online":{"date-parts":[[2011,3,5]]},"reference":[{"volume-title":"version 2.0","author":"Guide CUDA","key":"e_1_3_2_1_1_1","unstructured":"CUDA programming Guide , version 2.0 . NVIDIA Corporation . CUDA programming Guide, version 2.0. NVIDIA Corporation."},{"key":"e_1_3_2_1_2_1","unstructured":"Cuda Visual Profiler. NVIDIA Corporation.  Cuda Visual Profiler. NVIDIA Corporation ."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/265924.265925"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1007\/11744023_32"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-006-0002-3"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/1186562.1015800"},{"key":"e_1_3_2_1_7_1","unstructured":"J. Burkardt. Example avi files. World Wide Web.  J. Burkardt. Example avi files. World Wide Web."},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICME.2007.4285037"},{"key":"e_1_3_2_1_9_1","volume-title":"World Wide Web","author":"Cowgill M.","year":"2009","unstructured":"M. Cowgill . Opensurf gpu enhancement . World Wide Web , 2009 . M. Cowgill. Opensurf gpu enhancement. World Wide Web, 2009."},{"key":"e_1_3_2_1_10_1","first-page":"749628","volume-title":"Proc. of SPIE Vol","volume":"7496","author":"Du G.","year":"2009","unstructured":"G. Du , F. Su , and A. Cai . Face recognition using SURF features . In Proc. of SPIE Vol , volume 7496 , pages 749628 -- 749621 , 2009 . G. Du, F. Su, and A. Cai. Face recognition using SURF features. In Proc. of SPIE Vol, volume 7496, pages 749628--1, 2009."},{"key":"e_1_3_2_1_12_1","volume-title":"ECE1724 Project Speeded-Up Speeded-Up Robust Features","author":"Furgale P.","year":"2009","unstructured":"P. Furgale , C. Tong , and G. Kenway . ECE1724 Project Speeded-Up Speeded-Up Robust Features . 2009 . P. Furgale, C. Tong, and G. Kenway. ECE1724 Project Speeded-Up Speeded-Up Robust Features. 2009."},{"key":"e_1_3_2_1_13_1","volume-title":"Proceedings of the International Conference on Parallel Computing (ParCo 07)","volume":"15","author":"Furlinger K.","unstructured":"K. Furlinger and S. Moore . Continuous runtime profiling of OpenMP applications . In Proceedings of the International Conference on Parallel Computing (ParCo 07) (Advances in Parallel Computing , volume 15 . K. Furlinger and S. Moore. Continuous runtime profiling of OpenMP applications. In Proceedings of the International Conference on Parallel Computing (ParCo 07)(Advances in Parallel Computing, volume 15."},{"key":"e_1_3_2_1_14_1","volume-title":"SIGGRAPH ASIA","author":"Gerstmann D.","year":"2009","unstructured":"D. Gerstmann . Opencl event model usage . SIGGRAPH ASIA 2009 . D. Gerstmann. Opencl event model usage. SIGGRAPH ASIA 2009."},{"issue":"39","key":"e_1_3_2_1_15_1","first-page":"851","article-title":"Parallel prefix sum (scan) with CUDA","volume":"3","author":"Harris M.","year":"2007","unstructured":"M. Harris , S. Sengupta , and J. Owens . Parallel prefix sum (scan) with CUDA . GPU Gems , 3 ( 39 ): 851 -- 876 , 2007 . M. Harris, S. Sengupta, and J. Owens. Parallel prefix sum (scan) with CUDA. GPU Gems, 3(39):851--876, 2007.","journal-title":"GPU Gems"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-92990-1_4"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2007.366305"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/1810085.1810105"},{"key":"e_1_3_2_1_19_1","volume-title":"The OpenCL specification version 1.1","author":"Munshi A.","year":"2010","unstructured":"A. Munshi . The OpenCL specification version 1.1 . Khronos OpenCL Working Group , 2010 . A. Munshi. The OpenCL specification version 1.1. Khronos OpenCL Working Group, 2010."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/1836845.1836904"},{"key":"e_1_3_2_1_21_1","unstructured":"G. Remedy. 2010.  G. Remedy. 2010."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/1345206.1345220"},{"key":"e_1_3_2_1_23_1","first-page":"122","volume-title":"Workshops","author":"Spafford K.","unstructured":"K. Spafford , J. Meredith , J. Vetter , J. Chen , R. Grout , and R. Sankaran . Accelerating S3D: A GPGPU Case Study. In Euro-Par 2009, Parallel Processing-Workshops. The Netherlands, August 25--28, 2009 , Workshops , page 122 . Not Avail, 2010. K. Spafford, J. Meredith, J. Vetter, J. Chen, R. Grout, and R. Sankaran. Accelerating S3D: A GPGPU Case Study. In Euro-Par 2009, Parallel Processing-Workshops. The Netherlands, August 25--28, 2009, Workshops, page 122. Not Avail, 2010."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/IISWC.2009.5306788"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/1513895.1513898"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/IISWC.2009.5306794"},{"key":"e_1_3_2_1_27_1","volume-title":"TN","author":"Warn S.","year":"2010","unstructured":"S. Warn , W. Emeneker , J. Gauch , J. Cothren , and A. Apon . Accelerating image feature comparisons using cuda on commodity hardware. Knoxville , TN , July 2010 . Symposium on Application Accelerators in High Performance Computing (SAAHPC). S. Warn, W. Emeneker, J. Gauch, J. Cothren, and A. Apon. Accelerating image feature comparisons using cuda on commodity hardware. Knoxville, TN, July 2010. Symposium on Application Accelerators in High Performance Computing (SAAHPC)."},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10766-009-0122-9"}],"event":{"name":"GPGPU-4: Fourth Workshop on General Purpose Processing on Graphics Processing Units","acronym":"GPGPU-4","location":"Newport Beach California USA"},"container-title":["Proceedings of the Fourth Workshop on General Purpose Processing on Graphics Processing Units"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1964179.1964193","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1964179.1964193","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T20:26:47Z","timestamp":1750278407000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1964179.1964193"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,3,5]]},"references-count":27,"alternative-id":["10.1145\/1964179.1964193","10.1145\/1964179"],"URL":"https:\/\/doi.org\/10.1145\/1964179.1964193","relation":{},"subject":[],"published":{"date-parts":[[2011,3,5]]},"assertion":[{"value":"2011-03-05","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}