{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:08:19Z","timestamp":1750306099128,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":38,"publisher":"ACM","license":[{"start":{"date-parts":[[2017,10,14]],"date-time":"2017-10-14T00:00:00Z","timestamp":1507939200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["CCF-1513201, CCF-1423108"],"award-info":[{"award-number":["CCF-1513201, CCF-1423108"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2017,10,14]]},"DOI":"10.1145\/3123939.3123976","type":"proceedings-article","created":{"date-parts":[[2017,11,20]],"date-time":"2017-11-20T14:31:12Z","timestamp":1511188272000},"page":"600-611","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":26,"title":["Wireframe"],"prefix":"10.1145","author":[{"given":"AmirAli","family":"Abdolrashidi","sequence":"first","affiliation":[{"name":"University of California"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Devashree","family":"Tripathy","sequence":"additional","affiliation":[{"name":"University of California"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mehmet Esat","family":"Belviranli","sequence":"additional","affiliation":[{"name":"University of California"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Laxmi Narayan","family":"Bhuyan","sequence":"additional","affiliation":[{"name":"University of California"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Daniel","family":"Wong","sequence":"additional","affiliation":[{"name":"University of California"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2017,10,14]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"2012. Dynamic Parallelism in CUDA. http:\/\/developer.download.nvidia.com\/assets\/cuda\/docs\/TechBrief_Dynamic_Parallelism_in_CUDA_v2.pdf. (2012).  2012. Dynamic Parallelism in CUDA. http:\/\/developer.download.nvidia.com\/assets\/cuda\/docs\/TechBrief_Dynamic_Parallelism_in_CUDA_v2.pdf. (2012)."},{"key":"e_1_3_2_1_2_1","unstructured":"2016. CUDA Programming Guide. https:\/\/docs.nvidia.com\/cuda\/cuda-c-programming-guide\/. (2016). Accessed: 09-27-2016.  2016. CUDA Programming Guide. https:\/\/docs.nvidia.com\/cuda\/cuda-c-programming-guide\/. (2016). Accessed: 09-27-2016."},{"key":"e_1_3_2_1_3_1","unstructured":"2017. CUDA 9 Features Revealed: Volta Cooperative Groups and More. https:\/\/devblogs.nvidia.com\/parallelforall\/cuda-9-features-revealed\/. (2017).  2017. CUDA 9 Features Revealed: Volta Cooperative Groups and More. https:\/\/devblogs.nvidia.com\/parallelforall\/cuda-9-features-revealed\/. (2017)."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/2768177.2768184"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISPASS.2009.4919648"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/2608020.2608024"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/2751205.2751243"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/2398856.2364563"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/IVS.2010.5548142"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/2830772.2830818"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2010.5470413"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.parco.2012.03.004"},{"volume-title":"KLAP: Kernel Launch Aggregation and Promotion for Optimizing Dynamic Parallelism. In MICRO'16","author":"El Izzat","key":"e_1_3_2_1_13_1","unstructured":"Izzat El Hajj et al. 2016 . KLAP: Kernel Launch Aggregation and Promotion for Optimizing Dynamic Parallelism. In MICRO'16 . Izzat El Hajj et al. 2016. KLAP: Kernel Launch Aggregation and Promotion for Optimizing Dynamic Parallelism. In MICRO'16."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2010.13"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10766-016-0426-5"},{"volume-title":"Support for dependency driven executions among openmp tasks","author":"Ghosh Priyanka","key":"e_1_3_2_1_16_1","unstructured":"Priyanka Ghosh , Yonghong Yan , and Barbara Chapman . 2012. Support for dependency driven executions among openmp tasks . IEEE. Priyanka Ghosh, Yonghong Yan, and Barbara Chapman. 2012. Support for dependency driven executions among openmp tasks. IEEE."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-40698-0_10"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/2155620.2155628"},{"key":"e_1_3_2_1_19_1","volume-title":"Innovative Parallel Computing (InPar)","author":"Gupta Kshitij","year":"2012","unstructured":"Kshitij Gupta , Jeff A Stuart , and John D Owens . 2012. A study of persistent threads style GPU programming for GPGPU workloads . In Innovative Parallel Computing (InPar) , 2012 . IEEE , 1--14. Kshitij Gupta, Jeff A Stuart, and John D Owens. 2012. A study of persistent threads style GPU programming for GPGPU workloads. In Innovative Parallel Computing (InPar), 2012. IEEE, 1--14."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/2451116.2451158"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/360827.360844"},{"volume-title":"Dynamic Time Warping","author":"M\u00fcller Meinard","key":"e_1_3_2_1_22_1","unstructured":"Meinard M\u00fcller . 2007. Dynamic Time Warping . Springer Berlin Heidelberg , 69--84. Meinard M\u00fcller. 2007. Dynamic Time Warping. Springer Berlin Heidelberg, 69--84."},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-37410-4_23"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2012.16"},{"volume-title":"CUDA by example: an introduction to general-purpose GPU programming","author":"Sanders Jason","key":"e_1_3_2_1_25_1","unstructured":"Jason Sanders and Edward Kandrot . 2010. CUDA by example: an introduction to general-purpose GPU programming . Addison-Wesley Professional . Jason Sanders and Edward Kandrot. 2010. CUDA by example: an introduction to general-purpose GPU programming. Addison-Wesley Professional."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2012.194"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.5555\/2830689.2830709"},{"volume-title":"Controlled Kernel Launch for Dynamic Parallelism in GPUs. In 2017 IEEE 23rd International Symposium on High Performance Computer Architecture (HPCA).","author":"Tang Xulong","key":"e_1_3_2_1_28_1","unstructured":"Xulong Tang , Ashutosh Pattnaik , Huaipan Jiang , Onur Kayiran , Adwait Jog , Sreepathi Pai , Mohamed Ibrahim , Mahmut T. Kandemir , and Chita R. Das . 2017 . Controlled Kernel Launch for Dynamic Parallelism in GPUs. In 2017 IEEE 23rd International Symposium on High Performance Computer Architecture (HPCA). Xulong Tang, Ashutosh Pattnaik, Huaipan Jiang, Onur Kayiran, Adwait Jog, Sreepathi Pai, Mohamed Ibrahim, Mahmut T. Kandemir, and Chita R. Das. 2017. Controlled Kernel Launch for Dynamic Parallelism in GPUs. In 2017 IEEE 23rd International Symposium on High Performance Computer Architecture (HPCA)."},{"key":"e_1_3_2_1_29_1","unstructured":"David Tarjan Kevin Skadron and Paulius Micikevicius. {n. d.}. The art of performance tuning for CUDA and manycore architectures. ({n. d.}).  David Tarjan Kevin Skadron and Paulius Micikevicius. {n. d.}. The art of performance tuning for CUDA and manycore architectures. ({n. d.})."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/MC.2012.255"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-11454-5_2"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2015.2487346"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA.2016.57"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/2749469.2750393"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF01407876"},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2010.5470477"},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/2442516.2442539"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/2692916.2555254"}],"event":{"name":"MICRO-50: The 50th Annual IEEE\/ACM International Symposium on Microarchitecture","sponsor":["SIGMICRO ACM Special Interest Group on Microarchitectural Research and Processing","IEEE-CS\\DATC IEEE Computer Society"],"location":"Cambridge Massachusetts","acronym":"MICRO-50"},"container-title":["Proceedings of the 50th Annual IEEE\/ACM International Symposium on Microarchitecture"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3123939.3123976","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3123939.3123976","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3123939.3123976","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T03:30:31Z","timestamp":1750217431000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3123939.3123976"}},"subtitle":["supporting data-dependent parallelism through dependency graph execution in GPUs"],"short-title":[],"issued":{"date-parts":[[2017,10,14]]},"references-count":38,"alternative-id":["10.1145\/3123939.3123976","10.1145\/3123939"],"URL":"https:\/\/doi.org\/10.1145\/3123939.3123976","relation":{},"subject":[],"published":{"date-parts":[[2017,10,14]]},"assertion":[{"value":"2017-10-14","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}