{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,24]],"date-time":"2025-09-24T09:43:30Z","timestamp":1758707010158,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":25,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,4,18]],"date-time":"2023-04-18T00:00:00Z","timestamp":1681776000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,4,18]]},"DOI":"10.1145\/3585341.3585371","type":"proceedings-article","created":{"date-parts":[[2023,4,6]],"date-time":"2023-04-06T14:05:24Z","timestamp":1680789924000},"page":"1-9","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["Towards Alignment of Parallelism in SYCL and ISO C++"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-0237-3823","authenticated-orcid":false,"given":"Simon John","family":"Pennycook","sequence":"first","affiliation":[{"name":"Intel Corporation, United States"}]},{"ORCID":"https:\/\/orcid.org\/0009-0003-1511-2021","authenticated-orcid":false,"given":"Ben","family":"Ashbaugh","sequence":"additional","affiliation":[{"name":"Intel Corporation, United States"}]},{"ORCID":"https:\/\/orcid.org\/0009-0004-7563-1752","authenticated-orcid":false,"given":"James","family":"Brodman","sequence":"additional","affiliation":[{"name":"Intel Corporation, United States"}]},{"ORCID":"https:\/\/orcid.org\/0009-0005-0677-4060","authenticated-orcid":false,"given":"Michael","family":"Kinsner","sequence":"additional","affiliation":[{"name":"Intel Corporation, Canada"}]},{"ORCID":"https:\/\/orcid.org\/0009-0000-2522-0251","authenticated-orcid":false,"given":"Steffen","family":"Larsen","sequence":"additional","affiliation":[{"name":"Intel Corporation (UK) Limited, United Kingdom"}]},{"ORCID":"https:\/\/orcid.org\/0009-0005-4483-0692","authenticated-orcid":false,"given":"Greg","family":"Lueck","sequence":"additional","affiliation":[{"name":"Intel Corporation, United States"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1603-2413","authenticated-orcid":false,"given":"Roland","family":"Schulz","sequence":"additional","affiliation":[{"name":"Intel Corporation, United States"}]},{"ORCID":"https:\/\/orcid.org\/0009-0003-3904-0712","authenticated-orcid":false,"given":"Michael","family":"Voss","sequence":"additional","affiliation":[{"name":"Intel Corporation, United States"}]}],"member":"320","published-online":{"date-parts":[[2023,4,18]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"Ruslan Arutyunyan. 2022. C++17 Parallel Algorithms and P2300. Technical Report. http:\/\/wg21.link\/p2500  Ruslan Arutyunyan. 2022. C++17 Parallel Algorithms and P2300. Technical Report. http:\/\/wg21.link\/p2500"},{"key":"e_1_3_2_1_2_1","volume-title":"International Workshop on OpenCL(IWOCL\u201921)","author":"Ashbaugh Ben","year":"2021","unstructured":"Ben Ashbaugh , James\u00a0 C Brodman , Michael Kinsner , Gregory Lueck , John Pennycook , and Roland Schulz . 2021 . Toward a Better Defined SYCL Memory Consistency Model . In International Workshop on OpenCL(IWOCL\u201921) . Association for Computing Machinery, New York, NY, USA, Article 20, 3\u00a0pages. Ben Ashbaugh, James\u00a0C Brodman, Michael Kinsner, Gregory Lueck, John Pennycook, and Roland Schulz. 2021. Toward a Better Defined SYCL Memory Consistency Model. In International Workshop on OpenCL(IWOCL\u201921). Association for Computing Machinery, New York, NY, USA, Article 20, 3\u00a0pages."},{"key":"e_1_3_2_1_3_1","volume-title":"Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis","author":"Bauer Michael","year":"2011","unstructured":"Michael Bauer , Henry Cook , and Brucek Khailany . 2011 . CudaDMA: Optimizing GPU Memory Bandwidth via Warp Specialization . In Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis ( Seattle, Washington) (SC \u201911). Association for Computing Machinery, New York, NY, USA, Article 12, 11\u00a0pages. Michael Bauer, Henry Cook, and Brucek Khailany. 2011. CudaDMA: Optimizing GPU Memory Bandwidth via Warp Specialization. In Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (Seattle, Washington) (SC \u201911). Association for Computing Machinery, New York, NY, USA, Article 12, 11\u00a0pages."},{"key":"e_1_3_2_1_4_1","first-page":"8","article-title":"Singe","volume":"49","author":"Bauer Michael","year":"2014","unstructured":"Michael Bauer , Sean Treichler , and Alex Aiken . 2014 . Singe : Leveraging Warp Specialization for High Performance on GPUs. SIGPLAN Not. 49 , 8 (feb 2014), 119\u2013130. Michael Bauer, Sean Treichler, and Alex Aiken. 2014. Singe: Leveraging Warp Specialization for High Performance on GPUs. SIGPLAN Not. 49, 8 (feb 2014), 119\u2013130.","journal-title":"Leveraging Warp Specialization for High Performance on GPUs. SIGPLAN Not."},{"key":"e_1_3_2_1_5_1","volume-title":"RAJA: Portable Performance for Large-scale Scientific Applications. In 2019 IEEE\/ACM International Workshop on Performance, Portability and Productivity in HPC (P3HPC). IEEE, 71\u201381","author":"Beckingsale A","year":"2019","unstructured":"David\u00a0 A Beckingsale , Jason Burmark , Rich Hornung , Holger Jones , William Killian , Adam\u00a0 J Kunen , Olga Pearce , Peter Robinson , Brian\u00a0 S Ryujin , and Thomas\u00a0 RW Scogland . 2019 . RAJA: Portable Performance for Large-scale Scientific Applications. In 2019 IEEE\/ACM International Workshop on Performance, Portability and Productivity in HPC (P3HPC). IEEE, 71\u201381 . David\u00a0A Beckingsale, Jason Burmark, Rich Hornung, Holger Jones, William Killian, Adam\u00a0J Kunen, Olga Pearce, Peter Robinson, Brian\u00a0S Ryujin, and Thomas\u00a0RW Scogland. 2019. RAJA: Portable Performance for Large-scale Scientific Applications. In 2019 IEEE\/ACM International Workshop on Performance, Portability and Productivity in HPC (P3HPC). IEEE, 71\u201381."},{"key":"e_1_3_2_1_6_1","unstructured":"OpenMP Architecture\u00a0Review Board. 2022. OpenMP Application Programming Interface Version 6.0 Preview 1.  OpenMP Architecture\u00a0Review Board. 2022. OpenMP Application Programming Interface Version 6.0 Preview 1."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/MM.2018.022071134"},{"key":"e_1_3_2_1_8_1","volume-title":"Retrieved","author":"Intel Corporation","year":"2021","unstructured":"Intel Corporation . 2021 . sycl_ext_oneapi_properties . Retrieved January 12, 2023 from https:\/\/github.com\/intel\/llvm\/blob\/sycl\/sycl\/doc\/extensions\/experimental\/sycl_ext_oneapi_properties.asciidoc Intel Corporation. 2021. sycl_ext_oneapi_properties. Retrieved January 12, 2023 from https:\/\/github.com\/intel\/llvm\/blob\/sycl\/sycl\/doc\/extensions\/experimental\/sycl_ext_oneapi_properties.asciidoc"},{"key":"e_1_3_2_1_9_1","volume-title":"Retrieved","author":"Intel Corporation","year":"2022","unstructured":"Intel Corporation . 2022 . sycl_ext_oneapi_kernel_properties . Retrieved January 12, 2023 from https:\/\/github.com\/intel\/llvm\/blob\/sycl\/sycl\/doc\/extensions\/experimental\/sycl_ext_oneapi_properties.asciidoc Intel Corporation. 2022. sycl_ext_oneapi_kernel_properties. Retrieved January 12, 2023 from https:\/\/github.com\/intel\/llvm\/blob\/sycl\/sycl\/doc\/extensions\/experimental\/sycl_ext_oneapi_properties.asciidoc"},{"key":"e_1_3_2_1_10_1","volume-title":"Retrieved","author":"NVIDIA Corporation","year":"2022","unstructured":"NVIDIA Corporation . 2022 . Tuning CUDA Applications for Volta . Retrieved January 12, 2023 from https:\/\/docs.nvidia.com\/cuda\/pdf\/Volta_Tuning_Guide.pdf NVIDIA Corporation. 2022. Tuning CUDA Applications for Volta. Retrieved January 12, 2023 from https:\/\/docs.nvidia.com\/cuda\/pdf\/Volta_Tuning_Guide.pdf"},{"key":"e_1_3_2_1_11_1","volume-title":"Analyzing Reduction Abstraction Capabilities. In 2021 International Workshop on Performance, Portability and Productivity in HPC (P3HPC). 33\u201344","author":"Deakin Tom","year":"2021","unstructured":"Tom Deakin , Simon McIntosh-Smith , S.\u00a0 John Pennycook , and Jason Sewall . 2021 . Analyzing Reduction Abstraction Capabilities. In 2021 International Workshop on Performance, Portability and Productivity in HPC (P3HPC). 33\u201344 . Tom Deakin, Simon McIntosh-Smith, S.\u00a0John Pennycook, and Jason Sewall. 2021. Analyzing Reduction Abstraction Capabilities. In 2021 International Workshop on Performance, Portability and Productivity in HPC (P3HPC). 33\u201344."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jpdc.2014.07.003"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/3078155.3078160"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/P3HPC51967.2020.00008"},{"key":"e_1_3_2_1_16_1","unstructured":"Khronos OpenCL\u00a0Working Group. 2015. The OpenCL Specification Version 2.0 Revision 29.  Khronos OpenCL\u00a0Working Group. 2015. The OpenCL Specification Version 2.0 Revision 29."},{"key":"e_1_3_2_1_17_1","unstructured":"Khronos OpenCL\u00a0Working Group. 2022. The OpenCL Specification Version 3.0.  Khronos OpenCL\u00a0Working Group. 2022. The OpenCL Specification Version 3.0."},{"key":"e_1_3_2_1_18_1","volume-title":"SYCL 2020 Specification (revision 6).","author":"Khronos SYCL\u00a0Working Group","year":"2022","unstructured":"Khronos SYCL\u00a0Working Group . 2022 . SYCL 2020 Specification (revision 6). Khronos SYCL\u00a0Working Group. 2022. SYCL 2020 Specification (revision 6)."},{"key":"e_1_3_2_1_19_1","unstructured":"The Khronos\u00a0Group Inc.2023. SYCL Overview. Retrieved January 11 2023 from https:\/\/www.khronos.org\/sycl\/  The Khronos\u00a0Group Inc.2023. SYCL Overview. Retrieved January 11 2023 from https:\/\/www.khronos.org\/sycl\/"},{"volume-title":"ISO. 2017. ISO\/IEC 14882:2017 Information technology \u2014 Programming languages \u2014 C++","key":"e_1_3_2_1_20_1","unstructured":"ISO. 2017. ISO\/IEC 14882:2017 Information technology \u2014 Programming languages \u2014 C++ ( fifth ed.). 1605 pages. https:\/\/www.iso.org\/standard\/68564.html ISO. 2017. ISO\/IEC 14882:2017 Information technology \u2014 Programming languages \u2014 C++ (fifth ed.). 1605 pages. https:\/\/www.iso.org\/standard\/68564.html"},{"key":"e_1_3_2_1_21_1","volume-title":"Efficient Fork-Join on GPUs Through Warp Specialization. In 2017 IEEE 24th International Conference on High Performance Computing (HiPC). 358\u2013367","author":"Jacob Arpith\u00a0Chacko","year":"2017","unstructured":"Arpith\u00a0Chacko Jacob , Alexandre\u00a0 E Eichenberger , Hyojin Sung , Samuel\u00a0 F Antao , Gheorghe-Teodor Bercea , Carlo Bertolli , Alexey Bataev , Tian Jin , Tong Chen , Zehra Sura , Georgios Rokos , and Kevin O\u2019Brien . 2017 . Efficient Fork-Join on GPUs Through Warp Specialization. In 2017 IEEE 24th International Conference on High Performance Computing (HiPC). 358\u2013367 . Arpith\u00a0Chacko Jacob, Alexandre\u00a0E Eichenberger, Hyojin Sung, Samuel\u00a0F Antao, Gheorghe-Teodor Bercea, Carlo Bertolli, Alexey Bataev, Tian Jin, Tong Chen, Zehra Sura, Georgios Rokos, and Kevin O\u2019Brien. 2017. Efficient Fork-Join on GPUs Through Warp Specialization. In 2017 IEEE 24th International Conference on High Performance Computing (HiPC). 358\u2013367."},{"key":"e_1_3_2_1_22_1","unstructured":"John Kessenich Boaz Ouriel and Raun Krisch. 2022. SPIR-V Specification Version 1.6 Revision 2.  John Kessenich Boaz Ouriel and Raun Krisch. 2022. SPIR-V Specification Version 1.6 Revision 2."},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"crossref","first-page":"28","DOI":"10.1109\/MCSE.2021.3097276","article-title":"Navigating Performance, Portability, and Productivity","volume":"23","author":"Pennycook John","year":"2021","unstructured":"S.\u00a0 John Pennycook , Jason\u00a0 D. Sewall , Douglas\u00a0 W. Jacobsen , Tom Deakin , and Simon McIntosh-Smith . 2021 . Navigating Performance, Portability, and Productivity . Computing in Science & Engineering 23 , 5 (2021), 28 \u2013 38 . S.\u00a0John Pennycook, Jason\u00a0D. Sewall, Douglas\u00a0W. Jacobsen, Tom Deakin, and Simon McIntosh-Smith. 2021. Navigating Performance, Portability, and Productivity. Computing in Science & Engineering 23, 5 (2021), 28\u201338.","journal-title":"Computing in Science & Engineering"},{"key":"e_1_3_2_1_24_1","first-page":"10","article-title":"Portable Inter-Workgroup Barrier Synchronisation for GPUs","volume":"51","author":"Sorensen Tyler","year":"2016","unstructured":"Tyler Sorensen , Alastair\u00a0 F. Donaldson , Mark Batty , Ganesh Gopalakrishnan , and Zvonimir Rakamari\u0107 . 2016 . Portable Inter-Workgroup Barrier Synchronisation for GPUs . SIGPLAN Not. 51 , 10 (oct 2016), 39\u201358. Tyler Sorensen, Alastair\u00a0F. Donaldson, Mark Batty, Ganesh Gopalakrishnan, and Zvonimir Rakamari\u0107. 2016. Portable Inter-Workgroup Barrier Synchronisation for GPUs. SIGPLAN Not. 51, 10 (oct 2016), 39\u201358.","journal-title":"SIGPLAN Not."},{"key":"e_1_3_2_1_25_1","volume-title":"29th International Conference on Concurrency Theory (CONCUR","author":"Sorensen Tyler","year":"2018","unstructured":"Tyler Sorensen , Hugues Evrard , and Alastair\u00a0 F. Donaldson . 2018 . GPU Schedulers: How Fair Is Fair Enough? . In 29th International Conference on Concurrency Theory (CONCUR 2018). 23:1\u201323:17. Tyler Sorensen, Hugues Evrard, and Alastair\u00a0F. Donaldson. 2018. GPU Schedulers: How Fair Is Fair Enough?. In 29th International Conference on Concurrency Theory (CONCUR 2018). 23:1\u201323:17."},{"key":"e_1_3_2_1_26_1","volume-title":"Proc. ACM Program. Lang. 5, OOPSLA, Article 131 (oct","author":"Sorensen Tyler","year":"2021","unstructured":"Tyler Sorensen , Lucas\u00a0 F. Salvador , Harmit Raval , Hugues Evrard , John Wickerson , Margaret Martonosi , and Alastair\u00a0 F. Donaldson . 2021 . Specifying and Testing GPU Workgroup Progress Models . Proc. ACM Program. Lang. 5, OOPSLA, Article 131 (oct 2021), 30\u00a0pages. Tyler Sorensen, Lucas\u00a0F. Salvador, Harmit Raval, Hugues Evrard, John Wickerson, Margaret Martonosi, and Alastair\u00a0F. Donaldson. 2021. Specifying and Testing GPU Workgroup Progress Models. Proc. ACM Program. Lang. 5, OOPSLA, Article 131 (oct 2021), 30\u00a0pages."}],"event":{"name":"IWOCL '23: International Workshop on OpenCL","acronym":"IWOCL '23","location":"Cambridge United Kingdom"},"container-title":["International Workshop on OpenCL"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3585341.3585371","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:37:56Z","timestamp":1750178276000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3585341.3585371"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,4,18]]},"references-count":25,"alternative-id":["10.1145\/3585341.3585371","10.1145\/3585341"],"URL":"https:\/\/doi.org\/10.1145\/3585341.3585371","relation":{},"subject":[],"published":{"date-parts":[[2023,4,18]]},"assertion":[{"value":"2023-04-18","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}