{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,24]],"date-time":"2026-02-24T18:15:14Z","timestamp":1771956914010,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":13,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,4,27]],"date-time":"2021-04-27T00:00:00Z","timestamp":1619481600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,4,27]]},"DOI":"10.1145\/3456669.3456684","type":"proceedings-article","created":{"date-parts":[[2021,4,27]],"date-time":"2021-04-27T15:22:31Z","timestamp":1619536951000},"page":"1-4","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":9,"title":["Extending DPC++ with Support for Huawei Ascend AI Chipset"],"prefix":"10.1145","author":[{"given":"Wilson","family":"Feng","sequence":"first","affiliation":[{"name":"Huawei Heterogeneous Compiler Lab, CA"}]},{"given":"Rasool","family":"Maghareh","sequence":"additional","affiliation":[{"name":"Huawei Heterogeneous Compiler Lab, CA"}]},{"given":"Kai-Ting Amy","family":"Wang","sequence":"additional","affiliation":[{"name":"Huawei Heterogeneous Compiler Lab, CA"}]}],"member":"320","published-online":{"date-parts":[[2021,4,27]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"2020. Auto Kernel Generator. https:\/\/gitee.com\/mindspore\/akg\/.  2020. Auto Kernel Generator. https:\/\/gitee.com\/mindspore\/akg\/."},{"key":"e_1_3_2_1_2_1","unstructured":"2020. Intel Implicit SPMD Program Compiler. https:\/\/ispc.github.io\/.  2020. Intel Implicit SPMD Program Compiler. https:\/\/ispc.github.io\/."},{"key":"e_1_3_2_1_3_1","volume-title":"Revision Date","year":"2020","unstructured":"2020. Khronos SYCL Working Group: SYCL Specification - Generic heterogeneous computing for modern C++. https:\/\/www.khronos.org\/registry\/SYCL\/specs\/sycl-2020-provisional.pdf . Revision Date : June 30, 2020 . 2020. Khronos SYCL Working Group: SYCL Specification - Generic heterogeneous computing for modern C++. https:\/\/www.khronos.org\/registry\/SYCL\/specs\/sycl-2020-provisional.pdf. Revision Date: June 30, 2020."},{"key":"e_1_3_2_1_4_1","unstructured":"2020. Single Instruction Multiple Data. https:\/\/support.huawei.com\/enterprise\/de\/doc\/EDOC1100164852\/8e920796\/vector-computation#EN-US_TOPIC_0279011964.  2020. Single Instruction Multiple Data. https:\/\/support.huawei.com\/enterprise\/de\/doc\/EDOC1100164852\/8e920796\/vector-computation#EN-US_TOPIC_0279011964."},{"key":"e_1_3_2_1_5_1","unstructured":"2020. Single Instruction Multiple Thread. https:\/\/www.hardwaretimes.com\/simd-vs-simt-vs-smt-whats-the-difference-between-parallel-processing-models\/.  2020. Single Instruction Multiple Thread. https:\/\/www.hardwaretimes.com\/simd-vs-simt-vs-smt-whats-the-difference-between-parallel-processing-models\/."},{"key":"e_1_3_2_1_6_1","unstructured":"2021. Atlas Data Center Solution V100R020C00 Application Software Development Guide 01. https:\/\/support.huawei.com\/enterprise\/br\/doc\/EDOC1100155021\/27a61ce6\/acl-api-reference. Accessed: 2021-03-18.  2021. Atlas Data Center Solution V100R020C00 Application Software Development Guide 01. https:\/\/support.huawei.com\/enterprise\/br\/doc\/EDOC1100155021\/27a61ce6\/acl-api-reference. Accessed: 2021-03-18."},{"key":"e_1_3_2_1_7_1","unstructured":"2021. CUDA Zone - NVIDIA Developer. https:\/\/developer.nvidia.com\/cuda-zone. Accessed: 2021-01-10.  2021. CUDA Zone - NVIDIA Developer. https:\/\/developer.nvidia.com\/cuda-zone. Accessed: 2021-01-10."},{"key":"e_1_3_2_1_8_1","unstructured":"2021. DaVinci: A ScalableArchitecture for Neural Network Computing. https:\/\/www.cmc.ca\/wp-content\/uploads\/2020\/03\/Zhan-Xu-Huawei.pdf. Accessed: 2021-03-19.  2021. DaVinci: A ScalableArchitecture for Neural Network Computing. https:\/\/www.cmc.ca\/wp-content\/uploads\/2020\/03\/Zhan-Xu-Huawei.pdf. Accessed: 2021-03-19."},{"key":"e_1_3_2_1_9_1","unstructured":"2021. Intel oneAPI DPC++\/C++ Compiler. https:\/\/software.intel.com\/content\/www\/us\/en\/develop\/tools\/oneapi\/components\/dpc-compiler.html. Accessed: 2021-01-10.  2021. Intel oneAPI DPC++\/C++ Compiler. https:\/\/software.intel.com\/content\/www\/us\/en\/develop\/tools\/oneapi\/components\/dpc-compiler.html. Accessed: 2021-01-10."},{"key":"e_1_3_2_1_10_1","unstructured":"2021. oneAPI Programming Model. https:\/\/www.oneapi.com\/. Accessed: 2021-01-10.  2021. oneAPI Programming Model. https:\/\/www.oneapi.com\/. Accessed: 2021-01-10."},{"key":"e_1_3_2_1_11_1","volume-title":"13th USENIX Symposium on Operating Systems Design and Implementation (OSDI18)","author":"Chen Tianqi","year":"2018","unstructured":"Tianqi Chen , Thierry Moreau , Ziheng Jiang , Lianmin Zheng , Eddie Yan , Haichen Shen , Meghan Cowan , Leyuan Wang , Yuwei Hu , Luis Ceze , 2018 . TVM: An automated end-to-end optimizing compiler for deep learning . In 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI18) . 578\u2013594. Tianqi Chen, Thierry Moreau, Ziheng Jiang, Lianmin Zheng, Eddie Yan, Haichen Shen, Meghan Cowan, Leyuan Wang, Yuwei Hu, Luis Ceze, 2018. TVM: An automated end-to-end optimizing compiler for deep learning. In 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI18). 578\u2013594."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/DSD51259.2020.00091"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO50266.2020.00044"}],"event":{"name":"IWOCL'21: International Workshop on OpenCL","location":"Munich Germany","acronym":"IWOCL'21"},"container-title":["International Workshop on OpenCL"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3456669.3456684","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3456669.3456684","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:46:55Z","timestamp":1750193215000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3456669.3456684"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,4,27]]},"references-count":13,"alternative-id":["10.1145\/3456669.3456684","10.1145\/3456669"],"URL":"https:\/\/doi.org\/10.1145\/3456669.3456684","relation":{},"subject":[],"published":{"date-parts":[[2021,4,27]]},"assertion":[{"value":"2021-04-27","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}