{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,24]],"date-time":"2026-02-24T18:13:57Z","timestamp":1771956837508,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":16,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,4,3]],"date-time":"2022-04-03T00:00:00Z","timestamp":1648944000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,4,3]]},"DOI":"10.1145\/3530390.3532733","type":"proceedings-article","created":{"date-parts":[[2022,5,18]],"date-time":"2022-05-18T22:15:49Z","timestamp":1652912149000},"page":"1-6","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["Systematically extending a high-level code generator with support for tensor cores"],"prefix":"10.1145","author":[{"given":"Lukas","family":"Siefke","sequence":"first","affiliation":[{"name":"University of M\u00fcnster, M\u00fcnster, Germany"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5271-6893","authenticated-orcid":false,"given":"Bastian","family":"K\u00f6pcke","sequence":"additional","affiliation":[{"name":"University of M\u00fcnster, M\u00fcnster, Germany"}]},{"given":"Sergei","family":"Gorlatch","sequence":"additional","affiliation":[{"name":"University of M\u00fcnster, M\u00fcnster, Germany"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5048-0741","authenticated-orcid":false,"given":"Michel","family":"Steuwer","sequence":"additional","affiliation":[{"name":"The University of Edinburgh, Edinburgh, United Kingdom"}]}],"member":"320","published-online":{"date-parts":[[2022,5,18]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"crossref","unstructured":"Manuel M. T. Chakravarty Gabriele Keller Sean Lee Trevor L. McDonell and Vinod Grover. 2011. Accelerating Haskell array codes with multicore GPUs. In DAMP.  Manuel M. T. Chakravarty Gabriele Keller Sean Lee Trevor L. McDonell and Vinod Grover. 2011. Accelerating Haskell array codes with multicore GPUs. In DAMP.","DOI":"10.1145\/1926354.1926358"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"crossref","unstructured":"Abdul Dakkak Cheng Li Isaac Gelado Jinjun Xiong and Wen-Mei W. Hwu. 2018. Accelerating Reduction and Scan Using Tensor Core Units. arXiv:1811.09736  Abdul Dakkak Cheng Li Isaac Gelado Jinjun Xiong and Wen-Mei W. Hwu. 2018. Accelerating Reduction and Scan Using Tensor Core Units. arXiv:1811.09736","DOI":"10.1145\/3330345.3331057"},{"key":"e_1_3_2_1_3_1","volume-title":"Henrik Barthels, Rastislav Bod\u00edk, and Vinod Grover.","author":"Hagedorn Bastian","year":"2020","unstructured":"Bastian Hagedorn , Archibald Samuel Elliott , Henrik Barthels, Rastislav Bod\u00edk, and Vinod Grover. 2020 . Fireiron : A Data-Movement-Aware Scheduling Language for GPUs. In PACT. Bastian Hagedorn, Archibald Samuel Elliott, Henrik Barthels, Rastislav Bod\u00edk, and Vinod Grover. 2020. Fireiron: A Data-Movement-Aware Scheduling Language for GPUs. In PACT."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"crossref","unstructured":"Bastian Hagedorn Johannes Lenfers Thomas Koehler Xueying Qin Sergei Gorlatch and Michel Steuwer. 2020. Achieving high-performance the functional way: a functional pearl on expressing high-performance optimizations as rewrite strategies. In ICFP.  Bastian Hagedorn Johannes Lenfers Thomas Koehler Xueying Qin Sergei Gorlatch and Michel Steuwer. 2020. Achieving high-performance the functional way: a functional pearl on expressing high-performance optimizations as rewrite strategies. In ICFP.","DOI":"10.1145\/3410227"},{"key":"e_1_3_2_1_5_1","volume-title":"Patterson","author":"Hennessy John L.","year":"2019","unstructured":"John L. Hennessy and David A . Patterson . 2019 . A new golden age for computer architecture. Commun. ACM 62, 2 (2019). John L. Hennessy and David A. Patterson. 2019. A new golden age for computer architecture. Commun. ACM 62, 2 (2019)."},{"key":"e_1_3_2_1_6_1","volume-title":"Oancea","author":"Henriksen Troels","year":"2017","unstructured":"Troels Henriksen , Niels G. W. Serup , Martin Elsman , Fritz Henglein , and Cosmin E . Oancea . 2017 . Futhark: purely func. GPU-programming with nested parallelism and in-place array updates. In PLDI. Troels Henriksen, Niels G. W. Serup, Martin Elsman, Fritz Henglein, and Cosmin E. Oancea. 2017. Futhark: purely func. GPU-programming with nested parallelism and in-place array updates. In PLDI."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"crossref","unstructured":"Thomas Koehler and Michel Steuwer. 2021. Towards a Domain-Extensible Compiler: Optimizing an Image Processing Pipeline on Mobile CPUs. In CGO.  Thomas Koehler and Michel Steuwer. 2021. Towards a Domain-Extensible Compiler: Optimizing an Image Processing Pipeline on Mobile CPUs. In CGO.","DOI":"10.1109\/CGO51591.2021.9370337"},{"key":"e_1_3_2_1_8_1","unstructured":"Thomas Koehler Phil Trinder and Michel Steuwer. 2021. Sketch-Guided Equality Saturation: Scaling Equality Saturation to Complex Optimizations in Languages with Bindings. arXiv:2111.13040  Thomas Koehler Phil Trinder and Michel Steuwer. 2021. Sketch-Guided Equality Saturation: Scaling Equality Saturation to Complex Optimizations in Languages with Bindings. arXiv:2111.13040"},{"key":"e_1_3_2_1_9_1","volume-title":"Performance & Precision. In IPDPS Workshops.","author":"Markidis Stefano","unstructured":"Stefano Markidis , Steven Wei Der Chien , Erwin Laure , Ivy Bo Peng , and Jeffrey S. Vetter . 2018. NVIDIA Tensor Core Programmability , Performance & Precision. In IPDPS Workshops. Stefano Markidis, Steven Wei Der Chien, Erwin Laure, Ivy Bo Peng, and Jeffrey S. Vetter. 2018. NVIDIA Tensor Core Programmability, Performance & Precision. In IPDPS Workshops."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2020.3011893"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"crossref","unstructured":"Adam Paszke Daniel D. Johnson David Duvenaud Dimitrios Vytiniotis Alexey Radul Matthew J. Johnson Jonathan Ragan-Kelley and Dougal Maclaurin. 2021. Getting to the point: index sets and parallelism-preserving autodiff for pointful array prog.. In ICFP.  Adam Paszke Daniel D. Johnson David Duvenaud Dimitrios Vytiniotis Alexey Radul Matthew J. Johnson Jonathan Ragan-Kelley and Dougal Maclaurin. 2021. Getting to the point: index sets and parallelism-preserving autodiff for pointful array prog.. In ICFP.","DOI":"10.1145\/3473593"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"crossref","unstructured":"Jonathan Ragan-Kelley Andrew Adams Dillon Sharlet Connelly Barnes Sylvain Paris Marc Levoy Saman P. Amarasinghe and Fr\u00e9do Durand. 2018. Halide: decoupling algorithms from schedules for high-performance image processing. Commun. ACM 61 1(2018).  Jonathan Ragan-Kelley Andrew Adams Dillon Sharlet Connelly Barnes Sylvain Paris Marc Levoy Saman P. Amarasinghe and Fr\u00e9do Durand. 2018. Halide: decoupling algorithms from schedules for high-performance image processing. Commun. ACM 61 1(2018).","DOI":"10.1145\/3150211"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"crossref","unstructured":"Albert Reuther Peter Michaleas Michael Jones Vijay Gadepally Siddharth Samsi and Jeremy Kepner. 2021. AI Accelerator Survey and Trends. In HPEC.  Albert Reuther Peter Michaleas Michael Jones Vijay Gadepally Siddharth Samsi and Jeremy Kepner. 2021. AI Accelerator Survey and Trends. In HPEC.","DOI":"10.1109\/HPEC49654.2021.9622867"},{"key":"e_1_3_2_1_14_1","volume-title":"Optimizing the FFT Using Mixed Precision on Tensor Core Hardware. In HiPCW Workshops.","author":"Sorna Anumeena","year":"2018","unstructured":"Anumeena Sorna , Xiaohe Cheng , Eduardo F. D'Azevedo , Kwai Wong , and Stanimire Tomov . 2018 . Optimizing the FFT Using Mixed Precision on Tensor Core Hardware. In HiPCW Workshops. Anumeena Sorna, Xiaohe Cheng, Eduardo F. D'Azevedo, Kwai Wong, and Stanimire Tomov. 2018. Optimizing the FFT Using Mixed Precision on Tensor Core Hardware. In HiPCW Workshops."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"crossref","unstructured":"Michel Steuwer Christian Fensch Sam Lindley and Christophe Dubach. 2015. Generating perf. portable code using rewrite rules: from high-level func. expr. to high-perf. OpenCL code. In ICFP.  Michel Steuwer Christian Fensch Sam Lindley and Christophe Dubach. 2015. Generating perf. portable code using rewrite rules: from high-level func. expr. to high-perf. OpenCL code. In ICFP.","DOI":"10.1145\/2784731.2784754"},{"key":"e_1_3_2_1_16_1","unstructured":"Michel Steuwer Thomas Koehler Bastian K\u00f6pcke and Federico Pizzuti. 2022. RISE & Shine: Language-Oriented Compiler Design. arXiv:2201.03611  Michel Steuwer Thomas Koehler Bastian K\u00f6pcke and Federico Pizzuti. 2022. RISE & Shine: Language-Oriented Compiler Design. arXiv:2201.03611"}],"event":{"name":"PPoPP '22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming","location":"Seoul Republic of Korea","acronym":"PPoPP '22","sponsor":["SIGPLAN ACM Special Interest Group on Programming Languages","SIGHPC ACM Special Interest Group on High Performance Computing, Special Interest Group on High Performance Computing"]},"container-title":["Proceedings of the 14th Workshop on General Purpose Processing Using GPU"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3530390.3532733","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3530390.3532733","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T18:09:24Z","timestamp":1750183764000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3530390.3532733"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,4,3]]},"references-count":16,"alternative-id":["10.1145\/3530390.3532733","10.1145\/3530390"],"URL":"https:\/\/doi.org\/10.1145\/3530390.3532733","relation":{},"subject":[],"published":{"date-parts":[[2022,4,3]]},"assertion":[{"value":"2022-05-18","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}