{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,11]],"date-time":"2026-03-11T01:35:17Z","timestamp":1773192917683,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":30,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,4,17]],"date-time":"2021-04-17T00:00:00Z","timestamp":1618617600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000185","name":"Defense Advanced Research Projects Agency","doi-asserted-by":"publisher","award":["HR0011-18-3-0007"],"award-info":[{"award-number":["HR0011-18-3-0007"]}],"id":[{"id":"10.13039\/100000185","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000015","name":"U.S. Department of Energy, Office of Science, Office of Advanced Scientific Computing Research","doi-asserted-by":"publisher","award":["DESC0018121"],"award-info":[{"award-number":["DESC0018121"]}],"id":[{"id":"10.13039\/100000015","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,4,19]]},"DOI":"10.1145\/3445814.3446692","type":"proceedings-article","created":{"date-parts":[[2021,4,11]],"date-time":"2021-04-11T17:06:26Z","timestamp":1618160786000},"page":"902-914","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":36,"title":["VeGen: a vectorizer generator for SIMD and beyond"],"prefix":"10.1145","author":[{"given":"Yishen","family":"Chen","sequence":"first","affiliation":[{"name":"Massachusetts Institute of Technology, USA"}]},{"given":"Charith","family":"Mendis","sequence":"additional","affiliation":[{"name":"University of Illinois at Urbana-Champaign, USA"}]},{"given":"Michael","family":"Carbin","sequence":"additional","affiliation":[{"name":"Massachusetts Institute of Technology, USA"}]},{"given":"Saman","family":"Amarasinghe","sequence":"additional","affiliation":[{"name":"Massachusetts Institute of Technology, USA"}]}],"member":"320","published-online":{"date-parts":[[2021,4,17]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"Intel Intrinsics Guide. URL https:\/\/software.intel.com\/sites\/landingpage\/IntrinsicsGuide\/.  Intel Intrinsics Guide. URL https:\/\/software.intel.com\/sites\/landingpage\/IntrinsicsGuide\/."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/29873.29875"},{"key":"e_1_3_2_1_3_1","first-page":"96","volume-title":"Proceedings. 13th International Conference on Parallel Architecture and Compilation Techniques, 2004. PACT 2004.","author":"Bachega Leonardo","unstructured":"Leonardo Bachega , Siddhartha Chatterjee , Kenneth A Dockser , John A Gunnels , Manish Gupta , Fred G Gustavson , Christopher A Lapkowski , Gary K Liu , Mark P Mendell , Charles D Wait , A high-performance SIMD floating point unit for BlueGene\/L: Architecture, compilation, and algorithm design . In Proceedings. 13th International Conference on Parallel Architecture and Compilation Techniques, 2004. PACT 2004. , pages 85? 96 . IEEE, 2004. Leonardo Bachega, Siddhartha Chatterjee, Kenneth A Dockser, John A Gunnels, Manish Gupta, Fred G Gustavson, Christopher A Lapkowski, Gary K Liu, Mark P Mendell, Charles D Wait, et al. A high-performance SIMD floating point unit for BlueGene\/L: Architecture, compilation, and algorithm design. In Proceedings. 13th International Conference on Parallel Architecture and Compilation Techniques, 2004. PACT 2004., pages 85?96. IEEE, 2004."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/2908080.2908111"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/1168857.1168906"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/2442516.2442529"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/3168821"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/357094.357097"},{"key":"e_1_3_2_1_9_1","first-page":"594","volume-title":"13th $\\$USENIX$\\$ Symposium on Operating Systems Design and Implementation ($\\$OSDI$\\$ 18)","author":"Chen Tianqi","year":"2018","unstructured":"Tianqi Chen , Thierry Moreau , Ziheng Jiang , Lianmin Zheng , Eddie Yan , Haichen Shen , Meghan Cowan , Leyuan Wang , Yuwei Hu , Luis Ceze , $\\$TVM$\\$: An automated end-to-end optimizing compiler for deep learning . In 13th $\\$USENIX$\\$ Symposium on Operating Systems Design and Implementation ($\\$OSDI$\\$ 18) , pages 578? 594 , 2018 . Tianqi Chen, Thierry Moreau, Ziheng Jiang, Lianmin Zheng, Eddie Yan, Haichen Shen, Meghan Cowan, Leyuan Wang, Yuwei Hu, Luis Ceze, et al. $\\$TVM$\\$: An automated end-to-end optimizing compiler for deep learning. In 13th $\\$USENIX$\\$ Symposium on Operating Systems Design and Implementation ($\\$OSDI$\\$ 18), pages 578?594, 2018."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.5555\/1792734.1792766"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/996841.996853"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2002.1015494"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/356893.356897"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/349299.349320"},{"key":"e_1_3_2_1_15_1","volume-title":"Proceedings of the International Symposium on Code Generation and Optimization: Feedback-directed and Runtime Optimization, CGO '04","author":"Lattner Chris","unstructured":"Chris Lattner and Vikram Adve . LLVM : A compilation framework for lifelong program analysis & transformation . In Proceedings of the International Symposium on Code Generation and Optimization: Feedback-directed and Runtime Optimization, CGO '04 , pages 75?, Washington, DC, USA, 2004. IEEE Computer Society. ISBN 0-7695-2102-9. URL http:\/\/dl.acm.org\/citation.cfm?id=977395.977673. Chris Lattner and Vikram Adve. LLVM: A compilation framework for lifelong program analysis & transformation. In Proceedings of the International Symposium on Code Generation and Optimization: Feedback-directed and Runtime Optimization, CGO '04, pages 75?, Washington, DC, USA, 2004. IEEE Computer Society. ISBN 0-7695-2102-9. URL http:\/\/dl.acm.org\/citation.cfm?id=977395.977673."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/2254064.2254106"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/1995896.1995938"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/3276480"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/1454115.1454119"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/1133981.1133997"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/2872362.2872387"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/PACT.2015.32"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/CGO.2015.7054199"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/3243176.3243189"},{"key":"e_1_3_2_1_25_1","first-page":"216","volume-title":"Proceedings of the 2019 IEEE\/ACM International Symposium on Code Generation and Optimization, CGO 2019","author":"Porpodas Vasileios","unstructured":"Vasileios Porpodas , Rodrigo C. O. Rocha , Evgueni Brevnov , Lu\u00eds F. W. G\\'oes, and Timothy Mattson . Super-Node SLP : Optimized vectorization for code sequences containing operators and their inverse elements . In Proceedings of the 2019 IEEE\/ACM International Symposium on Code Generation and Optimization, CGO 2019 , pages 206? 216 . IEEE Press, 2019. ISBN 9781728114361. Vasileios Porpodas, Rodrigo C. O. Rocha, Evgueni Brevnov, Lu\u00eds F. W. G\\'oes, and Timothy Mattson. Super-Node SLP: Optimized vectorization for code sequences containing operators and their inverse elements. In Proceedings of the 2019 IEEE\/ACM International Symposium on Code Generation and Optimization, CGO 2019, pages 206?216. IEEE Press, 2019. ISBN 9781728114361."},{"key":"e_1_3_2_1_26_1","volume-title":"Proceedings of the IEEE, 93\\penalty0 (2):\\penalty0 232?275","author":"Puschel Markus","year":"2005","unstructured":"Markus Puschel , Jos\\'e MF Moura , Jeremy R Johnson , David Padua , Manuela M Veloso , Bryan W Singer , Jianxin Xiong , Franz Franchetti , Aca Gacic , Yevgen Voronenko , : Code generation for dsp transforms . Proceedings of the IEEE, 93\\penalty0 (2):\\penalty0 232?275 , 2005 . Markus Puschel, Jos\\'e MF Moura, Jeremy R Johnson, David Padua, Manuela M Veloso, Bryan W Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, et al. Spiral: Code generation for dsp transforms. Proceedings of the IEEE, 93\\penalty0 (2):\\penalty0 232?275, 2005."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/256167.256225"},{"key":"e_1_3_2_1_28_1","volume-title":"GCC Developers Summit. Citeseer","author":"Rosen Ira","year":"2007","unstructured":"Ira Rosen , Dorit Nuzman , and Ayal Zaks . Loop-aware SLP in gcc . In GCC Developers Summit. Citeseer , 2007 . Ira Rosen, Dorit Nuzman, and Ayal Zaks. Loop-aware SLP in gcc. In GCC Developers Summit. Citeseer, 2007."},{"key":"e_1_3_2_1_29_1","volume-title":"Souper: A synthesizing superoptimizer. CoRR, abs\/1711.04422","author":"Sasnauskas Raimondas","year":"2017","unstructured":"Raimondas Sasnauskas , Yang Chen , Peter Collingbourne , Jeroen Ketema , Jubi Taneja , and John Regehr . Souper: A synthesizing superoptimizer. CoRR, abs\/1711.04422 , 2017 . URL http:\/\/arxiv.org\/abs\/1711.04422. Raimondas Sasnauskas, Yang Chen, Peter Collingbourne, Jeroen Ketema, Jubi Taneja, and John Regehr. Souper: A synthesizing superoptimizer. CoRR, abs\/1711.04422, 2017. URL http:\/\/arxiv.org\/abs\/1711.04422."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/2451116.2451150"}],"event":{"name":"ASPLOS '21: 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems","location":"Virtual USA","acronym":"ASPLOS '21","sponsor":["SIGPLAN ACM Special Interest Group on Programming Languages"]},"container-title":["Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3445814.3446692","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/abs\/10.1145\/3445814.3446692","content-type":"text\/html","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3445814.3446692","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3445814.3446692","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T21:28:14Z","timestamp":1750195694000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3445814.3446692"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,4,17]]},"references-count":30,"alternative-id":["10.1145\/3445814.3446692","10.1145\/3445814"],"URL":"https:\/\/doi.org\/10.1145\/3445814.3446692","relation":{},"subject":[],"published":{"date-parts":[[2021,4,17]]},"assertion":[{"value":"2021-04-17","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}