{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:47:02Z","timestamp":1750308422053,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":57,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,6,3]],"date-time":"2021-06-03T00:00:00Z","timestamp":1622678400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,6,3]]},"DOI":"10.1145\/3447818.3461665","type":"proceedings-article","created":{"date-parts":[[2021,6,4]],"date-time":"2021-06-04T15:09:36Z","timestamp":1622819376000},"page":"1-12","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Inter-loop optimization in RAJA using loop chains"],"prefix":"10.1145","author":[{"given":"Brandon","family":"Neth","sequence":"first","affiliation":[{"name":"University of Arizona"}]},{"given":"Thomas R. W.","family":"Scogland","sequence":"additional","affiliation":[{"name":"Lawrence Livermore National Laboratory"}]},{"given":"Bronis R.","family":"de Supinski","sequence":"additional","affiliation":[{"name":"Lawrence Livermore National Laboratory"}]},{"given":"Michelle Mills","family":"Strout","sequence":"additional","affiliation":[{"name":"University of Arizona"}]}],"member":"320","published-online":{"date-parts":[[2021,6,4]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"Top500 list - june 2020 June 2020.  Top500 list - june 2020 June 2020."},{"key":"e_1_3_2_1_2_1","first-page":"265","volume-title":"Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation, OSDI'16","author":"Abadi M.","year":"2016","unstructured":"M. Abadi , P. Barham , J. Chen , Z. Chen , A. Davis , J. Dean , M. Devin , S. Ghemawat , G. Irving , M. Isard , M. Kudlur , J. Levenberg , R. Monga , S. Moore , D. G. Murray , B. Steiner , P. Tucker , V. Vasudevan , P. Warden , M. Wicke , Y. Yu , and X. Zheng . Tensorflow: A system for large-scale machine learning . In Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation, OSDI'16 , page 265 - 283 , USA, 2016 . USENIX Association. M. Abadi, P. Barham, J. Chen, Z. Chen, A. Davis, J. Dean, M. Devin, S. Ghemawat, G. Irving, M. Isard, M. Kudlur, J. Levenberg, R. Monga, S. Moore, D. G. Murray, B. Steiner, P. Tucker, V. Vasudevan, P. Warden, M. Wicke, Y. Yu, and X. Zheng. Tensorflow: A system for large-scale machine learning. In Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation, OSDI'16, page 265-283, USA, 2016. USENIX Association."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/3306346.3322967"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.5555\/370049.370476"},{"key":"e_1_3_2_1_5_1","volume-title":"In Proceedings of 4th Globus Retreat","author":"Allen G.","year":"2000","unstructured":"G. Allen , T. Dramlitsch , I. Foster , T. Goodale , N. Karonis , M. Ripeanu , E. Seidel , and B. Toonen . Cactus-g toolkit: Supporting efficient execution in heterogeneous distributed computing environments . In In Proceedings of 4th Globus Retreat , 2000 . G. Allen, T. Dramlitsch, I. Foster, T. Goodale, N. Karonis, M. Ripeanu, E. Seidel, and B. Toonen. Cactus-g toolkit: Supporting efficient execution in heterogeneous distributed computing environments. In In Proceedings of 4th Globus Retreat, 2000."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/CGO.2019.8661197"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1007\/3-540-49372-7_10"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/P3HPC49587.2019.00012"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/WACCPD.2016.010"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1504\/IJHPCN.2019.097053"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1375581.1375595"},{"key":"e_1_3_2_1_12_1","first-page":"578","volume-title":"13th USENIX Symposium on Operating Systems Design and Implementation (OSDI 18)","author":"Chen T.","year":"2018","unstructured":"T. Chen , T. Moreau , Z. Jiang , L. Zheng , E. Yan , H. Shen , M. Cowan , L. Wang , Y. Hu , L. Ceze , C. Guestrin , and A. Krishnamurthy . TVM: An automated end-to-end optimizing compiler for deep learning . In 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI 18) , pages 578 -- 594 , Carlsbad, CA , Oct. 2018 . USENIX Association. T. Chen, T. Moreau, Z. Jiang, L. Zheng, E. Yan, H. Shen, M. Cowan, L. Wang, Y. Hu, L. Ceze, C. Guestrin, and A. Krishnamurthy. TVM: An automated end-to-end optimizing compiler for deep learning. In 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI 18), pages 578--594, Carlsbad, CA, Oct. 2018. USENIX Association."},{"key":"e_1_3_2_1_13_1","volume-title":"Learning to optimize tensor programs. Technical report, arXiv:1805.08166v4","author":"Chen T.","year":"2019","unstructured":"T. Chen , L. Zheng , E. Yan , Z. Jiang , T. Moreau , L. Ceze , C. Guestrin , and A. Krishnamurthy . Learning to optimize tensor programs. Technical report, arXiv:1805.08166v4 , 8 January 2019 . T. Chen, L. Zheng, E. Yan, Z. Jiang, T. Moreau, L. Ceze, C. Guestrin, and A. Krishnamurthy. Learning to optimize tensor programs. Technical report, arXiv:1805.08166v4, 8 January 2019."},{"key":"e_1_3_2_1_14_1","volume-title":"Ccnav: Understanding compiler optimizations in binary code","author":"Devkota S.","year":"2020","unstructured":"S. Devkota , P. Aschwanden , A. Kunen , M. Legendre , and K. E. Isaacs . Ccnav: Understanding compiler optimizations in binary code . IEEE transactions on visualization and computer graphics, 2020 . S. Devkota, P. Aschwanden, A. Kunen, M. Legendre, and K. E. Isaacs. Ccnav: Understanding compiler optimizations in binary code. IEEE transactions on visualization and computer graphics, 2020."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/582034.582084"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jpdc.2014.07.003"},{"key":"e_1_3_2_1_17_1","first-page":"1","volume-title":"Proceedings of the First International Workshop on Polyhedral Compilation Techniques (IMPACT)","volume":"2011","author":"Grosser T.","year":"2011","unstructured":"T. Grosser , H. Zheng , R. Aloor , A. Simb\u00fcrger , A. Gr\u00f6\u00dflinger , and L.-N. Pouchet . Polly-polyhedral optimization in llvm . In Proceedings of the First International Workshop on Polyhedral Compilation Techniques (IMPACT) , volume 2011 , page 1 , 2011 . T. Grosser, H. Zheng, R. Aloor, A. Simb\u00fcrger, A. Gr\u00f6\u00dflinger, and L.-N. Pouchet. Polly-polyhedral optimization in llvm. In Proceedings of the First International Workshop on Polyhedral Compilation Techniques (IMPACT), volume 2011, page 1, 2011."},{"key":"e_1_3_2_1_18_1","volume-title":"Sycl specification, version 1.2.1 revision 6. Technical report","author":"K. O. W. Group","year":"2019","unstructured":"K. O. W. Group . Sycl specification, version 1.2.1 revision 6. Technical report , The Knronos Group , November 2019 . K. O. W. Group. Sycl specification, version 1.2.1 revision 6. Technical report, The Knronos Group, November 2019."},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/2807591.2807627"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/3168824"},{"key":"e_1_3_2_1_21_1","volume-title":"SUNDIALS: Suite of nonlinear and differential\/algebraic equation solvers. ACM Transactions on Mathematical Software (TOMS), 31(3):363--396","author":"Hindmarsh A. C.","year":"2005","unstructured":"A. C. Hindmarsh , P. N. Brown , K. E. Grant , S. L. Lee , R. Serban , D. E. Shumaker , and C. S. Woodward . SUNDIALS: Suite of nonlinear and differential\/algebraic equation solvers. ACM Transactions on Mathematical Software (TOMS), 31(3):363--396 , 2005 . A. C. Hindmarsh, P. N. Brown, K. E. Grant, S. L. Lee, R. Serban, D. E. Shumaker, and C. S. Woodward. SUNDIALS: Suite of nonlinear and differential\/algebraic equation solvers. ACM Transactions on Mathematical Software (TOMS), 31(3):363--396, 2005."},{"key":"e_1_3_2_1_22_1","volume-title":"Lawrence Livermore National Lab.(LLNL)","author":"Hornung R. D.","year":"2019","unstructured":"R. D. Hornung and D. A. Beckingsale . ECP ST Project 2.3.1.06-STPM08-RAJA. Technical report , Lawrence Livermore National Lab.(LLNL) , Livermore, CA ( United States) , 2019 . R. D. Hornung and D. A. Beckingsale. ECP ST Project 2.3.1.06-STPM08-RAJA. Technical report, Lawrence Livermore National Lab.(LLNL), Livermore, CA (United States), 2019."},{"key":"e_1_3_2_1_23_1","volume-title":"Lawrence Livermore National Lab.(LLNL)","author":"Hornung R. D.","year":"2017","unstructured":"R. D. Hornung , H. E. Hones , Raja performance suite. Technical report , Lawrence Livermore National Lab.(LLNL) , Livermore, CA ( United States) , 2017 . R. D. Hornung, H. E. Hones, et al. Raja performance suite. Technical report, Lawrence Livermore National Lab.(LLNL), Livermore, CA (United States), 2017."},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/3178487.3178507"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICAPP.1995.472180"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPSW.2013.68"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/1250734.1250761"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/3272127.3275055"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/3302256"},{"key":"e_1_3_2_1_32_1","volume-title":"Architecture and performance of devito, a system for automated stencil computation. CoRR, abs\/1807.03032","author":"Luporini F.","year":"2018","unstructured":"F. Luporini , M. Lange , M. Louboutin , N. Kukreja , J. H\u00fcckelheim , C. Yount , P. A. Witte , P. H. J. Kelly , G. J. Gorman , and F. J. Herrmann . Architecture and performance of devito, a system for automated stencil computation. CoRR, abs\/1807.03032 , 2018 . F. Luporini, M. Lange, M. Louboutin, N. Kukreja, J. H\u00fcckelheim, C. Yount, P. A. Witte, P. H. J. Kelly, G. J. Gorman, and F. J. Herrmann. Architecture and performance of devito, a system for automated stencil computation. CoRR, abs\/1807.03032, 2018."},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/2897824.2925952"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/2694344.2694364"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/SC.2014.70"},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1147\/JRD.2019.2960218"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/P3HPC.2018.00007"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/2185520.2185528"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/2491956.2462176"},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/JPROC.2018.2862896"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1007\/3-540-44681-8_114"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/2184319.2184345"},{"key":"e_1_3_2_1_44_1","volume-title":"Lawrence Livermore National Lab.(LLNL)","author":"Settgast R. R.","year":"2018","unstructured":"R. R. Settgast , J. White , B. Corbett , A. Vargas , C. Sherman , P. Fu , and C. Annavarapu . Geosx simulation framework. Technical report , Lawrence Livermore National Lab.(LLNL) , Livermore, CA ( United States) , 2018 . R. R. Settgast, J. White, B. Corbett, A. Vargas, C. Sherman, P. Fu, and C. Annavarapu. Geosx simulation framework. Technical report, Lawrence Livermore National Lab.(LLNL), Livermore, CA (United States), 2018."},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/5992.805137"},{"key":"e_1_3_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2014.118"},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1145\/2584665"},{"key":"e_1_3_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1145\/1989493.1989508"},{"key":"e_1_3_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2009.5161054"},{"key":"e_1_3_2_1_51_1","volume-title":"GCC Research Opportunities Workshop (GROW'10)","author":"Trifunovic K.","year":"2010","unstructured":"K. Trifunovic , A. Cohen , D. Edelsohn , F. Li , T. Grosser , H. Jagasia , R. Ladelsky , S. Pop , J. Sj\u00f6din , and R. Upadrasta . Graphite two years after: First lessons learned from real-world polyhedral compilation . In GCC Research Opportunities Workshop (GROW'10) , 2010 . K. Trifunovic, A. Cohen, D. Edelsohn, F. Li, T. Grosser, H. Jagasia, R. Ladelsky, S. Pop, J. Sj\u00f6din, and R. Upadrasta. Graphite two years after: First lessons learned from real-world polyhedral compilation. In GCC Research Opportunities Workshop (GROW'10), 2010."},{"key":"e_1_3_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1145\/2908080.2908105"},{"key":"e_1_3_2_1_53_1","volume-title":"Tensor comprehensions: Framework-agnostic high-performance machine learning abstractions. Technical report, arXiv","author":"Vasilache N.","year":"1802","unstructured":"N. Vasilache , O. Zinenko , T. Theodoridis , P. Goyal , Z. DeVito , W. S. Moses , S. Verdoolaege , A. Adams , and A. Cohen . Tensor comprehensions: Framework-agnostic high-performance machine learning abstractions. Technical report, arXiv : 1802 .04730, 2018. N. Vasilache, O. Zinenko, T. Theodoridis, P. Goyal, Z. DeVito, W. S. Moses, S. Verdoolaege, A. Adams, and A. Cohen. Tensor comprehensions: Framework-agnostic high-performance machine learning abstractions. Technical report, arXiv: 1802.04730, 2018."},{"key":"e_1_3_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-57172-5_2"},{"key":"e_1_3_2_1_55_1","volume-title":"Proceedings of the SIAM Workshop on Object Oriented Methods for Inter-operable Scientific and Engineering Computing (OO'98)","author":"Veldhuizen T. L.","year":"1998","unstructured":"T. L. Veldhuizen and D. Gannon . Active libraries: Rethinking the roles of compilers and libraries . In Proceedings of the SIAM Workshop on Object Oriented Methods for Inter-operable Scientific and Engineering Computing (OO'98) . SIAM Press , 1998 . T. L. Veldhuizen and D. Gannon. Active libraries: Rethinking the roles of compilers and libraries. In Proceedings of the SIAM Workshop on Object Oriented Methods for Inter-operable Scientific and Engineering Computing (OO'98). SIAM Press, 1998."},{"key":"e_1_3_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1109\/SC.2014.21"},{"key":"e_1_3_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.5555\/846234.849346"},{"key":"e_1_3_2_1_58_1","volume-title":"Proceedings of the 3rd International Workshop on Polyhedral Compilation Techniques (IMPACT)","author":"Wonnacott D. G.","year":"2013","unstructured":"D. G. Wonnacott and M. M. Strout . On the scalability of loop tiling techniques . In Proceedings of the 3rd International Workshop on Polyhedral Compilation Techniques (IMPACT) , January 2013 . D. G. Wonnacott and M. M. Strout. On the scalability of loop tiling techniques. In Proceedings of the 3rd International Workshop on Polyhedral Compilation Techniques (IMPACT), January 2013."},{"key":"e_1_3_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1145\/3373376.3378514"},{"key":"e_1_3_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1109\/WOLFHPC.2016.08"},{"key":"e_1_3_2_1_61_1","first-page":"17","volume-title":"International Workshop on Languages and Compilers for Parallel Computing","author":"Yuki T.","year":"2012","unstructured":"T. Yuki , G. Gupta , D. Kim , T. Pathan , and S. Rajopadhye . Alphaz: A system for design space exploration in the polyhedral model . In International Workshop on Languages and Compilers for Parallel Computing , pages 17 -- 31 . Springer , 2012 . T. Yuki, G. Gupta, D. Kim, T. Pathan, and S. Rajopadhye. Alphaz: A system for design space exploration in the polyhedral model. In International Workshop on Languages and Compilers for Parallel Computing, pages 17--31. Springer, 2012."}],"event":{"name":"ICS '21: 2021 International Conference on Supercomputing","sponsor":["SIGARCH ACM Special Interest Group on Computer Architecture"],"location":"Virtual Event USA","acronym":"ICS '21"},"container-title":["Proceedings of the ACM International Conference on Supercomputing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3447818.3461665","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3447818.3461665","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T17:49:27Z","timestamp":1750268967000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3447818.3461665"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,6,3]]},"references-count":57,"alternative-id":["10.1145\/3447818.3461665","10.1145\/3447818"],"URL":"https:\/\/doi.org\/10.1145\/3447818.3461665","relation":{},"subject":[],"published":{"date-parts":[[2021,6,3]]},"assertion":[{"value":"2021-06-04","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}