{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,26]],"date-time":"2025-09-26T13:28:53Z","timestamp":1758893333709,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":41,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,7,17]],"date-time":"2021-07-17T00:00:00Z","timestamp":1626480000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100008562","name":"University of Texas at Austin","doi-asserted-by":"publisher","award":["UTA19-001215"],"award-info":[{"award-number":["UTA19-001215"]}],"id":[{"id":"10.13039\/100008562","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Department of Energy, National Nuclear Security Administration","award":["DE-NA0002375"],"award-info":[{"award-number":["DE-NA0002375"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,7,17]]},"DOI":"10.1145\/3437359.3465581","type":"proceedings-article","created":{"date-parts":[[2021,7,18]],"date-time":"2021-07-18T04:08:46Z","timestamp":1626581326000},"page":"1-9","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":5,"title":["A Heterogeneous MPI+PPL Task Scheduling Approach for Asynchronous Many-Task Runtime Systems"],"prefix":"10.1145","author":[{"given":"John","family":"Holmen","sequence":"first","affiliation":[{"name":"Scientific Computing and Imaging Institute, University of Utah, USA"}]},{"given":"Damodar","family":"Sahasrabudhe","sequence":"additional","affiliation":[{"name":"Scientific Computing and Imaging Institute, University of Utah, USA"}]},{"given":"Martin","family":"Berzins","sequence":"additional","affiliation":[{"name":"Scientific Computing and Imaging Institute, University of Utah, USA"}]}],"member":"320","published-online":{"date-parts":[[2021,7,17]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"2019. Aurora. https:\/\/aurora.alcf.anl.gov\/.  2019. Aurora. https:\/\/aurora.alcf.anl.gov\/."},{"key":"e_1_3_2_1_2_1","unstructured":"2019. Frontier. https:\/\/www.olcf.ornl.gov\/frontier\/.  2019. Frontier. https:\/\/www.olcf.ornl.gov\/frontier\/."},{"key":"e_1_3_2_1_3_1","unstructured":"2019. Kokkos: The C++ Performance Portability Programming Model Wiki. https:\/\/github.com\/kokkos\/kokkos\/wiki.  2019. Kokkos: The C++ Performance Portability Programming Model Wiki. https:\/\/github.com\/kokkos\/kokkos\/wiki."},{"key":"e_1_3_2_1_4_1","unstructured":"2019. Tutorials for the Kokkos C++ Performance Portability Programming EcoSystem. https:\/\/github.com\/kokkos\/kokkos-tutorials.  2019. Tutorials for the Kokkos C++ Performance Portability Programming EcoSystem. https:\/\/github.com\/kokkos\/kokkos-tutorials."},{"key":"e_1_3_2_1_5_1","unstructured":"2020. November 2020 - TOP500 Supercomputer Sites. https:\/\/top500.org\/lists\/top500\/2020\/11\/.  2020. November 2020 - TOP500 Supercomputer Sites. https:\/\/top500.org\/lists\/top500\/2020\/11\/."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1002\/cpe.1631"},{"key":"e_1_3_2_1_7_1","volume-title":"Hedgehog: Understandable Scheduler-Free Heterogeneous Asynchronous Multithreaded Data-Flow Graphs. In 2020 IEEE\/ACM 3rd Annual Parallel Applications Workshop: Alternatives To MPI+X (PAW-ATM). 1\u201315","author":"Bardakoff A.","year":"1920","unstructured":"A. Bardakoff , B. Bachelet , T. Blattner , W. Keyrouz , G.\u00a0 C. Kroiz , and L. Yon . 2020 . Hedgehog: Understandable Scheduler-Free Heterogeneous Asynchronous Multithreaded Data-Flow Graphs. In 2020 IEEE\/ACM 3rd Annual Parallel Applications Workshop: Alternatives To MPI+X (PAW-ATM). 1\u201315 . https:\/\/doi.org\/10.1109\/PAWATM5 1920 .2020.00006 A. Bardakoff, B. Bachelet, T. Blattner, W. Keyrouz, G.\u00a0C. Kroiz, and L. Yon. 2020. Hedgehog: Understandable Scheduler-Free Heterogeneous Asynchronous Multithreaded Data-Flow Graphs. In 2020 IEEE\/ACM 3rd Annual Parallel Applications Workshop: Alternatives To MPI+X (PAW-ATM). 1\u201315. https:\/\/doi.org\/10.1109\/PAWATM51920.2020.00006"},{"key":"e_1_3_2_1_8_1","volume-title":"Legion: Expressing locality and independence with logical regions. In Proceedings of the international conference on high performance computing, networking, storage and analysis","author":"Bauer M.","year":"2012","unstructured":"M. Bauer , S. Treichler , E. Slaughter , and A. Aiken . 2012 . Legion: Expressing locality and independence with logical regions. In Proceedings of the international conference on high performance computing, networking, storage and analysis . IEEE Computer Society Press , 66. M. Bauer, S. Treichler, E. Slaughter, and A. Aiken. 2012. Legion: Expressing locality and independence with logical regions. In Proceedings of the international conference on high performance computing, networking, storage and analysis. IEEE Computer Society Press, 66."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1137\/15M1023270"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/MCSE.2013.98"},{"volume-title":"Proceedings of the 20th European MPI Users\u2019 Group Meeting. 13\u201318","author":"Dinan J.","key":"e_1_3_2_1_11_1","unstructured":"J. Dinan , P. Balaji , D. Goodell , D. Miller , M. Snir , and R. Thakur . 2013. Enabling MPI interoperability through flexible communication endpoints . In Proceedings of the 20th European MPI Users\u2019 Group Meeting. 13\u201318 . J. Dinan, P. Balaji, D. Goodell, D. Miller, M. Snir, and R. Thakur. 2013. Enabling MPI interoperability through flexible communication endpoints. In Proceedings of the 20th European MPI Users\u2019 Group Meeting. 13\u201318."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jpdc.2014.07.003"},{"volume-title":"Numerical Solution of Partial Differential Equations on Parallel Computers, Are\u00a0Magnus Bruaset and Aslak Tveito (Eds.)","author":"Falgout D.","key":"e_1_3_2_1_13_1","unstructured":"R.\u00a0 D. Falgout , J.\u00a0 E. Jones , and U.\u00a0 M. Yang . 2006. The Design and Implementation of hypre, a Library of Parallel High Performance Preconditioners . In Numerical Solution of Partial Differential Equations on Parallel Computers, Are\u00a0Magnus Bruaset and Aslak Tveito (Eds.) . Springer Berlin Heidelberg , Berlin, Heidelberg , 267\u2013294. R.\u00a0D. Falgout, J.\u00a0E. Jones, and U.\u00a0M. Yang. 2006. The Design and Implementation of hypre, a Library of Parallel High Performance Preconditioners. In Numerical Solution of Partial Differential Equations on Parallel Computers, Are\u00a0Magnus Bruaset and Aslak Tveito (Eds.). Springer Berlin Heidelberg, Berlin, Heidelberg, 267\u2013294."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"crossref","unstructured":"J.\u00a0K. Holmen A. Humphrey and M. Berzins. 2015. Chapter 13 - Exploring Use of the Reserved Core. In High Performance Parallelism Pearls Volume Two: Multicore and Many-core Programming Approaches J.\u00a0Reinders and J.\u00a0Jeffers (Eds.). Vol.\u00a02. Morgan Kaufmann Boston MA USA 229 \u2013 242.  J.\u00a0K. Holmen A. Humphrey and M. Berzins. 2015. Chapter 13 - Exploring Use of the Reserved Core. In High Performance Parallelism Pearls Volume Two: Multicore and Many-core Programming Approaches J.\u00a0Reinders and J.\u00a0Jeffers (Eds.). Vol.\u00a02. Morgan Kaufmann Boston MA USA 229 \u2013 242.","DOI":"10.1016\/B978-0-12-803819-2.00010-0"},{"key":"e_1_3_2_1_15_1","first-page":"1","article-title":"Improving Uintah\u2019s Scalability Through the Use of Portable Kokkos-Based Data Parallel Tasks. In Proceedings of the Practice and Experience in Advanced Research Computing 2017 on Sustainability, Success and Impact (New Orleans, LA, USA) (PEARC17). ACM, New York, NY, USA","volume":"27","author":"Holmen K.","year":"2017","unstructured":"J.\u00a0 K. Holmen , A. Humphrey , D. Sunderland , and M. Berzins . 2017 . Improving Uintah\u2019s Scalability Through the Use of Portable Kokkos-Based Data Parallel Tasks. In Proceedings of the Practice and Experience in Advanced Research Computing 2017 on Sustainability, Success and Impact (New Orleans, LA, USA) (PEARC17). ACM, New York, NY, USA , Article 27 , 27: 1 \u2013 27 :8\u00a0pages. J.\u00a0K. Holmen, A. Humphrey, D. Sunderland, and M. Berzins. 2017. Improving Uintah\u2019s Scalability Through the Use of Portable Kokkos-Based Data Parallel Tasks. In Proceedings of the Practice and Experience in Advanced Research Computing 2017 on Sustainability, Success and Impact (New Orleans, LA, USA) (PEARC17). ACM, New York, NY, USA, Article 27, 27:1\u201327:8\u00a0pages.","journal-title":"Article"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/P3HPC49587.2019.00009"},{"key":"e_1_3_2_1_17_1","volume-title":"Technical Report UUSCI-2019-001. SCI Institute.","author":"Holmen K.","year":"2019","unstructured":"J.\u00a0 K. Holmen , B. Peterson , A. Humphrey , D. Sunderland , O.\u00a0 H. Diaz-Ibarra , J.\u00a0 N. Thornock , and M. Berzins . 2019 . Portably Improving Uintah\u2019s Readiness for Exascale Systems Through the Use of Kokkos . Technical Report UUSCI-2019-001. SCI Institute. J.\u00a0K. Holmen, B. Peterson, A. Humphrey, D. Sunderland, O.\u00a0H. Diaz-Ibarra, J.\u00a0N. Thornock, and M. Berzins. 2019. Portably Improving Uintah\u2019s Readiness for Exascale Systems Through the Use of Kokkos. Technical Report UUSCI-2019-001. SCI Institute."},{"volume-title":"The RAJA portability layer: overview and status","author":"Hornung D.","key":"e_1_3_2_1_18_1","unstructured":"R.\u00a0 D. Hornung and J.\u00a0 A. Keasler . 2014. The RAJA portability layer: overview and status . Technical Report. Lawrence Livermore National Laboratory (LLNL), Livermore, CA. R.\u00a0D. Hornung and J.\u00a0A. Keasler. 2014. The RAJA portability layer: overview and status. Technical Report. Lawrence Livermore National Laboratory (LLNL), Livermore, CA."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"crossref","unstructured":"A. Humphrey T. Harman M. Berzins and P. Smith. 2015. A Scalable Algorithm for Radiative Heat Transfer Using Reverse Monte Carlo Ray Tracing. In High Performance Computing Julian\u00a0M. Kunkel and Thomas Ludwig (Eds.). Lecture Notes in Computer Science Vol.\u00a09137. Springer International Publishing 212\u2013230.  A. Humphrey T. Harman M. Berzins and P. Smith. 2015. A Scalable Algorithm for Radiative Heat Transfer Using Reverse Monte Carlo Ray Tracing. In High Performance Computing Julian\u00a0M. Kunkel and Thomas Ludwig (Eds.). Lecture Notes in Computer Science Vol.\u00a09137. Springer International Publishing 212\u2013230.","DOI":"10.1007\/978-3-319-20119-1_16"},{"volume-title":"Proceedings of the first conference of the Extreme Science and Engineering Discovery Environment (XSEDE\u201912)","author":"Humphrey A.","key":"e_1_3_2_1_21_1","unstructured":"A. Humphrey , Q. Meng , M. Berzins , and T. Harman . 2012. Radiation Modeling Using the Uintah Heterogeneous CPU\/GPU Runtime System . In Proceedings of the first conference of the Extreme Science and Engineering Discovery Environment (XSEDE\u201912) . Association for Computing Machinery. A. Humphrey, Q. Meng, M. Berzins, and T. Harman. 2012. Radiation Modeling Using the Uintah Heterogeneous CPU\/GPU Runtime System. In Proceedings of the first conference of the Extreme Science and Engineering Discovery Environment (XSEDE\u201912). Association for Computing Machinery."},{"volume-title":"2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW). 1222\u20131231","author":"Humphrey A.","key":"e_1_3_2_1_22_1","unstructured":"A. Humphrey , D. Sunderland , T. Harman , and M. Berzins . 2016. Radiative Heat Transfer Calculation on 16384 GPUs Using a Reverse Monte Carlo Ray Tracing Approach with Adaptive Mesh Refinement . In 2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW). 1222\u20131231 . A. Humphrey, D. Sunderland, T. Harman, and M. Berzins. 2016. Radiative Heat Transfer Calculation on 16384 GPUs Using a Reverse Monte Carlo Ray Tracing Approach with Adaptive Mesh Refinement. In 2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW). 1222\u20131231."},{"key":"e_1_3_2_1_23_1","unstructured":"A. Johnson. 2020. Area Exam: General-Purpose Performance Portable Programming Models for Productive Exascale Computing. (2020).  A. Johnson. 2020. Area Exam: General-Purpose Performance Portable Programming Models for Productive Exascale Computing. (2020)."},{"volume-title":"Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models","author":"Kaiser H.","key":"e_1_3_2_1_24_1","unstructured":"H. Kaiser , T. Heller , B. Adelstein-Lelbach , A. Serio , and D. Fey . 2014. HPX: A Task Based Programming Model in a Global Address Space . In Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models ( Eugene, OR, USA) (PGAS \u201914). ACM, New York, NY, USA, Article 6, 11\u00a0pages. H. Kaiser, T. Heller, B. Adelstein-Lelbach, A. Serio, and D. Fey. 2014. HPX: A Task Based Programming Model in a Global Address Space. In Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models (Eugene, OR, USA) (PGAS \u201914). ACM, New York, NY, USA, Article 6, 11\u00a0pages."},{"volume-title":"Proceedings of the Eighth Annual Conference on Object-oriented Programming Systems, Languages, and Applications (Washington, D.C., USA) (OOPSLA \u201993)","author":"Kale V.","key":"e_1_3_2_1_25_1","unstructured":"L.\u00a0 V. Kale and S. Krishnan . 1993. CHARM++: A Portable Concurrent Object Oriented System Based on C++ . In Proceedings of the Eighth Annual Conference on Object-oriented Programming Systems, Languages, and Applications (Washington, D.C., USA) (OOPSLA \u201993) . ACM, New York, NY, USA, 91\u2013108. L.\u00a0V. Kale and S. Krishnan. 1993. CHARM++: A Portable Concurrent Object Oriented System Based on C++. In Proceedings of the Eighth Annual Conference on Object-oriented Programming Systems, Languages, and Applications (Washington, D.C., USA) (OOPSLA \u201993). ACM, New York, NY, USA, 91\u2013108."},{"key":"e_1_3_2_1_26_1","unstructured":"R. Keryell M. Rovatsou and L. Howes. 2019. Khronos Group SYCL 1.2.1 Specification. https:\/\/www.khronos.org\/registry\/SYCL\/specs\/sycl-1.2.1.pdf.  R. Keryell M. Rovatsou and L. Howes. 2019. Khronos Group SYCL 1.2.1 Specification. https:\/\/www.khronos.org\/registry\/SYCL\/specs\/sycl-1.2.1.pdf."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"crossref","unstructured":"S. Kumar A. Humphrey W. Usher S. Petruzza B. Peterson J.\u00a0A. Schmidt D. Harris B. Isaac J. Thornock T. Harman V. Pascucci and M. Berzins. 2018. Scalable Data Management of the Uintah Simulation Framework for Next-Generation Engineering Problems with Radiation. In Supercomputing Frontiers Rio Yokota and Weigang Wu (Eds.). Springer International Publishing 219\u2013240. https:\/\/doi.org\/10.1007\/978-3-319-69953-0_13  S. Kumar A. Humphrey W. Usher S. Petruzza B. Peterson J.\u00a0A. Schmidt D. Harris B. Isaac J. Thornock T. Harman V. Pascucci and M. Berzins. 2018. Scalable Data Management of the Uintah Simulation Framework for Next-Generation Engineering Problems with Radiation. In Supercomputing Frontiers Rio Yokota and Weigang Wu (Eds.). Springer International Publishing 219\u2013240. https:\/\/doi.org\/10.1007\/978-3-319-69953-0_13","DOI":"10.1007\/978-3-319-69953-0_13"},{"key":"e_1_3_2_1_28_1","volume-title":"OCCA: A unified approach to multi-threading languages. arXiv preprint arXiv:1403.0968(2014).","author":"Medina S.","year":"2014","unstructured":"D.\u00a0 S. Medina , A. St-Cyr , and T. Warburton . 2014 . OCCA: A unified approach to multi-threading languages. arXiv preprint arXiv:1403.0968(2014). D.\u00a0S. Medina, A. St-Cyr, and T. Warburton. 2014. OCCA: A unified approach to multi-threading languages. arXiv preprint arXiv:1403.0968(2014)."},{"volume-title":"Proceedings of the TeraGrid 2011 Conference","author":"Meng Q.","key":"e_1_3_2_1_29_1","unstructured":"Q. Meng , M. Berzins , and J. Schmidt . 2011. Using Hybrid Parallelism to improve memory use in Uintah . In Proceedings of the TeraGrid 2011 Conference ( Salt Lake City, Utah). ACM. Q. Meng, M. Berzins, and J. Schmidt. 2011. Using Hybrid Parallelism to improve memory use in Uintah. In Proceedings of the TeraGrid 2011 Conference (Salt Lake City, Utah). ACM."},{"key":"e_1_3_2_1_30_1","volume-title":"Storage and Analysis (SCC)","author":"Meng Q.","year":"2012","unstructured":"Q. Meng , A. Humphrey , and M. Berzins . 2012. The Uintah Framework: A Unified Heterogeneous Task Scheduling and Runtime System. In High Performance Computing, Networking , Storage and Analysis (SCC) , 2012 SC Companion:. 2441\u20132448. Q. Meng, A. Humphrey, and M. Berzins. 2012. The Uintah Framework: A Unified Heterogeneous Task Scheduling and Runtime System. In High Performance Computing, Networking, Storage and Analysis (SCC), 2012 SC Companion:. 2441\u20132448."},{"volume-title":"Proceedings of the Conference on Extreme Science and Engineering Discovery Environment: Gateway to Discovery (XSEDE 2013)","author":"Meng Q.","key":"e_1_3_2_1_31_1","unstructured":"Q. Meng , A. Humphrey , J. Schmidt , and M. Berzins . 2013. Preliminary Experiences with the Uintah Framework on Intel Xeon Phi and Stampede . In Proceedings of the Conference on Extreme Science and Engineering Discovery Environment: Gateway to Discovery (XSEDE 2013) (San Diego, California). 48:1\u201348:8. Q. Meng, A. Humphrey, J. Schmidt, and M. Berzins. 2013. Preliminary Experiences with the Uintah Framework on Intel Xeon Phi and Stampede. In Proceedings of the Conference on Extreme Science and Engineering Discovery Environment: Gateway to Discovery (XSEDE 2013) (San Diego, California). 48:1\u201348:8."},{"volume-title":"Proceedings of the 5th International Workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing","author":"Peterson B.","key":"e_1_3_2_1_32_1","unstructured":"B. Peterson , H. Dasari , A. Humphrey , J. Sutherland , T. Saad , and M. Berzins . 2015. Reducing Overhead in the Uintah Framework to Support Short-lived Tasks on GPU-heterogeneous Architectures . In Proceedings of the 5th International Workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing ( Austin, Texas) (WOLFHPC \u201915). ACM, New York, NY, USA, Article 4, 8\u00a0pages. https:\/\/doi.org\/10.1145\/2830018.2830023 B. Peterson, H. Dasari, A. Humphrey, J. Sutherland, T. Saad, and M. Berzins. 2015. Reducing Overhead in the Uintah Framework to Support Short-lived Tasks on GPU-heterogeneous Architectures. In Proceedings of the 5th International Workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing (Austin, Texas) (WOLFHPC \u201915). ACM, New York, NY, USA, Article 4, 8\u00a0pages. https:\/\/doi.org\/10.1145\/2830018.2830023"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jocs.2018.06.005"},{"volume-title":"Addressing Global Data Dependencies in Heterogeneous Asynchronous Runtime Systems on GPUs. In Third International Workshop on Extreme Scale Programming Models and Middleware(ESPM2). IEEE Press.","author":"Peterson B.","key":"e_1_3_2_1_34_1","unstructured":"B. Peterson , A. Humphrey , J. Schmidt , and M. Berzins . 2017 . Addressing Global Data Dependencies in Heterogeneous Asynchronous Runtime Systems on GPUs. In Third International Workshop on Extreme Scale Programming Models and Middleware(ESPM2). IEEE Press. B. Peterson, A. Humphrey, J. Schmidt, and M. Berzins. 2017. Addressing Global Data Dependencies in Heterogeneous Asynchronous Runtime Systems on GPUs. In Third International Workshop on Extreme Scale Programming Models and Middleware(ESPM2). IEEE Press."},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"crossref","unstructured":"B. Peterson A. Humphrey D. Sunderland J. Sutherland T. Saad H. Dasari and M. Berzins. 2018. Automatic Halo Management for the Uintah GPU-Heterogeneous Asynchronous Many-Task Runtime. International Journal of Parallel Programming (Dec 2018). https:\/\/doi.org\/10.1007\/s10766-018-0619-1  B. Peterson A. Humphrey D. Sunderland J. Sutherland T. Saad H. Dasari and M. Berzins. 2018. Automatic Halo Management for the Uintah GPU-Heterogeneous Asynchronous Many-Task Runtime. International Journal of Parallel Programming (Dec 2018). https:\/\/doi.org\/10.1007\/s10766-018-0619-1","DOI":"10.1007\/s10766-018-0619-1"},{"key":"e_1_3_2_1_36_1","unstructured":"B. Peterson N. Xiao J.\u00a0K. Holmen S. Chaganti A. Pakki J. Schmidt D. Sunderland A. Humphrey and M. Berzins. 2015. Developing Uintah\u2019s Runtime System For Forthcoming Architectures. Technical Report. SCI Institute.  B. Peterson N. Xiao J.\u00a0K. Holmen S. Chaganti A. Pakki J. Schmidt D. Sunderland A. Humphrey and M. Berzins. 2015. Developing Uintah\u2019s Runtime System For Forthcoming Architectures. Technical Report. SCI Institute."},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"crossref","unstructured":"J. Reinders B. Ashbaugh J. Brodman M. Kinsner J. Pennycook and X. Tian. 2021. Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL. Springer Nature.  J. Reinders B. Ashbaugh J. Brodman M. Kinsner J. Pennycook and X. Tian. 2021. Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL. Springer Nature.","DOI":"10.1007\/978-1-4842-5574-2"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"crossref","unstructured":"D. Sahasrabudhe and M. Berzins. 2020. Improving Performance of the Hypre Iterative Solver for Uintah Combustion Codes on Manycore Architectures Using MPI Endpoints and Kernel Consolidation. In Computational Science \u2013 ICCS 2020. Springer International Publishing Cham 175\u2013190.  D. Sahasrabudhe and M. Berzins. 2020. Improving Performance of the Hypre Iterative Solver for Uintah Combustion Codes on Manycore Architectures Using MPI Endpoints and Kernel Consolidation. In Computational Science \u2013 ICCS 2020. Springer International Publishing Cham 175\u2013190.","DOI":"10.1007\/978-3-030-50371-0_13"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jocs.2020.101279"},{"key":"e_1_3_2_1_40_1","volume-title":"2013 13th IEEE\/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid). 458\u2013465","author":"Schmidt J.","year":"2013","unstructured":"J. Schmidt , M. Berzins , J. Thornock , T. Saad , and J. Sutherland . 2013. Large Scale Parallel Solution of Incompressible Flow Problems using Uintah and hypre . In 2013 13th IEEE\/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid). 458\u2013465 . https:\/\/doi.org\/10.1109\/CCGrid. 2013 .10 J. Schmidt, M. Berzins, J. Thornock, T. Saad, and J. Sutherland. 2013. Large Scale Parallel Solution of Incompressible Flow Problems using Uintah and hypre. In 2013 13th IEEE\/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid). 458\u2013465. https:\/\/doi.org\/10.1109\/CCGrid.2013.10"},{"volume-title":"16th AIAA Computational Fluid Dynamics Conference. 3697","author":"Smith J.","key":"e_1_3_2_1_41_1","unstructured":"P.\u00a0 J. Smith , R. Rawat , J. Spinti , S. Kumar , S. Borodai , and A. Violi . 2003. Large eddy simulations of accidental fires using massively parallel computers . In 16th AIAA Computational Fluid Dynamics Conference. 3697 . P.\u00a0J. Smith, R. Rawat, J. Spinti, S. Kumar, S. Borodai, and A. Violi. 2003. Large eddy simulations of accidental fires using massively parallel computers. In 16th AIAA Computational Fluid Dynamics Conference. 3697."},{"volume-title":"Proceedings of the Second Internationsl Workshop on Extreme Scale Programming Models and Middleware","author":"Sunderland D.","key":"e_1_3_2_1_42_1","unstructured":"D. Sunderland , B. Peterson , J. Schmidt , A. Humphrey , J. Thornock , and M. Berzins . 2016. An Overview of Performance Portability in the Uintah Runtime System Through the Use of Kokkos . In Proceedings of the Second Internationsl Workshop on Extreme Scale Programming Models and Middleware ( Salt Lake City, Utah) (ESPM2). IEEE Press, Piscataway, NJ, USA, 44\u201347. D. Sunderland, B. Peterson, J. Schmidt, A. Humphrey, J. Thornock, and M. Berzins. 2016. An Overview of Performance Portability in the Uintah Runtime System Through the Use of Kokkos. In Proceedings of the Second Internationsl Workshop on Extreme Scale Programming Models and Middleware (Salt Lake City, Utah) (ESPM2). IEEE Press, Piscataway, NJ, USA, 44\u201347."}],"event":{"name":"PEARC '21: Practice and Experience in Advanced Research Computing","sponsor":["SIGAPP ACM Special Interest Group on Applied Computing","SIGHPC ACM Special Interest Group on High Performance Computing, Special Interest Group on High Performance Computing"],"location":"Boston MA USA","acronym":"PEARC '21"},"container-title":["Practice and Experience in Advanced Research Computing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3437359.3465581","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3437359.3465581","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:03:31Z","timestamp":1750197811000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3437359.3465581"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,7,17]]},"references-count":41,"alternative-id":["10.1145\/3437359.3465581","10.1145\/3437359"],"URL":"https:\/\/doi.org\/10.1145\/3437359.3465581","relation":{},"subject":[],"published":{"date-parts":[[2021,7,17]]},"assertion":[{"value":"2021-07-17","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}