{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T22:54:56Z","timestamp":1777676096108,"version":"3.51.4"},"reference-count":42,"publisher":"SAGE Publications","issue":"1","license":[{"start":{"date-parts":[[2025,9,27]],"date-time":"2025-09-27T00:00:00Z","timestamp":1758931200000},"content-version":"vor","delay-in-days":365,"URL":"http:\/\/www.sagepub.com\/licence-information-for-chorus"}],"funder":[{"DOI":"10.13039\/100000015","name":"U.S. Department of Energy","doi-asserted-by":"publisher","award":["17-SC-20-SC"],"award-info":[{"award-number":["17-SC-20-SC"]}],"id":[{"id":"10.13039\/100000015","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["2004541"],"award-info":[{"award-number":["2004541"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["The International Journal of High Performance Computing Applications"],"published-print":{"date-parts":[[2025,1]]},"abstract":"<jats:p>SLATE (Software for Linear Algebra Targeting Exascale) is a distributed, dense linear algebra library targeting both CPU-only and GPU-accelerated systems, developed over the course of the Exascale Computing Project (ECP). While it began with several documents setting out its initial design, significant design changes occurred throughout its development. In some cases, these were anticipated: an early version used a simple consistency flag that was later replaced with a full-featured consistency protocol. In other cases, performance limitations and software and hardware changes prompted a redesign. Sequential communication tasks were parallelized; host-to-host MPI calls were replaced with GPU device-to-device MPI calls; more advanced algorithms such as Communication Avoiding LU and the Random Butterfly Transform (RBT) were introduced. Early choices that turned out to be cumbersome, error prone, or inflexible have been replaced with simpler, more intuitive, or more flexible designs. Applications have been a driving force, prompting a lighter weight queue class, nonuniform tile sizes, and more flexible MPI process grids. Of paramount importance has been building a portable library that works across several different GPU architectures \u2013 AMD, Intel, and NVIDIA \u2013 while keeping a clean and maintainable codebase. Here we explore the evolving design choices and their effects, both in terms of performance and software sustainability.<\/jats:p>","DOI":"10.1177\/10943420241286531","type":"journal-article","created":{"date-parts":[[2024,9,28]],"date-time":"2024-09-28T00:33:19Z","timestamp":1727483599000},"page":"3-17","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":5,"title":["Evolution of the SLATE linear algebra library"],"prefix":"10.1177","volume":"39","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-2996-1641","authenticated-orcid":false,"given":"Mark","family":"Gates","sequence":"first","affiliation":[{"name":"Innovative Computing Laboratory, University of Tennessee, Knoxville, TN, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5054-4784","authenticated-orcid":false,"given":"Ahmad","family":"Abdelfattah","sequence":"additional","affiliation":[{"name":"Innovative Computing Laboratory, University of Tennessee, Knoxville, TN, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1057-1590","authenticated-orcid":false,"given":"Kadir","family":"Akbudak","sequence":"additional","affiliation":[{"name":"ECRC (Extreme Computing Research Center), King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mohammed","family":"Al Farhan","sequence":"additional","affiliation":[{"name":"Hewlett Packard Enterprise (HPE), Spring, TX, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9911-6094","authenticated-orcid":false,"given":"Rabab","family":"Alomairy","sequence":"additional","affiliation":[{"name":"Massachusetts Institute of Technology (MIT), Cambridge, MA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5731-368X","authenticated-orcid":false,"given":"Daniel","family":"Bielich","sequence":"additional","affiliation":[{"name":"Ansys, Canonsburg, PA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Treece","family":"Burgess","sequence":"additional","affiliation":[{"name":"Innovative Computing Laboratory, University of Tennessee, Knoxville, TN, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3740-8985","authenticated-orcid":false,"given":"S\u00e9bastien","family":"Cayrols","sequence":"additional","affiliation":[{"name":"NVIDIA, Santa Clara, CA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Neil","family":"Lindquist","sequence":"additional","affiliation":[{"name":"NVIDIA, Santa Clara, CA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Dalal","family":"Sukkari","sequence":"additional","affiliation":[{"name":"Innovative Computing Laboratory, University of Tennessee, Knoxville, TN, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3901-9695","authenticated-orcid":false,"given":"Asim","family":"YarKhan","sequence":"additional","affiliation":[{"name":"Oak Ridge National Laboratory, Oak Ridge, TN, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"179","published-online":{"date-parts":[[2024,9,27]]},"reference":[{"key":"bibr1-10943420241286531","volume-title":"Technical Report ICL-UT-17-02, SLATE Working Note 1, Innovative Computing Laboratory","author":"Abdelfattah A","year":"2017"},{"key":"bibr2-10943420241286531","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2408.04440"},{"key":"bibr3-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1016\/B978-0-12-385963-1.00034-4"},{"key":"bibr4-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2017.2766064"},{"key":"bibr5-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1137\/1.9780898719604"},{"key":"bibr6-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1016\/j.parco.2013.12.003"},{"key":"bibr7-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1145\/3447818.3461616"},{"key":"bibr8-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1137\/1.9780898719642"},{"key":"bibr9-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2011.299"},{"key":"bibr10-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1016\/j.jpdc.2014.07.003"},{"key":"bibr11-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1145\/3581784.3607066"},{"key":"bibr12-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1137\/080731992"},{"key":"bibr13-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1063\/5.0198797"},{"key":"bibr14-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1137\/18M1218212"},{"key":"bibr15-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1145\/2807591.2807623"},{"key":"bibr16-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1145\/3295500.3356223"},{"key":"bibr17-10943420241286531","volume-title":"SLATE Users\u2019 Guide, SWAN No. 10. Technical Report ICL-UT-19-01, Innovative Computing Laboratory","author":"Gates M","year":"2020"},{"key":"bibr18-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1109\/P3HPC56579.2022.00009"},{"key":"bibr19-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1016\/j.parco.2022.102897"},{"key":"bibr20-10943420241286531","doi-asserted-by":"publisher","DOI":"10.56021\/9781421407944"},{"key":"bibr21-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1109\/SC.2008.5214287"},{"key":"bibr22-10943420241286531","volume-title":"2020 collegeville Workshop on Scientific Software Whitepapers","author":"Heroux M","year":"2020"},{"key":"bibr23-10943420241286531","author":"Hoemmen M","year":"2023","journal-title":"Technical report"},{"key":"bibr24-10943420241286531","unstructured":"Intel (2024) Intel oneAPI programming guide \u2014 C\/C++ OpenMP and SYCL composability. https:\/\/www.intel.com\/content\/www\/us\/en\/docs\/oneapi\/programming-guide\/2024-1\/c-c-openmp-and-sycl-composability.html."},{"key":"bibr25-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1145\/327070.327237"},{"key":"bibr26-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1021\/acs.chemrev.0c00998"},{"key":"bibr27-10943420241286531","volume-title":"SLATE Working Note 3: Designing SLATE: Software for Linear Algebra Targeting Exascale. Technical Report ICL-UT-17-06, Innovative Computing Laboratory","author":"Kurzak J","year":"2017"},{"key":"bibr28-10943420241286531","unstructured":"Lawrence Livermore National Laboratory (2024) Hydrogen. https:\/\/github.com\/LLNL\/Elemental."},{"key":"bibr29-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1109\/ScalA51936.2020.00010"},{"key":"bibr30-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1109\/ScalAH56622.2022.00010"},{"key":"bibr31-10943420241286531","unstructured":"Lindquist N, Luszczek P, Dongarra J (2023a) Generalizing random butterfly transforms to arbitrary matrix sizes. https:\/\/arxiv.org\/abs\/2312.09376."},{"key":"bibr32-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1145\/3577193.3593731"},{"key":"bibr33-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1088\/0953-8984\/26\/21\/213201"},{"key":"bibr34-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-43229-4_7"},{"key":"bibr35-10943420241286531","unstructured":"OLCF (2024a) Frontier user guide. https:\/\/docs.olcf.ornl.gov\/systems\/frontier_user_guide.html."},{"key":"bibr36-10943420241286531","unstructured":"OLCF (2024b) Summit user guide. https:\/\/docs.olcf.ornl.gov\/systems\/summit_user_guide.html."},{"key":"bibr37-10943420241286531","unstructured":"Parker DS (1995) Random butterfly transformations with applications in computational linear algebra. Los Angeles, CA, USA: UCLA Computer Science Department. Technical Report CSD-950023."},{"key":"bibr38-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1145\/2427023.2427030"},{"key":"bibr39-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1137\/140993478"},{"key":"bibr40-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-23397-5_10"},{"key":"bibr41-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1137\/14M0973773"},{"key":"bibr42-10943420241286531","doi-asserted-by":"publisher","DOI":"10.1016\/j.parco.2021.102837"}],"container-title":["The International Journal of High Performance Computing Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/10943420241286531","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/10943420241286531","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/10943420241286531","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/10943420241286531","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T08:17:39Z","timestamp":1777450659000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/10943420241286531"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,9,27]]},"references-count":42,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2025,1]]}},"alternative-id":["10.1177\/10943420241286531"],"URL":"https:\/\/doi.org\/10.1177\/10943420241286531","relation":{},"ISSN":["1094-3420","1741-2846"],"issn-type":[{"value":"1094-3420","type":"print"},{"value":"1741-2846","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,9,27]]}}}