{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,9]],"date-time":"2026-05-09T11:09:06Z","timestamp":1778324946446,"version":"3.51.4"},"reference-count":70,"publisher":"SAGE Publications","issue":"5","license":[{"start":{"date-parts":[[2024,6,27]],"date-time":"2024-06-27T00:00:00Z","timestamp":1719446400000},"content-version":"vor","delay-in-days":366,"URL":"http:\/\/www.sagepub.com\/licence-information-for-chorus"}],"funder":[{"DOI":"10.13039\/100006132","name":"Office of Science","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100006132","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["The International Journal of High Performance Computing Applications"],"published-print":{"date-parts":[[2023,9]]},"abstract":"<jats:p>High-resolution simulations of polar ice sheets play a crucial role in the ongoing effort to develop more accurate and reliable Earth system models for probabilistic sea-level projections. These simulations often require a massive amount of memory and computation from large supercomputing clusters to provide sufficient accuracy and resolution; therefore, it has become essential to ensure performance on these platforms. Many of today\u2019s supercomputers contain a diverse set of computing architectures and require specific programming interfaces in order to obtain optimal efficiency. In an effort to avoid architecture-specific programming and maintain productivity across platforms, the ice-sheet modeling code known as MPAS-Albany Land Ice (MALI) uses high-level abstractions to integrate Trilinos libraries and the Kokkos programming model for performance portable code across a variety of different architectures. In this article, we analyze the performance portable features of MALI via a performance analysis on current CPU-based and GPU-based supercomputers. The analysis highlights not only the performance portable improvements made in finite element assembly and multigrid preconditioning within MALI with speedups between 1.26 and 1.82x across CPU and GPU architectures but also identifies the need to further improve performance in software coupling and preconditioning on GPUs. We perform a weak scalability study and show that simulations on GPU-based machines perform 1.24\u20131.92x faster when utilizing the GPUs. The best performance is found in finite element assembly, which achieved a speedup of up to 8.65x and a weak scaling efficiency of 82.6% with GPUs. We additionally describe an automated performance testing framework developed for this code base using a changepoint detection method. The framework is used to make actionable decisions about performance within MALI. We provide several concrete examples of scenarios in which the framework has identified performance regressions, improvements, and algorithm differences over the course of 2\u00a0years of development.<\/jats:p>","DOI":"10.1177\/10943420231183688","type":"journal-article","created":{"date-parts":[[2023,6,27]],"date-time":"2023-06-27T03:48:58Z","timestamp":1687837738000},"page":"600-625","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":8,"title":["Performance portable ice-sheet modeling with MALI"],"prefix":"10.1177","volume":"37","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-9853-1747","authenticated-orcid":false,"given":"Jerry","family":"Watkins","sequence":"first","affiliation":[{"name":"Sandia National Laboratories, Livermore, CA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Max","family":"Carlson","sequence":"additional","affiliation":[{"name":"Sandia National Laboratories, Livermore, CA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kyle","family":"Shan","sequence":"additional","affiliation":[{"name":"Micron Technology, Boise, ID, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Irina","family":"Tezaur","sequence":"additional","affiliation":[{"name":"Sandia National Laboratories, Livermore, CA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2671-8032","authenticated-orcid":false,"given":"Mauro","family":"Perego","sequence":"additional","affiliation":[{"name":"Sandia National Laboratories, Albuquerque, NM, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Luca","family":"Bertagna","sequence":"additional","affiliation":[{"name":"Sandia National Laboratories, Albuquerque, NM, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Carolyn","family":"Kao","sequence":"additional","affiliation":[{"name":"TSMC, Hsinchu, Taiwan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Matthew J","family":"Hoffman","sequence":"additional","affiliation":[{"name":"Los Alamos National Laboratory, Los Alamos, NM, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Stephen F","family":"Price","sequence":"additional","affiliation":[{"name":"Los Alamos National Laboratory, Los Alamos, NM, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"179","published-online":{"date-parts":[[2023,6,27]]},"reference":[{"key":"bibr1-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-016-0987-z"},{"key":"bibr2-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1155\/2012\/693861"},{"key":"bibr3-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1155\/2012\/243875"},{"key":"bibr4-10943420231183688","doi-asserted-by":"publisher","DOI":"10.2172\/1491860"},{"key":"bibr5-10943420231183688","doi-asserted-by":"publisher","DOI":"10.3189\/S002214300001621X"},{"key":"bibr6-10943420231183688","first-page":"3","volume":"8","author":"Bonferroni C","year":"1936","journal-title":"Pubblicazioni del R Istituto Superiore di Scienze Economiche e Commericiali di Firenze"},{"key":"bibr7-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1016\/j.cageo.2014.07.019"},{"key":"bibr8-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1201\/9781315367989"},{"key":"bibr9-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1137\/110834512"},{"key":"bibr10-10943420231183688","first-page":"177","author":"Carlson M","year":"2020","journal-title":"CSRI Summer Proceedings"},{"key":"bibr11-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1016\/j.jcp.2019.07.024"},{"key":"bibr12-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1016\/j.jcp.2012.08.037"},{"key":"bibr13-10943420231183688","volume-title":"The Physics of Glaciers","author":"Cuffey KM","year":"2010"},{"key":"bibr14-10943420231183688","doi-asserted-by":"crossref","unstructured":"Daly D, Brown W, Ingo H, et al. (2020) The use of change point detection to identify software performance regressions in a continuous integration system Proceedings of the ACM\/SPEC International Conference on Performance Engineering, pp. 67\u201375.","DOI":"10.1145\/3358960.3375791"},{"key":"bibr15-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1177\/1094342017749957"},{"key":"bibr16-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-27140-8_10"},{"key":"bibr17-10943420231183688","doi-asserted-by":"publisher","DOI":"10.3189\/002214310792447851"},{"key":"bibr18-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1016\/j.jpdc.2014.07.003"},{"key":"bibr19-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1038\/s41586-021-03302-y"},{"key":"bibr20-10943420231183688","first-page":"1","author":"Fischler Y","year":"2021","journal-title":"Geoscientific Model Development Discussions"},{"key":"bibr21-10943420231183688","first-page":"741","volume-title":"Climate change 2013: the physical science basis. Contribution of Working Group I to the Fifth Assessment Report of the Intergovernmental Panel on Climate Change","author":"Flato G","year":"2014"},{"key":"bibr22-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1145\/3454122.3454124"},{"key":"bibr23-10943420231183688","doi-asserted-by":"publisher","DOI":"10.5194\/gmd-6-1299-2013"},{"key":"bibr24-10943420231183688","doi-asserted-by":"publisher","DOI":"10.5194\/tc-14-3071-2020"},{"key":"bibr25-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1080\/00224065.2003.11980233"},{"key":"bibr26-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1137\/21M1395260"},{"key":"bibr27-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1145\/1089014.1089021"},{"key":"bibr28-10943420231183688","doi-asserted-by":"crossref","unstructured":"Hoefler T, Belli R (2015) Scientific benchmarking of parallel computing systems: twelve ways to tell the masses when reporting performance results Proceedings of the international conference for high performance computing, networking, storage and analysis, pp. 1\u201312.","DOI":"10.1145\/2807591.2807644"},{"key":"bibr29-10943420231183688","doi-asserted-by":"publisher","DOI":"10.5194\/gmd-11-3747-2018"},{"key":"bibr30-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1137\/140974407"},{"key":"bibr31-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1016\/j.infsof.2014.05.006"},{"issue":"1","key":"bibr32-10943420231183688","volume":"117","author":"Larour E","year":"2012","journal-title":"Journal of Geophysical Research: Earth Surface"},{"key":"bibr33-10943420231183688","doi-asserted-by":"publisher","DOI":"10.5194\/esd-11-35-2020"},{"key":"bibr34-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1016\/j.jesp.2013.03.013"},{"key":"bibr35-10943420231183688","doi-asserted-by":"publisher","DOI":"10.2172\/1332474"},{"key":"bibr36-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1098\/rspa.1957.0026"},{"key":"bibr37-10943420231183688","author":"Oppenheimer M","year":"2019","journal-title":"Coasts and Communities"},{"issue":"8","key":"bibr38-10943420231183688","volume":"108","author":"Pattyn F","year":"2003","journal-title":"Journal of Geophysical Research: Solid Earth"},{"key":"bibr39-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1007\/s40641-017-0069-7"},{"key":"bibr40-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1155\/2012\/202071"},{"key":"bibr41-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1155\/2012\/818262"},{"key":"bibr42-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1029\/2020gl091741"},{"key":"bibr43-10943420231183688","first-page":"2825","volume":"12","author":"Pedregosa F","year":"2011","journal-title":"Journal of Machine Learning Research"},{"key":"bibr44-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1016\/j.jocs.2021.101347"},{"key":"bibr45-10943420231183688","volume":"90","author":"Pennycook S","year":"2017","journal-title":"Future Generation Computer Systems"},{"key":"bibr46-10943420231183688","unstructured":"Pennycook SJ, Sewall J, Lee V (2016) A metric for performance portability.\n                      arXiv preprint arXiv:1611.07409\n                      ."},{"key":"bibr47-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1109\/MCSE.2021.3097276"},{"key":"bibr48-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1002\/2014JF003181"},{"key":"bibr49-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-30023-3_28"},{"key":"bibr50-10943420231183688","doi-asserted-by":"crossref","unstructured":"Prokopenko A, Siefert CM, Hu JJ, Hoemmen M, Klinvex A (2016) Ifpack2 users guide 1.0. Technical report SAND2016-5338, Sandia National Labs, 2016.","DOI":"10.2172\/1259544"},{"key":"bibr51-10943420231183688","first-page":"589","volume-title":"Climate change 2007: The physical science basis. Contribution of Working Group I to the Fourth Assessment Report of the IPCC (FAR)","author":"Randall DA","year":"2007"},{"key":"bibr52-10943420231183688","doi-asserted-by":"publisher","DOI":"10.5194\/gmd-13-955-2020"},{"key":"bibr53-10943420231183688","doi-asserted-by":"publisher","DOI":"10.5194\/gmd-2021-411"},{"key":"bibr54-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1016\/j.ocemod.2013.04.010"},{"issue":"2","key":"bibr55-10943420231183688","volume":"114","author":"Rutt IC","year":"2009","journal-title":"Journal of Geophysical Research: Earth Surface"},{"key":"bibr56-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1615\/IntJMultCompEng.2016017040"},{"key":"bibr57-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1093\/qjmam\/hbp025"},{"key":"bibr58-10943420231183688","doi-asserted-by":"publisher","DOI":"10.5194\/tc-14-3033-2020"},{"key":"bibr59-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1201\/b17279"},{"key":"bibr60-10943420231183688","doi-asserted-by":"publisher","DOI":"10.5194\/gmd-8-1197-2015"},{"key":"bibr61-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2015.05.467"},{"key":"bibr62-10943420231183688","volume-title":"The NOX and LOCA Project Website","author":"The NOX and LOCA Project Team","year":"2022"},{"key":"bibr63-10943420231183688","volume-title":"The Teuchos Project Website","author":"The Teuchos Project Team","year":"2022"},{"key":"bibr64-10943420231183688","volume-title":"June 2021 TOP500 List","author":"TOP500","year":"2021"},{"key":"bibr65-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1109\/MCSE.2021.3098509"},{"key":"bibr66-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2021.3097283"},{"key":"bibr67-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1137\/15M1040839"},{"key":"bibr68-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-30705-9_16"},{"key":"bibr69-10943420231183688","doi-asserted-by":"publisher","DOI":"10.5194\/tc-5-715-2011"},{"key":"bibr70-10943420231183688","doi-asserted-by":"publisher","DOI":"10.1109\/P3HPC.2018.00005"}],"container-title":["The International Journal of High Performance Computing Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/10943420231183688","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/10943420231183688","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/10943420231183688","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/10943420231183688","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T08:17:31Z","timestamp":1777450651000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/10943420231183688"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,6,27]]},"references-count":70,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2023,9]]}},"alternative-id":["10.1177\/10943420231183688"],"URL":"https:\/\/doi.org\/10.1177\/10943420231183688","relation":{},"ISSN":["1094-3420","1741-2846"],"issn-type":[{"value":"1094-3420","type":"print"},{"value":"1741-2846","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,6,27]]}}}