{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,20]],"date-time":"2026-01-20T00:39:32Z","timestamp":1768869572943,"version":"3.49.0"},"reference-count":35,"publisher":"SAGE Publications","issue":"2","license":[{"start":{"date-parts":[[2021,5,17]],"date-time":"2021-05-17T00:00:00Z","timestamp":1621209600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["The International Journal of High Performance Computing Applications"],"published-print":{"date-parts":[[2022,3]]},"abstract":"<jats:p> The solution of linear systems of equations is a central task in a number of scientific and engineering applications. In many cases the solution of linear systems may take most of the simulation time thus representing a major bottleneck in the further development of scientific and technical software. For large scale simulations, nowadays accounting for several millions or even billions of unknowns, it is quite common to resort to preconditioned iterative solvers for exploiting their low memory requirements and, at least potential, parallelism. Approximate inverses have been shown to be robust and effective preconditioners in various contexts. In this work, we show how adaptive Factored Sparse Approximate Inverse (aFSAI), characterized by a very high degree of parallelism, can be successfully implemented on a distributed memory computer equipped with GPU accelerators. Taking advantage of GPUs in adaptive FSAI set-up is not a trivial task, nevertheless we show through an extensive numerical experimentation how the proposed approach outperforms more traditional preconditioners and results in a close-to-ideal behavior in challenging linear algebra problems. <\/jats:p>","DOI":"10.1177\/10943420211017188","type":"journal-article","created":{"date-parts":[[2021,5,17]],"date-time":"2021-05-17T10:27:05Z","timestamp":1621247225000},"page":"153-166","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":8,"title":["A GPU-accelerated adaptive FSAI preconditioner for massively parallel simulations"],"prefix":"10.1177","volume":"36","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-1548-2378","authenticated-orcid":false,"given":"Giovanni","family":"Isotton","sequence":"first","affiliation":[{"name":"M3E S.r.l., via Giambellino 7, 35129 Paova, Italy"}]},{"given":"Carlo","family":"Janna","sequence":"additional","affiliation":[{"name":"M3E S.r.l., via Giambellino 7, 35129 Paova, Italy"}]},{"given":"Massimo","family":"Bernaschi","sequence":"additional","affiliation":[{"name":"Institute for Applied Computing, CNR, 00185 Rome, Italy"}]}],"member":"179","published-online":{"date-parts":[[2021,5,17]]},"reference":[{"key":"bibr1-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1016\/j.parco.2017.10.003"},{"key":"bibr2-10943420211017188","unstructured":"Balay S, Abhyankar S, Adams MF, et al. (2019) PETSc Web page. Available at: https:\/\/www.mcs.anl.gov\/petsc (accessed 2021)."},{"key":"bibr3-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1137\/110838844"},{"key":"bibr4-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1006\/jcph.2002.7176"},{"key":"bibr5-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1016\/S0168-9274(98)00118-4"},{"key":"bibr6-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1137\/S1064827594271421"},{"key":"bibr7-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1137\/15M1027826"},{"key":"bibr8-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1137\/18M1197461"},{"key":"bibr9-10943420211017188","first-page":"1","volume":"92","author":"Bernaschi M","year":"2019","journal-title":"Parallel Computing"},{"key":"bibr10-10943420211017188","first-page":"183","volume":"25","author":"Bertaccini D","year":"2014","journal-title":"Advances in Parallel Computing"},{"key":"bibr11-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1137\/S1064827594270415"},{"key":"bibr12-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1007\/3-540-47789-6_66"},{"key":"bibr13-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1016\/j.cma.2019.04.034"},{"key":"bibr14-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1016\/j.camwa.2014.08.022"},{"key":"bibr15-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1137\/S1064827594276552"},{"key":"bibr16-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1002\/cpe.5484"},{"key":"bibr17-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1137\/100810368"},{"key":"bibr18-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1145\/2629475"},{"key":"bibr19-10943420211017188","unstructured":"Janna C, Isotton G, Frigo M (2020) Chronos Web page. Available at: https:\/\/www.m3eweb.it\/chronos\/. URL https:\/\/www.m3eweb.it\/chronos\/ (accessed 2021)."},{"key":"bibr20-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevLett.106.194502"},{"key":"bibr21-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1002\/nla.1680010208"},{"key":"bibr22-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1137\/0614004"},{"key":"bibr23-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1016\/j.compstruc.2014.05.009"},{"key":"bibr24-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1137\/120872735"},{"key":"bibr25-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1007\/s11227-012-0825-3"},{"key":"bibr26-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1177\/1094342020905637"},{"key":"bibr27-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1007\/s11227-020-03201-5"},{"key":"bibr28-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1007\/s10237-016-0828-8"},{"key":"bibr29-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1002\/cnm.3308"},{"key":"bibr30-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1137\/140980260"},{"key":"bibr31-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1137\/17M1161178"},{"key":"bibr32-10943420211017188","unstructured":"Strohmaier E, Dongarra J, Simon H, et al. (2020) Top500: the list of the 500 most powerful computer systems. Available at: https:\/\/www.top500.org (accessed 2021)."},{"key":"bibr33-10943420211017188","unstructured":"Trilinos Project Team T (2020) The Trilinos Project Website. Available at: https:\/\/trilinos.github.io (accessed 22 May 2020)."},{"key":"bibr34-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1017\/S0962492917000083"},{"key":"bibr35-10943420211017188","doi-asserted-by":"publisher","DOI":"10.1016\/j.finel.2010.11.005"}],"container-title":["The International Journal of High Performance Computing Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/10943420211017188","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/10943420211017188","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/10943420211017188","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,1]],"date-time":"2025-03-01T05:39:45Z","timestamp":1740807585000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/10943420211017188"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,5,17]]},"references-count":35,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2022,3]]}},"alternative-id":["10.1177\/10943420211017188"],"URL":"https:\/\/doi.org\/10.1177\/10943420211017188","relation":{},"ISSN":["1094-3420","1741-2846"],"issn-type":[{"value":"1094-3420","type":"print"},{"value":"1741-2846","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,5,17]]}}}