{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,3,1]],"date-time":"2025-03-01T05:46:56Z","timestamp":1740808016075,"version":"3.38.0"},"reference-count":25,"publisher":"SAGE Publications","issue":"2","license":[{"start":{"date-parts":[[2016,7,28]],"date-time":"2016-07-28T00:00:00Z","timestamp":1469664000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["The International Journal of High Performance Computing Applications"],"published-print":{"date-parts":[[2017,3]]},"abstract":"<jats:p> We present a performance analysis of a parallel implementation for both preconditioned conjugate gradient and preconditioned bi-conjugate gradient solvers running on graphic processing units (GPUs) with CUDA programming model. The solvers were mainly optimized for the solution of sparse systems of algebraic equations at complex entries, arising from the three-dimensional edge-finite element analysis of the electromagnetic phenomena involved in the open-bound earth diffusion of currents under time-harmonic excitation. We used a shifted incomplete Cholesky (IC) factorization as preconditioner. Results show a significant speedup by using either a single-GPU or a multi-GPU device, compared to a serial central processing unit (CPU) implementation, thereby allowing the simulations of large-scale problems in low-cost personal computers. Additional experiments of the optimized solvers show that its use can be extended successfully to other complex systems of equations arising in electrical engineering, such as those obtained in power\u2013system analysis. <\/jats:p>","DOI":"10.1177\/1094342015584476","type":"journal-article","created":{"date-parts":[[2015,5,13]],"date-time":"2015-05-13T01:32:08Z","timestamp":1431480728000},"page":"119-133","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":1,"title":["GPU-accelerated iterative solution of complex-entry systems issued from 3D edge-FEA of electromagnetics in the frequency domain"],"prefix":"10.1177","volume":"31","author":[{"given":"Ana Fl\u00e1via P.","family":"Camargos","sequence":"first","affiliation":[{"name":"Instituto Federal de Minas Gerais, Formiga\u2013MG, Brasil"},{"name":"Escola Polit\u00e9cnica da Universidade de S\u00e3o Paulo\u2013SP, S\u00e3o Paulo, Brasil"}]},{"given":"Viviane C.","family":"Silva","sequence":"additional","affiliation":[{"name":"Escola Polit\u00e9cnica da Universidade de S\u00e3o Paulo\u2013SP, S\u00e3o Paulo, Brasil"}]},{"given":"Jean-M.","family":"Guichon","sequence":"additional","affiliation":[{"name":"Grenoble G\u00e9nie Electrique Laboratoire, CNRS, France"}]},{"given":"G\u00e9rard","family":"Meunier","sequence":"additional","affiliation":[{"name":"Grenoble G\u00e9nie Electrique Laboratoire, CNRS, France"}]}],"member":"179","published-online":{"date-parts":[[2016,7,28]]},"reference":[{"key":"bibr1-1094342015584476","doi-asserted-by":"publisher","DOI":"10.1109\/PDP.2010.51"},{"key":"bibr2-1094342015584476","doi-asserted-by":"publisher","DOI":"10.1137\/1.9781611971538"},{"volume-title":"Efficient sparse matrix-vector multiplication on CUDA","year":"2008","author":"Bell N","key":"bibr3-1094342015584476"},{"key":"bibr4-1094342015584476","doi-asserted-by":"publisher","DOI":"10.1109\/TMAG.2013.2285091"},{"key":"bibr5-1094342015584476","doi-asserted-by":"publisher","DOI":"10.1109\/PDP.2014.95"},{"key":"bibr6-1094342015584476","doi-asserted-by":"publisher","DOI":"10.1109\/20.250792"},{"key":"bibr7-1094342015584476","doi-asserted-by":"publisher","DOI":"10.1016\/j.cam.2011.04.025"},{"key":"bibr8-1094342015584476","doi-asserted-by":"publisher","DOI":"10.6028\/jres.049.044"},{"key":"bibr9-1094342015584476","doi-asserted-by":"publisher","DOI":"10.1109\/TPWRS.2013.2252631"},{"key":"bibr10-1094342015584476","doi-asserted-by":"publisher","DOI":"10.1109\/TMAG.2011.2175375"},{"key":"bibr11-1094342015584476","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-32683-7_8"},{"key":"bibr12-1094342015584476","unstructured":"Li R, Saad Y (n.d.) GPU-accelerated preconditioned iterative linear solvers. Available at: http:\/\/citeseerx.ist.psu.edu\/index (accessed April 2015)."},{"key":"bibr13-1094342015584476","unstructured":"Lumsdanie A, Siek J (1998a) MTL: The Matrix Template Library 2. Available at: http:\/\/osl.iu.edu\/research\/mtl\/mtl2.php3 (accessed May 2015)."},{"key":"bibr14-1094342015584476","unstructured":"Lumsdanie A, Siek J (2014) ITL: The Iterative Template Library. Available at: www.osl.iu.edu\/research\/itl (accessed May 2015)."},{"key":"bibr15-1094342015584476","doi-asserted-by":"publisher","DOI":"10.1109\/TMAG.2013.2282360"},{"key":"bibr16-1094342015584476","doi-asserted-by":"publisher","DOI":"10.1109\/20.312533"},{"key":"bibr17-1094342015584476","doi-asserted-by":"publisher","DOI":"10.1109\/CEFC.2010.5481252"},{"key":"bibr18-1094342015584476","unstructured":"Naumov M (2011) Incomplete-LU and Cholesky preconditioned iterative methods using CUSPARSE and CUBLAS. NVIDIA Corporation. Available at: https:\/\/developer.nvidia.com (accessed June 2011)."},{"key":"bibr19-1094342015584476","doi-asserted-by":"publisher","DOI":"10.1109\/JPROC.2008.917757"},{"key":"bibr20-1094342015584476","doi-asserted-by":"publisher","DOI":"10.1109\/TMAG.2011.2179527"},{"key":"bibr21-1094342015584476","doi-asserted-by":"publisher","DOI":"10.1137\/1.9780898718003"},{"volume-title":"An introduction to the conjugate gradient method without the agonizing pain","year":"1994","author":"Shewchuk JR","key":"bibr22-1094342015584476"},{"key":"bibr23-1094342015584476","doi-asserted-by":"publisher","DOI":"10.1109\/TMAG.2010.2074188"},{"key":"bibr24-1094342015584476","doi-asserted-by":"publisher","DOI":"10.1109\/TAP.2004.835265"},{"key":"bibr25-1094342015584476","doi-asserted-by":"publisher","DOI":"10.1007\/BF01389450"}],"container-title":["The International Journal of High Performance Computing Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/1094342015584476","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/1094342015584476","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/1094342015584476","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,2,28]],"date-time":"2025-02-28T18:13:55Z","timestamp":1740766435000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/1094342015584476"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,7,28]]},"references-count":25,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2017,3]]}},"alternative-id":["10.1177\/1094342015584476"],"URL":"https:\/\/doi.org\/10.1177\/1094342015584476","relation":{},"ISSN":["1094-3420","1741-2846"],"issn-type":[{"type":"print","value":"1094-3420"},{"type":"electronic","value":"1741-2846"}],"subject":[],"published":{"date-parts":[[2016,7,28]]}}}