{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:26:53Z","timestamp":1750307213973,"version":"3.41.0"},"reference-count":12,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2011,12,19]],"date-time":"2011-12-19T00:00:00Z","timestamp":1324252800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["SIGARCH Comput. Archit. News"],"published-print":{"date-parts":[[2011,12,19]]},"abstract":"<jats:p>This paper describes electromagnetical field simulation using the 3D-FDTD method for antenna designing on a CUDAcompatible GPU. We use the Split Perfectly Matched Layer as an absorbing boundary condition. As is well known, the 3D-FDTD method is a kind of stencil computation and is considered better at GPU implementation. In order to find the best blocking size for the target GPU architecture, we empirically explore a design space of blocking size. We also propose a kernel fusing method as one of the efficient optimization methods, which improves the total performance about 10% at the cost of a small increase in memory usage. As a result of evaluation, our implementation of the 3D-FDTD method on a GeForce GTX295 platform achieves about 130 times performance improvement compared to a simple CPU implementation, which is expected to be faster than an ideally parallelized CPU implementation using multicore and SIMD instructions.<\/jats:p>","DOI":"10.1145\/2082156.2082163","type":"journal-article","created":{"date-parts":[[2011,12,27]],"date-time":"2011-12-27T15:22:22Z","timestamp":1324999342000},"page":"26-31","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["GPU implementation and optimization of electromagnetic simulation using the FDTD method for antenna designing"],"prefix":"10.1145","volume":"39","author":[{"given":"Keisuke","family":"Dohi","sequence":"first","affiliation":[{"name":"Nagasaki University, Japan"}]},{"given":"Yuichiro","family":"Shibata","sequence":"additional","affiliation":[{"name":"Nagasaki University, Japan"}]},{"given":"Kiyoshi","family":"Oguri","sequence":"additional","affiliation":[{"name":"Nagasaki University, Japan"}]},{"given":"Takafumi","family":"Fujimoto","sequence":"additional","affiliation":[{"name":"Nagasaki University, Japan"}]}],"member":"320","published-online":{"date-parts":[[2011,12,19]]},"reference":[{"volume-title":"on antennas and propagation","author":"Yee K.","key":"e_1_2_1_1_1","unstructured":"K. Yee , \"Numerical solution of initial boundary value problems involving Maxwell's equations in isotropic media,\" IEEE Trans . on antennas and propagation , vol. 14 , no. 3, pp. 302--307, 1966. K. Yee, \"Numerical solution of initial boundary value problems involving Maxwell's equations in isotropic media,\" IEEE Trans. on antennas and propagation, vol. 14, no. 3, pp. 302--307, 1966."},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMTT.1975.1128640"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/8.546249"},{"volume-title":"Convolution PML (CPML): An efficient FDTD implementation of the CFS-PML for arbitrary media,\" Microwave and optical technology lett","author":"Roden J.","key":"e_1_2_1_4_1","unstructured":"J. Roden and S. Gedney , \" Convolution PML (CPML): An efficient FDTD implementation of the CFS-PML for arbitrary media,\" Microwave and optical technology lett ., vol. 27 , no. 5, pp. 334--339, 2000. J. Roden and S. Gedney, \"Convolution PML (CPML): An efficient FDTD implementation of the CFS-PML for arbitrary media,\" Microwave and optical technology lett., vol. 27, no. 5, pp. 334--339, 2000."},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/LMWC.2003.810124"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1006\/jcph.1994.1159"},{"key":"e_1_2_1_7_1","first-page":"327","volume-title":"A GPU-based calculation using the three-dimensional FDTD method for electromagnetic field analysis,\" in Annu. Int'l Conf. of the IEEE Engineering in Medicine and Biology Society (EMBC)","author":"Nagaoka T.","year":"2010","unstructured":"T. Nagaoka and S. Watanabe , \" A GPU-based calculation using the three-dimensional FDTD method for electromagnetic field analysis,\" in Annu. Int'l Conf. of the IEEE Engineering in Medicine and Biology Society (EMBC) , pp. 327 -- 330 , IEEE , 2010 . T. Nagaoka and S. Watanabe, \"A GPU-based calculation using the three-dimensional FDTD method for electromagnetic field analysis,\" in Annu. Int'l Conf. of the IEEE Engineering in Medicine and Biology Society (EMBC), pp. 327--330, IEEE, 2010."},{"key":"e_1_2_1_8_1","first-page":"5255","volume-title":"GPU based FDTD solver with CPML boundaries,\" in IEEE Antennas and Propagation Society Int'l Symp","author":"Inman M.","year":"2007","unstructured":"M. Inman , A. Elsherbeni , J. Maloney , and B. Baker , \" GPU based FDTD solver with CPML boundaries,\" in IEEE Antennas and Propagation Society Int'l Symp ., pp. 5255 -- 5258 , IEEE , 2007 . M. Inman, A. Elsherbeni, J. Maloney, and B. Baker, \"GPU based FDTD solver with CPML boundaries,\" in IEEE Antennas and Propagation Society Int'l Symp., pp. 5255--5258, IEEE, 2007."},{"key":"e_1_2_1_9_1","first-page":"1","volume-title":"Parallel 3D Finite Difference Time Domain Simulations on Graphics Processors with Cuda,\" in Int'l Conf. on Computational Intelligence and Software Engineering","author":"Kang L.","year":"2009","unstructured":"L. Kang and K. Fanmin , \" Parallel 3D Finite Difference Time Domain Simulations on Graphics Processors with Cuda,\" in Int'l Conf. on Computational Intelligence and Software Engineering , pp. 1 -- 4 , IEEE , 2009 . L. Kang and K. Fanmin, \"Parallel 3D Finite Difference Time Domain Simulations on Graphics Processors with Cuda,\" in Int'l Conf. on Computational Intelligence and Software Engineering, pp. 1--4, IEEE, 2009."},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-13119-6_38"},{"key":"e_1_2_1_11_1","first-page":"1009","volume-title":"Digest","author":"Railton C.","year":"1989","unstructured":"C. Railton and J. McGeehan , \" Analysis of microstrip discontinuities using the finite difference time domain technique,\" in IEEE MTT-S Int'l Microwave Symp . Digest , pp. 1009 -- 1012 , IEEE, 1989 . C. Railton and J. McGeehan, \"Analysis of microstrip discontinuities using the finite difference time domain technique,\" in IEEE MTT-S Int'l Microwave Symp. Digest, pp. 1009--1012, IEEE, 1989."},{"issue":"7","key":"e_1_2_1_12_1","first-page":"1544","article-title":"Analysis of Computation Error in Antenna's Simulation by Using Non-Uniform Mesh FDTD","volume":"83","author":"Huiling J.","year":"2000","unstructured":"J. Huiling and A. Hiroyuki , \" Analysis of Computation Error in Antenna's Simulation by Using Non-Uniform Mesh FDTD ,\" IEICE Trans. on communications , vol. 83 , no. 7 , pp. 1544 -- 1553 , 2000 -07-25. J. Huiling and A. Hiroyuki, \"Analysis of Computation Error in Antenna's Simulation by Using Non-Uniform Mesh FDTD,\" IEICE Trans. on communications, vol. 83, no. 7, pp. 1544--1553, 2000-07-25.","journal-title":"IEICE Trans. on communications"}],"container-title":["ACM SIGARCH Computer Architecture News"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2082156.2082163","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2082156.2082163","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T10:06:42Z","timestamp":1750241202000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2082156.2082163"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,12,19]]},"references-count":12,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2011,12,19]]}},"alternative-id":["10.1145\/2082156.2082163"],"URL":"https:\/\/doi.org\/10.1145\/2082156.2082163","relation":{},"ISSN":["0163-5964"],"issn-type":[{"type":"print","value":"0163-5964"}],"subject":[],"published":{"date-parts":[[2011,12,19]]},"assertion":[{"value":"2011-12-19","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}