{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,29]],"date-time":"2025-09-29T08:21:45Z","timestamp":1759134105439,"version":"3.38.0"},"reference-count":27,"publisher":"SAGE Publications","issue":"1","license":[{"start":{"date-parts":[[2013,6,11]],"date-time":"2013-06-11T00:00:00Z","timestamp":1370908800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["The International Journal of High Performance Computing Applications"],"published-print":{"date-parts":[[2014,2]]},"abstract":"<jats:p> Plasma turbulence research based on five-dimensional (5D) gyrokinetic simulations is one of the most critical and demanding issues in fusion science. To pioneer new physics regimes both in problem sizes and in timescales, an improvement of strong scaling is essential. Overlap of computations and communications using non-blocking MPI communication schemes is a promising approach to improving strong scaling, but it often fails on practical applications with conventional MPI libraries. In this work, this classical issue is resolved by developing communication-overlap techniques with additional MPI support for non-blocking communication routines and with heterogeneous OpenMP threads, which work even on conventional MPI libraries and network hardware. These techniques dramatically improved the parallel efficiency of a gyrokinetic toroidal 5D Eulerian code GT5D on the K-computer, which has a dedicated network, and on the Helios system which has a commodity network. On the K-computer, excellent strong scaling was achieved beyond 100k cores whilst keeping a sustained performance of [Formula: see text]10% ([Formula: see text]307 TFlops using 196,608 cores), and simulations for next-generation large-scale fusion experiments are significantly accelerated. This performance is 16[Formula: see text] sped up compared with the maximum performance reported at the 2011 International Conference for High Performance Computing, Networking, Storage and Analysis ([Formula: see text]19 TFlops using 16,384 cores of the BX900 cluster) (Idomura, 2011). <\/jats:p>","DOI":"10.1177\/1094342013490973","type":"journal-article","created":{"date-parts":[[2013,6,12]],"date-time":"2013-06-12T03:51:36Z","timestamp":1371009096000},"page":"73-86","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":23,"title":["Communication-overlap techniques for improved strong scaling of gyrokinetic Eulerian code beyond 100k cores on the K-computer"],"prefix":"10.1177","volume":"28","author":[{"given":"Yasuhiro","family":"Idomura","sequence":"first","affiliation":[{"name":"Center for Computational Science and e-Systems, Japan Atomic Energy Agency, Japan"},{"name":"Fusion Research and Development Directorate, Japan Atomic Energy Agency, Japan"}]},{"given":"Motoki","family":"Nakata","sequence":"additional","affiliation":[{"name":"Fusion Research and Development Directorate, Japan Atomic Energy Agency, Japan"}]},{"given":"Susumu","family":"Yamada","sequence":"additional","affiliation":[{"name":"Center for Computational Science and e-Systems, Japan Atomic Energy Agency, Japan"}]},{"given":"Masahiko","family":"Machida","sequence":"additional","affiliation":[{"name":"Center for Computational Science and e-Systems, Japan Atomic Energy Agency, Japan"}]},{"given":"Toshiyuki","family":"Imamura","sequence":"additional","affiliation":[{"name":"Advanced Institute for Computational Science, RIKEN, Japan"}]},{"given":"Tomohiko","family":"Watanabe","sequence":"additional","affiliation":[{"name":"National Institute for Fusion Science, Japan"}]},{"given":"Masanori","family":"Nunami","sequence":"additional","affiliation":[{"name":"National Institute for Fusion Science, Japan"}]},{"given":"Hikaru","family":"Inoue","sequence":"additional","affiliation":[{"name":"Fujitsu Limited, Japan"}]},{"given":"Shigenobu","family":"Tsutsumi","sequence":"additional","affiliation":[{"name":"Fujitsu Kyusyu Systems Limited, Japan"}]},{"given":"Ikuo","family":"Miyoshi","sequence":"additional","affiliation":[{"name":"Fujitsu Limited, Japan"}]},{"given":"Naoyuki","family":"Shida","sequence":"additional","affiliation":[{"name":"Fujitsu Limited, Japan"}]}],"member":"179","published-online":{"date-parts":[[2013,6,11]]},"reference":[{"key":"bibr1-1094342013490973","doi-asserted-by":"publisher","DOI":"10.1103\/RevModPhys.79.421"},{"key":"bibr2-1094342013490973","doi-asserted-by":"publisher","DOI":"10.1088\/0029-5515\/51\/7\/073039"},{"key":"bibr3-1094342013490973","doi-asserted-by":"publisher","DOI":"10.1063\/1.1695358"},{"key":"bibr4-1094342013490973","doi-asserted-by":"publisher","DOI":"10.1063\/1.873896"},{"key":"bibr5-1094342013490973","doi-asserted-by":"publisher","DOI":"10.1137\/0720023"},{"key":"bibr6-1094342013490973","doi-asserted-by":"publisher","DOI":"10.1088\/0029-5515\/50\/4\/043002"},{"key":"bibr7-1094342013490973","doi-asserted-by":"publisher","DOI":"10.1016\/0167-8191(96)00024-5"},{"key":"bibr8-1094342013490973","doi-asserted-by":"publisher","DOI":"10.1016\/j.cpc.2008.04.005"},{"key":"bibr9-1094342013490973","doi-asserted-by":"publisher","DOI":"10.1016\/j.jcp.2007.04.013"},{"key":"bibr10-1094342013490973","doi-asserted-by":"publisher","DOI":"10.1145\/2063348.2063354"},{"volume-title":"23rd international atomic energy agency fusion energy conference","year":"2010","author":"Idomura Y","key":"bibr11-1094342013490973"},{"key":"bibr12-1094342013490973","doi-asserted-by":"publisher","DOI":"10.1088\/0029-5515\/49\/6\/065029"},{"key":"bibr13-1094342013490973","doi-asserted-by":"publisher","DOI":"10.1016\/j.crhy.2006.06.007"},{"key":"bibr14-1094342013490973","doi-asserted-by":"publisher","DOI":"10.1088\/0029-5515\/52\/2\/023026"},{"key":"bibr15-1094342013490973","doi-asserted-by":"publisher","DOI":"10.1016\/j.jcp.2011.01.029"},{"key":"bibr16-1094342013490973","doi-asserted-by":"publisher","DOI":"10.1088\/0029-5515\/45\/8\/026"},{"key":"bibr17-1094342013490973","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevLett.88.195004"},{"journal-title":"JSST2012 International Conference on Simulation Technology","year":"2012","author":"Maeyama S","key":"bibr18-1094342013490973"},{"key":"bibr19-1094342013490973","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevLett.105.155001"},{"volume-title":"24th international atomic energy agency fusion energy conference","year":"2012","author":"Nakata M","key":"bibr20-1094342013490973"},{"key":"bibr21-1094342013490973","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevLett.74.1763"},{"key":"bibr22-1094342013490973","doi-asserted-by":"publisher","DOI":"10.1145\/2063384.2071033"},{"key":"bibr23-1094342013490973","doi-asserted-by":"publisher","DOI":"10.1016\/j.cpc.2010.02.014"},{"key":"bibr24-1094342013490973","doi-asserted-by":"publisher","DOI":"10.1145\/1122971.1122978"},{"key":"bibr25-1094342013490973","doi-asserted-by":"publisher","DOI":"10.1063\/1.859862"},{"key":"bibr26-1094342013490973","doi-asserted-by":"publisher","DOI":"10.1109\/SC.2005.1"},{"key":"bibr27-1094342013490973","doi-asserted-by":"publisher","DOI":"10.1006\/jcph.1996.0193"}],"container-title":["The International Journal of High Performance Computing Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/1094342013490973","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/1094342013490973","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/1094342013490973","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,1]],"date-time":"2025-03-01T11:13:41Z","timestamp":1740827621000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/1094342013490973"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,6,11]]},"references-count":27,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2014,2]]}},"alternative-id":["10.1177\/1094342013490973"],"URL":"https:\/\/doi.org\/10.1177\/1094342013490973","relation":{},"ISSN":["1094-3420","1741-2846"],"issn-type":[{"type":"print","value":"1094-3420"},{"type":"electronic","value":"1741-2846"}],"subject":[],"published":{"date-parts":[[2013,6,11]]}}}