{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,20]],"date-time":"2025-12-20T22:20:23Z","timestamp":1766269223026,"version":"3.38.0"},"reference-count":26,"publisher":"SAGE Publications","issue":"3","license":[{"start":{"date-parts":[[2013,10,17]],"date-time":"2013-10-17T00:00:00Z","timestamp":1381968000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["The International Journal of High Performance Computing Applications"],"published-print":{"date-parts":[[2014,8]]},"abstract":"<jats:p> Silicon nanowires are potentially useful in next-generation field-effect transistors, and it is important to clarify the electron states of silicon nanowires to know the behavior of new devices. Computer simulations are promising tools for calculating electron states. Real-space density functional theory (RSDFT) code performs first-principles electronic structure calculations. To obtain higher performance, we applied various optimization techniques to the code: multi-level parallelization, load balance management, sub-mesh\/torus allocation, and a message-passing interface library tuned for the K computer. We measured and evaluated the performance of the modified RSDFT code on the K computer. A 5.48 petaflops (PFLOPS) sustained performance was measured for an iteration of a self-consistent field calculation for a 107,292-atom Si nanowire simulation using 82,944 compute nodes, which is 51.67% of the K computer\u2019s peak performance of 10.62 PFLOPS. This scale of simulation enables analysis of the behavior of a silicon nanowire with a diameter of 10\u201320 nm. <\/jats:p>","DOI":"10.1177\/1094342013508163","type":"journal-article","created":{"date-parts":[[2013,10,18]],"date-time":"2013-10-18T01:56:45Z","timestamp":1382061405000},"page":"335-355","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":42,"title":["Performance evaluation of ultra-large-scale first-principles electronic structure calculation code on the K computer"],"prefix":"10.1177","volume":"28","author":[{"given":"Yukihiro","family":"Hasegawa","sequence":"first","affiliation":[{"name":"RIKEN Advanced Institute for Computational Science, Kobe, Japan"}]},{"given":"Jun-Ichi","family":"Iwata","sequence":"additional","affiliation":[{"name":"Department of Applied Physics, School of Engineering, The University of Tokyo, Tokyo, Japan"}]},{"given":"Miwako","family":"Tsuji","sequence":"additional","affiliation":[{"name":"RIKEN Advanced Institute for Computational Science, Kobe, Japan"}]},{"given":"Daisuke","family":"Takahashi","sequence":"additional","affiliation":[{"name":"Center for Computational Sciences, University of Tsukuba, Tsukuba, Japan"}]},{"given":"Atsushi","family":"Oshiyama","sequence":"additional","affiliation":[{"name":"Department of Applied Physics, School of Engineering, The University of Tokyo, Tokyo, Japan"}]},{"given":"Kazuo","family":"Minami","sequence":"additional","affiliation":[{"name":"RIKEN Advanced Institute for Computational Science, Kobe, Japan"}]},{"given":"Taisuke","family":"Boku","sequence":"additional","affiliation":[{"name":"Center for Computational Sciences, University of Tsukuba, Tsukuba, Japan"}]},{"given":"Hikaru","family":"Inoue","sequence":"additional","affiliation":[{"name":"Computational Science and Engineering Solution Division, Technical Computing Solution Unit, Fujitsu Ltd, Chiba, Japan"}]},{"given":"Yoshito","family":"Kitazawa","sequence":"additional","affiliation":[{"name":"IT Solution Unit, CAE Simulation Department, Fujitsu Systems East Ltd, Nagano, Japan"}]},{"given":"Ikuo","family":"Miyoshi","sequence":"additional","affiliation":[{"name":"PA Project, Next Generation Technical Computing Unit, Fujitsu Ltd, Kawasaki, Japan"}]},{"given":"Mitsuo","family":"Yokokawa","sequence":"additional","affiliation":[{"name":"RIKEN Advanced Institute for Computational Science, Kobe, Japan"},{"name":"Graduate School of System Informatics, Kobe University, Kobe, Japan"}]}],"member":"179","published-online":{"date-parts":[[2013,10,17]]},"reference":[{"key":"bibr1-1094342013508163","doi-asserted-by":"publisher","DOI":"10.1007\/s00450-012-0211-7"},{"key":"bibr2-1094342013508163","doi-asserted-by":"publisher","DOI":"10.1109\/MC.2009.370"},{"key":"bibr3-1094342013508163","doi-asserted-by":"publisher","DOI":"10.1109\/IEDM.2009.5424364"},{"key":"bibr4-1094342013508163","doi-asserted-by":"publisher","DOI":"10.1109\/CCGRID.2006.78"},{"key":"bibr5-1094342013508163","doi-asserted-by":"publisher","DOI":"10.1109\/VLSIT.2010.5556217"},{"key":"bibr6-1094342013508163","doi-asserted-by":"publisher","DOI":"10.1145\/77626.79170"},{"volume-title":"SPARC64 VIIIfx Extensions","year":"2008","author":"Fujitsu Ltd","key":"bibr7-1094342013508163"},{"volume-title":"Proceedings of 2006 International Conference for High Performance Computing, Networking, Storage and Analysis","year":"2006","author":"Gygi F","key":"bibr8-1094342013508163"},{"key":"bibr9-1094342013508163","doi-asserted-by":"publisher","DOI":"10.1145\/2063384.2063386"},{"journal-title":"Proceedings of Joint International Conference on Supercomputing in Nuclear Applications and Monte Carlo 2010 (SNA + MC2010)","year":"2010","author":"Imamura T","key":"bibr10-1094342013508163"},{"key":"bibr11-1094342013508163","doi-asserted-by":"publisher","DOI":"10.1016\/j.jcp.2009.11.038"},{"key":"bibr12-1094342013508163","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevB.56.14985"},{"key":"bibr13-1094342013508163","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevB.77.085301"},{"key":"bibr14-1094342013508163","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevB.43.2213"},{"key":"bibr15-1094342013508163","volume":"21","author":"Maruyama T","year":"2009","journal-title":"Hot Chips"},{"key":"bibr16-1094342013508163","doi-asserted-by":"publisher","DOI":"10.1109\/MM.2010.40"},{"key":"bibr17-1094342013508163","doi-asserted-by":"publisher","DOI":"10.1109\/ICKS.2008.6"},{"key":"bibr18-1094342013508163","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevB.71.113101"},{"key":"bibr19-1094342013508163","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevB.23.5048"},{"key":"bibr20-1094342013508163","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevLett.68.1858"},{"key":"bibr21-1094342013508163","volume":"22","author":"Toyoshima T","year":"2010","journal-title":"Hot Chips"},{"key":"bibr22-1094342013508163","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevB.52.5573"},{"key":"bibr23-1094342013508163","doi-asserted-by":"publisher","DOI":"10.1109\/ICPPW.2009.73"},{"key":"bibr24-1094342013508163","doi-asserted-by":"publisher","DOI":"10.1145\/1188455.1188504"},{"key":"bibr25-1094342013508163","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevB.54.5586"},{"key":"bibr26-1094342013508163","first-page":"37","author":"Yokozawa T","year":"2006","journal-title":"Proceedings of Fourth International Workshop on Parallel matrix Algorithms and Applications (PMAA\u201906)"}],"container-title":["The International Journal of High Performance Computing Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/1094342013508163","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/1094342013508163","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/1094342013508163","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,2]],"date-time":"2025-03-02T15:01:38Z","timestamp":1740927698000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/1094342013508163"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,10,17]]},"references-count":26,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2014,8]]}},"alternative-id":["10.1177\/1094342013508163"],"URL":"https:\/\/doi.org\/10.1177\/1094342013508163","relation":{},"ISSN":["1094-3420","1741-2846"],"issn-type":[{"type":"print","value":"1094-3420"},{"type":"electronic","value":"1741-2846"}],"subject":[],"published":{"date-parts":[[2013,10,17]]}}}