{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,7]],"date-time":"2026-05-07T15:45:33Z","timestamp":1778168733357,"version":"3.51.4"},"reference-count":20,"publisher":"SAGE Publications","issue":"1","license":[{"start":{"date-parts":[[2015,10,11]],"date-time":"2015-10-11T00:00:00Z","timestamp":1444521600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["The International Journal of High Performance Computing Applications"],"published-print":{"date-parts":[[2016,2]]},"abstract":"<jats:p>The high-performance conjugate gradient (HPCG) is new benchmark software for supercomputers that provides a more realistic performance metric than existing benchmarks, such as the LINPACK benchmark. The HPCG measures the speed of solving symmetric sparse linear system equations using the conjugate gradient method preconditioned by a multigrid symmetric Gauss\u2013Seidel smoother. The combination of a sparse linear system and a preconditioned conjugate gradient method is widely used in many scientific and engineering computer applications. This study introduces a tuning method for the K computer. According to weak-scaling measurements on the K computer, it has good parallel scalability. Therefore, our tuning strategy focuses on single CPU performance rather than parallel performance. Single CPU performance strongly depends on memory throughput and multicore utilization. Therefore, we attempt to improve memory\/cache access performance and multithreading efficiency. As a result, a HPCG score obtained with the K computer achieved second place at SC\u201914.<\/jats:p>","DOI":"10.1177\/1094342015607950","type":"journal-article","created":{"date-parts":[[2015,10,13]],"date-time":"2015-10-13T17:47:06Z","timestamp":1444758426000},"page":"55-70","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":21,"title":["High-performance conjugate gradient performance improvement on the K computer"],"prefix":"10.1177","volume":"30","author":[{"given":"Kiyoshi","family":"Kumahata","sequence":"first","affiliation":[{"name":"Software Development Team, Operations and Computer Technologies Division, RIKEN AICS, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kazuo","family":"Minami","sequence":"additional","affiliation":[{"name":"Software Development Team, Operations and Computer Technologies Division, RIKEN AICS, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Naoya","family":"Maruyama","sequence":"additional","affiliation":[{"name":"HPC Programming Framework Research Team, RIKEN AICS, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"179","published-online":{"date-parts":[[2015,10,11]]},"reference":[{"key":"bibr1-1094342015607950","doi-asserted-by":"publisher","DOI":"10.1109\/MC.2009.370"},{"key":"bibr2-1094342015607950","author":"Dongarra J","year":"2014","journal-title":"Performance of various computers using standard linear equations software"},{"key":"bibr3-1094342015607950","author":"Dongarra J","year":"2014","journal-title":"SC\u201914 HPCG BoF"},{"key":"bibr4-1094342015607950","author":"Dongarra J","year":"2013","journal-title":"SC\u201913 Top500 BoF"},{"key":"bibr5-1094342015607950","author":"Dongarra J","year":"2014","journal-title":"ISC\u201914 Top500 BoF"},{"key":"bibr6-1094342015607950","doi-asserted-by":"publisher","DOI":"10.1002\/cpe.728"},{"key":"bibr7-1094342015607950","unstructured":"Fujitsu Ltd (2008) SPARC64 VIIIfx Extensions. Architecture manual, Fujitsu Ltd."},{"key":"bibr8-1094342015607950","doi-asserted-by":"publisher","DOI":"10.2172\/1113870"},{"key":"bibr9-1094342015607950","volume-title":"proceedings of 26th IEEE international parallel & distributed processing symposium","author":"Iwashita T","year":"2012"},{"key":"bibr10-1094342015607950","doi-asserted-by":"publisher","DOI":"10.1016\/j.compfluid.2005.07.006"},{"key":"bibr11-1094342015607950","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2013.05.427"},{"key":"bibr12-1094342015607950","volume":"21","author":"Maruyama T","year":"2009","journal-title":"Hot Chips"},{"key":"bibr13-1094342015607950","doi-asserted-by":"publisher","DOI":"10.1109\/MM.2010.40"},{"key":"bibr14-1094342015607950","unstructured":"McCalpin J (1995) Memory bandwidth and machine balance in current high performance computers. IEEE Computer Society Technical Committee on Computer Architecture (TCCA) newsletter, December. Available at: http:\/\/www.cs.virginia.edu\/stream\/.Accessed on 2015.03.25"},{"key":"bibr15-1094342015607950","author":"Park J","year":"2014","journal-title":"SC14"},{"key":"bibr16-1094342015607950","author":"Phillips E","year":"2014","journal-title":"SC\u201914 HPCG BoF"},{"key":"bibr17-1094342015607950","doi-asserted-by":"publisher","DOI":"10.1145\/2503210.2504565"},{"key":"bibr18-1094342015607950","volume":"22","author":"Toyoshima T","year":"2010","journal-title":"Hot Chips"},{"key":"bibr19-1094342015607950","doi-asserted-by":"publisher","DOI":"10.1145\/1498765.1498785"},{"key":"bibr20-1094342015607950","first-page":"28","author":"Xianyi Z","year":"2014","journal-title":"Algorithms and Architectures for Parallel Processing Part I"}],"container-title":["The International Journal of High Performance Computing Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/1094342015607950","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/1094342015607950","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/1094342015607950","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T08:19:34Z","timestamp":1777450774000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/1094342015607950"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,10,11]]},"references-count":20,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2016,2]]}},"alternative-id":["10.1177\/1094342015607950"],"URL":"https:\/\/doi.org\/10.1177\/1094342015607950","relation":{},"ISSN":["1094-3420","1741-2846"],"issn-type":[{"value":"1094-3420","type":"print"},{"value":"1741-2846","type":"electronic"}],"subject":[],"published":{"date-parts":[[2015,10,11]]}}}