{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,7]],"date-time":"2026-03-07T19:06:55Z","timestamp":1772910415867,"version":"3.50.1"},"reference-count":25,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2016,12,20]],"date-time":"2016-12-20T00:00:00Z","timestamp":1482192000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0\/"}],"funder":[{"name":"DOE\/SC","award":["DE-AC05-00OR22725"],"award-info":[{"award-number":["DE-AC05-00OR22725"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Commun. ACM"],"published-print":{"date-parts":[[2016,12,20]]},"abstract":"<jats:p>Supercomputing is evolving toward hybrid and accelerator-based architectures with millions of cores. The Hardware\/Hybrid Accelerated Cosmology Code (HACC) framework exploits this diverse landscape at the largest scales of problem size, obtaining high scalability and sustained performance. Developed to satisfy the science requirements of cosmological surveys, HACC melds particle and grid methods using a novel algorithmic structure that flexibly maps across architectures, including CPU\/GPU, multi\/many-core, and Blue Gene systems. In this Research Highlight, we demonstrate the success of HACC on two very different machines, the CPU\/GPU system Titan and the BG\/Q systems Sequoia and Mira, attaining very high levels of scalable performance. We demonstrate strong and weak scaling on Titan, obtaining up to 99.2% parallel efficiency, evolving 1.1 trillion particles. On Sequoia, we reach 13.94 PFlops (69.2% of peak) and 90% parallel efficiency on 1,572,864 cores, with 3.6 trillion particles, the largest cosmological benchmark yet performed. HACC design concepts are applicable to several other supercomputer applications.<\/jats:p>","DOI":"10.1145\/3015569","type":"journal-article","created":{"date-parts":[[2016,12,21]],"date-time":"2016-12-21T14:45:06Z","timestamp":1482331506000},"page":"97-104","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":62,"title":["HACC"],"prefix":"10.1145","volume":"60","author":[{"given":"Salman","family":"Habib","sequence":"first","affiliation":[{"name":"Argonne National Laboratory, Lemont, IL"}]},{"given":"Vitali","family":"Morozov","sequence":"additional","affiliation":[{"name":"Argonne National Laboratory, Lemont, IL"}]},{"given":"Nicholas","family":"Frontiere","sequence":"additional","affiliation":[{"name":"Argonne National Laboratory, Lemont, IL"}]},{"given":"Hal","family":"Finkel","sequence":"additional","affiliation":[{"name":"Argonne National Laboratory, Lemont, IL"}]},{"given":"Adrian","family":"Pope","sequence":"additional","affiliation":[{"name":"Argonne National Laboratory, Lemont, IL"}]},{"given":"Katrin","family":"Heitmann","sequence":"additional","affiliation":[{"name":"Argonne National Laboratory, Lemont, IL"}]},{"given":"Kalyan","family":"Kumaran","sequence":"additional","affiliation":[{"name":"Argonne National Laboratory, Lemont, IL"}]},{"given":"Venkatram","family":"Vishwanath","sequence":"additional","affiliation":[{"name":"Argonne National Laboratory, Lemont, IL"}]},{"given":"Tom","family":"Peterka","sequence":"additional","affiliation":[{"name":"Argonne National Laboratory, Lemont, IL"}]},{"given":"Joe","family":"Insley","sequence":"additional","affiliation":[{"name":"Argonne National Laboratory, Lemont, IL"}]},{"given":"David","family":"Daniel","sequence":"additional","affiliation":[{"name":"Los Alamos National Laboratory, Los Alamos, New Mexico"}]},{"given":"Patricia","family":"Fasel","sequence":"additional","affiliation":[{"name":"Los Alamos National Laboratory, Los Alamos, New Mexico"}]},{"given":"Zarija","family":"Luki\u0107","sequence":"additional","affiliation":[{"name":"Lawrence Berkeley National Laboratory, Berkeley, CA"}]}],"member":"320","published-online":{"date-parts":[[2016,12,20]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1088\/0004-637X\/766\/1\/32"},{"key":"e_1_2_1_2_1","volume-title":"12th Kingston Meeting on Theoretical Astrophysics, Proceedings of Meeting Held in Halifax; Nova Scotia (ASP Conference Series # 123)","author":"Bryan G.L.","year":"1996"},{"key":"e_1_2_1_3_1","doi-asserted-by":"crossref","unstructured":"Couchman H.M.P. Thomas P.A. Pearce F.R. Hydra: An adaptive-mesh implementation of P 3M-SPH Astrophys. J. 452 797 (1995).  Couchman H.M.P. Thomas P.A. Pearce F.R. Hydra: An adaptive-mesh implementation of P 3M-SPH Astrophys. J. 452 797 (1995).","DOI":"10.1086\/176348"},{"key":"e_1_2_1_4_1","first-page":"229","article-title":"a review of cosmological simulation methods, see also Dolag, K., Borgani, S., Schindler, S., Diaferio, A., Bykov","volume":"134","author":"For","year":"2008","journal-title":"A.M. Space Sci. Rev."},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1086\/317361"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1111\/j.1365-2966.2011.19528.x"},{"key":"e_1_2_1_7_1","unstructured":"Habib S. Morozov V. Finkel H. Pope A. Heitmann K. Kumaran K. Peterka T. Insley J. Daniel D. Fasel P. Frontiere N. Luki\u0107 Z. arXiv:1211.4864 Supercomputing 2012.  Habib S. Morozov V. Finkel H. Pope A. Heitmann K. Kumaran K. Peterka T. Insley J. Daniel D. Fasel P. Frontiere N. Luki\u0107 Z. arXiv:1211.4864 Supercomputing 2012."},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.newast.2015.06.003"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1088\/1742-6596\/180\/1\/012019"},{"key":"e_1_2_1_10_1","unstructured":"Hamming R.W. Digital Filters. Dover Publications Mineola New York 1998.  Hamming R.W. Digital Filters. Dover Publications Mineola New York 1998."},{"key":"e_1_2_1_11_1","doi-asserted-by":"crossref","unstructured":"Heitmann K. Frontiere N. Sewell C. Habib S. Pope A. Finkel H. Rizzi S. Insley J. Bhattacharya S. The Q continuum simulation: Harnessing the power of GPU accelerated supercomputers. J. - Astrophys. J. Supp. 219 (2015) 34 arXiv: 1411.3396 {astro-ph.CO}.  Heitmann K. Frontiere N. Sewell C. Habib S. Pope A. Finkel H. Rizzi S. Insley J. Bhattacharya S. The Q continuum simulation: Harnessing the power of GPU accelerated supercomputers. J. - Astrophys. J. Supp. 219 (2015) 34 arXiv:1411.3396 {astro-ph.CO}.","DOI":"10.1088\/0067-0049\/219\/2\/34"},{"key":"e_1_2_1_12_1","doi-asserted-by":"crossref","unstructured":"Heitmann K. Higdon D. White M. Habib S. Williams B.J. Lawrence E. Wagner C. The Coyote universe. II. Cosmological models and precision emulation of the nonlinear matter power spectrum Astrophys. J. 705 (2009) 156.  Heitmann K. Higdon D. White M. Habib S. Williams B.J. Lawrence E. Wagner C. The Coyote universe. II. Cosmological models and precision emulation of the nonlinear matter power spectrum Astrophys. J. 705 (2009) 156.","DOI":"10.1088\/0004-637X\/705\/1\/156"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1088\/1749-4699\/1\/1\/015003"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1086\/432646"},{"key":"e_1_2_1_15_1","volume-title":"Adam Hilger","author":"Hockney R.W.","year":"1988"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1086\/110933"},{"key":"e_1_2_1_17_1","unstructured":"Peebles P.J.E. The Large-Scale Structure of the Universe. Princeton University Press Princeton New Jersey 1980.  Peebles P.J.E. The Large-Scale Structure of the Universe. Princeton University Press Princeton New Jersey 1980."},{"key":"e_1_2_1_18_1","first-page":"446","article-title":"Many-Body Tree Methods in Physics. Cambridge University Press, 1996; see also Barnes, J., Hut","volume":"324","author":"Pfalzner S.","year":"1986","journal-title":"P. Nature"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/MCSE.2010.28"},{"key":"e_1_2_1_20_1","first-page":"14","article-title":"Power, C., Navarro, J.F., Jenkins, A., Frenk, C.S., White, S.D.M., Springel, V., Stadel, J., Quinn, T. The inner structure of ACDM haloes - I. A numerical convergence study","volume":"338","author":"The","year":"2003","journal-title":"Mon. Not. R. Astron. Soc."},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1103\/RevModPhys.61.185"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1111\/j.1365-2966.2005.09655.x"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1051\/0004-6361:20011817"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1051\/0004-6361:20000357"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1088\/0004-637X\/713\/1\/383"}],"container-title":["Communications of the ACM"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3015569","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3015569","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:24:14Z","timestamp":1750220654000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3015569"}},"subtitle":["extreme scaling and performance across diverse architectures"],"short-title":[],"issued":{"date-parts":[[2016,12,20]]},"references-count":25,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2016,12,20]]}},"alternative-id":["10.1145\/3015569"],"URL":"https:\/\/doi.org\/10.1145\/3015569","relation":{},"ISSN":["0001-0782","1557-7317"],"issn-type":[{"value":"0001-0782","type":"print"},{"value":"1557-7317","type":"electronic"}],"subject":[],"published":{"date-parts":[[2016,12,20]]},"assertion":[{"value":"2016-12-20","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}