{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T23:01:30Z","timestamp":1777676490553,"version":"3.51.4"},"reference-count":34,"publisher":"SAGE Publications","issue":"4","license":[{"start":{"date-parts":[[2011,11,17]],"date-time":"2011-11-17T00:00:00Z","timestamp":1321488000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["The International Journal of High Performance Computing Applications"],"published-print":{"date-parts":[[2012,11]]},"abstract":"<jats:p>We present the implementation and performance of a class of directionally unsplit Riemann-solver-based hydrodynamic schemes on graphics processing units (GPUs). These schemes, including the MUSCL-Hancock method, a variant of the MUSCL-Hancock method, and the corner-transport-upwind method, are embedded into the adaptive-mesh-refinement (AMR) code GAMER. Furthermore, a hybrid MPI\/OpenMP model is investigated, which enables the full exploitation of the computing power in a heterogeneous CPU\/GPU cluster and significantly improves the overall performance. Performance benchmarks are conducted on the Dirac GPU cluster at NERSC\/LBNL using up to 32 Tesla C2050 GPUs. A single GPU achieves speed-ups of 101 (25) and 84 (22) for uniform-mesh and AMR simulations, respectively, as compared with the performance using one (four) CPU core(s), and the excellent performance persists in multi-GPU tests. In addition, we make a direct comparison between GAMER and the widely adopted CPU code Athena in adiabatic hydrodynamic tests and demonstrate that, with the same accuracy, GAMER is able to achieve two orders of magnitude performance speed-up.<\/jats:p>","DOI":"10.1177\/1094342011428146","type":"journal-article","created":{"date-parts":[[2011,11,18]],"date-time":"2011-11-18T20:25:16Z","timestamp":1321647916000},"page":"367-377","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":14,"title":["Directionally unsplit hydrodynamic schemes with hybrid MPI\/OpenMP\/GPU parallelization in AMR"],"prefix":"10.1177","volume":"26","author":[{"given":"Hsi-Yu","family":"Schive","sequence":"first","affiliation":[{"name":"Department of Physics, National Taiwan University, Taipei, Taiwan"},{"name":"Center for Theoretical Sciences, National Taiwan University, Taipei, Taiwan"},{"name":"Leung Center for Cosmology and Particle Astrophysics (LeCosPA), National Taiwan University, 10617, Taipei, Taiwan"}]},{"given":"Ui-Han","family":"Zhang","sequence":"additional","affiliation":[{"name":"Department of Physics, National Taiwan University, Taipei, Taiwan"},{"name":"Center for Theoretical Sciences, National Taiwan University, Taipei, Taiwan"},{"name":"Leung Center for Cosmology and Particle Astrophysics (LeCosPA), National Taiwan University, 10617, Taipei, Taiwan"}]},{"given":"Tzihong","family":"Chiueh","sequence":"additional","affiliation":[{"name":"Department of Physics, National Taiwan University, Taipei, Taiwan"},{"name":"Center for Theoretical Sciences, National Taiwan University, Taipei, Taiwan"},{"name":"Leung Center for Cosmology and Particle Astrophysics (LeCosPA), National Taiwan University, 10617, Taipei, Taiwan"}]}],"member":"179","published-online":{"date-parts":[[2011,11,17]]},"reference":[{"key":"bibr1-1094342011428146","first-page":"244","volume":"724","author":"Aubert D","year":"2010","journal-title":"I. On the structure of the ultraviolet radiation field. ApJ"},{"key":"bibr2-1094342011428146","doi-asserted-by":"publisher","DOI":"10.1137\/0915008"},{"key":"bibr3-1094342011428146","doi-asserted-by":"publisher","DOI":"10.1016\/0021-9991(89)90035-1"},{"key":"bibr4-1094342011428146","doi-asserted-by":"publisher","DOI":"10.1016\/0021-9991(84)90073-1"},{"key":"bibr5-1094342011428146","first-page":"363","volume-title":"Computational Astrophysics; 12th Kingston Meeting on Theoretical Astrophysics (ASP Conference Series","volume":"123","author":"Bryan GL","year":"1996"},{"key":"bibr6-1094342011428146","doi-asserted-by":"publisher","DOI":"10.1016\/0021-9991(90)90233-Q"},{"key":"bibr7-1094342011428146","doi-asserted-by":"publisher","DOI":"10.1016\/0021-9991(84)90143-8"},{"key":"bibr8-1094342011428146","doi-asserted-by":"publisher","DOI":"10.1088\/0067-0049\/186\/2\/308"},{"key":"bibr9-1094342011428146","doi-asserted-by":"publisher","DOI":"10.1016\/0021-9991(91)90211-3"},{"key":"bibr10-1094342011428146","doi-asserted-by":"publisher","DOI":"10.1093\/mnras\/250.3.581"},{"key":"bibr11-1094342011428146","doi-asserted-by":"publisher","DOI":"10.1086\/317361"},{"key":"bibr12-1094342011428146","doi-asserted-by":"publisher","DOI":"10.1016\/j.newast.2009.03.002"},{"key":"bibr13-1094342011428146","doi-asserted-by":"publisher","DOI":"10.1016\/j.jcp.2007.12.017"},{"key":"bibr14-1094342011428146","first-page":"21","volume":"24","author":"Hamada T","year":"2009","journal-title":"CSRD"},{"key":"bibr15-1094342011428146","doi-asserted-by":"publisher","DOI":"10.1016\/j.newast.2010.07.009"},{"key":"bibr16-1094342011428146","doi-asserted-by":"publisher","DOI":"10.1002\/cpa.3160480303"},{"key":"bibr17-1094342011428146","volume-title":"The OpenCL Specification (Version 1.1)","author":"Khronos Group","year":"2011"},{"key":"bibr18-1094342011428146","doi-asserted-by":"publisher","DOI":"10.1006\/jcph.2000.6575"},{"key":"bibr19-1094342011428146","volume-title":"Petascale Computing: Algorithms and Applications","author":"Norman ML","year":"2007"},{"key":"bibr20-1094342011428146","volume-title":"NVIDIA CUDA C Programming Guide (Version 4.0)","author":"NVIDIA","year":"2011"},{"key":"bibr21-1094342011428146","volume-title":"Numerical Recipes. The Art of Scientific Computing","author":"Press WH","year":"2007","edition":"3"},{"key":"bibr22-1094342011428146","doi-asserted-by":"publisher","DOI":"10.1086\/526425"},{"key":"bibr23-1094342011428146","doi-asserted-by":"publisher","DOI":"10.1016\/0021-9991(81)90128-5"},{"key":"bibr24-1094342011428146","doi-asserted-by":"publisher","DOI":"10.1016\/j.newast.2007.12.005"},{"key":"bibr25-1094342011428146","doi-asserted-by":"publisher","DOI":"10.1088\/0067-0049\/186\/2\/457"},{"key":"bibr26-1094342011428146","first-page":"401","volume":"270","author":"Schive H","year":"2011","journal-title":"Proc IAUS"},{"key":"bibr27-1094342011428146","author":"Shukla H","year":"2011","journal-title":"Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC 2011)"},{"key":"bibr28-1094342011428146","doi-asserted-by":"publisher","DOI":"10.1002\/9781118130506.ch3"},{"key":"bibr29-1094342011428146","doi-asserted-by":"publisher","DOI":"10.1086\/588755"},{"key":"bibr30-1094342011428146","doi-asserted-by":"publisher","DOI":"10.1016\/j.newast.2008.06.003"},{"key":"bibr31-1094342011428146","doi-asserted-by":"publisher","DOI":"10.1007\/b79761"},{"key":"bibr32-1094342011428146","doi-asserted-by":"publisher","DOI":"10.1016\/j.newast.2009.10.002"},{"key":"bibr33-1094342011428146","doi-asserted-by":"publisher","DOI":"10.1111\/j.1365-2966.2011.18646.x"},{"key":"bibr34-1094342011428146","first-page":"5202","volume":"1102","author":"Zink B","year":"2011","journal-title":"Preprint arXiv"}],"container-title":["The International Journal of High Performance Computing Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/1094342011428146","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/1094342011428146","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/1094342011428146","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T08:19:07Z","timestamp":1777450747000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/1094342011428146"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,11,17]]},"references-count":34,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2012,11]]}},"alternative-id":["10.1177\/1094342011428146"],"URL":"https:\/\/doi.org\/10.1177\/1094342011428146","relation":{},"ISSN":["1094-3420","1741-2846"],"issn-type":[{"value":"1094-3420","type":"print"},{"value":"1741-2846","type":"electronic"}],"subject":[],"published":{"date-parts":[[2011,11,17]]}}}