{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,3,27]],"date-time":"2025-03-27T22:41:50Z","timestamp":1743115310252,"version":"3.40.3"},"publisher-location":"Singapore","reference-count":18,"publisher":"Springer Singapore","isbn-type":[{"type":"print","value":"9789811576829"},{"type":"electronic","value":"9789811576836"}],"license":[{"start":{"date-parts":[[2020,11,20]],"date-time":"2020-11-20T00:00:00Z","timestamp":1605830400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2020,11,20]],"date-time":"2020-11-20T00:00:00Z","timestamp":1605830400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>This chapter presents the XcalableMP on the Fugaku supercomputer, the Japanese flagship supercomputer developed by FLAGSHIP2020 project in RIKEN R-CCS. The porting and the performance evaluation were done as a part of this project, and the XcalableMP is available for the Fugaku users for improving the productivity and performance of parallel programing. The performance of XcalableMP on the Fugaku is enhanced by the manycore processor and a new Tofu-D interconnect. We are now working on the next version, XcalableMP 2.0, for cutting-edge high-performance systems with manycore processors by multithreading and multi-tasking with integrations of PGAS model and synchronization models. We conclude this book with retrospectives and challenges for future PGAS models.<\/jats:p>","DOI":"10.1007\/978-981-15-7683-6_10","type":"book-chapter","created":{"date-parts":[[2020,11,19]],"date-time":"2020-11-19T13:11:21Z","timestamp":1605791481000},"page":"245-262","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["XcalableMP 2.0 and Future Directions"],"prefix":"10.1007","author":[{"given":"Mitsuhisa","family":"Sato","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hitoshi","family":"Murai","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Masahiro","family":"Nakao","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Keisuke","family":"Tsugane","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tesuya","family":"Odajima","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jinpil","family":"Lee","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2020,11,20]]},"reference":[{"key":"10_CR1","unstructured":"Flagship 2020 Project (Supercomputer Fugaku), https:\/\/www.r-ccs.riken.jp\/en\/overview\/exascalepj\/"},{"key":"10_CR2","unstructured":"H. Murai, M. Sato, An efficient implementation of stencil communication for the XcalableMP PGAS parallel programming language, in 7th International Conference on PGAS Programming Models, Edinburgh (2013)"},{"key":"10_CR3","unstructured":"FUJITSU Ltd., Development Studio uTofu User\u2019s Guide (2020)"},{"key":"10_CR4","unstructured":"RIKEN Advanced Institute for Computational Science (RIKEN AICS), Fiber Miniapp Suite (2104), http:\/\/www.fiber-miniapp.github.io"},{"key":"10_CR5","doi-asserted-by":"crossref","unstructured":"H. Murai, M. Nakao, H. Iwashita, M. Sato, Preliminary performance evaluation of coarray-based implementation of fiber Miniapp suite using XcalableMP PGAS language, in Second Annual PGAS Applications Workshop (PAW), Denver, CO (2017)","DOI":"10.1145\/3144779.3144780"},{"key":"10_CR6","unstructured":"CCS QCD Solver benchmark program, https:\/\/www.ccs.tsukuba.ac.jp\/qcd\/ccsqcdsolverbenchmic\/"},{"key":"10_CR7","unstructured":"NTChem Overview, https:\/\/www.r-ccs.riken.jp\/software_center\/software\/ntchem\/overview\/"},{"key":"10_CR8","unstructured":"K. Tsugane, J. Lee, H. Murai, M. Sato Multi-tasking execution in PGAS language XcalableMP and communication optimization on many-core clusters, in HPC Asia 2018, Tokyo (2018), pp. 75\u201385"},{"key":"10_CR9","unstructured":"Joint Center for Advanced High Performance Computing (JCAHPC), Basic Specification of Oakforest-PACS, http:\/\/jcahpc.jp\/files\/OFP-basic.pdf"},{"key":"10_CR10","unstructured":"D. Alejandro, A. Eduard, B. Rosa M, L. Jesus, M. Luis, M. Xavier, P. Judit, OmpSs: a proposal for programming heterogeneous multi-core architectures. Parallel Process. Lett. 21, 173\u2013193 (2011)"},{"key":"10_CR11","unstructured":"Joint Center for Advanced High Performance Computing (JCAHPC), Basic Specification of Oakforest-PACS, http:\/\/jcahpc.jp\/files\/OFP-basic.pdf"},{"key":"10_CR12","unstructured":"OSU Micro-Benchmarks, http:\/\/mvapich.cse.ohio-state.edu\/benchmarks\/"},{"key":"10_CR13","doi-asserted-by":"crossref","unstructured":"H. Iwashita, M. Nakao, H. Murai, M. Sato, A source-to-source translation of coarray Fortran with MPI for high performance, in HPC Asia 2018, Tokyo (2018)","DOI":"10.1145\/3149457.3155888"},{"key":"10_CR14","unstructured":"XcalableMP Language Specification v 1.4, https:\/\/xcalablemp.org\/download\/spec\/xmp-spec-1.4.pdf"},{"key":"10_CR15","unstructured":"OpenShmem, http:\/\/www.openshmem.org\/site\/"},{"key":"10_CR16","unstructured":"Global Arrays, https:\/\/hpc.pnl.gov\/globalarrays\/"},{"key":"10_CR17","unstructured":"Y. Zheng, A. Kamil, M.B. Driscoll, H. Shan, K. Yelick, UPC++: a PGAS extension for C++, in 2014 IEEE 28th International Parallel and Distributed Processing Symposium (2014), pp. 1105\u20131114"},{"key":"10_CR18","doi-asserted-by":"publisher","unstructured":"K. Fuerlinger, T. Fuchs, R. Kowalewski, DASH: a C++ PGAS library for distributed data structures and parallel algorithms, in 2016 IEEE 18th International Conference on High Performance Computing and Communications; IEEE 14th International Conference on Smart City; IEEE 2nd International Conference on Data Science and Systems (HPCC\/SmartCity\/DSS), Sydney, NSW (2016), pp. 983\u2013990. https:\/\/doi.org\/10.1109\/HPCC-SmartCity-DSS.2016.0140","DOI":"10.1109\/HPCC-SmartCity-DSS.2016.0140"}],"container-title":["XcalableMP PGAS Programming Language"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/978-981-15-7683-6_10","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,11,19]],"date-time":"2020-11-19T13:15:35Z","timestamp":1605791735000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/978-981-15-7683-6_10"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,11,20]]},"ISBN":["9789811576829","9789811576836"],"references-count":18,"URL":"https:\/\/doi.org\/10.1007\/978-981-15-7683-6_10","relation":{},"subject":[],"published":{"date-parts":[[2020,11,20]]},"assertion":[{"value":"20 November 2020","order":1,"name":"first_online","label":"First Online","group":{"name":"ChapterHistory","label":"Chapter History"}}]}}