{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,3,26]],"date-time":"2025-03-26T20:04:49Z","timestamp":1743019489454,"version":"3.40.3"},"publisher-location":"Singapore","reference-count":11,"publisher":"Springer Singapore","isbn-type":[{"type":"print","value":"9789811576829"},{"type":"electronic","value":"9789811576836"}],"license":[{"start":{"date-parts":[[2020,11,20]],"date-time":"2020-11-20T00:00:00Z","timestamp":1605830400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2020,11,20]],"date-time":"2020-11-20T00:00:00Z","timestamp":1605830400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>This chapter describes the implementation and performance evaluation of Omni compiler, which is a reference implementation of the compiler for XcalableMP. For performance evaluation, this chapter also presents how to implement the HPC Challenge benchmarks, which is a benchmark suite for an HPC parallel language. The results show that the performance of XMP is comparable to that of MPI in many cases.<\/jats:p>","DOI":"10.1007\/978-981-15-7683-6_2","type":"book-chapter","created":{"date-parts":[[2020,11,19]],"date-time":"2020-11-19T13:11:21Z","timestamp":1605791481000},"page":"73-96","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Implementation and Performance Evaluation of Omni Compiler"],"prefix":"10.1007","author":[{"given":"Masahiro","family":"Nakao","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hitoshi","family":"Murai","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2020,11,20]]},"reference":[{"key":"2_CR1","unstructured":"Programming Environment Research Team, https:\/\/pro-env.riken.jp"},{"key":"2_CR2","unstructured":"High Performance Computing System laboratory, University of Tsukuba, Japan, https:\/\/www.hpcs.cs.tsukuba.ac.jp"},{"key":"2_CR3","unstructured":"M. Sato et al., Omni compiler and XcodeML: an infrastructure for source-to-source transformation, in Platform for Advanced Scientific Computing Conference (PASC16), Lausanne (2016)"},{"key":"2_CR4","unstructured":"M. Nakao et al., Performance evaluation for Omni XcalableMP compiler on many-core cluster system based on knights landing, in IXPUG Workshop Asia 2018, Tokyo (2018), pp. 52\u201358"},{"issue":"1","key":"2_CR5","doi-asserted-by":"publisher","first-page":"110","DOI":"10.1177\/1094342017698214","volume":"33","author":"M Nakao","year":"2017","unstructured":"M. Nakao et al., Implementation and evaluation of the HPC challenge benchmark in the XcalableMP PGAS language. Int. J. High Perform. Comput. Appl. 33(1), 110\u2013123 (2017)","journal-title":"Int. J. High Perform. Comput. Appl."},{"key":"2_CR6","unstructured":"BLAS: Basic Linear Algebra Subprograms, http:\/\/www.netlib.org\/blas\/ (2016)"},{"key":"2_CR7","doi-asserted-by":"publisher","first-page":"23","DOI":"10.1007\/BF00162341","volume":"4","author":"DH Bailey","year":"1990","unstructured":"D.H. Bailey, FFTs in external or hierarchical memory. J. Supercomput. 4, 23\u201335 (1990)","journal-title":"J. Supercomput."},{"key":"2_CR8","doi-asserted-by":"publisher","DOI":"10.1137\/1.9781611970999","volume-title":"Computational Frameworks for the Fast Fourier Transform","author":"C Van Loan","year":"1992","unstructured":"C. Van Loan, Computational Frameworks for the Fast Fourier Transform (Society for Industrial and Applied Mathematics, Philadelphia, 1992)"},{"key":"2_CR9","unstructured":"D. Takahashi, A Fast Fourier Transform Package, http:\/\/www.ffte.jp (2014)"},{"key":"2_CR10","unstructured":"R. Ponnusamy et al., Communication overhead on the CM5: an experimental performance evaluation, in Fourth Symposium on the Frontiers of Massively Parallel Computation (1992), pp.108\u2013115"},{"key":"2_CR11","unstructured":"HPL Algorithm Panel Broadcast, http:\/\/www.netlib.org\/benchmark\/hpl\/algorithm.html (2016)"}],"container-title":["XcalableMP PGAS Programming Language"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/978-981-15-7683-6_2","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,11,19]],"date-time":"2020-11-19T13:13:25Z","timestamp":1605791605000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/978-981-15-7683-6_2"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,11,20]]},"ISBN":["9789811576829","9789811576836"],"references-count":11,"URL":"https:\/\/doi.org\/10.1007\/978-981-15-7683-6_2","relation":{},"subject":[],"published":{"date-parts":[[2020,11,20]]},"assertion":[{"value":"20 November 2020","order":1,"name":"first_online","label":"First Online","group":{"name":"ChapterHistory","label":"Chapter History"}}]}}