{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,20]],"date-time":"2025-02-20T05:16:59Z","timestamp":1740028619670,"version":"3.37.3"},"reference-count":0,"publisher":"IOS Press","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2014]]},"abstract":"<jats:p>State-of-the-art high performance computing (HPC) applications have to scale over an increasing number of processing elements, meanwhile application developers recently have to face the programming of special accelerator hardware. Although computing languages like CUDA and programming standards like OpenACC provide a fairly easy way to exploit the computational power of general purpose graphics processing units (GPGPUs), their programming is still challenging. Performance analysis is a vital procedure to efficiently use the available hardware and programming models. This paper presents the Vampir performance analysis capabilities by taking the example of a molecular dynamics code, which uses message passing (MPI), threading (OpenMP) and offloading to accelerators (OpenACC and CUDA). It is shown that the Vampir tool-set allows a holistic view on the combined usage of all commonly utilized programming paradigms in heterogeneous HPC applications.<\/jats:p>","DOI":"10.3233\/978-1-61499-381-0-793","type":"book-chapter","created":{"date-parts":[[2025,2,19]],"date-time":"2025-02-19T15:30:51Z","timestamp":1739979051000},"source":"Crossref","is-referenced-by-count":0,"title":["A Case Study: Holistic Performance Analysis on Heterogeneous Architectures using the Vampir Toolchain"],"prefix":"10.3233","author":[{"family":"Dietrich Robert","sequence":"additional","affiliation":[]},{"family":"Winkler Frank","sequence":"additional","affiliation":[]},{"family":"William Thomas","sequence":"additional","affiliation":[]},{"family":"Stolle Jonas","sequence":"additional","affiliation":[]},{"family":"Henschel Robert","sequence":"additional","affiliation":[]},{"family":"Berry Donald K.","sequence":"additional","affiliation":[]}],"member":"7437","container-title":["Advances in Parallel Computing","Parallel Computing: Accelerating Computational Science and Engineering (CSE)"],"original-title":[],"deposited":{"date-parts":[[2025,2,19]],"date-time":"2025-02-19T15:47:17Z","timestamp":1739980037000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.medra.org\/servlet\/aliasResolver?alias=iospressISSNISBN&issn=0927-5452&volume=25&spage=793"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014]]},"references-count":0,"URL":"https:\/\/doi.org\/10.3233\/978-1-61499-381-0-793","relation":{},"ISSN":["0927-5452"],"issn-type":[{"value":"0927-5452","type":"print"}],"subject":[],"published":{"date-parts":[[2014]]}}}