Performance Analysis for Large Scale GPU Applications and DL Frameworks


Performance Analysis for Large Scale GPU Applications and DL Frameworks

Juckeland, G.; Henschel, R.

Get your hands on the latest versions of Score-P and Vampir to profile the execution behavior of your large-scale GPU-Accelerated applications. See how these HPC community tools pick up as other tools (such as NVVP) drop off when your application spans multiple compute nodes. Regardless of whether your application uses CUDA, OpenACC, OpenMP or OpenCL for acceleration, or whether it is written in C, C++, Fortran or Python, you will receive a high-resolution timeline view of all program activity alongside the standard profiles to identify hot spots and avenues for optimization. The novel Python support now also enables performance studies for optimizing the inner workings of deep learning frameworks.

  • Lecture (Conference)
    GPU Technology Conference 2019, 17.-21.03.2019, San Jose, CA, USA

Permalink: https://www.hzdr.de/publications/Publ-29070
Publ.-Id: 29070