Workshop Title:

Parallel Performance Evaluation Using TAU

Date/Time:

June 21–22, 2021

Morning: 9am PT/10am MT - noon PT/1pm MT 

  • Afternoon 1:30pm PT/2:30pm PT  - 4:30pm PT/5:30pm MT

A Webex link will be sent to all registrants.

Description:

This two-day workshop will cover performance evaluation of applications on LLNL, Sandia, and LANL OCF platforms and consulting sessions.

This workshop will focus on performance data collection, analysis, and performance optimization. After describing and demonstrating how performance data (both profile and trace data) can be collected in a straightforward manner using TAU’s (Tuning and Analysis Utilities) automated instrumentation, the workshop will cover how to analyze the performance data collected and drill down to find performance bottlenecks and determine their causes.

The workshop will include some sample codes that illustrate the different instrumentation and measurement choices available to the users. Topics will cover generating performance profiles and traces with memory utilization and headroom, I/O, and hardware performance counters data using PAPI. Hardware counter data can show not only which routines are taking the most time, but why? For example, because of cache misses, TLB misses, excess address arithmetic, or poor branch prediction behavior.

The workshop will cover instrumentation of OpenMP programs using OMPT and measurement of a program’s memory footprint. It will demonstrate scalable tracing using OTF2 and visualization using the Vampir trace analysis tool. Performance data analysis using ParaProf and PerfExplorer will be demonstrated using the performance data management framework (TAUdb) that includes TAU’s performance database.

The workshop will also feature cross experiment analysis including comparing the effects of multi-core architectures on code performance. The demonstrations will include using TAU on programming paradigms such as ROCm, Intel oneAPI (DPC++/SYCL), OpenCL, OpenACC, and CUDA on current and emerging GPU enabled HPC systems.

The workshop will also cover using TAU in the ECP Extreme-scale Scientific Software Stack (E4S) [https://e4s.io] using container technology and Amazon Web Services (AWS) instances. We will attempt to collect and analyze performance data for additional user codes during the hands-on portion of the workshop. Users and developers are welcome to contact the instructor ahead of time to begin collecting data to discuss at the workshop.

The second day will be devoted to discussions with application developers, by appointment only. Please contact Sameer Shende - sameer@paratools.com to request an appointment.