Author
Abstract
Modern high-performance computing (HPC) depends on an ever-evolving hardware landscape. Supercomputers, typically composed of hundreds to thousands of heterogeneous computing units, are further complicated by a variety of available memory and storage architectures. Consequently, efficient HPC-oriented parallel data processing and computation are becoming increasingly complex, even for experienced users. To address this important challenge and streamline the deployment of modern HPC resources, academia and industry have created various performance analysis and profiling tools. The aim of these tools is to collect information on program execution, enabling informed decisions and program optimization. A recent study naively applied a variety of profiling tools on well-known MPI-centric HPC proxy applications and compared the generated runtime overhead, memory consumption, and call path data. Results demonstrated that instrumentation-based tools, like Score-P and TAU, have limitations. We reproduced the experiments and identified the causes for these disadvantages. First, we enhanced libunwind, responsible for collecting backtraces from a signal handler, to make it more reliable. These enhancements enabled Score-P to produce results for all experiment configurations, but also improved the overhead for one benchmark. Second, we show that a proxy application exercises a common MPI communication pattern that induces a high overhead for instrumentation-based tools. We re-ran all of the experiments using a version of this proxy benchmark that was adjusted for ORNL’s Frontier system. This adjusted version also reduced the overhead of the instrumentation-centric tools. Finally, if the tools are used as advertised by the tool developers, all tools work with acceptable runtime overhead, regardless of the profiling technique used.
Suggested Citation
Mikhail Zarubin & Bert Wesarg, 2026.
"Reasoning the Runtime Overhead of Profiling Tools,"
Springer Books, in: Christoph Niethammer & Hartmut Mix & Wolfgang E. Nagel & Michael M. Resch (ed.), Tools for High Performance Computing 2023, pages 1-17,
Springer.
Handle:
RePEc:spr:sprchp:978-3-032-16397-4_1
DOI: 10.1007/978-3-032-16397-4_1
Download full text from publisher
To our knowledge, this item is not available for
download. To find whether it is available, there are three
options:
1. Check below whether another version of this item is available online.
2. Check on the provider's
web page
whether it is in fact available.
3. Perform a
for a similarly titled item that would be
available.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:sprchp:978-3-032-16397-4_1. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.