Author
Listed:
- Bernd Mohr
(Forschungszentrum Jülich GmbH, Jülich Supercomputing Centre)
- Vladimir Voevodin
(Moscow State University, RCC)
- Judit Giménez
(Barcelona Supercomputing Centre)
- Erik Hagersten
(Rogue Wave Software AB)
- Andreas Knüpfer
(Technical University Dresden)
- Dmitry A. Nikitenko
(Moscow State University, RCC)
- Mats Nilsson
(Rogue Wave Software AB)
- Harald Servat
(Barcelona Supercomputing Centre)
- Aamer Shah
(German Research School for Simulation Sciences GmbH / RWTH Aachen University)
- Frank Winkler
(Technical University Dresden)
- Felix Wolf
(German Research School for Simulation Sciences GmbH / RWTH Aachen University)
- Ilya Zhukov
(Forschungszentrum Jülich GmbH, Jülich Supercomputing Centre)
Abstract
To maximise the scientific output of a high-performance computing system, different stakeholders pursue different strategies. While individual application developers are trying to shorten the time to solution by optimising their codes, system administrators are tuning the configuration of the overall system to increase its throughput. Yet, the complexity of today’s machines with their strong interrelationship between application and system performance presents serious challenges to achieving these goals. The HOPSA project (HOlistic Performance System Analysis) therefore sets out to create an integrated diagnostic infrastructure for combined application and system-level tuning – with the former provided by the EU and the latter by the Russian project partners. Starting from system-wide basic performance screening of individual jobs, an automated workflow routes findings on potential bottlenecks either to application developers or system administrators with recommendations on how to identify their root cause using more powerful diagnostic tools. Developers can choose from a variety of mature performance-analysis tools developed by our consortium. Within this project, the tools will be further integrated and enhanced with respect to scalability, depth of analysis, and support for asynchronous tasking, a node-level paradigm playing an increasingly important role in hybrid programs on emerging hierarchical and heterogeneous systems.
Suggested Citation
Bernd Mohr & Vladimir Voevodin & Judit Giménez & Erik Hagersten & Andreas Knüpfer & Dmitry A. Nikitenko & Mats Nilsson & Harald Servat & Aamer Shah & Frank Winkler & Felix Wolf & Ilya Zhukov, 2013.
"The HOPSA Workflow and Tools,"
Springer Books, in: Alexey Cheptsov & Steffen Brinkmann & José Gracia & Michael M. Resch & Wolfgang E. Nagel (ed.), Tools for High Performance Computing 2012, edition 127, pages 127-146,
Springer.
Handle:
RePEc:spr:sprchp:978-3-642-37349-7_9
DOI: 10.1007/978-3-642-37349-7_9
Download full text from publisher
To our knowledge, this item is not available for
download. To find whether it is available, there are three
options:
1. Check below whether another version of this item is available online.
2. Check on the provider's
web page
whether it is in fact available.
3. Perform a
for a similarly titled item that would be
available.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:sprchp:978-3-642-37349-7_9. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.