Author
Listed:
- M. L. Becoña
(University of Santiago de Compostela, CiTIUS Centro de Investigación en Tecnoloxías Intelixentes)
- O. G. Lorenzo
(University of Santiago de Compostela, CiTIUS Centro de Investigación en Tecnoloxías Intelixentes)
- T. F. Pena
(University of Santiago de Compostela, CiTIUS Centro de Investigación en Tecnoloxías Intelixentes)
- J. C. Cabaleiro
(University of Santiago de Compostela, CiTIUS Centro de Investigación en Tecnoloxías Intelixentes)
- F. F. Rivera
(University of Santiago de Compostela, CiTIUS Centro de Investigación en Tecnoloxías Intelixentes)
- J. A. Lorenzo
(ETIS Laboratory, CY Cergy Paris Université)
Abstract
Multicore systems present on-board memory hierarchies and communication networks that influence performance when executing shared-memory parallel codes. Characterising this influence is complex, and understanding the effect of particular hardware configurations on different codes is of paramount importance. In this context, precise monitoring information can be extracted from hardware counters (HC) at runtime to characterise the behaviour of each thread of a parallel code. This technology provides high accuracy with a low overhead. In particular, we introduce a new tool to get this information from hardware counters in terms of number of floating point operations per second, operational intensity, latency of memory access, and energy consumption. Note the first two parameters define the well-known Roofline Model, an intuitive visual performance model used to provide performance estimates of applications running on multi-core architectures. The third parameter quantifies data locality and the fourth one is related to the load of each node of the system. All this information is accessed through the perf $$\_$$ _ events interface provided by Linux, with the aid of the libpfm library. This tool can be used to utilise its monitoring information to optimise execution efficiency in NUMA systems by balancing or scheduling the workloads, guiding thread and page migration strategies in order to increase locality and affinity. The designated migrations are based on optimisation strategies, supported by runtime information provided by hardware counters. Overall, the profiling application is launched from a terminal as a background process, it does not require superuser permissions to run properly, and can lead to performance optimization in multithreaded applications and power saving in NUMA systems.
Suggested Citation
M. L. Becoña & O. G. Lorenzo & T. F. Pena & J. C. Cabaleiro & F. F. Rivera & J. A. Lorenzo, 2021.
"A Tool for Runtime Analysis of Performance and Energy Usage in NUMA Systems,"
Springer Books, in: Hartmut Mix & Christoph Niethammer & Huan Zhou & Wolfgang E. Nagel & Michael M. Resch (ed.), Tools for High Performance Computing 2018 / 2019, pages 85-99,
Springer.
Handle:
RePEc:spr:sprchp:978-3-030-66057-4_4
DOI: 10.1007/978-3-030-66057-4_4
Download full text from publisher
To our knowledge, this item is not available for
download. To find whether it is available, there are three
options:
1. Check below whether another version of this item is available online.
2. Check on the provider's
web page
whether it is in fact available.
3. Perform a
for a similarly titled item that would be
available.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:sprchp:978-3-030-66057-4_4. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.