IDEAS home Printed from https://ideas.repec.org/h/spr/sprchp/978-3-319-56702-0_6.html
   My bibliography  Save this book chapter

Trace-Based Detection of Lock Contention in MPI One-Sided Communication

In: Tools for High Performance Computing 2016

Author

Listed:
  • Marc-André Hermanns

    (Forschungszentrum Jülich GmbH, JARA-HPC, Jülich Supercomputing Centre)

  • Markus Geimer

    (Forschungszentrum Jülich GmbH, Jülich Supercomputing Centre)

  • Bernd Mohr

    (Forschungszentrum Jülich GmbH, JARA-HPC, Jülich Supercomputing Centre)

  • Felix Wolf

    (TU Darmstadt, Parallel Programming)

Abstract

Performance analysis is an essential part of the development process of HPC applications. Thus, developers need adequate tools to evaluate design and implementation decisions to effectively develop efficient parallel applications. Therefore, it is crucial that tools provide an as complete support as possible for the available language and library features to ensure that design decisions are not negatively influenced by the level of available tool support. The message passing interface (MPI) supports three basic communication paradigms: point-to-point, collective, and one-sided. Each of these targets and excels at a specific application scenario. While current performance tools support the first two quite well, one-sided communication is often neglected. In our earlier work, we were able to reduce this gap by showing how wait states in MPI one-sided communication using active-target synchronization can be detected at large scale using our trace-based message replay technique. Further extending our work on the detection of progress-related wait states in ARMCI, this paper presents an improved infrastructure that is capable of not only detecting progress-related wait states, but also wait states due to lock contention in MPI passive-target synchronization. We present an event-based definition of lock contention, the trace-based algorithm to detect it, as well as initial results with a micro-benchmark and an application kernel scaling up to 65,536 processes.

Suggested Citation

  • Marc-André Hermanns & Markus Geimer & Bernd Mohr & Felix Wolf, 2017. "Trace-Based Detection of Lock Contention in MPI One-Sided Communication," Springer Books, in: Christoph Niethammer & José Gracia & Tobias Hilbrich & Andreas Knüpfer & Michael M. Resch & Wolfgang (ed.), Tools for High Performance Computing 2016, pages 97-114, Springer.
  • Handle: RePEc:spr:sprchp:978-3-319-56702-0_6
    DOI: 10.1007/978-3-319-56702-0_6
    as

    Download full text from publisher

    To our knowledge, this item is not available for download. To find whether it is available, there are three options:
    1. Check below whether another version of this item is available online.
    2. Check on the provider's web page whether it is in fact available.
    3. Perform a
    for a similarly titled item that would be available.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:sprchp:978-3-319-56702-0_6. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.