High-Level Topology-Oblivious Optimization of MPI Broadcast Algorithms on Extreme-Scale Platforms

Khalid Hasanov, Jean-Noël Quintin, Alexey Lastovetsky

Research output: Chapter in Book/Report/Conference proceedingChapter

4 Scopus citations

Abstract

There has been a significant research in collective communication operations, in particular in MPI broadcast, on distributed memory platforms. Most of the research works are done to optimize the collective operations for particular architectures by taking into account either their topology or platform parameters. In this work we propose a very simple and at the same time general approach to optimize legacy MPI broadcast algorithms, which are widely used in MPICH and OpenMPI. Theoretical analysis and experimental results on IBM BlueGene/P and a cluster of Grid’5000 platform are presented.
Original languageEnglish (US)
Title of host publicationEuro-Par 2014: Parallel Processing Workshops
PublisherSpringer Nature
Pages412-424
Number of pages13
ISBN (Print)9783319143125
DOIs
StatePublished - 2014
Externally publishedYes

Bibliographical note

KAUST Repository Item: Exported on 2020-10-01
Acknowledgements: This work has emanated from research conducted withthe financial support of IRCSET (Irish Research Council for Science, Engineeringand Technology) and IBM, grant number EPSG/2011/188 and Science Founda-tion Ireland, grant number 08/IN.1/I2054.Some of the experiments presented in this publication were carried out us-ing the Grid’5000 experimental testbed, being developed under the INRIA AL-ADDIN development action with support from CNRS, RENATER and severalUniversities as well as other funding bodies (seehttps://www.grid5000.fr)Another part of the experiments were carried out using the resources of the Su-percomputing Laboratory at King Abdullah University of Science&Technology(KAUST) in Thuwal, Saudi Arabia.
This publication acknowledges KAUST support, but has no KAUST affiliated authors.

Fingerprint

Dive into the research topics of 'High-Level Topology-Oblivious Optimization of MPI Broadcast Algorithms on Extreme-Scale Platforms'. Together they form a unique fingerprint.

Cite this