Abstract
There has been a significant research in collective communication operations, in particular in MPI broadcast, on distributed memory platforms. Most of the research works are done to optimize the collective operations for particular architectures by taking into account either their topology or platform parameters. In this work we propose a very simple and at the same time general approach to optimize legacy MPI broadcast algorithms, which are widely used in MPICH and OpenMPI. Theoretical analysis and experimental results on IBM BlueGene/P and a cluster of Grid’5000 platform are presented.
Original language | English (US) |
---|---|
Title of host publication | Euro-Par 2014: Parallel Processing Workshops |
Publisher | Springer Nature |
Pages | 412-424 |
Number of pages | 13 |
ISBN (Print) | 9783319143125 |
DOIs | |
State | Published - 2014 |
Externally published | Yes |
Bibliographical note
KAUST Repository Item: Exported on 2020-10-01Acknowledgements: This work has emanated from research conducted withthe financial support of IRCSET (Irish Research Council for Science, Engineeringand Technology) and IBM, grant number EPSG/2011/188 and Science Founda-tion Ireland, grant number 08/IN.1/I2054.Some of the experiments presented in this publication were carried out us-ing the Grid’5000 experimental testbed, being developed under the INRIA AL-ADDIN development action with support from CNRS, RENATER and severalUniversities as well as other funding bodies (seehttps://www.grid5000.fr)Another part of the experiments were carried out using the resources of the Su-percomputing Laboratory at King Abdullah University of Science&Technology(KAUST) in Thuwal, Saudi Arabia.
This publication acknowledges KAUST support, but has no KAUST affiliated authors.