An overview of concerns observed in allowing for reproducibility in parallel applications that heavily depend on the three dimensional distributed memory fast Fourier transform are summarized. Suggestions for reproducibility categories for benchmark results are given.
|Original language||English (US)|
|Title of host publication||Companion of the 2019 ACM/SPEC International Conference on Performance Engineering - ICPE '19|
|Publisher||Association for Computing Machinery (ACM)|
|Number of pages||4|
|State||Published - Apr 5 2019|
Bibliographical noteKAUST Repository Item: Exported on 2020-10-01
Acknowledgements: We thank all the authors of  and those who have given a presentation on their use of the FFT in the ongoing discussion at www.fft.report. We also thank Robert Henschel for an overview of the SPEC benchmarking process at the benchmarking in the data center workshop at HPC Asia 2019. We thank RIKEN for the use of the K computer, HLRS for the use of Kabuki and Hazelhen, and the KAUST Supercomputing Laboratory for the use of Shaheen II. B.K.M. was partially supported by HPC Europa 3 (INFRAIA-2016-1-730897). B.K.M. thanks H. Berger, A. Chepstov, J. Gracia and A. Jocksch for helpful hints and discussions.