In this paper we present scaling results of a FFT library, FFTK, and a pseudospectral code, Tarang, on grid resolutions up to 81923 grid using 65536 cores of Blue Gene/P and 196608 cores of Cray XC40 supercomputers. We observe that communication dominates computation, more so on the Cray XC40. The computation time scales as Tcomp∼p−1, and the communication time as Tcomm∼n−γ2 with γ2 ranging from 0.7 to 0.9 for Blue Gene/P, and from 0.43 to 0.73 for Cray XC40. FFTK, and the fluid and convection solvers of Tarang exhibit weak as well as strong scaling nearly up to 196608 cores of Cray XC40. We perform a comparative study of the performance on the Blue Gene/P and Cray XC40 clusters.
|Original language||English (US)|
|Number of pages||15|
|Journal||Journal of Parallel and Distributed Computing|
|State||Published - Mar 2018|
Bibliographical noteFunding Information:
We thank Shashwat Bhattacharya for his valuable help on plotting. Our numerical simulations were performed at Cray XC40 Shaheen II at KAUST supercomputing laboratory (KSL), Saudi Arabia. We thank KAUST scientists for the kind support while performing our simulations. This work was supported by research grants SERB/F/3279 from Science and Engineering Research Board India , research grant K1052 and baseline research grant BAS/1/1349-01-01 from KAUST , and R&D/20130307 from IIT Kanpur ; and by KAUST baseline research funds of Ravi Samtaney.
© 2017 Elsevier Inc.
- Extreme-resolution turbulence simulation
- Fast Fourier transform
- Pseudospectral method
- Turbulence simulation
ASJC Scopus subject areas
- Theoretical Computer Science
- Hardware and Architecture
- Computer Networks and Communications
- Artificial Intelligence