Abstract
We describe code optimization and parallelization procedures applied to the sequential overland flow solver FullSWOF2D. Major difficulties when simulating overland flows comprise dealing with high resolution datasets of large scale areas which either cannot be computed on a single node either due to limited amount of memory or due to too many (time step) iterations resulting from the CFL condition. We address these issues in terms of two major contributions. First, we demonstrate a generic step-by-step transformation of the second order finite volume scheme in FullSWOF2D towards MPI parallelization. Second, the computational kernels are optimized by the use of templates and a portable vectorization approach. We discuss the load imbalance of the flux computation due to dry and wet cells and propose a solution using an efficient cell counting approach. Finally, scalability results are shown for different test scenarios along with a flood simulation benchmark using the Shaheen II supercomputer.
Original language | English (US) |
---|---|
Pages (from-to) | 110-125 |
Number of pages | 16 |
Journal | Computers & Mathematics with Applications |
Volume | 74 |
Issue number | 1 |
DOIs | |
State | Published - Jan 25 2017 |
Externally published | Yes |
Bibliographical note
KAUST Repository Item: Exported on 2020-10-01Acknowledgements: We thank for financial support by the Kompetenznetzwerk für Wissenschaftliches Höchstleistungsrechnen in Bayern (KONWIHR) for the Multicore-Software-Initiative with the project “Optimization of a multi-functional shallow water solver for complex overland flows” (KONWIHR-IV) and KAUST Supercomputing Laboratory for providing the access to supercomputer Shaheen 2 (project k1050). We thank Ralf-Peter Mundani and Florian Mintgen from the Department of Civil, Geo and Environmental Engineering of the Technical University of Munich for access to the Glasgow scenario datasets as well as for the fruitful discussions on this scenario.
This publication acknowledges KAUST support, but has no KAUST affiliated authors.