Multiple GPUs using MPI and OpenACC

Saber Feki, Ahmed Al-Jarro, Hakan Bagci

Research output: Contribution to journalArticlepeer-review


A scalable parallelization algorithm to port an explicit marching-on-in-time (MOT)-based time domain volume integral equation (TDVIE) solver onto multi-GPUs is described. The algorithm makes use of MPI and OpenACC for efficient implementation. The MPI processes are responsible for synchronizing and communicating the distributed compute kernels of the MOT-TDVIE solver between the GPUs, where one MPI task is assigned to one GPU. The compiler directives of the OpenACC are responsible for the data transfer and kernels’ offloading from the CPU to the GPU and their execution on the GPU. The speedups achieved against the MPI/OpenMP code execution on multiple CPUs and parallel efficiencies are presented.

Original languageEnglish (US)
Pages (from-to)164-167
Number of pages4
JournalApplied Computational Electromagnetics Society Journal
Issue number2
StatePublished - Feb 2018

Bibliographical note

Publisher Copyright:


  • Explicit marching-on-in-time scheme
  • GPU
  • MPI
  • OpenACC
  • Time-domain volume integral equation

ASJC Scopus subject areas

  • Astronomy and Astrophysics
  • Electrical and Electronic Engineering


Dive into the research topics of 'Multiple GPUs using MPI and OpenACC'. Together they form a unique fingerprint.

Cite this