Loading…

IMPLEMENTATION OF FDTD-COMPATIBLE GREEN'S FUNCTION ON HETEROGENEOUS CPU-GPU PARALLEL PROCESSING SYSTEM

This paper presents an implementation of the FDTD-compatible Green's function on a heterogeneous parallel processing system. The developed implementation simultaneously utilizes computational power of the central processing unit (CPU) and the graphics processing unit (GPU) to the computational...

Full description

Saved in:
Bibliographic Details
Published in:Electromagnetic waves (Cambridge, Mass.) Mass.), 2013-03, Vol.135, p.297-316
Main Author: Stefanski, Tomasz P
Format: Article
Language:English
Subjects:
Citations: Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This paper presents an implementation of the FDTD-compatible Green's function on a heterogeneous parallel processing system. The developed implementation simultaneously utilizes computational power of the central processing unit (CPU) and the graphics processing unit (GPU) to the computational tasks best suited for each architecture. Recently, closed-form expression for this discrete Green's function (DGF) was derived, which facilitates its applications in the FDTD simulations of radiation and scattering problems. Unfortunately, implementation of the new DGF formula in software requires a multiple precision arithmetic and may cause long runtimes. Therefore, an acceleration of the DGF computations on a CPU-GPU heterogeneous parallel processing system was developed using the multiple precision arithmetic and the OpenMP and CUDA parallel programming interfaces. The method avoids drawbacks of the CPU-and GPU-only accelerated implementations of the DGF, i.e., long runtime on the CPU and significant overhead of the GPU initialization respectively for long and short length of the DGF waveform. As a result, the sevenfold speedup was obtained relative to the reference DGF implementation on a multicore CPU thus applicability of the DGF in FDTD simulations was significantly improved.
ISSN:1559-8985
1070-4698
1559-8985
DOI:10.2528/PIER12111702