Loading…

IMPLEMENTATION OF FDTD-COMPATIBLE GREEN'S FUNCTION ON HETEROGENEOUS CPU-GPU PARALLEL PROCESSING SYSTEM

This paper presents an implementation of the FDTD-compatible Green's function on a heterogeneous parallel processing system. The developed implementation simultaneously utilizes computational power of the central processing unit (CPU) and the graphics processing unit (GPU) to the computational...

Full description

Saved in:
Bibliographic Details
Published in:Electromagnetic waves (Cambridge, Mass.) Mass.), 2013-03, Vol.135, p.297-316
Main Author: Stefanski, Tomasz P
Format: Article
Language:English
Subjects:
Citations: Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c374t-c681a357fc010e65f563ad82d670bb0302db731dfb224ab227a344011059a7cf3
cites
container_end_page 316
container_issue
container_start_page 297
container_title Electromagnetic waves (Cambridge, Mass.)
container_volume 135
creator Stefanski, Tomasz P
description This paper presents an implementation of the FDTD-compatible Green's function on a heterogeneous parallel processing system. The developed implementation simultaneously utilizes computational power of the central processing unit (CPU) and the graphics processing unit (GPU) to the computational tasks best suited for each architecture. Recently, closed-form expression for this discrete Green's function (DGF) was derived, which facilitates its applications in the FDTD simulations of radiation and scattering problems. Unfortunately, implementation of the new DGF formula in software requires a multiple precision arithmetic and may cause long runtimes. Therefore, an acceleration of the DGF computations on a CPU-GPU heterogeneous parallel processing system was developed using the multiple precision arithmetic and the OpenMP and CUDA parallel programming interfaces. The method avoids drawbacks of the CPU-and GPU-only accelerated implementations of the DGF, i.e., long runtime on the CPU and significant overhead of the GPU initialization respectively for long and short length of the DGF waveform. As a result, the sevenfold speedup was obtained relative to the reference DGF implementation on a multicore CPU thus applicability of the DGF in FDTD simulations was significantly improved.
doi_str_mv 10.2528/PIER12111702
format article
fullrecord <record><control><sourceid>gale_cross</sourceid><recordid>TN_cdi_gale_infotracmisc_A348311388</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A348311388</galeid><sourcerecordid>A348311388</sourcerecordid><originalsourceid>FETCH-LOGICAL-c374t-c681a357fc010e65f563ad82d670bb0302db731dfb224ab227a344011059a7cf3</originalsourceid><addsrcrecordid>eNptkUtLw0AUhYMoWGp3_oABFyKYOo9MMl3GdJIG8iKPhaswmWRKpA_JRNB_b6QuWvBeuPdy-M7dHMO4R3CJKWYvWchzhBFCDsRXxgxRujLZitHrs_vWWGj9DqeilkMgmhkqjLOIxzwp3TJME5D6wF-Xa9NL42xSXiMOgpzz5LEAfpV4JyYBG17yPA14wtOqAF5WmUFWgczN3SjiEcjy1ONFESYBKN6Kksd3xo0SO90t_vbcqHxeehszSoPQcyNTEscaTWkzJAh1lIQIdjZV1CaiZbi1Hdg0kEDcNg5BrWowtsQ0HEEsCyIE6Uo4UpG58XD6uxW7ru4P6jgOQu57LWuXWIwgRBibqOU_1NRtt-_l8dCpftIvDE8XhokZu69xKz61rsMiv2SfT6wcjloPnao_hn4vhu8awfo3qfo8KfID9PN6GA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>IMPLEMENTATION OF FDTD-COMPATIBLE GREEN'S FUNCTION ON HETEROGENEOUS CPU-GPU PARALLEL PROCESSING SYSTEM</title><source>Publicly Available Content (ProQuest)</source><source>EZB Electronic Journals Library</source><creator>Stefanski, Tomasz P</creator><creatorcontrib>Stefanski, Tomasz P</creatorcontrib><description>This paper presents an implementation of the FDTD-compatible Green's function on a heterogeneous parallel processing system. The developed implementation simultaneously utilizes computational power of the central processing unit (CPU) and the graphics processing unit (GPU) to the computational tasks best suited for each architecture. Recently, closed-form expression for this discrete Green's function (DGF) was derived, which facilitates its applications in the FDTD simulations of radiation and scattering problems. Unfortunately, implementation of the new DGF formula in software requires a multiple precision arithmetic and may cause long runtimes. Therefore, an acceleration of the DGF computations on a CPU-GPU heterogeneous parallel processing system was developed using the multiple precision arithmetic and the OpenMP and CUDA parallel programming interfaces. The method avoids drawbacks of the CPU-and GPU-only accelerated implementations of the DGF, i.e., long runtime on the CPU and significant overhead of the GPU initialization respectively for long and short length of the DGF waveform. As a result, the sevenfold speedup was obtained relative to the reference DGF implementation on a multicore CPU thus applicability of the DGF in FDTD simulations was significantly improved.</description><identifier>ISSN: 1559-8985</identifier><identifier>ISSN: 1070-4698</identifier><identifier>EISSN: 1559-8985</identifier><identifier>DOI: 10.2528/PIER12111702</identifier><language>eng</language><publisher>Electromagnetics Academy</publisher><subject>Central processing units ; Fourier transformations ; Graphics coprocessors ; Microprocessors ; Parallel processing ; Potential theory (Mathematics) ; Properties ; Time-domain analysis</subject><ispartof>Electromagnetic waves (Cambridge, Mass.), 2013-03, Vol.135, p.297-316</ispartof><rights>COPYRIGHT 2013 Electromagnetics Academy</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c374t-c681a357fc010e65f563ad82d670bb0302db731dfb224ab227a344011059a7cf3</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Stefanski, Tomasz P</creatorcontrib><title>IMPLEMENTATION OF FDTD-COMPATIBLE GREEN'S FUNCTION ON HETEROGENEOUS CPU-GPU PARALLEL PROCESSING SYSTEM</title><title>Electromagnetic waves (Cambridge, Mass.)</title><description>This paper presents an implementation of the FDTD-compatible Green's function on a heterogeneous parallel processing system. The developed implementation simultaneously utilizes computational power of the central processing unit (CPU) and the graphics processing unit (GPU) to the computational tasks best suited for each architecture. Recently, closed-form expression for this discrete Green's function (DGF) was derived, which facilitates its applications in the FDTD simulations of radiation and scattering problems. Unfortunately, implementation of the new DGF formula in software requires a multiple precision arithmetic and may cause long runtimes. Therefore, an acceleration of the DGF computations on a CPU-GPU heterogeneous parallel processing system was developed using the multiple precision arithmetic and the OpenMP and CUDA parallel programming interfaces. The method avoids drawbacks of the CPU-and GPU-only accelerated implementations of the DGF, i.e., long runtime on the CPU and significant overhead of the GPU initialization respectively for long and short length of the DGF waveform. As a result, the sevenfold speedup was obtained relative to the reference DGF implementation on a multicore CPU thus applicability of the DGF in FDTD simulations was significantly improved.</description><subject>Central processing units</subject><subject>Fourier transformations</subject><subject>Graphics coprocessors</subject><subject>Microprocessors</subject><subject>Parallel processing</subject><subject>Potential theory (Mathematics)</subject><subject>Properties</subject><subject>Time-domain analysis</subject><issn>1559-8985</issn><issn>1070-4698</issn><issn>1559-8985</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2013</creationdate><recordtype>article</recordtype><recordid>eNptkUtLw0AUhYMoWGp3_oABFyKYOo9MMl3GdJIG8iKPhaswmWRKpA_JRNB_b6QuWvBeuPdy-M7dHMO4R3CJKWYvWchzhBFCDsRXxgxRujLZitHrs_vWWGj9DqeilkMgmhkqjLOIxzwp3TJME5D6wF-Xa9NL42xSXiMOgpzz5LEAfpV4JyYBG17yPA14wtOqAF5WmUFWgczN3SjiEcjy1ONFESYBKN6Kksd3xo0SO90t_vbcqHxeehszSoPQcyNTEscaTWkzJAh1lIQIdjZV1CaiZbi1Hdg0kEDcNg5BrWowtsQ0HEEsCyIE6Uo4UpG58XD6uxW7ru4P6jgOQu57LWuXWIwgRBibqOU_1NRtt-_l8dCpftIvDE8XhokZu69xKz61rsMiv2SfT6wcjloPnao_hn4vhu8awfo3qfo8KfID9PN6GA</recordid><startdate>20130301</startdate><enddate>20130301</enddate><creator>Stefanski, Tomasz P</creator><general>Electromagnetics Academy</general><scope>AAYXX</scope><scope>CITATION</scope><scope>ISR</scope></search><sort><creationdate>20130301</creationdate><title>IMPLEMENTATION OF FDTD-COMPATIBLE GREEN'S FUNCTION ON HETEROGENEOUS CPU-GPU PARALLEL PROCESSING SYSTEM</title><author>Stefanski, Tomasz P</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c374t-c681a357fc010e65f563ad82d670bb0302db731dfb224ab227a344011059a7cf3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2013</creationdate><topic>Central processing units</topic><topic>Fourier transformations</topic><topic>Graphics coprocessors</topic><topic>Microprocessors</topic><topic>Parallel processing</topic><topic>Potential theory (Mathematics)</topic><topic>Properties</topic><topic>Time-domain analysis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Stefanski, Tomasz P</creatorcontrib><collection>CrossRef</collection><collection>Gale In Context: Science</collection><jtitle>Electromagnetic waves (Cambridge, Mass.)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Stefanski, Tomasz P</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>IMPLEMENTATION OF FDTD-COMPATIBLE GREEN'S FUNCTION ON HETEROGENEOUS CPU-GPU PARALLEL PROCESSING SYSTEM</atitle><jtitle>Electromagnetic waves (Cambridge, Mass.)</jtitle><date>2013-03-01</date><risdate>2013</risdate><volume>135</volume><spage>297</spage><epage>316</epage><pages>297-316</pages><issn>1559-8985</issn><issn>1070-4698</issn><eissn>1559-8985</eissn><abstract>This paper presents an implementation of the FDTD-compatible Green's function on a heterogeneous parallel processing system. The developed implementation simultaneously utilizes computational power of the central processing unit (CPU) and the graphics processing unit (GPU) to the computational tasks best suited for each architecture. Recently, closed-form expression for this discrete Green's function (DGF) was derived, which facilitates its applications in the FDTD simulations of radiation and scattering problems. Unfortunately, implementation of the new DGF formula in software requires a multiple precision arithmetic and may cause long runtimes. Therefore, an acceleration of the DGF computations on a CPU-GPU heterogeneous parallel processing system was developed using the multiple precision arithmetic and the OpenMP and CUDA parallel programming interfaces. The method avoids drawbacks of the CPU-and GPU-only accelerated implementations of the DGF, i.e., long runtime on the CPU and significant overhead of the GPU initialization respectively for long and short length of the DGF waveform. As a result, the sevenfold speedup was obtained relative to the reference DGF implementation on a multicore CPU thus applicability of the DGF in FDTD simulations was significantly improved.</abstract><pub>Electromagnetics Academy</pub><doi>10.2528/PIER12111702</doi><tpages>20</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1559-8985
ispartof Electromagnetic waves (Cambridge, Mass.), 2013-03, Vol.135, p.297-316
issn 1559-8985
1070-4698
1559-8985
language eng
recordid cdi_gale_infotracmisc_A348311388
source Publicly Available Content (ProQuest); EZB Electronic Journals Library
subjects Central processing units
Fourier transformations
Graphics coprocessors
Microprocessors
Parallel processing
Potential theory (Mathematics)
Properties
Time-domain analysis
title IMPLEMENTATION OF FDTD-COMPATIBLE GREEN'S FUNCTION ON HETEROGENEOUS CPU-GPU PARALLEL PROCESSING SYSTEM
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T23%3A02%3A56IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=IMPLEMENTATION%20OF%20FDTD-COMPATIBLE%20GREEN'S%20FUNCTION%20ON%20HETEROGENEOUS%20CPU-GPU%20PARALLEL%20PROCESSING%20SYSTEM&rft.jtitle=Electromagnetic%20waves%20(Cambridge,%20Mass.)&rft.au=Stefanski,%20Tomasz%20P&rft.date=2013-03-01&rft.volume=135&rft.spage=297&rft.epage=316&rft.pages=297-316&rft.issn=1559-8985&rft.eissn=1559-8985&rft_id=info:doi/10.2528/PIER12111702&rft_dat=%3Cgale_cross%3EA348311388%3C/gale_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c374t-c681a357fc010e65f563ad82d670bb0302db731dfb224ab227a344011059a7cf3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_galeid=A348311388&rfr_iscdi=true