Loading…
IMPLEMENTATION OF FDTD-COMPATIBLE GREEN'S FUNCTION ON HETEROGENEOUS CPU-GPU PARALLEL PROCESSING SYSTEM
This paper presents an implementation of the FDTD-compatible Green's function on a heterogeneous parallel processing system. The developed implementation simultaneously utilizes computational power of the central processing unit (CPU) and the graphics processing unit (GPU) to the computational...
Saved in:
Published in: | Electromagnetic waves (Cambridge, Mass.) Mass.), 2013-03, Vol.135, p.297-316 |
---|---|
Main Author: | |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c374t-c681a357fc010e65f563ad82d670bb0302db731dfb224ab227a344011059a7cf3 |
---|---|
cites | |
container_end_page | 316 |
container_issue | |
container_start_page | 297 |
container_title | Electromagnetic waves (Cambridge, Mass.) |
container_volume | 135 |
creator | Stefanski, Tomasz P |
description | This paper presents an implementation of the FDTD-compatible Green's function on a heterogeneous parallel processing system. The developed implementation simultaneously utilizes computational power of the central processing unit (CPU) and the graphics processing unit (GPU) to the computational tasks best suited for each architecture. Recently, closed-form expression for this discrete Green's function (DGF) was derived, which facilitates its applications in the FDTD simulations of radiation and scattering problems. Unfortunately, implementation of the new DGF formula in software requires a multiple precision arithmetic and may cause long runtimes. Therefore, an acceleration of the DGF computations on a CPU-GPU heterogeneous parallel processing system was developed using the multiple precision arithmetic and the OpenMP and CUDA parallel programming interfaces. The method avoids drawbacks of the CPU-and GPU-only accelerated implementations of the DGF, i.e., long runtime on the CPU and significant overhead of the GPU initialization respectively for long and short length of the DGF waveform. As a result, the sevenfold speedup was obtained relative to the reference DGF implementation on a multicore CPU thus applicability of the DGF in FDTD simulations was significantly improved. |
doi_str_mv | 10.2528/PIER12111702 |
format | article |
fullrecord | <record><control><sourceid>gale_cross</sourceid><recordid>TN_cdi_gale_infotracmisc_A348311388</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A348311388</galeid><sourcerecordid>A348311388</sourcerecordid><originalsourceid>FETCH-LOGICAL-c374t-c681a357fc010e65f563ad82d670bb0302db731dfb224ab227a344011059a7cf3</originalsourceid><addsrcrecordid>eNptkUtLw0AUhYMoWGp3_oABFyKYOo9MMl3GdJIG8iKPhaswmWRKpA_JRNB_b6QuWvBeuPdy-M7dHMO4R3CJKWYvWchzhBFCDsRXxgxRujLZitHrs_vWWGj9DqeilkMgmhkqjLOIxzwp3TJME5D6wF-Xa9NL42xSXiMOgpzz5LEAfpV4JyYBG17yPA14wtOqAF5WmUFWgczN3SjiEcjy1ONFESYBKN6Kksd3xo0SO90t_vbcqHxeehszSoPQcyNTEscaTWkzJAh1lIQIdjZV1CaiZbi1Hdg0kEDcNg5BrWowtsQ0HEEsCyIE6Uo4UpG58XD6uxW7ru4P6jgOQu57LWuXWIwgRBibqOU_1NRtt-_l8dCpftIvDE8XhokZu69xKz61rsMiv2SfT6wcjloPnao_hn4vhu8awfo3qfo8KfID9PN6GA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>IMPLEMENTATION OF FDTD-COMPATIBLE GREEN'S FUNCTION ON HETEROGENEOUS CPU-GPU PARALLEL PROCESSING SYSTEM</title><source>Publicly Available Content (ProQuest)</source><source>EZB Electronic Journals Library</source><creator>Stefanski, Tomasz P</creator><creatorcontrib>Stefanski, Tomasz P</creatorcontrib><description>This paper presents an implementation of the FDTD-compatible Green's function on a heterogeneous parallel processing system. The developed implementation simultaneously utilizes computational power of the central processing unit (CPU) and the graphics processing unit (GPU) to the computational tasks best suited for each architecture. Recently, closed-form expression for this discrete Green's function (DGF) was derived, which facilitates its applications in the FDTD simulations of radiation and scattering problems. Unfortunately, implementation of the new DGF formula in software requires a multiple precision arithmetic and may cause long runtimes. Therefore, an acceleration of the DGF computations on a CPU-GPU heterogeneous parallel processing system was developed using the multiple precision arithmetic and the OpenMP and CUDA parallel programming interfaces. The method avoids drawbacks of the CPU-and GPU-only accelerated implementations of the DGF, i.e., long runtime on the CPU and significant overhead of the GPU initialization respectively for long and short length of the DGF waveform. As a result, the sevenfold speedup was obtained relative to the reference DGF implementation on a multicore CPU thus applicability of the DGF in FDTD simulations was significantly improved.</description><identifier>ISSN: 1559-8985</identifier><identifier>ISSN: 1070-4698</identifier><identifier>EISSN: 1559-8985</identifier><identifier>DOI: 10.2528/PIER12111702</identifier><language>eng</language><publisher>Electromagnetics Academy</publisher><subject>Central processing units ; Fourier transformations ; Graphics coprocessors ; Microprocessors ; Parallel processing ; Potential theory (Mathematics) ; Properties ; Time-domain analysis</subject><ispartof>Electromagnetic waves (Cambridge, Mass.), 2013-03, Vol.135, p.297-316</ispartof><rights>COPYRIGHT 2013 Electromagnetics Academy</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c374t-c681a357fc010e65f563ad82d670bb0302db731dfb224ab227a344011059a7cf3</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Stefanski, Tomasz P</creatorcontrib><title>IMPLEMENTATION OF FDTD-COMPATIBLE GREEN'S FUNCTION ON HETEROGENEOUS CPU-GPU PARALLEL PROCESSING SYSTEM</title><title>Electromagnetic waves (Cambridge, Mass.)</title><description>This paper presents an implementation of the FDTD-compatible Green's function on a heterogeneous parallel processing system. The developed implementation simultaneously utilizes computational power of the central processing unit (CPU) and the graphics processing unit (GPU) to the computational tasks best suited for each architecture. Recently, closed-form expression for this discrete Green's function (DGF) was derived, which facilitates its applications in the FDTD simulations of radiation and scattering problems. Unfortunately, implementation of the new DGF formula in software requires a multiple precision arithmetic and may cause long runtimes. Therefore, an acceleration of the DGF computations on a CPU-GPU heterogeneous parallel processing system was developed using the multiple precision arithmetic and the OpenMP and CUDA parallel programming interfaces. The method avoids drawbacks of the CPU-and GPU-only accelerated implementations of the DGF, i.e., long runtime on the CPU and significant overhead of the GPU initialization respectively for long and short length of the DGF waveform. As a result, the sevenfold speedup was obtained relative to the reference DGF implementation on a multicore CPU thus applicability of the DGF in FDTD simulations was significantly improved.</description><subject>Central processing units</subject><subject>Fourier transformations</subject><subject>Graphics coprocessors</subject><subject>Microprocessors</subject><subject>Parallel processing</subject><subject>Potential theory (Mathematics)</subject><subject>Properties</subject><subject>Time-domain analysis</subject><issn>1559-8985</issn><issn>1070-4698</issn><issn>1559-8985</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2013</creationdate><recordtype>article</recordtype><recordid>eNptkUtLw0AUhYMoWGp3_oABFyKYOo9MMl3GdJIG8iKPhaswmWRKpA_JRNB_b6QuWvBeuPdy-M7dHMO4R3CJKWYvWchzhBFCDsRXxgxRujLZitHrs_vWWGj9DqeilkMgmhkqjLOIxzwp3TJME5D6wF-Xa9NL42xSXiMOgpzz5LEAfpV4JyYBG17yPA14wtOqAF5WmUFWgczN3SjiEcjy1ONFESYBKN6Kksd3xo0SO90t_vbcqHxeehszSoPQcyNTEscaTWkzJAh1lIQIdjZV1CaiZbi1Hdg0kEDcNg5BrWowtsQ0HEEsCyIE6Uo4UpG58XD6uxW7ru4P6jgOQu57LWuXWIwgRBibqOU_1NRtt-_l8dCpftIvDE8XhokZu69xKz61rsMiv2SfT6wcjloPnao_hn4vhu8awfo3qfo8KfID9PN6GA</recordid><startdate>20130301</startdate><enddate>20130301</enddate><creator>Stefanski, Tomasz P</creator><general>Electromagnetics Academy</general><scope>AAYXX</scope><scope>CITATION</scope><scope>ISR</scope></search><sort><creationdate>20130301</creationdate><title>IMPLEMENTATION OF FDTD-COMPATIBLE GREEN'S FUNCTION ON HETEROGENEOUS CPU-GPU PARALLEL PROCESSING SYSTEM</title><author>Stefanski, Tomasz P</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c374t-c681a357fc010e65f563ad82d670bb0302db731dfb224ab227a344011059a7cf3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2013</creationdate><topic>Central processing units</topic><topic>Fourier transformations</topic><topic>Graphics coprocessors</topic><topic>Microprocessors</topic><topic>Parallel processing</topic><topic>Potential theory (Mathematics)</topic><topic>Properties</topic><topic>Time-domain analysis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Stefanski, Tomasz P</creatorcontrib><collection>CrossRef</collection><collection>Gale In Context: Science</collection><jtitle>Electromagnetic waves (Cambridge, Mass.)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Stefanski, Tomasz P</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>IMPLEMENTATION OF FDTD-COMPATIBLE GREEN'S FUNCTION ON HETEROGENEOUS CPU-GPU PARALLEL PROCESSING SYSTEM</atitle><jtitle>Electromagnetic waves (Cambridge, Mass.)</jtitle><date>2013-03-01</date><risdate>2013</risdate><volume>135</volume><spage>297</spage><epage>316</epage><pages>297-316</pages><issn>1559-8985</issn><issn>1070-4698</issn><eissn>1559-8985</eissn><abstract>This paper presents an implementation of the FDTD-compatible Green's function on a heterogeneous parallel processing system. The developed implementation simultaneously utilizes computational power of the central processing unit (CPU) and the graphics processing unit (GPU) to the computational tasks best suited for each architecture. Recently, closed-form expression for this discrete Green's function (DGF) was derived, which facilitates its applications in the FDTD simulations of radiation and scattering problems. Unfortunately, implementation of the new DGF formula in software requires a multiple precision arithmetic and may cause long runtimes. Therefore, an acceleration of the DGF computations on a CPU-GPU heterogeneous parallel processing system was developed using the multiple precision arithmetic and the OpenMP and CUDA parallel programming interfaces. The method avoids drawbacks of the CPU-and GPU-only accelerated implementations of the DGF, i.e., long runtime on the CPU and significant overhead of the GPU initialization respectively for long and short length of the DGF waveform. As a result, the sevenfold speedup was obtained relative to the reference DGF implementation on a multicore CPU thus applicability of the DGF in FDTD simulations was significantly improved.</abstract><pub>Electromagnetics Academy</pub><doi>10.2528/PIER12111702</doi><tpages>20</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1559-8985 |
ispartof | Electromagnetic waves (Cambridge, Mass.), 2013-03, Vol.135, p.297-316 |
issn | 1559-8985 1070-4698 1559-8985 |
language | eng |
recordid | cdi_gale_infotracmisc_A348311388 |
source | Publicly Available Content (ProQuest); EZB Electronic Journals Library |
subjects | Central processing units Fourier transformations Graphics coprocessors Microprocessors Parallel processing Potential theory (Mathematics) Properties Time-domain analysis |
title | IMPLEMENTATION OF FDTD-COMPATIBLE GREEN'S FUNCTION ON HETEROGENEOUS CPU-GPU PARALLEL PROCESSING SYSTEM |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T23%3A02%3A56IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=IMPLEMENTATION%20OF%20FDTD-COMPATIBLE%20GREEN'S%20FUNCTION%20ON%20HETEROGENEOUS%20CPU-GPU%20PARALLEL%20PROCESSING%20SYSTEM&rft.jtitle=Electromagnetic%20waves%20(Cambridge,%20Mass.)&rft.au=Stefanski,%20Tomasz%20P&rft.date=2013-03-01&rft.volume=135&rft.spage=297&rft.epage=316&rft.pages=297-316&rft.issn=1559-8985&rft.eissn=1559-8985&rft_id=info:doi/10.2528/PIER12111702&rft_dat=%3Cgale_cross%3EA348311388%3C/gale_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c374t-c681a357fc010e65f563ad82d670bb0302db731dfb224ab227a344011059a7cf3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_galeid=A348311388&rfr_iscdi=true |