Loading…

GPU optimized computation of stencil based algorithms

The paper describes an optimized GPU based approach for stencil based algorithms. The simulations have been performed for a two dimensional steady state heat conduction problem, which has been solved through the red black point successive over relaxation method. Two kernels have been developed and t...

Full description

Saved in:
Bibliographic Details
Main Authors: Itu, L. M., Suciu, C., Moldoveanu, F., Postelnicu, A.
Format: Conference Proceeding
Language:eng ; jpn
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 6
container_issue
container_start_page 1
container_title
container_volume
creator Itu, L. M.
Suciu, C.
Moldoveanu, F.
Postelnicu, A.
description The paper describes an optimized GPU based approach for stencil based algorithms. The simulations have been performed for a two dimensional steady state heat conduction problem, which has been solved through the red black point successive over relaxation method. Two kernels have been developed and their performance has been greatly improved through coalesced memory accesses and special shared memory approaches. The approach described in the paper does not only represent a step forward for the steady state heat conduction problem but also for any other algorithm which performs the numerical solution of partial differential equations or which is stencil based. The paper not only describes the various code versions but also the process which has lead to these improvements. Also the optimized GPU code version has been compared with the corresponding CPU version. The testing results show that the GPU algorithm always leads to an improvement. The value of the improvement though greatly depends on the number of grid points on which the computations are performed.
doi_str_mv 10.1109/RoEduNet.2011.5993693
format conference_proceeding
fullrecord <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_5993693</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>5993693</ieee_id><sourcerecordid>5993693</sourcerecordid><originalsourceid>FETCH-LOGICAL-i156t-f4210c200dd3e876052bfce985c0d478065cab9b0c12589ac44f5822745de1893</originalsourceid><addsrcrecordid>eNpVj81KAzEURiMqWOo8gQjzAjPem_8spdRWKCpi1yWTZDTSaYZJutCnV7AbVx-HAwc-Qm4RWkQwd69p6Y9PobQUEFthDJOGnZHKKI1cKIWUCXX-jxm7IDMKUjcITF-RKudPAEAjlaRiRsTqZVunscQhfgdfuzSMx2JLTIc69XUu4eDivu5s_pV2_56mWD6GfE0ue7vPoTrtnGwflm-LdbN5Xj0u7jdNRCFL03OK4CiA9yxoJUHQrnfBaOHAc6VBCmc704FDKrSxjvNeaEoVFz6gNmxObv66MYSwG6c42OlrdzrOfgDH6Eph</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>GPU optimized computation of stencil based algorithms</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Itu, L. M. ; Suciu, C. ; Moldoveanu, F. ; Postelnicu, A.</creator><creatorcontrib>Itu, L. M. ; Suciu, C. ; Moldoveanu, F. ; Postelnicu, A.</creatorcontrib><description>The paper describes an optimized GPU based approach for stencil based algorithms. The simulations have been performed for a two dimensional steady state heat conduction problem, which has been solved through the red black point successive over relaxation method. Two kernels have been developed and their performance has been greatly improved through coalesced memory accesses and special shared memory approaches. The approach described in the paper does not only represent a step forward for the steady state heat conduction problem but also for any other algorithm which performs the numerical solution of partial differential equations or which is stencil based. The paper not only describes the various code versions but also the process which has lead to these improvements. Also the optimized GPU code version has been compared with the corresponding CPU version. The testing results show that the GPU algorithm always leads to an improvement. The value of the improvement though greatly depends on the number of grid points on which the computations are performed.</description><identifier>ISSN: 2068-1038</identifier><identifier>ISBN: 9781457712333</identifier><identifier>ISBN: 1457712334</identifier><identifier>EISBN: 9781457712357</identifier><identifier>EISBN: 1457712342</identifier><identifier>EISBN: 9781457712340</identifier><identifier>EISBN: 1457712350</identifier><identifier>DOI: 10.1109/RoEduNet.2011.5993693</identifier><language>eng ; jpn</language><publisher>IEEE</publisher><subject>GPU ; Graphics processing unit ; heat conduction ; Heating ; Instruction sets ; Kernel ; optimization ; Performance evaluation ; shared memory ; speed-up ; Steady-state ; successive over relaxation ; Throughput</subject><ispartof>2011 RoEduNet International Conference 10th Edition: Networking in Education and Research, 2011, p.1-6</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/5993693$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,2052,27902,54530,54895,54907</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/5993693$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Itu, L. M.</creatorcontrib><creatorcontrib>Suciu, C.</creatorcontrib><creatorcontrib>Moldoveanu, F.</creatorcontrib><creatorcontrib>Postelnicu, A.</creatorcontrib><title>GPU optimized computation of stencil based algorithms</title><title>2011 RoEduNet International Conference 10th Edition: Networking in Education and Research</title><addtitle>RoEduNet</addtitle><description>The paper describes an optimized GPU based approach for stencil based algorithms. The simulations have been performed for a two dimensional steady state heat conduction problem, which has been solved through the red black point successive over relaxation method. Two kernels have been developed and their performance has been greatly improved through coalesced memory accesses and special shared memory approaches. The approach described in the paper does not only represent a step forward for the steady state heat conduction problem but also for any other algorithm which performs the numerical solution of partial differential equations or which is stencil based. The paper not only describes the various code versions but also the process which has lead to these improvements. Also the optimized GPU code version has been compared with the corresponding CPU version. The testing results show that the GPU algorithm always leads to an improvement. The value of the improvement though greatly depends on the number of grid points on which the computations are performed.</description><subject>GPU</subject><subject>Graphics processing unit</subject><subject>heat conduction</subject><subject>Heating</subject><subject>Instruction sets</subject><subject>Kernel</subject><subject>optimization</subject><subject>Performance evaluation</subject><subject>shared memory</subject><subject>speed-up</subject><subject>Steady-state</subject><subject>successive over relaxation</subject><subject>Throughput</subject><issn>2068-1038</issn><isbn>9781457712333</isbn><isbn>1457712334</isbn><isbn>9781457712357</isbn><isbn>1457712342</isbn><isbn>9781457712340</isbn><isbn>1457712350</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2011</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNpVj81KAzEURiMqWOo8gQjzAjPem_8spdRWKCpi1yWTZDTSaYZJutCnV7AbVx-HAwc-Qm4RWkQwd69p6Y9PobQUEFthDJOGnZHKKI1cKIWUCXX-jxm7IDMKUjcITF-RKudPAEAjlaRiRsTqZVunscQhfgdfuzSMx2JLTIc69XUu4eDivu5s_pV2_56mWD6GfE0ue7vPoTrtnGwflm-LdbN5Xj0u7jdNRCFL03OK4CiA9yxoJUHQrnfBaOHAc6VBCmc704FDKrSxjvNeaEoVFz6gNmxObv66MYSwG6c42OlrdzrOfgDH6Eph</recordid><startdate>201106</startdate><enddate>201106</enddate><creator>Itu, L. M.</creator><creator>Suciu, C.</creator><creator>Moldoveanu, F.</creator><creator>Postelnicu, A.</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>201106</creationdate><title>GPU optimized computation of stencil based algorithms</title><author>Itu, L. M. ; Suciu, C. ; Moldoveanu, F. ; Postelnicu, A.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i156t-f4210c200dd3e876052bfce985c0d478065cab9b0c12589ac44f5822745de1893</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng ; jpn</language><creationdate>2011</creationdate><topic>GPU</topic><topic>Graphics processing unit</topic><topic>heat conduction</topic><topic>Heating</topic><topic>Instruction sets</topic><topic>Kernel</topic><topic>optimization</topic><topic>Performance evaluation</topic><topic>shared memory</topic><topic>speed-up</topic><topic>Steady-state</topic><topic>successive over relaxation</topic><topic>Throughput</topic><toplevel>online_resources</toplevel><creatorcontrib>Itu, L. M.</creatorcontrib><creatorcontrib>Suciu, C.</creatorcontrib><creatorcontrib>Moldoveanu, F.</creatorcontrib><creatorcontrib>Postelnicu, A.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Xplore</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Itu, L. M.</au><au>Suciu, C.</au><au>Moldoveanu, F.</au><au>Postelnicu, A.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>GPU optimized computation of stencil based algorithms</atitle><btitle>2011 RoEduNet International Conference 10th Edition: Networking in Education and Research</btitle><stitle>RoEduNet</stitle><date>2011-06</date><risdate>2011</risdate><spage>1</spage><epage>6</epage><pages>1-6</pages><issn>2068-1038</issn><isbn>9781457712333</isbn><isbn>1457712334</isbn><eisbn>9781457712357</eisbn><eisbn>1457712342</eisbn><eisbn>9781457712340</eisbn><eisbn>1457712350</eisbn><abstract>The paper describes an optimized GPU based approach for stencil based algorithms. The simulations have been performed for a two dimensional steady state heat conduction problem, which has been solved through the red black point successive over relaxation method. Two kernels have been developed and their performance has been greatly improved through coalesced memory accesses and special shared memory approaches. The approach described in the paper does not only represent a step forward for the steady state heat conduction problem but also for any other algorithm which performs the numerical solution of partial differential equations or which is stencil based. The paper not only describes the various code versions but also the process which has lead to these improvements. Also the optimized GPU code version has been compared with the corresponding CPU version. The testing results show that the GPU algorithm always leads to an improvement. The value of the improvement though greatly depends on the number of grid points on which the computations are performed.</abstract><pub>IEEE</pub><doi>10.1109/RoEduNet.2011.5993693</doi><tpages>6</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 2068-1038
ispartof 2011 RoEduNet International Conference 10th Edition: Networking in Education and Research, 2011, p.1-6
issn 2068-1038
language eng ; jpn
recordid cdi_ieee_primary_5993693
source IEEE Electronic Library (IEL) Conference Proceedings
subjects GPU
Graphics processing unit
heat conduction
Heating
Instruction sets
Kernel
optimization
Performance evaluation
shared memory
speed-up
Steady-state
successive over relaxation
Throughput
title GPU optimized computation of stencil based algorithms
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-08T06%3A38%3A45IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=GPU%20optimized%20computation%20of%20stencil%20based%20algorithms&rft.btitle=2011%20RoEduNet%20International%20Conference%2010th%20Edition:%20Networking%20in%20Education%20and%20Research&rft.au=Itu,%20L.%20M.&rft.date=2011-06&rft.spage=1&rft.epage=6&rft.pages=1-6&rft.issn=2068-1038&rft.isbn=9781457712333&rft.isbn_list=1457712334&rft_id=info:doi/10.1109/RoEduNet.2011.5993693&rft.eisbn=9781457712357&rft.eisbn_list=1457712342&rft.eisbn_list=9781457712340&rft.eisbn_list=1457712350&rft_dat=%3Cieee_6IE%3E5993693%3C/ieee_6IE%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-i156t-f4210c200dd3e876052bfce985c0d478065cab9b0c12589ac44f5822745de1893%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=5993693&rfr_iscdi=true