Loading…

The Design of NoC-Side Memory Access Scheduling for Energy-Efficient GPGPUs

Memory access scheduling schemes, often performed in memory controllers, have a marked impact on alleviating the heavy burden placed on memory systems of GPGPUs. Existing out-of-order scheduling schemes, like FR-FCFS, improve memory access efficiency by reordering memory request sequences at the des...

Full description

Saved in:
Bibliographic Details
Published in:International journal of parallel programming 2018-08, Vol.46 (4), p.722-735
Main Authors: Liu, Wenjie, Ma, Sheng, Huang, Libo, Wang, Zhiying
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c268t-859a00e813dc03b77378e31a8de3af5537a81f120ac9e733c6569bf7323af8f13
container_end_page 735
container_issue 4
container_start_page 722
container_title International journal of parallel programming
container_volume 46
creator Liu, Wenjie
Ma, Sheng
Huang, Libo
Wang, Zhiying
description Memory access scheduling schemes, often performed in memory controllers, have a marked impact on alleviating the heavy burden placed on memory systems of GPGPUs. Existing out-of-order scheduling schemes, like FR-FCFS, improve memory access efficiency by reordering memory request sequences at the destination. Their effectiveness, however, is at the expense of complex logics and high power consumption. In this paper, we propose a NoC-side memory access scheduling based on the key insight that the transmission of on-chip networks is the dominating factor in destroying the row access locality and causing poor memory access efficiency. With appropriate NoC-side optimization, the straight-forward in-order scheduling can be used in memory controllers to simplify scheduling logics and alleviate the tight power envelope. Moreover, we introduce several light-weight optimizations to further improve the system performance. Experimental results on memory-intensive applications show that, comparing with FR-FCFS, our proposed scheme increases the overall system performance by 10.5%, reduces the power consumption by 20% and improves the energy efficiency by 36.9%.
doi_str_mv 10.1007/s10766-017-0521-2
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2089088615</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2089088615</sourcerecordid><originalsourceid>FETCH-LOGICAL-c268t-859a00e813dc03b77378e31a8de3af5537a81f120ac9e733c6569bf7323af8f13</originalsourceid><addsrcrecordid>eNp1kMFOAjEQhhujiYg-gLcmnqvTlm67R4KIRlQS4Nws3emyBHaxhQNvb8maePI0h_m_fzIfIfccHjmAfoocdJYx4JqBEpyJC9LjSkumswFckh4Yo5geKHNNbmLcAECujemR98Ua6TPGumpo6-lnO2LzukT6gbs2nOjQOYyRzt0ay-O2birq20DHDYbqxMbe167G5kAns8lsGW_JlS-2Ee9-Z58sX8aL0Subfk3eRsMpcyIzB2ZUXgCg4bJ0IFdaS21Q8sKUKAuvlNSF4Z4LKFyOWkqXqSxfeS1FWhvPZZ88dL370H4fMR7spj2GJp20AkyeXs24SinepVxoYwzo7T7UuyKcLAd7dmY7ZzY5s2dnViRGdExM2abC8Nf8P_QDFLBsXg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2089088615</pqid></control><display><type>article</type><title>The Design of NoC-Side Memory Access Scheduling for Energy-Efficient GPGPUs</title><source>ABI/INFORM Global</source><source>Springer Nature</source><creator>Liu, Wenjie ; Ma, Sheng ; Huang, Libo ; Wang, Zhiying</creator><creatorcontrib>Liu, Wenjie ; Ma, Sheng ; Huang, Libo ; Wang, Zhiying</creatorcontrib><description>Memory access scheduling schemes, often performed in memory controllers, have a marked impact on alleviating the heavy burden placed on memory systems of GPGPUs. Existing out-of-order scheduling schemes, like FR-FCFS, improve memory access efficiency by reordering memory request sequences at the destination. Their effectiveness, however, is at the expense of complex logics and high power consumption. In this paper, we propose a NoC-side memory access scheduling based on the key insight that the transmission of on-chip networks is the dominating factor in destroying the row access locality and causing poor memory access efficiency. With appropriate NoC-side optimization, the straight-forward in-order scheduling can be used in memory controllers to simplify scheduling logics and alleviate the tight power envelope. Moreover, we introduce several light-weight optimizations to further improve the system performance. Experimental results on memory-intensive applications show that, comparing with FR-FCFS, our proposed scheme increases the overall system performance by 10.5%, reduces the power consumption by 20% and improves the energy efficiency by 36.9%.</description><identifier>ISSN: 0885-7458</identifier><identifier>EISSN: 1573-7640</identifier><identifier>DOI: 10.1007/s10766-017-0521-2</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Computer memory ; Computer Science ; Connectivity ; Controllers ; Efficiency ; Energy efficiency ; Power consumption ; Processor Architectures ; Scheduling ; Servers ; Software Engineering/Programming and Operating Systems ; Special issue on Network and Parallel Computing for New Architectures and Applications ; Theory of Computation ; Weight reduction</subject><ispartof>International journal of parallel programming, 2018-08, Vol.46 (4), p.722-735</ispartof><rights>Springer Science+Business Media, LLC 2017</rights><rights>International Journal of Parallel Programming is a copyright of Springer, (2017). All Rights Reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c268t-859a00e813dc03b77378e31a8de3af5537a81f120ac9e733c6569bf7323af8f13</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/2089088615/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$H</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2089088615?pq-origsite=primo$$EHTML$$P50$$Gproquest$$H</linktohtml><link.rule.ids>314,780,784,11688,27924,27925,36060,44363,74895</link.rule.ids></links><search><creatorcontrib>Liu, Wenjie</creatorcontrib><creatorcontrib>Ma, Sheng</creatorcontrib><creatorcontrib>Huang, Libo</creatorcontrib><creatorcontrib>Wang, Zhiying</creatorcontrib><title>The Design of NoC-Side Memory Access Scheduling for Energy-Efficient GPGPUs</title><title>International journal of parallel programming</title><addtitle>Int J Parallel Prog</addtitle><description>Memory access scheduling schemes, often performed in memory controllers, have a marked impact on alleviating the heavy burden placed on memory systems of GPGPUs. Existing out-of-order scheduling schemes, like FR-FCFS, improve memory access efficiency by reordering memory request sequences at the destination. Their effectiveness, however, is at the expense of complex logics and high power consumption. In this paper, we propose a NoC-side memory access scheduling based on the key insight that the transmission of on-chip networks is the dominating factor in destroying the row access locality and causing poor memory access efficiency. With appropriate NoC-side optimization, the straight-forward in-order scheduling can be used in memory controllers to simplify scheduling logics and alleviate the tight power envelope. Moreover, we introduce several light-weight optimizations to further improve the system performance. Experimental results on memory-intensive applications show that, comparing with FR-FCFS, our proposed scheme increases the overall system performance by 10.5%, reduces the power consumption by 20% and improves the energy efficiency by 36.9%.</description><subject>Computer memory</subject><subject>Computer Science</subject><subject>Connectivity</subject><subject>Controllers</subject><subject>Efficiency</subject><subject>Energy efficiency</subject><subject>Power consumption</subject><subject>Processor Architectures</subject><subject>Scheduling</subject><subject>Servers</subject><subject>Software Engineering/Programming and Operating Systems</subject><subject>Special issue on Network and Parallel Computing for New Architectures and Applications</subject><subject>Theory of Computation</subject><subject>Weight reduction</subject><issn>0885-7458</issn><issn>1573-7640</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>M0C</sourceid><recordid>eNp1kMFOAjEQhhujiYg-gLcmnqvTlm67R4KIRlQS4Nws3emyBHaxhQNvb8maePI0h_m_fzIfIfccHjmAfoocdJYx4JqBEpyJC9LjSkumswFckh4Yo5geKHNNbmLcAECujemR98Ua6TPGumpo6-lnO2LzukT6gbs2nOjQOYyRzt0ay-O2birq20DHDYbqxMbe167G5kAns8lsGW_JlS-2Ee9-Z58sX8aL0Subfk3eRsMpcyIzB2ZUXgCg4bJ0IFdaS21Q8sKUKAuvlNSF4Z4LKFyOWkqXqSxfeS1FWhvPZZ88dL370H4fMR7spj2GJp20AkyeXs24SinepVxoYwzo7T7UuyKcLAd7dmY7ZzY5s2dnViRGdExM2abC8Nf8P_QDFLBsXg</recordid><startdate>20180801</startdate><enddate>20180801</enddate><creator>Liu, Wenjie</creator><creator>Ma, Sheng</creator><creator>Huang, Libo</creator><creator>Wang, Zhiying</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>8AL</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8FL</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FRNLG</scope><scope>F~G</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K60</scope><scope>K6~</scope><scope>K7-</scope><scope>L.-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0C</scope><scope>M0N</scope><scope>M2O</scope><scope>MBDVC</scope><scope>P5Z</scope><scope>P62</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope></search><sort><creationdate>20180801</creationdate><title>The Design of NoC-Side Memory Access Scheduling for Energy-Efficient GPGPUs</title><author>Liu, Wenjie ; Ma, Sheng ; Huang, Libo ; Wang, Zhiying</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c268t-859a00e813dc03b77378e31a8de3af5537a81f120ac9e733c6569bf7323af8f13</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Computer memory</topic><topic>Computer Science</topic><topic>Connectivity</topic><topic>Controllers</topic><topic>Efficiency</topic><topic>Energy efficiency</topic><topic>Power consumption</topic><topic>Processor Architectures</topic><topic>Scheduling</topic><topic>Servers</topic><topic>Software Engineering/Programming and Operating Systems</topic><topic>Special issue on Network and Parallel Computing for New Architectures and Applications</topic><topic>Theory of Computation</topic><topic>Weight reduction</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Liu, Wenjie</creatorcontrib><creatorcontrib>Ma, Sheng</creatorcontrib><creatorcontrib>Huang, Libo</creatorcontrib><creatorcontrib>Wang, Zhiying</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>ABI/INFORM Complete</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Global (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Business Premium Collection (Alumni)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>Computer Science Database</collection><collection>ABI/INFORM Professional Advanced</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ABI/INFORM Global</collection><collection>Computing Database</collection><collection>ProQuest research library</collection><collection>Research Library (Corporate)</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest One Business</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><jtitle>International journal of parallel programming</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Liu, Wenjie</au><au>Ma, Sheng</au><au>Huang, Libo</au><au>Wang, Zhiying</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>The Design of NoC-Side Memory Access Scheduling for Energy-Efficient GPGPUs</atitle><jtitle>International journal of parallel programming</jtitle><stitle>Int J Parallel Prog</stitle><date>2018-08-01</date><risdate>2018</risdate><volume>46</volume><issue>4</issue><spage>722</spage><epage>735</epage><pages>722-735</pages><issn>0885-7458</issn><eissn>1573-7640</eissn><abstract>Memory access scheduling schemes, often performed in memory controllers, have a marked impact on alleviating the heavy burden placed on memory systems of GPGPUs. Existing out-of-order scheduling schemes, like FR-FCFS, improve memory access efficiency by reordering memory request sequences at the destination. Their effectiveness, however, is at the expense of complex logics and high power consumption. In this paper, we propose a NoC-side memory access scheduling based on the key insight that the transmission of on-chip networks is the dominating factor in destroying the row access locality and causing poor memory access efficiency. With appropriate NoC-side optimization, the straight-forward in-order scheduling can be used in memory controllers to simplify scheduling logics and alleviate the tight power envelope. Moreover, we introduce several light-weight optimizations to further improve the system performance. Experimental results on memory-intensive applications show that, comparing with FR-FCFS, our proposed scheme increases the overall system performance by 10.5%, reduces the power consumption by 20% and improves the energy efficiency by 36.9%.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s10766-017-0521-2</doi><tpages>14</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0885-7458
ispartof International journal of parallel programming, 2018-08, Vol.46 (4), p.722-735
issn 0885-7458
1573-7640
language eng
recordid cdi_proquest_journals_2089088615
source ABI/INFORM Global; Springer Nature
subjects Computer memory
Computer Science
Connectivity
Controllers
Efficiency
Energy efficiency
Power consumption
Processor Architectures
Scheduling
Servers
Software Engineering/Programming and Operating Systems
Special issue on Network and Parallel Computing for New Architectures and Applications
Theory of Computation
Weight reduction
title The Design of NoC-Side Memory Access Scheduling for Energy-Efficient GPGPUs
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-29T19%3A23%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=The%20Design%20of%20NoC-Side%20Memory%20Access%20Scheduling%20for%20Energy-Efficient%20GPGPUs&rft.jtitle=International%20journal%20of%20parallel%20programming&rft.au=Liu,%20Wenjie&rft.date=2018-08-01&rft.volume=46&rft.issue=4&rft.spage=722&rft.epage=735&rft.pages=722-735&rft.issn=0885-7458&rft.eissn=1573-7640&rft_id=info:doi/10.1007/s10766-017-0521-2&rft_dat=%3Cproquest_cross%3E2089088615%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c268t-859a00e813dc03b77378e31a8de3af5537a81f120ac9e733c6569bf7323af8f13%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2089088615&rft_id=info:pmid/&rfr_iscdi=true