Loading…
The Design of NoC-Side Memory Access Scheduling for Energy-Efficient GPGPUs
Memory access scheduling schemes, often performed in memory controllers, have a marked impact on alleviating the heavy burden placed on memory systems of GPGPUs. Existing out-of-order scheduling schemes, like FR-FCFS, improve memory access efficiency by reordering memory request sequences at the des...
Saved in:
Published in: | International journal of parallel programming 2018-08, Vol.46 (4), p.722-735 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | cdi_FETCH-LOGICAL-c268t-859a00e813dc03b77378e31a8de3af5537a81f120ac9e733c6569bf7323af8f13 |
container_end_page | 735 |
container_issue | 4 |
container_start_page | 722 |
container_title | International journal of parallel programming |
container_volume | 46 |
creator | Liu, Wenjie Ma, Sheng Huang, Libo Wang, Zhiying |
description | Memory access scheduling schemes, often performed in memory controllers, have a marked impact on alleviating the heavy burden placed on memory systems of GPGPUs. Existing out-of-order scheduling schemes, like FR-FCFS, improve memory access efficiency by reordering memory request sequences at the destination. Their effectiveness, however, is at the expense of complex logics and high power consumption. In this paper, we propose a NoC-side memory access scheduling based on the key insight that the transmission of on-chip networks is the dominating factor in destroying the row access locality and causing poor memory access efficiency. With appropriate NoC-side optimization, the straight-forward in-order scheduling can be used in memory controllers to simplify scheduling logics and alleviate the tight power envelope. Moreover, we introduce several light-weight optimizations to further improve the system performance. Experimental results on memory-intensive applications show that, comparing with FR-FCFS, our proposed scheme increases the overall system performance by 10.5%, reduces the power consumption by 20% and improves the energy efficiency by 36.9%. |
doi_str_mv | 10.1007/s10766-017-0521-2 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2089088615</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2089088615</sourcerecordid><originalsourceid>FETCH-LOGICAL-c268t-859a00e813dc03b77378e31a8de3af5537a81f120ac9e733c6569bf7323af8f13</originalsourceid><addsrcrecordid>eNp1kMFOAjEQhhujiYg-gLcmnqvTlm67R4KIRlQS4Nws3emyBHaxhQNvb8maePI0h_m_fzIfIfccHjmAfoocdJYx4JqBEpyJC9LjSkumswFckh4Yo5geKHNNbmLcAECujemR98Ua6TPGumpo6-lnO2LzukT6gbs2nOjQOYyRzt0ay-O2birq20DHDYbqxMbe167G5kAns8lsGW_JlS-2Ee9-Z58sX8aL0Subfk3eRsMpcyIzB2ZUXgCg4bJ0IFdaS21Q8sKUKAuvlNSF4Z4LKFyOWkqXqSxfeS1FWhvPZZ88dL370H4fMR7spj2GJp20AkyeXs24SinepVxoYwzo7T7UuyKcLAd7dmY7ZzY5s2dnViRGdExM2abC8Nf8P_QDFLBsXg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2089088615</pqid></control><display><type>article</type><title>The Design of NoC-Side Memory Access Scheduling for Energy-Efficient GPGPUs</title><source>ABI/INFORM Global</source><source>Springer Nature</source><creator>Liu, Wenjie ; Ma, Sheng ; Huang, Libo ; Wang, Zhiying</creator><creatorcontrib>Liu, Wenjie ; Ma, Sheng ; Huang, Libo ; Wang, Zhiying</creatorcontrib><description>Memory access scheduling schemes, often performed in memory controllers, have a marked impact on alleviating the heavy burden placed on memory systems of GPGPUs. Existing out-of-order scheduling schemes, like FR-FCFS, improve memory access efficiency by reordering memory request sequences at the destination. Their effectiveness, however, is at the expense of complex logics and high power consumption. In this paper, we propose a NoC-side memory access scheduling based on the key insight that the transmission of on-chip networks is the dominating factor in destroying the row access locality and causing poor memory access efficiency. With appropriate NoC-side optimization, the straight-forward in-order scheduling can be used in memory controllers to simplify scheduling logics and alleviate the tight power envelope. Moreover, we introduce several light-weight optimizations to further improve the system performance. Experimental results on memory-intensive applications show that, comparing with FR-FCFS, our proposed scheme increases the overall system performance by 10.5%, reduces the power consumption by 20% and improves the energy efficiency by 36.9%.</description><identifier>ISSN: 0885-7458</identifier><identifier>EISSN: 1573-7640</identifier><identifier>DOI: 10.1007/s10766-017-0521-2</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Computer memory ; Computer Science ; Connectivity ; Controllers ; Efficiency ; Energy efficiency ; Power consumption ; Processor Architectures ; Scheduling ; Servers ; Software Engineering/Programming and Operating Systems ; Special issue on Network and Parallel Computing for New Architectures and Applications ; Theory of Computation ; Weight reduction</subject><ispartof>International journal of parallel programming, 2018-08, Vol.46 (4), p.722-735</ispartof><rights>Springer Science+Business Media, LLC 2017</rights><rights>International Journal of Parallel Programming is a copyright of Springer, (2017). All Rights Reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c268t-859a00e813dc03b77378e31a8de3af5537a81f120ac9e733c6569bf7323af8f13</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/2089088615/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$H</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2089088615?pq-origsite=primo$$EHTML$$P50$$Gproquest$$H</linktohtml><link.rule.ids>314,780,784,11688,27924,27925,36060,44363,74895</link.rule.ids></links><search><creatorcontrib>Liu, Wenjie</creatorcontrib><creatorcontrib>Ma, Sheng</creatorcontrib><creatorcontrib>Huang, Libo</creatorcontrib><creatorcontrib>Wang, Zhiying</creatorcontrib><title>The Design of NoC-Side Memory Access Scheduling for Energy-Efficient GPGPUs</title><title>International journal of parallel programming</title><addtitle>Int J Parallel Prog</addtitle><description>Memory access scheduling schemes, often performed in memory controllers, have a marked impact on alleviating the heavy burden placed on memory systems of GPGPUs. Existing out-of-order scheduling schemes, like FR-FCFS, improve memory access efficiency by reordering memory request sequences at the destination. Their effectiveness, however, is at the expense of complex logics and high power consumption. In this paper, we propose a NoC-side memory access scheduling based on the key insight that the transmission of on-chip networks is the dominating factor in destroying the row access locality and causing poor memory access efficiency. With appropriate NoC-side optimization, the straight-forward in-order scheduling can be used in memory controllers to simplify scheduling logics and alleviate the tight power envelope. Moreover, we introduce several light-weight optimizations to further improve the system performance. Experimental results on memory-intensive applications show that, comparing with FR-FCFS, our proposed scheme increases the overall system performance by 10.5%, reduces the power consumption by 20% and improves the energy efficiency by 36.9%.</description><subject>Computer memory</subject><subject>Computer Science</subject><subject>Connectivity</subject><subject>Controllers</subject><subject>Efficiency</subject><subject>Energy efficiency</subject><subject>Power consumption</subject><subject>Processor Architectures</subject><subject>Scheduling</subject><subject>Servers</subject><subject>Software Engineering/Programming and Operating Systems</subject><subject>Special issue on Network and Parallel Computing for New Architectures and Applications</subject><subject>Theory of Computation</subject><subject>Weight reduction</subject><issn>0885-7458</issn><issn>1573-7640</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>M0C</sourceid><recordid>eNp1kMFOAjEQhhujiYg-gLcmnqvTlm67R4KIRlQS4Nws3emyBHaxhQNvb8maePI0h_m_fzIfIfccHjmAfoocdJYx4JqBEpyJC9LjSkumswFckh4Yo5geKHNNbmLcAECujemR98Ua6TPGumpo6-lnO2LzukT6gbs2nOjQOYyRzt0ay-O2birq20DHDYbqxMbe167G5kAns8lsGW_JlS-2Ee9-Z58sX8aL0Subfk3eRsMpcyIzB2ZUXgCg4bJ0IFdaS21Q8sKUKAuvlNSF4Z4LKFyOWkqXqSxfeS1FWhvPZZ88dL370H4fMR7spj2GJp20AkyeXs24SinepVxoYwzo7T7UuyKcLAd7dmY7ZzY5s2dnViRGdExM2abC8Nf8P_QDFLBsXg</recordid><startdate>20180801</startdate><enddate>20180801</enddate><creator>Liu, Wenjie</creator><creator>Ma, Sheng</creator><creator>Huang, Libo</creator><creator>Wang, Zhiying</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>8AL</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8FL</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FRNLG</scope><scope>F~G</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K60</scope><scope>K6~</scope><scope>K7-</scope><scope>L.-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0C</scope><scope>M0N</scope><scope>M2O</scope><scope>MBDVC</scope><scope>P5Z</scope><scope>P62</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope></search><sort><creationdate>20180801</creationdate><title>The Design of NoC-Side Memory Access Scheduling for Energy-Efficient GPGPUs</title><author>Liu, Wenjie ; Ma, Sheng ; Huang, Libo ; Wang, Zhiying</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c268t-859a00e813dc03b77378e31a8de3af5537a81f120ac9e733c6569bf7323af8f13</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Computer memory</topic><topic>Computer Science</topic><topic>Connectivity</topic><topic>Controllers</topic><topic>Efficiency</topic><topic>Energy efficiency</topic><topic>Power consumption</topic><topic>Processor Architectures</topic><topic>Scheduling</topic><topic>Servers</topic><topic>Software Engineering/Programming and Operating Systems</topic><topic>Special issue on Network and Parallel Computing for New Architectures and Applications</topic><topic>Theory of Computation</topic><topic>Weight reduction</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Liu, Wenjie</creatorcontrib><creatorcontrib>Ma, Sheng</creatorcontrib><creatorcontrib>Huang, Libo</creatorcontrib><creatorcontrib>Wang, Zhiying</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>ABI/INFORM Complete</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Global (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Business Premium Collection (Alumni)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>Computer Science Database</collection><collection>ABI/INFORM Professional Advanced</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ABI/INFORM Global</collection><collection>Computing Database</collection><collection>ProQuest research library</collection><collection>Research Library (Corporate)</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>ProQuest One Business</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><jtitle>International journal of parallel programming</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Liu, Wenjie</au><au>Ma, Sheng</au><au>Huang, Libo</au><au>Wang, Zhiying</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>The Design of NoC-Side Memory Access Scheduling for Energy-Efficient GPGPUs</atitle><jtitle>International journal of parallel programming</jtitle><stitle>Int J Parallel Prog</stitle><date>2018-08-01</date><risdate>2018</risdate><volume>46</volume><issue>4</issue><spage>722</spage><epage>735</epage><pages>722-735</pages><issn>0885-7458</issn><eissn>1573-7640</eissn><abstract>Memory access scheduling schemes, often performed in memory controllers, have a marked impact on alleviating the heavy burden placed on memory systems of GPGPUs. Existing out-of-order scheduling schemes, like FR-FCFS, improve memory access efficiency by reordering memory request sequences at the destination. Their effectiveness, however, is at the expense of complex logics and high power consumption. In this paper, we propose a NoC-side memory access scheduling based on the key insight that the transmission of on-chip networks is the dominating factor in destroying the row access locality and causing poor memory access efficiency. With appropriate NoC-side optimization, the straight-forward in-order scheduling can be used in memory controllers to simplify scheduling logics and alleviate the tight power envelope. Moreover, we introduce several light-weight optimizations to further improve the system performance. Experimental results on memory-intensive applications show that, comparing with FR-FCFS, our proposed scheme increases the overall system performance by 10.5%, reduces the power consumption by 20% and improves the energy efficiency by 36.9%.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s10766-017-0521-2</doi><tpages>14</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0885-7458 |
ispartof | International journal of parallel programming, 2018-08, Vol.46 (4), p.722-735 |
issn | 0885-7458 1573-7640 |
language | eng |
recordid | cdi_proquest_journals_2089088615 |
source | ABI/INFORM Global; Springer Nature |
subjects | Computer memory Computer Science Connectivity Controllers Efficiency Energy efficiency Power consumption Processor Architectures Scheduling Servers Software Engineering/Programming and Operating Systems Special issue on Network and Parallel Computing for New Architectures and Applications Theory of Computation Weight reduction |
title | The Design of NoC-Side Memory Access Scheduling for Energy-Efficient GPGPUs |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-29T19%3A23%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=The%20Design%20of%20NoC-Side%20Memory%20Access%20Scheduling%20for%20Energy-Efficient%20GPGPUs&rft.jtitle=International%20journal%20of%20parallel%20programming&rft.au=Liu,%20Wenjie&rft.date=2018-08-01&rft.volume=46&rft.issue=4&rft.spage=722&rft.epage=735&rft.pages=722-735&rft.issn=0885-7458&rft.eissn=1573-7640&rft_id=info:doi/10.1007/s10766-017-0521-2&rft_dat=%3Cproquest_cross%3E2089088615%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c268t-859a00e813dc03b77378e31a8de3af5537a81f120ac9e733c6569bf7323af8f13%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2089088615&rft_id=info:pmid/&rfr_iscdi=true |