Loading…

The Tag Filter Architecture: An energy-efficient cache and directory design

Power consumption in current high-performance chip multiprocessors (CMPs) has become a major design concern that aggravates with the current trend of increasing the core count. A significant fraction of the total power budget is consumed by on-chip caches which are usually deployed with a high assoc...

Full description

Saved in:
Bibliographic Details
Published in:Journal of parallel and distributed computing 2017-02, Vol.100, p.193-202
Main Authors: Valls, Joan J., Ros, Alberto, Gómez, María E., Sahuquillo, Julio
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c377t-f1296d06352e7a503e4e83bcb063e20dd7b3f6540cda0f30056bba4afb01d10f3
cites cdi_FETCH-LOGICAL-c377t-f1296d06352e7a503e4e83bcb063e20dd7b3f6540cda0f30056bba4afb01d10f3
container_end_page 202
container_issue
container_start_page 193
container_title Journal of parallel and distributed computing
container_volume 100
creator Valls, Joan J.
Ros, Alberto
Gómez, María E.
Sahuquillo, Julio
description Power consumption in current high-performance chip multiprocessors (CMPs) has become a major design concern that aggravates with the current trend of increasing the core count. A significant fraction of the total power budget is consumed by on-chip caches which are usually deployed with a high associativity degree (even L1 caches are being implemented with eight ways) to enhance the system performance. On a cache access, each way in the corresponding set is accessed in parallel, which is costly in terms of energy. On the other hand, coherence protocols also must implement efficient directory caches that scale in terms of power consumption. Most of the state-of-the-art techniques that reduce the energy consumption of directories are at the cost of performance, which may become unacceptable for high-performance CMPs. In this paper, we propose an energy-efficient architectural design that can be effectively applied to any kind of cache memory. The proposed approach, called the Tag Filter (TF) Architecture, filters the ways accessed in the target cache set, and just a few ways are searched in the tag and data arrays. This allows the approach to reduce the dynamic energy consumption of caches without hurting their access time. For this purpose, the proposed architecture holds the X least significant bits of each tag in a small auxiliary X-bit-wide array. These bits are used to filter the ways where the least significant bits of the tag do not match with the bits in the X-bit array. Experimental results show that, on average, the TF Architecture reduces the dynamic power consumption across the studied applications up to 74.9%, 85.9%, and 84.5% when applied to L1 caches, L2 caches, and directory caches, respectively. •Homogeneous distribution of the less significant bits of the tag across ways of cache sets.•A single bit of the address is enough to guarantee a tag mismatch.•We propose a mechanism to filter the ways accessed using the less significant bits of the address tag.•Dynamic power consumption in the processor cache is reduced up to 85.9%.•Dynamic power consumption in the directory cache is reduced up to 84.5%.
doi_str_mv 10.1016/j.jpdc.2016.04.016
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1864531144</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0743731516300302</els_id><sourcerecordid>1864531144</sourcerecordid><originalsourceid>FETCH-LOGICAL-c377t-f1296d06352e7a503e4e83bcb063e20dd7b3f6540cda0f30056bba4afb01d10f3</originalsourceid><addsrcrecordid>eNp9kM1OwzAQhC0EEqXwApx85JKwjp2fIi5VRQFRiUs5W469bh2lSbFTpL49jsKZ06xG8600Q8g9g5QBKx6btDkanWbxTkGkUS7IjMGiSKAS1SWZQSl4UnKWX5ObEBoAxvKympGP7R7pVu3o2rUDerr0eu8G1MPJ4xNddhQ79LtzgtY67bAbqFY6Iqoz1Dgfg70_U4PB7bpbcmVVG_DuT-fka_2yXb0lm8_X99Vyk2helkNiWbYoDBQ8z7BUOXAUWPFa19HCDIwpa26LXIA2CiwHyIu6VkLZGphh0ZmTh-nv0fffJwyDPLigsW1Vh_0pSFYVIueMCRGj2RTVvg_Bo5VH7w7KnyUDOS4nGzkuJ8flJAgZJULPE4SxxI9DL8NYXeNUWJre_Yf_AsTZdq4</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1864531144</pqid></control><display><type>article</type><title>The Tag Filter Architecture: An energy-efficient cache and directory design</title><source>ScienceDirect Freedom Collection 2022-2024</source><creator>Valls, Joan J. ; Ros, Alberto ; Gómez, María E. ; Sahuquillo, Julio</creator><creatorcontrib>Valls, Joan J. ; Ros, Alberto ; Gómez, María E. ; Sahuquillo, Julio</creatorcontrib><description>Power consumption in current high-performance chip multiprocessors (CMPs) has become a major design concern that aggravates with the current trend of increasing the core count. A significant fraction of the total power budget is consumed by on-chip caches which are usually deployed with a high associativity degree (even L1 caches are being implemented with eight ways) to enhance the system performance. On a cache access, each way in the corresponding set is accessed in parallel, which is costly in terms of energy. On the other hand, coherence protocols also must implement efficient directory caches that scale in terms of power consumption. Most of the state-of-the-art techniques that reduce the energy consumption of directories are at the cost of performance, which may become unacceptable for high-performance CMPs. In this paper, we propose an energy-efficient architectural design that can be effectively applied to any kind of cache memory. The proposed approach, called the Tag Filter (TF) Architecture, filters the ways accessed in the target cache set, and just a few ways are searched in the tag and data arrays. This allows the approach to reduce the dynamic energy consumption of caches without hurting their access time. For this purpose, the proposed architecture holds the X least significant bits of each tag in a small auxiliary X-bit-wide array. These bits are used to filter the ways where the least significant bits of the tag do not match with the bits in the X-bit array. Experimental results show that, on average, the TF Architecture reduces the dynamic power consumption across the studied applications up to 74.9%, 85.9%, and 84.5% when applied to L1 caches, L2 caches, and directory caches, respectively. •Homogeneous distribution of the less significant bits of the tag across ways of cache sets.•A single bit of the address is enough to guarantee a tag mismatch.•We propose a mechanism to filter the ways accessed using the less significant bits of the address tag.•Dynamic power consumption in the processor cache is reduced up to 85.9%.•Dynamic power consumption in the directory cache is reduced up to 84.5%.</description><identifier>ISSN: 0743-7315</identifier><identifier>EISSN: 1096-0848</identifier><identifier>DOI: 10.1016/j.jpdc.2016.04.016</identifier><language>eng</language><publisher>Elsevier Inc</publisher><subject>Architecture ; Arrays ; Cache ; Design engineering ; Directories ; Directory ; Dynamic consumption ; Dynamics ; Energy consumption ; Energy management ; Multicore processors ; Power consumption</subject><ispartof>Journal of parallel and distributed computing, 2017-02, Vol.100, p.193-202</ispartof><rights>2016 Elsevier Inc.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c377t-f1296d06352e7a503e4e83bcb063e20dd7b3f6540cda0f30056bba4afb01d10f3</citedby><cites>FETCH-LOGICAL-c377t-f1296d06352e7a503e4e83bcb063e20dd7b3f6540cda0f30056bba4afb01d10f3</cites><orcidid>0000-0001-8630-4846</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Valls, Joan J.</creatorcontrib><creatorcontrib>Ros, Alberto</creatorcontrib><creatorcontrib>Gómez, María E.</creatorcontrib><creatorcontrib>Sahuquillo, Julio</creatorcontrib><title>The Tag Filter Architecture: An energy-efficient cache and directory design</title><title>Journal of parallel and distributed computing</title><description>Power consumption in current high-performance chip multiprocessors (CMPs) has become a major design concern that aggravates with the current trend of increasing the core count. A significant fraction of the total power budget is consumed by on-chip caches which are usually deployed with a high associativity degree (even L1 caches are being implemented with eight ways) to enhance the system performance. On a cache access, each way in the corresponding set is accessed in parallel, which is costly in terms of energy. On the other hand, coherence protocols also must implement efficient directory caches that scale in terms of power consumption. Most of the state-of-the-art techniques that reduce the energy consumption of directories are at the cost of performance, which may become unacceptable for high-performance CMPs. In this paper, we propose an energy-efficient architectural design that can be effectively applied to any kind of cache memory. The proposed approach, called the Tag Filter (TF) Architecture, filters the ways accessed in the target cache set, and just a few ways are searched in the tag and data arrays. This allows the approach to reduce the dynamic energy consumption of caches without hurting their access time. For this purpose, the proposed architecture holds the X least significant bits of each tag in a small auxiliary X-bit-wide array. These bits are used to filter the ways where the least significant bits of the tag do not match with the bits in the X-bit array. Experimental results show that, on average, the TF Architecture reduces the dynamic power consumption across the studied applications up to 74.9%, 85.9%, and 84.5% when applied to L1 caches, L2 caches, and directory caches, respectively. •Homogeneous distribution of the less significant bits of the tag across ways of cache sets.•A single bit of the address is enough to guarantee a tag mismatch.•We propose a mechanism to filter the ways accessed using the less significant bits of the address tag.•Dynamic power consumption in the processor cache is reduced up to 85.9%.•Dynamic power consumption in the directory cache is reduced up to 84.5%.</description><subject>Architecture</subject><subject>Arrays</subject><subject>Cache</subject><subject>Design engineering</subject><subject>Directories</subject><subject>Directory</subject><subject>Dynamic consumption</subject><subject>Dynamics</subject><subject>Energy consumption</subject><subject>Energy management</subject><subject>Multicore processors</subject><subject>Power consumption</subject><issn>0743-7315</issn><issn>1096-0848</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><recordid>eNp9kM1OwzAQhC0EEqXwApx85JKwjp2fIi5VRQFRiUs5W469bh2lSbFTpL49jsKZ06xG8600Q8g9g5QBKx6btDkanWbxTkGkUS7IjMGiSKAS1SWZQSl4UnKWX5ObEBoAxvKympGP7R7pVu3o2rUDerr0eu8G1MPJ4xNddhQ79LtzgtY67bAbqFY6Iqoz1Dgfg70_U4PB7bpbcmVVG_DuT-fka_2yXb0lm8_X99Vyk2helkNiWbYoDBQ8z7BUOXAUWPFa19HCDIwpa26LXIA2CiwHyIu6VkLZGphh0ZmTh-nv0fffJwyDPLigsW1Vh_0pSFYVIueMCRGj2RTVvg_Bo5VH7w7KnyUDOS4nGzkuJ8flJAgZJULPE4SxxI9DL8NYXeNUWJre_Yf_AsTZdq4</recordid><startdate>201702</startdate><enddate>201702</enddate><creator>Valls, Joan J.</creator><creator>Ros, Alberto</creator><creator>Gómez, María E.</creator><creator>Sahuquillo, Julio</creator><general>Elsevier Inc</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0001-8630-4846</orcidid></search><sort><creationdate>201702</creationdate><title>The Tag Filter Architecture: An energy-efficient cache and directory design</title><author>Valls, Joan J. ; Ros, Alberto ; Gómez, María E. ; Sahuquillo, Julio</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c377t-f1296d06352e7a503e4e83bcb063e20dd7b3f6540cda0f30056bba4afb01d10f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Architecture</topic><topic>Arrays</topic><topic>Cache</topic><topic>Design engineering</topic><topic>Directories</topic><topic>Directory</topic><topic>Dynamic consumption</topic><topic>Dynamics</topic><topic>Energy consumption</topic><topic>Energy management</topic><topic>Multicore processors</topic><topic>Power consumption</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Valls, Joan J.</creatorcontrib><creatorcontrib>Ros, Alberto</creatorcontrib><creatorcontrib>Gómez, María E.</creatorcontrib><creatorcontrib>Sahuquillo, Julio</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Journal of parallel and distributed computing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Valls, Joan J.</au><au>Ros, Alberto</au><au>Gómez, María E.</au><au>Sahuquillo, Julio</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>The Tag Filter Architecture: An energy-efficient cache and directory design</atitle><jtitle>Journal of parallel and distributed computing</jtitle><date>2017-02</date><risdate>2017</risdate><volume>100</volume><spage>193</spage><epage>202</epage><pages>193-202</pages><issn>0743-7315</issn><eissn>1096-0848</eissn><abstract>Power consumption in current high-performance chip multiprocessors (CMPs) has become a major design concern that aggravates with the current trend of increasing the core count. A significant fraction of the total power budget is consumed by on-chip caches which are usually deployed with a high associativity degree (even L1 caches are being implemented with eight ways) to enhance the system performance. On a cache access, each way in the corresponding set is accessed in parallel, which is costly in terms of energy. On the other hand, coherence protocols also must implement efficient directory caches that scale in terms of power consumption. Most of the state-of-the-art techniques that reduce the energy consumption of directories are at the cost of performance, which may become unacceptable for high-performance CMPs. In this paper, we propose an energy-efficient architectural design that can be effectively applied to any kind of cache memory. The proposed approach, called the Tag Filter (TF) Architecture, filters the ways accessed in the target cache set, and just a few ways are searched in the tag and data arrays. This allows the approach to reduce the dynamic energy consumption of caches without hurting their access time. For this purpose, the proposed architecture holds the X least significant bits of each tag in a small auxiliary X-bit-wide array. These bits are used to filter the ways where the least significant bits of the tag do not match with the bits in the X-bit array. Experimental results show that, on average, the TF Architecture reduces the dynamic power consumption across the studied applications up to 74.9%, 85.9%, and 84.5% when applied to L1 caches, L2 caches, and directory caches, respectively. •Homogeneous distribution of the less significant bits of the tag across ways of cache sets.•A single bit of the address is enough to guarantee a tag mismatch.•We propose a mechanism to filter the ways accessed using the less significant bits of the address tag.•Dynamic power consumption in the processor cache is reduced up to 85.9%.•Dynamic power consumption in the directory cache is reduced up to 84.5%.</abstract><pub>Elsevier Inc</pub><doi>10.1016/j.jpdc.2016.04.016</doi><tpages>10</tpages><orcidid>https://orcid.org/0000-0001-8630-4846</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0743-7315
ispartof Journal of parallel and distributed computing, 2017-02, Vol.100, p.193-202
issn 0743-7315
1096-0848
language eng
recordid cdi_proquest_miscellaneous_1864531144
source ScienceDirect Freedom Collection 2022-2024
subjects Architecture
Arrays
Cache
Design engineering
Directories
Directory
Dynamic consumption
Dynamics
Energy consumption
Energy management
Multicore processors
Power consumption
title The Tag Filter Architecture: An energy-efficient cache and directory design
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T20%3A38%3A40IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=The%20Tag%20Filter%20Architecture:%20An%20energy-efficient%20cache%20and%20directory%20design&rft.jtitle=Journal%20of%20parallel%20and%20distributed%20computing&rft.au=Valls,%20Joan%20J.&rft.date=2017-02&rft.volume=100&rft.spage=193&rft.epage=202&rft.pages=193-202&rft.issn=0743-7315&rft.eissn=1096-0848&rft_id=info:doi/10.1016/j.jpdc.2016.04.016&rft_dat=%3Cproquest_cross%3E1864531144%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c377t-f1296d06352e7a503e4e83bcb063e20dd7b3f6540cda0f30056bba4afb01d10f3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1864531144&rft_id=info:pmid/&rfr_iscdi=true