Loading…
Near-optimal replacement policies for shared caches in multicore processors
An optimal replacement policy that minimizes the miss rate in a private cache was proposed several decades ago. It requires knowing the future access sequence the cache will receive. There is no equivalent for shared caches because replacement decisions alter this future sequence. We present a novel...
Saved in:
Published in: | The Journal of supercomputing 2021-10, Vol.77 (10), p.11756-11785 |
---|---|
Main Authors: | , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c319t-8008cbfb9450eb747f9e6d98a3226cd0793df4d2cccc2386b6cb7c2ea5fe28673 |
---|---|
cites | cdi_FETCH-LOGICAL-c319t-8008cbfb9450eb747f9e6d98a3226cd0793df4d2cccc2386b6cb7c2ea5fe28673 |
container_end_page | 11785 |
container_issue | 10 |
container_start_page | 11756 |
container_title | The Journal of supercomputing |
container_volume | 77 |
creator | Díaz, Javier Ibáñez, Pablo Monreal, Teresa Viñals, Víctor Llabería, José M. |
description | An optimal replacement policy that minimizes the miss rate in a private cache was proposed several decades ago. It requires knowing the future access sequence the cache will receive. There is no equivalent for shared caches because replacement decisions alter this future sequence. We present a novel near-optimal policy for minimizing the miss rate in a shared cache that approaches the optimal execution iteratively. During each iteration, the future access sequence is reconstructed on every miss interleaving the future per-core sequences, taken from the previous iteration. This single sequence feeds a classical private-cache optimum replacement policy. Our evaluation on a shared last-level cache shows that our proposal iteratively converges to a near-optimal miss rate that is independent of the initial conditions, within a margin of 0.1%. The best state-of-the-art online policies achieve around 65% of the miss rate reduction obtained by our near-optimal proposal. In a shared cache, miss rate optimization does not imply the optimization of other metrics. Therefore, we also propose a new near-optimal policy to maximize fairness between cores. The best state-of-the-art online policy achieves 60% of the improvement in fairness seen with our near-optimal policy. Our proposals are useful both for setting upper performance bounds and inspiring implementable mechanisms for shared caches. |
doi_str_mv | 10.1007/s11227-021-03736-1 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2570438659</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2570438659</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-8008cbfb9450eb747f9e6d98a3226cd0793df4d2cccc2386b6cb7c2ea5fe28673</originalsourceid><addsrcrecordid>eNp9kE9LxDAQxYMouK5-AU8Fz9HJnzbJURZ1xUUveg5pOnW7dJuadA9-e6MVvDmXgeG9N7wfIZcMrhmAukmMca4ocEZBKFFRdkQWrFSCgtTymCzAcKC6lPyUnKW0AwCZdQvy9Iwu0jBO3d71RcSxdx73OEzFGPrOd5iKNsQibV3EpvDOb_OlG4r9oZ86HyIWYwweUwoxnZOT1vUJL373krzd372u1nTz8vC4ut1QL5iZqAbQvm5rI0vAWknVGqwao53gvPINKCOaVjbc5-FCV3Xla-U5urJFrislluRqzs2vPw6YJrsLhzjkl5aXKhfTVWmyis8qH0NKEVs7xlwyfloG9huanaHZDM3-QLMsm8RsSlk8vGP8i_7H9QVgjHAq</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2570438659</pqid></control><display><type>article</type><title>Near-optimal replacement policies for shared caches in multicore processors</title><source>Springer Nature</source><creator>Díaz, Javier ; Ibáñez, Pablo ; Monreal, Teresa ; Viñals, Víctor ; Llabería, José M.</creator><creatorcontrib>Díaz, Javier ; Ibáñez, Pablo ; Monreal, Teresa ; Viñals, Víctor ; Llabería, José M.</creatorcontrib><description>An optimal replacement policy that minimizes the miss rate in a private cache was proposed several decades ago. It requires knowing the future access sequence the cache will receive. There is no equivalent for shared caches because replacement decisions alter this future sequence. We present a novel near-optimal policy for minimizing the miss rate in a shared cache that approaches the optimal execution iteratively. During each iteration, the future access sequence is reconstructed on every miss interleaving the future per-core sequences, taken from the previous iteration. This single sequence feeds a classical private-cache optimum replacement policy. Our evaluation on a shared last-level cache shows that our proposal iteratively converges to a near-optimal miss rate that is independent of the initial conditions, within a margin of 0.1%. The best state-of-the-art online policies achieve around 65% of the miss rate reduction obtained by our near-optimal proposal. In a shared cache, miss rate optimization does not imply the optimization of other metrics. Therefore, we also propose a new near-optimal policy to maximize fairness between cores. The best state-of-the-art online policy achieves 60% of the improvement in fairness seen with our near-optimal policy. Our proposals are useful both for setting upper performance bounds and inspiring implementable mechanisms for shared caches.</description><identifier>ISSN: 0920-8542</identifier><identifier>EISSN: 1573-0484</identifier><identifier>DOI: 10.1007/s11227-021-03736-1</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Compilers ; Computer Science ; Hierarchies ; Initial conditions ; Interpreters ; Microprocessors ; Optimization ; Policies ; Processor Architectures ; Programming Languages</subject><ispartof>The Journal of supercomputing, 2021-10, Vol.77 (10), p.11756-11785</ispartof><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2021</rights><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2021.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-8008cbfb9450eb747f9e6d98a3226cd0793df4d2cccc2386b6cb7c2ea5fe28673</citedby><cites>FETCH-LOGICAL-c319t-8008cbfb9450eb747f9e6d98a3226cd0793df4d2cccc2386b6cb7c2ea5fe28673</cites><orcidid>0000-0002-5916-7898</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27915,27916</link.rule.ids></links><search><creatorcontrib>Díaz, Javier</creatorcontrib><creatorcontrib>Ibáñez, Pablo</creatorcontrib><creatorcontrib>Monreal, Teresa</creatorcontrib><creatorcontrib>Viñals, Víctor</creatorcontrib><creatorcontrib>Llabería, José M.</creatorcontrib><title>Near-optimal replacement policies for shared caches in multicore processors</title><title>The Journal of supercomputing</title><addtitle>J Supercomput</addtitle><description>An optimal replacement policy that minimizes the miss rate in a private cache was proposed several decades ago. It requires knowing the future access sequence the cache will receive. There is no equivalent for shared caches because replacement decisions alter this future sequence. We present a novel near-optimal policy for minimizing the miss rate in a shared cache that approaches the optimal execution iteratively. During each iteration, the future access sequence is reconstructed on every miss interleaving the future per-core sequences, taken from the previous iteration. This single sequence feeds a classical private-cache optimum replacement policy. Our evaluation on a shared last-level cache shows that our proposal iteratively converges to a near-optimal miss rate that is independent of the initial conditions, within a margin of 0.1%. The best state-of-the-art online policies achieve around 65% of the miss rate reduction obtained by our near-optimal proposal. In a shared cache, miss rate optimization does not imply the optimization of other metrics. Therefore, we also propose a new near-optimal policy to maximize fairness between cores. The best state-of-the-art online policy achieves 60% of the improvement in fairness seen with our near-optimal policy. Our proposals are useful both for setting upper performance bounds and inspiring implementable mechanisms for shared caches.</description><subject>Compilers</subject><subject>Computer Science</subject><subject>Hierarchies</subject><subject>Initial conditions</subject><subject>Interpreters</subject><subject>Microprocessors</subject><subject>Optimization</subject><subject>Policies</subject><subject>Processor Architectures</subject><subject>Programming Languages</subject><issn>0920-8542</issn><issn>1573-0484</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNp9kE9LxDAQxYMouK5-AU8Fz9HJnzbJURZ1xUUveg5pOnW7dJuadA9-e6MVvDmXgeG9N7wfIZcMrhmAukmMca4ocEZBKFFRdkQWrFSCgtTymCzAcKC6lPyUnKW0AwCZdQvy9Iwu0jBO3d71RcSxdx73OEzFGPrOd5iKNsQibV3EpvDOb_OlG4r9oZ86HyIWYwweUwoxnZOT1vUJL373krzd372u1nTz8vC4ut1QL5iZqAbQvm5rI0vAWknVGqwao53gvPINKCOaVjbc5-FCV3Xla-U5urJFrislluRqzs2vPw6YJrsLhzjkl5aXKhfTVWmyis8qH0NKEVs7xlwyfloG9huanaHZDM3-QLMsm8RsSlk8vGP8i_7H9QVgjHAq</recordid><startdate>20211001</startdate><enddate>20211001</enddate><creator>Díaz, Javier</creator><creator>Ibáñez, Pablo</creator><creator>Monreal, Teresa</creator><creator>Viñals, Víctor</creator><creator>Llabería, José M.</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0002-5916-7898</orcidid></search><sort><creationdate>20211001</creationdate><title>Near-optimal replacement policies for shared caches in multicore processors</title><author>Díaz, Javier ; Ibáñez, Pablo ; Monreal, Teresa ; Viñals, Víctor ; Llabería, José M.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-8008cbfb9450eb747f9e6d98a3226cd0793df4d2cccc2386b6cb7c2ea5fe28673</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Compilers</topic><topic>Computer Science</topic><topic>Hierarchies</topic><topic>Initial conditions</topic><topic>Interpreters</topic><topic>Microprocessors</topic><topic>Optimization</topic><topic>Policies</topic><topic>Processor Architectures</topic><topic>Programming Languages</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Díaz, Javier</creatorcontrib><creatorcontrib>Ibáñez, Pablo</creatorcontrib><creatorcontrib>Monreal, Teresa</creatorcontrib><creatorcontrib>Viñals, Víctor</creatorcontrib><creatorcontrib>Llabería, José M.</creatorcontrib><collection>CrossRef</collection><jtitle>The Journal of supercomputing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Díaz, Javier</au><au>Ibáñez, Pablo</au><au>Monreal, Teresa</au><au>Viñals, Víctor</au><au>Llabería, José M.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Near-optimal replacement policies for shared caches in multicore processors</atitle><jtitle>The Journal of supercomputing</jtitle><stitle>J Supercomput</stitle><date>2021-10-01</date><risdate>2021</risdate><volume>77</volume><issue>10</issue><spage>11756</spage><epage>11785</epage><pages>11756-11785</pages><issn>0920-8542</issn><eissn>1573-0484</eissn><abstract>An optimal replacement policy that minimizes the miss rate in a private cache was proposed several decades ago. It requires knowing the future access sequence the cache will receive. There is no equivalent for shared caches because replacement decisions alter this future sequence. We present a novel near-optimal policy for minimizing the miss rate in a shared cache that approaches the optimal execution iteratively. During each iteration, the future access sequence is reconstructed on every miss interleaving the future per-core sequences, taken from the previous iteration. This single sequence feeds a classical private-cache optimum replacement policy. Our evaluation on a shared last-level cache shows that our proposal iteratively converges to a near-optimal miss rate that is independent of the initial conditions, within a margin of 0.1%. The best state-of-the-art online policies achieve around 65% of the miss rate reduction obtained by our near-optimal proposal. In a shared cache, miss rate optimization does not imply the optimization of other metrics. Therefore, we also propose a new near-optimal policy to maximize fairness between cores. The best state-of-the-art online policy achieves 60% of the improvement in fairness seen with our near-optimal policy. Our proposals are useful both for setting upper performance bounds and inspiring implementable mechanisms for shared caches.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11227-021-03736-1</doi><tpages>30</tpages><orcidid>https://orcid.org/0000-0002-5916-7898</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0920-8542 |
ispartof | The Journal of supercomputing, 2021-10, Vol.77 (10), p.11756-11785 |
issn | 0920-8542 1573-0484 |
language | eng |
recordid | cdi_proquest_journals_2570438659 |
source | Springer Nature |
subjects | Compilers Computer Science Hierarchies Initial conditions Interpreters Microprocessors Optimization Policies Processor Architectures Programming Languages |
title | Near-optimal replacement policies for shared caches in multicore processors |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-14T23%3A13%3A11IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Near-optimal%20replacement%20policies%20for%20shared%20caches%20in%20multicore%20processors&rft.jtitle=The%20Journal%20of%20supercomputing&rft.au=D%C3%ADaz,%20Javier&rft.date=2021-10-01&rft.volume=77&rft.issue=10&rft.spage=11756&rft.epage=11785&rft.pages=11756-11785&rft.issn=0920-8542&rft.eissn=1573-0484&rft_id=info:doi/10.1007/s11227-021-03736-1&rft_dat=%3Cproquest_cross%3E2570438659%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c319t-8008cbfb9450eb747f9e6d98a3226cd0793df4d2cccc2386b6cb7c2ea5fe28673%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2570438659&rft_id=info:pmid/&rfr_iscdi=true |