Loading…
On effective data supply for multi-issue processors
Emerging multi-issue microprocessors require effective data supply to sustain multiple instruction processing. The data cache structure, the backbone of data supply, has been organized and managed as one large homogenous resource, offering little flexibility for selective caching. While memory laten...
Saved in:
Main Authors: | , , |
---|---|
Format: | Conference Proceeding |
Language: | English |
Subjects: | |
Online Access: | Request full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | |
container_end_page | 528 |
container_issue | |
container_start_page | 519 |
container_title | |
container_volume | |
creator | Rivers, J.A. Tam, E.S. Davidson, E.S. |
description | Emerging multi-issue microprocessors require effective data supply to sustain multiple instruction processing. The data cache structure, the backbone of data supply, has been organized and managed as one large homogenous resource, offering little flexibility for selective caching. While memory latency hiding techniques and multi-ported caches are critical to effective data supply, we show in this paper that even ideal non-blocking multi-ported caches fail to be sufficient in and of themselves in supplying data. We evaluate an approach in which the first level (L1) data cache is partitioned into multiple (multi-lateral) subcaches. The data reference stream of a running program is subdivided into two classes, and each class is mapped to a specific subcache whose management policy is more suitable for the access pattern of its class. This sort of selective organization and caching retains more useful data in the L1 Cache, which translates to more cache hits, less cache-memory bus contention and overall improvement in execution time. Our simulations show that a multi-lateral L1 cache of (8+1)KB total size generally performs as well as, and in some cases better than, an ideal multiported 16 KB cache structure in supplying data. |
doi_str_mv | 10.1109/ICCD.1997.628917 |
format | conference_proceeding |
fullrecord | <record><control><sourceid>ieee_CHZPO</sourceid><recordid>TN_cdi_ieee_primary_628917</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>628917</ieee_id><sourcerecordid>628917</sourcerecordid><originalsourceid>FETCH-LOGICAL-i104t-6b7f33c5c8ead55df455d38aef67031652c2830355d04ee56be88e991e8f99303</originalsourceid><addsrcrecordid>eNotj8tqwzAUREUfUJN6X7rSD8iVrNe9y-K-AoFs2nVQ7CtQcWpj2YX8fQ3pLObADAwMYw9KVkpJfNo2zUulEH3lakDlr1hRW--EQ3TXrEQPEhQ4qKXTN6xQK4Qz0tyxMudvucpYhRYKpvc_nGKkdk6_xLswB56XcezPPA4TPy39nETKeSE-TkNLOQ9Tvme3MfSZyn9u2Nfb62fzIXb7923zvBNJSTMLd_RR69a2QKGztotmNQ2BovNSK2frtgYt9ZpKQ2TdkQAIURFExLXYsMfLbiKiwzilU5jOh8tj_QfX5EbT</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>On effective data supply for multi-issue processors</title><source>IEEE Xplore All Conference Series</source><creator>Rivers, J.A. ; Tam, E.S. ; Davidson, E.S.</creator><creatorcontrib>Rivers, J.A. ; Tam, E.S. ; Davidson, E.S.</creatorcontrib><description>Emerging multi-issue microprocessors require effective data supply to sustain multiple instruction processing. The data cache structure, the backbone of data supply, has been organized and managed as one large homogenous resource, offering little flexibility for selective caching. While memory latency hiding techniques and multi-ported caches are critical to effective data supply, we show in this paper that even ideal non-blocking multi-ported caches fail to be sufficient in and of themselves in supplying data. We evaluate an approach in which the first level (L1) data cache is partitioned into multiple (multi-lateral) subcaches. The data reference stream of a running program is subdivided into two classes, and each class is mapped to a specific subcache whose management policy is more suitable for the access pattern of its class. This sort of selective organization and caching retains more useful data in the L1 Cache, which translates to more cache hits, less cache-memory bus contention and overall improvement in execution time. Our simulations show that a multi-lateral L1 cache of (8+1)KB total size generally performs as well as, and in some cases better than, an ideal multiported 16 KB cache structure in supplying data.</description><identifier>ISSN: 1063-6404</identifier><identifier>ISBN: 9780818682063</identifier><identifier>ISBN: 081868206X</identifier><identifier>EISSN: 2576-6996</identifier><identifier>DOI: 10.1109/ICCD.1997.628917</identifier><language>eng</language><publisher>IEEE</publisher><subject>Clocks ; Computer architecture ; Delay ; Laboratories ; Microarchitecture ; Microprocessors ; Parallel processing ; Resource management ; Rivers ; Spine</subject><ispartof>Proceedings International Conference on Computer Design VLSI in Computers and Processors, 1997, p.519-528</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/628917$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2058,4050,4051,27925,54555,54920,54932</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/628917$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Rivers, J.A.</creatorcontrib><creatorcontrib>Tam, E.S.</creatorcontrib><creatorcontrib>Davidson, E.S.</creatorcontrib><title>On effective data supply for multi-issue processors</title><title>Proceedings International Conference on Computer Design VLSI in Computers and Processors</title><addtitle>ICCD</addtitle><description>Emerging multi-issue microprocessors require effective data supply to sustain multiple instruction processing. The data cache structure, the backbone of data supply, has been organized and managed as one large homogenous resource, offering little flexibility for selective caching. While memory latency hiding techniques and multi-ported caches are critical to effective data supply, we show in this paper that even ideal non-blocking multi-ported caches fail to be sufficient in and of themselves in supplying data. We evaluate an approach in which the first level (L1) data cache is partitioned into multiple (multi-lateral) subcaches. The data reference stream of a running program is subdivided into two classes, and each class is mapped to a specific subcache whose management policy is more suitable for the access pattern of its class. This sort of selective organization and caching retains more useful data in the L1 Cache, which translates to more cache hits, less cache-memory bus contention and overall improvement in execution time. Our simulations show that a multi-lateral L1 cache of (8+1)KB total size generally performs as well as, and in some cases better than, an ideal multiported 16 KB cache structure in supplying data.</description><subject>Clocks</subject><subject>Computer architecture</subject><subject>Delay</subject><subject>Laboratories</subject><subject>Microarchitecture</subject><subject>Microprocessors</subject><subject>Parallel processing</subject><subject>Resource management</subject><subject>Rivers</subject><subject>Spine</subject><issn>1063-6404</issn><issn>2576-6996</issn><isbn>9780818682063</isbn><isbn>081868206X</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>1997</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNotj8tqwzAUREUfUJN6X7rSD8iVrNe9y-K-AoFs2nVQ7CtQcWpj2YX8fQ3pLObADAwMYw9KVkpJfNo2zUulEH3lakDlr1hRW--EQ3TXrEQPEhQ4qKXTN6xQK4Qz0tyxMudvucpYhRYKpvc_nGKkdk6_xLswB56XcezPPA4TPy39nETKeSE-TkNLOQ9Tvme3MfSZyn9u2Nfb62fzIXb7923zvBNJSTMLd_RR69a2QKGztotmNQ2BovNSK2frtgYt9ZpKQ2TdkQAIURFExLXYsMfLbiKiwzilU5jOh8tj_QfX5EbT</recordid><startdate>1997</startdate><enddate>1997</enddate><creator>Rivers, J.A.</creator><creator>Tam, E.S.</creator><creator>Davidson, E.S.</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>1997</creationdate><title>On effective data supply for multi-issue processors</title><author>Rivers, J.A. ; Tam, E.S. ; Davidson, E.S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i104t-6b7f33c5c8ead55df455d38aef67031652c2830355d04ee56be88e991e8f99303</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>1997</creationdate><topic>Clocks</topic><topic>Computer architecture</topic><topic>Delay</topic><topic>Laboratories</topic><topic>Microarchitecture</topic><topic>Microprocessors</topic><topic>Parallel processing</topic><topic>Resource management</topic><topic>Rivers</topic><topic>Spine</topic><toplevel>online_resources</toplevel><creatorcontrib>Rivers, J.A.</creatorcontrib><creatorcontrib>Tam, E.S.</creatorcontrib><creatorcontrib>Davidson, E.S.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE/IET Electronic Library</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Rivers, J.A.</au><au>Tam, E.S.</au><au>Davidson, E.S.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>On effective data supply for multi-issue processors</atitle><btitle>Proceedings International Conference on Computer Design VLSI in Computers and Processors</btitle><stitle>ICCD</stitle><date>1997</date><risdate>1997</risdate><spage>519</spage><epage>528</epage><pages>519-528</pages><issn>1063-6404</issn><eissn>2576-6996</eissn><isbn>9780818682063</isbn><isbn>081868206X</isbn><abstract>Emerging multi-issue microprocessors require effective data supply to sustain multiple instruction processing. The data cache structure, the backbone of data supply, has been organized and managed as one large homogenous resource, offering little flexibility for selective caching. While memory latency hiding techniques and multi-ported caches are critical to effective data supply, we show in this paper that even ideal non-blocking multi-ported caches fail to be sufficient in and of themselves in supplying data. We evaluate an approach in which the first level (L1) data cache is partitioned into multiple (multi-lateral) subcaches. The data reference stream of a running program is subdivided into two classes, and each class is mapped to a specific subcache whose management policy is more suitable for the access pattern of its class. This sort of selective organization and caching retains more useful data in the L1 Cache, which translates to more cache hits, less cache-memory bus contention and overall improvement in execution time. Our simulations show that a multi-lateral L1 cache of (8+1)KB total size generally performs as well as, and in some cases better than, an ideal multiported 16 KB cache structure in supplying data.</abstract><pub>IEEE</pub><doi>10.1109/ICCD.1997.628917</doi><tpages>10</tpages></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 1063-6404 |
ispartof | Proceedings International Conference on Computer Design VLSI in Computers and Processors, 1997, p.519-528 |
issn | 1063-6404 2576-6996 |
language | eng |
recordid | cdi_ieee_primary_628917 |
source | IEEE Xplore All Conference Series |
subjects | Clocks Computer architecture Delay Laboratories Microarchitecture Microprocessors Parallel processing Resource management Rivers Spine |
title | On effective data supply for multi-issue processors |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T17%3A50%3A16IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_CHZPO&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=On%20effective%20data%20supply%20for%20multi-issue%20processors&rft.btitle=Proceedings%20International%20Conference%20on%20Computer%20Design%20VLSI%20in%20Computers%20and%20Processors&rft.au=Rivers,%20J.A.&rft.date=1997&rft.spage=519&rft.epage=528&rft.pages=519-528&rft.issn=1063-6404&rft.eissn=2576-6996&rft.isbn=9780818682063&rft.isbn_list=081868206X&rft_id=info:doi/10.1109/ICCD.1997.628917&rft_dat=%3Cieee_CHZPO%3E628917%3C/ieee_CHZPO%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-i104t-6b7f33c5c8ead55df455d38aef67031652c2830355d04ee56be88e991e8f99303%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=628917&rfr_iscdi=true |