Loading…

On effective data supply for multi-issue processors

Emerging multi-issue microprocessors require effective data supply to sustain multiple instruction processing. The data cache structure, the backbone of data supply, has been organized and managed as one large homogenous resource, offering little flexibility for selective caching. While memory laten...

Full description

Saved in:
Bibliographic Details
Main Authors: Rivers, J.A., Tam, E.S., Davidson, E.S.
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 528
container_issue
container_start_page 519
container_title
container_volume
creator Rivers, J.A.
Tam, E.S.
Davidson, E.S.
description Emerging multi-issue microprocessors require effective data supply to sustain multiple instruction processing. The data cache structure, the backbone of data supply, has been organized and managed as one large homogenous resource, offering little flexibility for selective caching. While memory latency hiding techniques and multi-ported caches are critical to effective data supply, we show in this paper that even ideal non-blocking multi-ported caches fail to be sufficient in and of themselves in supplying data. We evaluate an approach in which the first level (L1) data cache is partitioned into multiple (multi-lateral) subcaches. The data reference stream of a running program is subdivided into two classes, and each class is mapped to a specific subcache whose management policy is more suitable for the access pattern of its class. This sort of selective organization and caching retains more useful data in the L1 Cache, which translates to more cache hits, less cache-memory bus contention and overall improvement in execution time. Our simulations show that a multi-lateral L1 cache of (8+1)KB total size generally performs as well as, and in some cases better than, an ideal multiported 16 KB cache structure in supplying data.
doi_str_mv 10.1109/ICCD.1997.628917
format conference_proceeding
fullrecord <record><control><sourceid>ieee_CHZPO</sourceid><recordid>TN_cdi_ieee_primary_628917</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>628917</ieee_id><sourcerecordid>628917</sourcerecordid><originalsourceid>FETCH-LOGICAL-i104t-6b7f33c5c8ead55df455d38aef67031652c2830355d04ee56be88e991e8f99303</originalsourceid><addsrcrecordid>eNotj8tqwzAUREUfUJN6X7rSD8iVrNe9y-K-AoFs2nVQ7CtQcWpj2YX8fQ3pLObADAwMYw9KVkpJfNo2zUulEH3lakDlr1hRW--EQ3TXrEQPEhQ4qKXTN6xQK4Qz0tyxMudvucpYhRYKpvc_nGKkdk6_xLswB56XcezPPA4TPy39nETKeSE-TkNLOQ9Tvme3MfSZyn9u2Nfb62fzIXb7923zvBNJSTMLd_RR69a2QKGztotmNQ2BovNSK2frtgYt9ZpKQ2TdkQAIURFExLXYsMfLbiKiwzilU5jOh8tj_QfX5EbT</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>On effective data supply for multi-issue processors</title><source>IEEE Xplore All Conference Series</source><creator>Rivers, J.A. ; Tam, E.S. ; Davidson, E.S.</creator><creatorcontrib>Rivers, J.A. ; Tam, E.S. ; Davidson, E.S.</creatorcontrib><description>Emerging multi-issue microprocessors require effective data supply to sustain multiple instruction processing. The data cache structure, the backbone of data supply, has been organized and managed as one large homogenous resource, offering little flexibility for selective caching. While memory latency hiding techniques and multi-ported caches are critical to effective data supply, we show in this paper that even ideal non-blocking multi-ported caches fail to be sufficient in and of themselves in supplying data. We evaluate an approach in which the first level (L1) data cache is partitioned into multiple (multi-lateral) subcaches. The data reference stream of a running program is subdivided into two classes, and each class is mapped to a specific subcache whose management policy is more suitable for the access pattern of its class. This sort of selective organization and caching retains more useful data in the L1 Cache, which translates to more cache hits, less cache-memory bus contention and overall improvement in execution time. Our simulations show that a multi-lateral L1 cache of (8+1)KB total size generally performs as well as, and in some cases better than, an ideal multiported 16 KB cache structure in supplying data.</description><identifier>ISSN: 1063-6404</identifier><identifier>ISBN: 9780818682063</identifier><identifier>ISBN: 081868206X</identifier><identifier>EISSN: 2576-6996</identifier><identifier>DOI: 10.1109/ICCD.1997.628917</identifier><language>eng</language><publisher>IEEE</publisher><subject>Clocks ; Computer architecture ; Delay ; Laboratories ; Microarchitecture ; Microprocessors ; Parallel processing ; Resource management ; Rivers ; Spine</subject><ispartof>Proceedings International Conference on Computer Design VLSI in Computers and Processors, 1997, p.519-528</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/628917$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2058,4050,4051,27925,54555,54920,54932</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/628917$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Rivers, J.A.</creatorcontrib><creatorcontrib>Tam, E.S.</creatorcontrib><creatorcontrib>Davidson, E.S.</creatorcontrib><title>On effective data supply for multi-issue processors</title><title>Proceedings International Conference on Computer Design VLSI in Computers and Processors</title><addtitle>ICCD</addtitle><description>Emerging multi-issue microprocessors require effective data supply to sustain multiple instruction processing. The data cache structure, the backbone of data supply, has been organized and managed as one large homogenous resource, offering little flexibility for selective caching. While memory latency hiding techniques and multi-ported caches are critical to effective data supply, we show in this paper that even ideal non-blocking multi-ported caches fail to be sufficient in and of themselves in supplying data. We evaluate an approach in which the first level (L1) data cache is partitioned into multiple (multi-lateral) subcaches. The data reference stream of a running program is subdivided into two classes, and each class is mapped to a specific subcache whose management policy is more suitable for the access pattern of its class. This sort of selective organization and caching retains more useful data in the L1 Cache, which translates to more cache hits, less cache-memory bus contention and overall improvement in execution time. Our simulations show that a multi-lateral L1 cache of (8+1)KB total size generally performs as well as, and in some cases better than, an ideal multiported 16 KB cache structure in supplying data.</description><subject>Clocks</subject><subject>Computer architecture</subject><subject>Delay</subject><subject>Laboratories</subject><subject>Microarchitecture</subject><subject>Microprocessors</subject><subject>Parallel processing</subject><subject>Resource management</subject><subject>Rivers</subject><subject>Spine</subject><issn>1063-6404</issn><issn>2576-6996</issn><isbn>9780818682063</isbn><isbn>081868206X</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>1997</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNotj8tqwzAUREUfUJN6X7rSD8iVrNe9y-K-AoFs2nVQ7CtQcWpj2YX8fQ3pLObADAwMYw9KVkpJfNo2zUulEH3lakDlr1hRW--EQ3TXrEQPEhQ4qKXTN6xQK4Qz0tyxMudvucpYhRYKpvc_nGKkdk6_xLswB56XcezPPA4TPy39nETKeSE-TkNLOQ9Tvme3MfSZyn9u2Nfb62fzIXb7923zvBNJSTMLd_RR69a2QKGztotmNQ2BovNSK2frtgYt9ZpKQ2TdkQAIURFExLXYsMfLbiKiwzilU5jOh8tj_QfX5EbT</recordid><startdate>1997</startdate><enddate>1997</enddate><creator>Rivers, J.A.</creator><creator>Tam, E.S.</creator><creator>Davidson, E.S.</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>1997</creationdate><title>On effective data supply for multi-issue processors</title><author>Rivers, J.A. ; Tam, E.S. ; Davidson, E.S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i104t-6b7f33c5c8ead55df455d38aef67031652c2830355d04ee56be88e991e8f99303</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>1997</creationdate><topic>Clocks</topic><topic>Computer architecture</topic><topic>Delay</topic><topic>Laboratories</topic><topic>Microarchitecture</topic><topic>Microprocessors</topic><topic>Parallel processing</topic><topic>Resource management</topic><topic>Rivers</topic><topic>Spine</topic><toplevel>online_resources</toplevel><creatorcontrib>Rivers, J.A.</creatorcontrib><creatorcontrib>Tam, E.S.</creatorcontrib><creatorcontrib>Davidson, E.S.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE/IET Electronic Library</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Rivers, J.A.</au><au>Tam, E.S.</au><au>Davidson, E.S.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>On effective data supply for multi-issue processors</atitle><btitle>Proceedings International Conference on Computer Design VLSI in Computers and Processors</btitle><stitle>ICCD</stitle><date>1997</date><risdate>1997</risdate><spage>519</spage><epage>528</epage><pages>519-528</pages><issn>1063-6404</issn><eissn>2576-6996</eissn><isbn>9780818682063</isbn><isbn>081868206X</isbn><abstract>Emerging multi-issue microprocessors require effective data supply to sustain multiple instruction processing. The data cache structure, the backbone of data supply, has been organized and managed as one large homogenous resource, offering little flexibility for selective caching. While memory latency hiding techniques and multi-ported caches are critical to effective data supply, we show in this paper that even ideal non-blocking multi-ported caches fail to be sufficient in and of themselves in supplying data. We evaluate an approach in which the first level (L1) data cache is partitioned into multiple (multi-lateral) subcaches. The data reference stream of a running program is subdivided into two classes, and each class is mapped to a specific subcache whose management policy is more suitable for the access pattern of its class. This sort of selective organization and caching retains more useful data in the L1 Cache, which translates to more cache hits, less cache-memory bus contention and overall improvement in execution time. Our simulations show that a multi-lateral L1 cache of (8+1)KB total size generally performs as well as, and in some cases better than, an ideal multiported 16 KB cache structure in supplying data.</abstract><pub>IEEE</pub><doi>10.1109/ICCD.1997.628917</doi><tpages>10</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1063-6404
ispartof Proceedings International Conference on Computer Design VLSI in Computers and Processors, 1997, p.519-528
issn 1063-6404
2576-6996
language eng
recordid cdi_ieee_primary_628917
source IEEE Xplore All Conference Series
subjects Clocks
Computer architecture
Delay
Laboratories
Microarchitecture
Microprocessors
Parallel processing
Resource management
Rivers
Spine
title On effective data supply for multi-issue processors
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T17%3A50%3A16IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_CHZPO&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=On%20effective%20data%20supply%20for%20multi-issue%20processors&rft.btitle=Proceedings%20International%20Conference%20on%20Computer%20Design%20VLSI%20in%20Computers%20and%20Processors&rft.au=Rivers,%20J.A.&rft.date=1997&rft.spage=519&rft.epage=528&rft.pages=519-528&rft.issn=1063-6404&rft.eissn=2576-6996&rft.isbn=9780818682063&rft.isbn_list=081868206X&rft_id=info:doi/10.1109/ICCD.1997.628917&rft_dat=%3Cieee_CHZPO%3E628917%3C/ieee_CHZPO%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-i104t-6b7f33c5c8ead55df455d38aef67031652c2830355d04ee56be88e991e8f99303%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=628917&rfr_iscdi=true