Loading…

Leveraging Protocol Knowledge in Slack Matching

Stalls, due to mis-matches in communication rates, are a major performance obstacle in pipelined circuits. If the rate of data production is faster than the rate of consumption, the resulting design performs slower than when the communication rate is matched. This can be remedied by inserting pipeli...

Full description

Saved in:
Bibliographic Details
Main Authors: Venkataramani, G., Goldstein, S.C.
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 729
container_issue
container_start_page 724
container_title
container_volume
creator Venkataramani, G.
Goldstein, S.C.
description Stalls, due to mis-matches in communication rates, are a major performance obstacle in pipelined circuits. If the rate of data production is faster than the rate of consumption, the resulting design performs slower than when the communication rate is matched. This can be remedied by inserting pipeline buffers (to temporarily hold data), allowing the producer to proceed if the consumer is not ready to accept data. The problem of deciding which channels need these buffers (and how many) for an arbitrary communication profile is called the slack matching problem; the optimal solution to this problem has been shown to be NP-complete. In this paper, we present a heuristic that uses knowledge of the communication protocol to explicitly model these bottlenecks, and an iterative algorithm to progressively remove these bottlenecks by inserting buffers. We apply this algorithm to asynchronous circuits, and show that it naturally handles large designs with arbitrarily cyclic and acyclic topologies, which exhibit various types of control choice. The heuristic is efficient, achieving linear time complexity in practice, and produces solutions that (a) achieve up to 60% performance speedup on large media processing kernels, and (b) can either be verified to be optimal, or the approximation margin can be bounded
doi_str_mv 10.1109/ICCAD.2006.320019
format conference_proceeding
fullrecord <record><control><sourceid>ieee_CHZPO</sourceid><recordid>TN_cdi_ieee_primary_4110258</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>4110258</ieee_id><sourcerecordid>4110258</sourcerecordid><originalsourceid>FETCH-LOGICAL-i90t-3d4261539af4ea9326b3a6009800a64de0e80b973e61c1130251576bfae6b16e3</originalsourceid><addsrcrecordid>eNotj81KxEAQhEdRcF33AcRLXiDZ7umZ2enjEv8WIwrufZkknRiNiSRB8e0NaB2qvkNRUEpdIiSIwOtdmm6vEw3gEpod-Uido2XLRJ7xWC3QWh9rQ-ZkZmAdE1p9plbj-AaziMlbWqh1Jl8yhLrp6uh56Ke-6Nvooeu_WylriZouemlD8R49hql4nUsX6rQK7Sir_1yq_e3NPr2Ps6e7XbrN4oZhiqk02qElDpWRwKRdTsEBsAcIzpQC4iHnDYnDApFAW7Qbl1dBXI5OaKmu_mYbETl8Ds1HGH4OZr6uradffj1Djg</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Leveraging Protocol Knowledge in Slack Matching</title><source>IEEE Xplore All Conference Series</source><creator>Venkataramani, G. ; Goldstein, S.C.</creator><creatorcontrib>Venkataramani, G. ; Goldstein, S.C.</creatorcontrib><description>Stalls, due to mis-matches in communication rates, are a major performance obstacle in pipelined circuits. If the rate of data production is faster than the rate of consumption, the resulting design performs slower than when the communication rate is matched. This can be remedied by inserting pipeline buffers (to temporarily hold data), allowing the producer to proceed if the consumer is not ready to accept data. The problem of deciding which channels need these buffers (and how many) for an arbitrary communication profile is called the slack matching problem; the optimal solution to this problem has been shown to be NP-complete. In this paper, we present a heuristic that uses knowledge of the communication protocol to explicitly model these bottlenecks, and an iterative algorithm to progressively remove these bottlenecks by inserting buffers. We apply this algorithm to asynchronous circuits, and show that it naturally handles large designs with arbitrarily cyclic and acyclic topologies, which exhibit various types of control choice. The heuristic is efficient, achieving linear time complexity in practice, and produces solutions that (a) achieve up to 60% performance speedup on large media processing kernels, and (b) can either be verified to be optimal, or the approximation margin can be bounded</description><identifier>ISSN: 1092-3152</identifier><identifier>EISSN: 1558-2434</identifier><identifier>EISBN: 1595933891</identifier><identifier>EISBN: 9781595933898</identifier><identifier>DOI: 10.1109/ICCAD.2006.320019</identifier><language>eng</language><publisher>IEEE</publisher><subject>Algorithm design and analysis ; Asynchronous circuits ; Circuit topology ; Communication system control ; Delay ; Iterative algorithms ; Kernel ; Pipelines ; Production ; Protocols</subject><ispartof>2006 IEEE/ACM International Conference on Computer Aided Design, 2006, p.724-729</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/4110258$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,23930,23931,25140,27925,54555,54932</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/4110258$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Venkataramani, G.</creatorcontrib><creatorcontrib>Goldstein, S.C.</creatorcontrib><title>Leveraging Protocol Knowledge in Slack Matching</title><title>2006 IEEE/ACM International Conference on Computer Aided Design</title><addtitle>ICCAD</addtitle><description>Stalls, due to mis-matches in communication rates, are a major performance obstacle in pipelined circuits. If the rate of data production is faster than the rate of consumption, the resulting design performs slower than when the communication rate is matched. This can be remedied by inserting pipeline buffers (to temporarily hold data), allowing the producer to proceed if the consumer is not ready to accept data. The problem of deciding which channels need these buffers (and how many) for an arbitrary communication profile is called the slack matching problem; the optimal solution to this problem has been shown to be NP-complete. In this paper, we present a heuristic that uses knowledge of the communication protocol to explicitly model these bottlenecks, and an iterative algorithm to progressively remove these bottlenecks by inserting buffers. We apply this algorithm to asynchronous circuits, and show that it naturally handles large designs with arbitrarily cyclic and acyclic topologies, which exhibit various types of control choice. The heuristic is efficient, achieving linear time complexity in practice, and produces solutions that (a) achieve up to 60% performance speedup on large media processing kernels, and (b) can either be verified to be optimal, or the approximation margin can be bounded</description><subject>Algorithm design and analysis</subject><subject>Asynchronous circuits</subject><subject>Circuit topology</subject><subject>Communication system control</subject><subject>Delay</subject><subject>Iterative algorithms</subject><subject>Kernel</subject><subject>Pipelines</subject><subject>Production</subject><subject>Protocols</subject><issn>1092-3152</issn><issn>1558-2434</issn><isbn>1595933891</isbn><isbn>9781595933898</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2006</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNotj81KxEAQhEdRcF33AcRLXiDZ7umZ2enjEv8WIwrufZkknRiNiSRB8e0NaB2qvkNRUEpdIiSIwOtdmm6vEw3gEpod-Uido2XLRJ7xWC3QWh9rQ-ZkZmAdE1p9plbj-AaziMlbWqh1Jl8yhLrp6uh56Ke-6Nvooeu_WylriZouemlD8R49hql4nUsX6rQK7Sir_1yq_e3NPr2Ps6e7XbrN4oZhiqk02qElDpWRwKRdTsEBsAcIzpQC4iHnDYnDApFAW7Qbl1dBXI5OaKmu_mYbETl8Ds1HGH4OZr6uradffj1Djg</recordid><startdate>200611</startdate><enddate>200611</enddate><creator>Venkataramani, G.</creator><creator>Goldstein, S.C.</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>200611</creationdate><title>Leveraging Protocol Knowledge in Slack Matching</title><author>Venkataramani, G. ; Goldstein, S.C.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i90t-3d4261539af4ea9326b3a6009800a64de0e80b973e61c1130251576bfae6b16e3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2006</creationdate><topic>Algorithm design and analysis</topic><topic>Asynchronous circuits</topic><topic>Circuit topology</topic><topic>Communication system control</topic><topic>Delay</topic><topic>Iterative algorithms</topic><topic>Kernel</topic><topic>Pipelines</topic><topic>Production</topic><topic>Protocols</topic><toplevel>online_resources</toplevel><creatorcontrib>Venkataramani, G.</creatorcontrib><creatorcontrib>Goldstein, S.C.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Venkataramani, G.</au><au>Goldstein, S.C.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Leveraging Protocol Knowledge in Slack Matching</atitle><btitle>2006 IEEE/ACM International Conference on Computer Aided Design</btitle><stitle>ICCAD</stitle><date>2006-11</date><risdate>2006</risdate><spage>724</spage><epage>729</epage><pages>724-729</pages><issn>1092-3152</issn><eissn>1558-2434</eissn><eisbn>1595933891</eisbn><eisbn>9781595933898</eisbn><abstract>Stalls, due to mis-matches in communication rates, are a major performance obstacle in pipelined circuits. If the rate of data production is faster than the rate of consumption, the resulting design performs slower than when the communication rate is matched. This can be remedied by inserting pipeline buffers (to temporarily hold data), allowing the producer to proceed if the consumer is not ready to accept data. The problem of deciding which channels need these buffers (and how many) for an arbitrary communication profile is called the slack matching problem; the optimal solution to this problem has been shown to be NP-complete. In this paper, we present a heuristic that uses knowledge of the communication protocol to explicitly model these bottlenecks, and an iterative algorithm to progressively remove these bottlenecks by inserting buffers. We apply this algorithm to asynchronous circuits, and show that it naturally handles large designs with arbitrarily cyclic and acyclic topologies, which exhibit various types of control choice. The heuristic is efficient, achieving linear time complexity in practice, and produces solutions that (a) achieve up to 60% performance speedup on large media processing kernels, and (b) can either be verified to be optimal, or the approximation margin can be bounded</abstract><pub>IEEE</pub><doi>10.1109/ICCAD.2006.320019</doi><tpages>6</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1092-3152
ispartof 2006 IEEE/ACM International Conference on Computer Aided Design, 2006, p.724-729
issn 1092-3152
1558-2434
language eng
recordid cdi_ieee_primary_4110258
source IEEE Xplore All Conference Series
subjects Algorithm design and analysis
Asynchronous circuits
Circuit topology
Communication system control
Delay
Iterative algorithms
Kernel
Pipelines
Production
Protocols
title Leveraging Protocol Knowledge in Slack Matching
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T19%3A12%3A31IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_CHZPO&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Leveraging%20Protocol%20Knowledge%20in%20Slack%20Matching&rft.btitle=2006%20IEEE/ACM%20International%20Conference%20on%20Computer%20Aided%20Design&rft.au=Venkataramani,%20G.&rft.date=2006-11&rft.spage=724&rft.epage=729&rft.pages=724-729&rft.issn=1092-3152&rft.eissn=1558-2434&rft_id=info:doi/10.1109/ICCAD.2006.320019&rft.eisbn=1595933891&rft.eisbn_list=9781595933898&rft_dat=%3Cieee_CHZPO%3E4110258%3C/ieee_CHZPO%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-i90t-3d4261539af4ea9326b3a6009800a64de0e80b973e61c1130251576bfae6b16e3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=4110258&rfr_iscdi=true