Loading…

Acceleration of Control Intensive Applications on Coarse-Grained Reconfigurable Arrays for Embedded Systems

Embedded systems confront two opposite goals: low-power operation and high performance. The current trend to reach these goals is toward heterogeneous platforms, including multi-core architectures with heterogeneous cores and hardware accelerators. The latter can be divided into custom accelerators...

Full description

Saved in:
Bibliographic Details
Published in:IEEE transactions on computers 2023-09, Vol.72 (9), p.2548-2560
Main Authors: Denkinger, Benoit Walter, Peon-Quiros, Miguel, Konijnenburg, Mario, Atienza, David, Catthoor, Francky
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c331t-5e9d04a31107db007ce04daeb6aedc46709a2eb01aef54d7884aa86f7ad3d9113
cites cdi_FETCH-LOGICAL-c331t-5e9d04a31107db007ce04daeb6aedc46709a2eb01aef54d7884aa86f7ad3d9113
container_end_page 2560
container_issue 9
container_start_page 2548
container_title IEEE transactions on computers
container_volume 72
creator Denkinger, Benoit Walter
Peon-Quiros, Miguel
Konijnenburg, Mario
Atienza, David
Catthoor, Francky
description Embedded systems confront two opposite goals: low-power operation and high performance. The current trend to reach these goals is toward heterogeneous platforms, including multi-core architectures with heterogeneous cores and hardware accelerators. The latter can be divided into custom accelerators (e.g., ASICs) and programmable domain-specific cores (e.g., DSIPs). VWR2A Denkinger et al. 2022 is a programmable architecture that integrates high computational density and low power memory structures. The flexibility of VWR2A allows a large portion of applications to be covered, resulting in better performance and energy efficiency than ASICs and general-purpose processors. However, while this has been well studied for data-intensive kernels, this is not the case for control-intensive kernels -code with complex if-else and nested loop structures. Traditionally, control-intensive code is left to be executed by the host processor. This situation unnecessarily restricts the potential impact of energy-efficient acceleration, especially at the application level. In this paper, we evaluate the performance and energy consumption of VWR2A for control-intensive code and compare it with an ARM Cortex-M4 processor and a RISC-V Ibex processor. The performance and energy consumption are evaluated at the kernel and application levels. Our results confirm that VWR2A is faster and more energy-efficient than the two considered general-purpose processors also for control-intensive code.
doi_str_mv 10.1109/TC.2023.3257504
format article
fullrecord <record><control><sourceid>proquest_ieee_</sourceid><recordid>TN_cdi_proquest_journals_2847963148</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10078324</ieee_id><sourcerecordid>2847963148</sourcerecordid><originalsourceid>FETCH-LOGICAL-c331t-5e9d04a31107db007ce04daeb6aedc46709a2eb01aef54d7884aa86f7ad3d9113</originalsourceid><addsrcrecordid>eNpN0DFPwzAQBWALgUQpzCwMlpjTnmMnjscqKqUSEhKU2XLsC0pJ42CnSP33pLQD0y3fu9M9Qu4ZzBgDNd-UsxRSPuNpJjMQF2TCskwmSmX5JZkAsCJRXMA1uYlxCwB5CmpCvhbWYovBDI3vqK9p6bsh-JauuwG72PwgXfR929g_EOmISm9CxGQVTNOho29ofVc3n_tgqnbUIZhDpLUPdLmr0LmRvB_igLt4S65q00a8O88p-Xhabsrn5OV1tS4XL4nlnA1JhsqBMHz8SroKQFoE4QxWuUFnRS5BmRQrYAbrTDhZFMKYIq-lcdwpxviUPJ729sF_7zEOeuv3oRtP6rQQUuWciWJU85OywccYsNZ9aHYmHDQDfWxUb0p9bFSfGx0TD6dEg4j_NMiCp4L_AiOGc0U</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2847963148</pqid></control><display><type>article</type><title>Acceleration of Control Intensive Applications on Coarse-Grained Reconfigurable Arrays for Embedded Systems</title><source>IEEE Xplore (Online service)</source><creator>Denkinger, Benoit Walter ; Peon-Quiros, Miguel ; Konijnenburg, Mario ; Atienza, David ; Catthoor, Francky</creator><creatorcontrib>Denkinger, Benoit Walter ; Peon-Quiros, Miguel ; Konijnenburg, Mario ; Atienza, David ; Catthoor, Francky</creatorcontrib><description>Embedded systems confront two opposite goals: low-power operation and high performance. The current trend to reach these goals is toward heterogeneous platforms, including multi-core architectures with heterogeneous cores and hardware accelerators. The latter can be divided into custom accelerators (e.g., ASICs) and programmable domain-specific cores (e.g., DSIPs). VWR2A Denkinger et al. 2022 is a programmable architecture that integrates high computational density and low power memory structures. The flexibility of VWR2A allows a large portion of applications to be covered, resulting in better performance and energy efficiency than ASICs and general-purpose processors. However, while this has been well studied for data-intensive kernels, this is not the case for control-intensive kernels -code with complex if-else and nested loop structures. Traditionally, control-intensive code is left to be executed by the host processor. This situation unnecessarily restricts the potential impact of energy-efficient acceleration, especially at the application level. In this paper, we evaluate the performance and energy consumption of VWR2A for control-intensive code and compare it with an ARM Cortex-M4 processor and a RISC-V Ibex processor. The performance and energy consumption are evaluated at the kernel and application levels. Our results confirm that VWR2A is faster and more energy-efficient than the two considered general-purpose processors also for control-intensive code.</description><identifier>ISSN: 0018-9340</identifier><identifier>EISSN: 1557-9956</identifier><identifier>DOI: 10.1109/TC.2023.3257504</identifier><identifier>CODEN: ITCOB4</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>Accelerators ; CGRA ; Codes ; Computer architecture ; Embedded systems ; Energy consumption ; Energy efficiency ; Hardware acceleration ; Kernel ; Kernels ; low-power ; Microprocessors ; Nested loops ; Performance evaluation ; Power management ; Program processors ; programmable cores ; reconfigurable architecture ; RISC ; Task analysis</subject><ispartof>IEEE transactions on computers, 2023-09, Vol.72 (9), p.2548-2560</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2023</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c331t-5e9d04a31107db007ce04daeb6aedc46709a2eb01aef54d7884aa86f7ad3d9113</citedby><cites>FETCH-LOGICAL-c331t-5e9d04a31107db007ce04daeb6aedc46709a2eb01aef54d7884aa86f7ad3d9113</cites><orcidid>0000-0002-1959-2013 ; 0000-0001-9536-4947 ; 0000-0002-5760-090X ; 0000-0001-8016-0888 ; 0000-0002-3599-8515</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10078324$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,54796</link.rule.ids></links><search><creatorcontrib>Denkinger, Benoit Walter</creatorcontrib><creatorcontrib>Peon-Quiros, Miguel</creatorcontrib><creatorcontrib>Konijnenburg, Mario</creatorcontrib><creatorcontrib>Atienza, David</creatorcontrib><creatorcontrib>Catthoor, Francky</creatorcontrib><title>Acceleration of Control Intensive Applications on Coarse-Grained Reconfigurable Arrays for Embedded Systems</title><title>IEEE transactions on computers</title><addtitle>TC</addtitle><description>Embedded systems confront two opposite goals: low-power operation and high performance. The current trend to reach these goals is toward heterogeneous platforms, including multi-core architectures with heterogeneous cores and hardware accelerators. The latter can be divided into custom accelerators (e.g., ASICs) and programmable domain-specific cores (e.g., DSIPs). VWR2A Denkinger et al. 2022 is a programmable architecture that integrates high computational density and low power memory structures. The flexibility of VWR2A allows a large portion of applications to be covered, resulting in better performance and energy efficiency than ASICs and general-purpose processors. However, while this has been well studied for data-intensive kernels, this is not the case for control-intensive kernels -code with complex if-else and nested loop structures. Traditionally, control-intensive code is left to be executed by the host processor. This situation unnecessarily restricts the potential impact of energy-efficient acceleration, especially at the application level. In this paper, we evaluate the performance and energy consumption of VWR2A for control-intensive code and compare it with an ARM Cortex-M4 processor and a RISC-V Ibex processor. The performance and energy consumption are evaluated at the kernel and application levels. Our results confirm that VWR2A is faster and more energy-efficient than the two considered general-purpose processors also for control-intensive code.</description><subject>Accelerators</subject><subject>CGRA</subject><subject>Codes</subject><subject>Computer architecture</subject><subject>Embedded systems</subject><subject>Energy consumption</subject><subject>Energy efficiency</subject><subject>Hardware acceleration</subject><subject>Kernel</subject><subject>Kernels</subject><subject>low-power</subject><subject>Microprocessors</subject><subject>Nested loops</subject><subject>Performance evaluation</subject><subject>Power management</subject><subject>Program processors</subject><subject>programmable cores</subject><subject>reconfigurable architecture</subject><subject>RISC</subject><subject>Task analysis</subject><issn>0018-9340</issn><issn>1557-9956</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNpN0DFPwzAQBWALgUQpzCwMlpjTnmMnjscqKqUSEhKU2XLsC0pJ42CnSP33pLQD0y3fu9M9Qu4ZzBgDNd-UsxRSPuNpJjMQF2TCskwmSmX5JZkAsCJRXMA1uYlxCwB5CmpCvhbWYovBDI3vqK9p6bsh-JauuwG72PwgXfR929g_EOmISm9CxGQVTNOho29ofVc3n_tgqnbUIZhDpLUPdLmr0LmRvB_igLt4S65q00a8O88p-Xhabsrn5OV1tS4XL4nlnA1JhsqBMHz8SroKQFoE4QxWuUFnRS5BmRQrYAbrTDhZFMKYIq-lcdwpxviUPJ729sF_7zEOeuv3oRtP6rQQUuWciWJU85OywccYsNZ9aHYmHDQDfWxUb0p9bFSfGx0TD6dEg4j_NMiCp4L_AiOGc0U</recordid><startdate>20230901</startdate><enddate>20230901</enddate><creator>Denkinger, Benoit Walter</creator><creator>Peon-Quiros, Miguel</creator><creator>Konijnenburg, Mario</creator><creator>Atienza, David</creator><creator>Catthoor, Francky</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-1959-2013</orcidid><orcidid>https://orcid.org/0000-0001-9536-4947</orcidid><orcidid>https://orcid.org/0000-0002-5760-090X</orcidid><orcidid>https://orcid.org/0000-0001-8016-0888</orcidid><orcidid>https://orcid.org/0000-0002-3599-8515</orcidid></search><sort><creationdate>20230901</creationdate><title>Acceleration of Control Intensive Applications on Coarse-Grained Reconfigurable Arrays for Embedded Systems</title><author>Denkinger, Benoit Walter ; Peon-Quiros, Miguel ; Konijnenburg, Mario ; Atienza, David ; Catthoor, Francky</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c331t-5e9d04a31107db007ce04daeb6aedc46709a2eb01aef54d7884aa86f7ad3d9113</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Accelerators</topic><topic>CGRA</topic><topic>Codes</topic><topic>Computer architecture</topic><topic>Embedded systems</topic><topic>Energy consumption</topic><topic>Energy efficiency</topic><topic>Hardware acceleration</topic><topic>Kernel</topic><topic>Kernels</topic><topic>low-power</topic><topic>Microprocessors</topic><topic>Nested loops</topic><topic>Performance evaluation</topic><topic>Power management</topic><topic>Program processors</topic><topic>programmable cores</topic><topic>reconfigurable architecture</topic><topic>RISC</topic><topic>Task analysis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Denkinger, Benoit Walter</creatorcontrib><creatorcontrib>Peon-Quiros, Miguel</creatorcontrib><creatorcontrib>Konijnenburg, Mario</creatorcontrib><creatorcontrib>Atienza, David</creatorcontrib><creatorcontrib>Catthoor, Francky</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Xplore</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on computers</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Denkinger, Benoit Walter</au><au>Peon-Quiros, Miguel</au><au>Konijnenburg, Mario</au><au>Atienza, David</au><au>Catthoor, Francky</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Acceleration of Control Intensive Applications on Coarse-Grained Reconfigurable Arrays for Embedded Systems</atitle><jtitle>IEEE transactions on computers</jtitle><stitle>TC</stitle><date>2023-09-01</date><risdate>2023</risdate><volume>72</volume><issue>9</issue><spage>2548</spage><epage>2560</epage><pages>2548-2560</pages><issn>0018-9340</issn><eissn>1557-9956</eissn><coden>ITCOB4</coden><abstract>Embedded systems confront two opposite goals: low-power operation and high performance. The current trend to reach these goals is toward heterogeneous platforms, including multi-core architectures with heterogeneous cores and hardware accelerators. The latter can be divided into custom accelerators (e.g., ASICs) and programmable domain-specific cores (e.g., DSIPs). VWR2A Denkinger et al. 2022 is a programmable architecture that integrates high computational density and low power memory structures. The flexibility of VWR2A allows a large portion of applications to be covered, resulting in better performance and energy efficiency than ASICs and general-purpose processors. However, while this has been well studied for data-intensive kernels, this is not the case for control-intensive kernels -code with complex if-else and nested loop structures. Traditionally, control-intensive code is left to be executed by the host processor. This situation unnecessarily restricts the potential impact of energy-efficient acceleration, especially at the application level. In this paper, we evaluate the performance and energy consumption of VWR2A for control-intensive code and compare it with an ARM Cortex-M4 processor and a RISC-V Ibex processor. The performance and energy consumption are evaluated at the kernel and application levels. Our results confirm that VWR2A is faster and more energy-efficient than the two considered general-purpose processors also for control-intensive code.</abstract><cop>New York</cop><pub>IEEE</pub><doi>10.1109/TC.2023.3257504</doi><tpages>13</tpages><orcidid>https://orcid.org/0000-0002-1959-2013</orcidid><orcidid>https://orcid.org/0000-0001-9536-4947</orcidid><orcidid>https://orcid.org/0000-0002-5760-090X</orcidid><orcidid>https://orcid.org/0000-0001-8016-0888</orcidid><orcidid>https://orcid.org/0000-0002-3599-8515</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0018-9340
ispartof IEEE transactions on computers, 2023-09, Vol.72 (9), p.2548-2560
issn 0018-9340
1557-9956
language eng
recordid cdi_proquest_journals_2847963148
source IEEE Xplore (Online service)
subjects Accelerators
CGRA
Codes
Computer architecture
Embedded systems
Energy consumption
Energy efficiency
Hardware acceleration
Kernel
Kernels
low-power
Microprocessors
Nested loops
Performance evaluation
Power management
Program processors
programmable cores
reconfigurable architecture
RISC
Task analysis
title Acceleration of Control Intensive Applications on Coarse-Grained Reconfigurable Arrays for Embedded Systems
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T11%3A11%3A05IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_ieee_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Acceleration%20of%20Control%20Intensive%20Applications%20on%20Coarse-Grained%20Reconfigurable%20Arrays%20for%20Embedded%20Systems&rft.jtitle=IEEE%20transactions%20on%20computers&rft.au=Denkinger,%20Benoit%20Walter&rft.date=2023-09-01&rft.volume=72&rft.issue=9&rft.spage=2548&rft.epage=2560&rft.pages=2548-2560&rft.issn=0018-9340&rft.eissn=1557-9956&rft.coden=ITCOB4&rft_id=info:doi/10.1109/TC.2023.3257504&rft_dat=%3Cproquest_ieee_%3E2847963148%3C/proquest_ieee_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c331t-5e9d04a31107db007ce04daeb6aedc46709a2eb01aef54d7884aa86f7ad3d9113%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2847963148&rft_id=info:pmid/&rft_ieee_id=10078324&rfr_iscdi=true