Loading…

LEC-MiCs: Low-Energy Checkpointing in Mixed-Criticality Multi-Core Systems

With the advent of multicore platforms in designing Mixed-Criticality Systems (MCSs), simultaneous management of reliability and energy while guaranteeing an acceptable service level for low-criticality tasks is a crucial challenge. To ensure the reliability of the MCSs against transient faults, fau...

Full description

Saved in:
Bibliographic Details
Published in:ACM transactions on cyber-physical systems 2024-03
Main Authors: Safari, Sepideh, Shokri, Shayan, Hessabi, Shaahin, Lotfi-Kamran, Pejman
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-a840-3dcfced8ec6136536cf8a9226fae1453ae204be61a362d607c60545216142c223
container_end_page
container_issue
container_start_page
container_title ACM transactions on cyber-physical systems
container_volume
creator Safari, Sepideh
Shokri, Shayan
Hessabi, Shaahin
Lotfi-Kamran, Pejman
description With the advent of multicore platforms in designing Mixed-Criticality Systems (MCSs), simultaneous management of reliability and energy while guaranteeing an acceptable service level for low-criticality tasks is a crucial challenge. To ensure the reliability of the MCSs against transient faults, fault-tolerant techniques are employed which will increase energy consumption. To mitigate the energy overhead, the Dynamic Voltage and Frequency Scaling (DVFS) technique will be exploited. However, this technique might lead to violating the timing constraints of high-criticality tasks. Therefore, this paper presents, for the first time, the low-energy checkpointing technique to guarantee the reliability of multiple preemptive periodic mixed-criticality tasks in a multicore platform. In contrast to the previous works in checkpointing technique which consider a specific number of faults that all the tasks in the system should tolerate, in this paper, the number of tolerable faults for each execution section of a task, and in each voltage and frequency level is determined through proposed formulas to meet the reliability target based on safety standards. Then, our proposed method determines the number of checkpoints and their non-uniform intervals for the normal and overrun sections of each task to reduce energy consumption, respectively. Moreover, the unified demand bound function (DBF) analysis is proposed for analyzing the schedulability of the task set, where each high-criticality task meets its timing and reliability constraints, and low-criticality tasks execute based on their derived guaranteed periods in each operational mode of the system. Experimental results show that our proposed scheme meets the timing and reliability constraints while at the same time, improving the QoS of low-criticality tasks, and managing energy consumption with an average of 29.49%, and 32.78%, respectively.
doi_str_mv 10.1145/3653720
format article
fullrecord <record><control><sourceid>acm_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1145_3653720</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3653720</sourcerecordid><originalsourceid>FETCH-LOGICAL-a840-3dcfced8ec6136536cf8a9226fae1453ae204be61a362d607c60545216142c223</originalsourceid><addsrcrecordid>eNo9kL1PwzAQxS0EElWp2Jm8MRn8kVxSNmSFAkrEQAe2yDiXYshHZQdB_ntStXS5O-n9dHrvEXIp-I0QUXyrIFaJ5CdkJlWSsiWo9PR4y7dzsgjhk3MuEphmMiPPeaZZ4XS4o3n_w7IO_Wak-gPt17Z33eC6DXUdLdwvVkx7NzhrGjeMtPhuBsd075G-jmHANlyQs9o0AReHPSfrh2ytH1n-snrS9zkzacSZqmxtsUrRgti5BVunZikl1AanBMqg5NE7gjAKZAU8scDjKJYCRCStlGpOrvdvre9D8FiXW-9a48dS8HJXQnkoYSKv9qSx7RH6F_8ARDtUzg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>LEC-MiCs: Low-Energy Checkpointing in Mixed-Criticality Multi-Core Systems</title><source>Association for Computing Machinery:Jisc Collections:ACM OPEN Journals 2023-2025 (reading list)</source><creator>Safari, Sepideh ; Shokri, Shayan ; Hessabi, Shaahin ; Lotfi-Kamran, Pejman</creator><creatorcontrib>Safari, Sepideh ; Shokri, Shayan ; Hessabi, Shaahin ; Lotfi-Kamran, Pejman</creatorcontrib><description>With the advent of multicore platforms in designing Mixed-Criticality Systems (MCSs), simultaneous management of reliability and energy while guaranteeing an acceptable service level for low-criticality tasks is a crucial challenge. To ensure the reliability of the MCSs against transient faults, fault-tolerant techniques are employed which will increase energy consumption. To mitigate the energy overhead, the Dynamic Voltage and Frequency Scaling (DVFS) technique will be exploited. However, this technique might lead to violating the timing constraints of high-criticality tasks. Therefore, this paper presents, for the first time, the low-energy checkpointing technique to guarantee the reliability of multiple preemptive periodic mixed-criticality tasks in a multicore platform. In contrast to the previous works in checkpointing technique which consider a specific number of faults that all the tasks in the system should tolerate, in this paper, the number of tolerable faults for each execution section of a task, and in each voltage and frequency level is determined through proposed formulas to meet the reliability target based on safety standards. Then, our proposed method determines the number of checkpoints and their non-uniform intervals for the normal and overrun sections of each task to reduce energy consumption, respectively. Moreover, the unified demand bound function (DBF) analysis is proposed for analyzing the schedulability of the task set, where each high-criticality task meets its timing and reliability constraints, and low-criticality tasks execute based on their derived guaranteed periods in each operational mode of the system. Experimental results show that our proposed scheme meets the timing and reliability constraints while at the same time, improving the QoS of low-criticality tasks, and managing energy consumption with an average of 29.49%, and 32.78%, respectively.</description><identifier>ISSN: 2378-962X</identifier><identifier>EISSN: 2378-9638</identifier><identifier>DOI: 10.1145/3653720</identifier><language>eng</language><publisher>New York, NY: ACM</publisher><subject>Computer systems organization ; Computer systems organization / Dependable and fault-tolerant systems and networks ; General and reference ; General and reference / Cross-computing tools and techniques ; Hardware ; Hardware / Hardware test ; Hardware / Robustness ; Software and its engineering ; Software and its engineering / Software organization and properties ; Software and its engineering / Software organization and properties / Extra-functional properties ; Software and its engineering / Software organization and properties / Extra-functional properties / Software fault tolerance ; Software and its engineering / Software organization and properties / Extra-functional properties / Software fault tolerance / Checkpoint / restart</subject><ispartof>ACM transactions on cyber-physical systems, 2024-03</ispartof><rights>Copyright held by the owner/author(s).</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-a840-3dcfced8ec6136536cf8a9226fae1453ae204be61a362d607c60545216142c223</cites><orcidid>0000-0003-3193-2567 ; 0000-0003-3293-8274 ; 0000-0003-4645-8255 ; 0000-0002-1578-8668</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27903,27904</link.rule.ids></links><search><creatorcontrib>Safari, Sepideh</creatorcontrib><creatorcontrib>Shokri, Shayan</creatorcontrib><creatorcontrib>Hessabi, Shaahin</creatorcontrib><creatorcontrib>Lotfi-Kamran, Pejman</creatorcontrib><title>LEC-MiCs: Low-Energy Checkpointing in Mixed-Criticality Multi-Core Systems</title><title>ACM transactions on cyber-physical systems</title><addtitle>ACM TCPS</addtitle><description>With the advent of multicore platforms in designing Mixed-Criticality Systems (MCSs), simultaneous management of reliability and energy while guaranteeing an acceptable service level for low-criticality tasks is a crucial challenge. To ensure the reliability of the MCSs against transient faults, fault-tolerant techniques are employed which will increase energy consumption. To mitigate the energy overhead, the Dynamic Voltage and Frequency Scaling (DVFS) technique will be exploited. However, this technique might lead to violating the timing constraints of high-criticality tasks. Therefore, this paper presents, for the first time, the low-energy checkpointing technique to guarantee the reliability of multiple preemptive periodic mixed-criticality tasks in a multicore platform. In contrast to the previous works in checkpointing technique which consider a specific number of faults that all the tasks in the system should tolerate, in this paper, the number of tolerable faults for each execution section of a task, and in each voltage and frequency level is determined through proposed formulas to meet the reliability target based on safety standards. Then, our proposed method determines the number of checkpoints and their non-uniform intervals for the normal and overrun sections of each task to reduce energy consumption, respectively. Moreover, the unified demand bound function (DBF) analysis is proposed for analyzing the schedulability of the task set, where each high-criticality task meets its timing and reliability constraints, and low-criticality tasks execute based on their derived guaranteed periods in each operational mode of the system. Experimental results show that our proposed scheme meets the timing and reliability constraints while at the same time, improving the QoS of low-criticality tasks, and managing energy consumption with an average of 29.49%, and 32.78%, respectively.</description><subject>Computer systems organization</subject><subject>Computer systems organization / Dependable and fault-tolerant systems and networks</subject><subject>General and reference</subject><subject>General and reference / Cross-computing tools and techniques</subject><subject>Hardware</subject><subject>Hardware / Hardware test</subject><subject>Hardware / Robustness</subject><subject>Software and its engineering</subject><subject>Software and its engineering / Software organization and properties</subject><subject>Software and its engineering / Software organization and properties / Extra-functional properties</subject><subject>Software and its engineering / Software organization and properties / Extra-functional properties / Software fault tolerance</subject><subject>Software and its engineering / Software organization and properties / Extra-functional properties / Software fault tolerance / Checkpoint / restart</subject><issn>2378-962X</issn><issn>2378-9638</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNo9kL1PwzAQxS0EElWp2Jm8MRn8kVxSNmSFAkrEQAe2yDiXYshHZQdB_ntStXS5O-n9dHrvEXIp-I0QUXyrIFaJ5CdkJlWSsiWo9PR4y7dzsgjhk3MuEphmMiPPeaZZ4XS4o3n_w7IO_Wak-gPt17Z33eC6DXUdLdwvVkx7NzhrGjeMtPhuBsd075G-jmHANlyQs9o0AReHPSfrh2ytH1n-snrS9zkzacSZqmxtsUrRgti5BVunZikl1AanBMqg5NE7gjAKZAU8scDjKJYCRCStlGpOrvdvre9D8FiXW-9a48dS8HJXQnkoYSKv9qSx7RH6F_8ARDtUzg</recordid><startdate>20240326</startdate><enddate>20240326</enddate><creator>Safari, Sepideh</creator><creator>Shokri, Shayan</creator><creator>Hessabi, Shaahin</creator><creator>Lotfi-Kamran, Pejman</creator><general>ACM</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0003-3193-2567</orcidid><orcidid>https://orcid.org/0000-0003-3293-8274</orcidid><orcidid>https://orcid.org/0000-0003-4645-8255</orcidid><orcidid>https://orcid.org/0000-0002-1578-8668</orcidid></search><sort><creationdate>20240326</creationdate><title>LEC-MiCs: Low-Energy Checkpointing in Mixed-Criticality Multi-Core Systems</title><author>Safari, Sepideh ; Shokri, Shayan ; Hessabi, Shaahin ; Lotfi-Kamran, Pejman</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a840-3dcfced8ec6136536cf8a9226fae1453ae204be61a362d607c60545216142c223</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer systems organization</topic><topic>Computer systems organization / Dependable and fault-tolerant systems and networks</topic><topic>General and reference</topic><topic>General and reference / Cross-computing tools and techniques</topic><topic>Hardware</topic><topic>Hardware / Hardware test</topic><topic>Hardware / Robustness</topic><topic>Software and its engineering</topic><topic>Software and its engineering / Software organization and properties</topic><topic>Software and its engineering / Software organization and properties / Extra-functional properties</topic><topic>Software and its engineering / Software organization and properties / Extra-functional properties / Software fault tolerance</topic><topic>Software and its engineering / Software organization and properties / Extra-functional properties / Software fault tolerance / Checkpoint / restart</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Safari, Sepideh</creatorcontrib><creatorcontrib>Shokri, Shayan</creatorcontrib><creatorcontrib>Hessabi, Shaahin</creatorcontrib><creatorcontrib>Lotfi-Kamran, Pejman</creatorcontrib><collection>CrossRef</collection><jtitle>ACM transactions on cyber-physical systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Safari, Sepideh</au><au>Shokri, Shayan</au><au>Hessabi, Shaahin</au><au>Lotfi-Kamran, Pejman</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>LEC-MiCs: Low-Energy Checkpointing in Mixed-Criticality Multi-Core Systems</atitle><jtitle>ACM transactions on cyber-physical systems</jtitle><stitle>ACM TCPS</stitle><date>2024-03-26</date><risdate>2024</risdate><issn>2378-962X</issn><eissn>2378-9638</eissn><abstract>With the advent of multicore platforms in designing Mixed-Criticality Systems (MCSs), simultaneous management of reliability and energy while guaranteeing an acceptable service level for low-criticality tasks is a crucial challenge. To ensure the reliability of the MCSs against transient faults, fault-tolerant techniques are employed which will increase energy consumption. To mitigate the energy overhead, the Dynamic Voltage and Frequency Scaling (DVFS) technique will be exploited. However, this technique might lead to violating the timing constraints of high-criticality tasks. Therefore, this paper presents, for the first time, the low-energy checkpointing technique to guarantee the reliability of multiple preemptive periodic mixed-criticality tasks in a multicore platform. In contrast to the previous works in checkpointing technique which consider a specific number of faults that all the tasks in the system should tolerate, in this paper, the number of tolerable faults for each execution section of a task, and in each voltage and frequency level is determined through proposed formulas to meet the reliability target based on safety standards. Then, our proposed method determines the number of checkpoints and their non-uniform intervals for the normal and overrun sections of each task to reduce energy consumption, respectively. Moreover, the unified demand bound function (DBF) analysis is proposed for analyzing the schedulability of the task set, where each high-criticality task meets its timing and reliability constraints, and low-criticality tasks execute based on their derived guaranteed periods in each operational mode of the system. Experimental results show that our proposed scheme meets the timing and reliability constraints while at the same time, improving the QoS of low-criticality tasks, and managing energy consumption with an average of 29.49%, and 32.78%, respectively.</abstract><cop>New York, NY</cop><pub>ACM</pub><doi>10.1145/3653720</doi><orcidid>https://orcid.org/0000-0003-3193-2567</orcidid><orcidid>https://orcid.org/0000-0003-3293-8274</orcidid><orcidid>https://orcid.org/0000-0003-4645-8255</orcidid><orcidid>https://orcid.org/0000-0002-1578-8668</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2378-962X
ispartof ACM transactions on cyber-physical systems, 2024-03
issn 2378-962X
2378-9638
language eng
recordid cdi_crossref_primary_10_1145_3653720
source Association for Computing Machinery:Jisc Collections:ACM OPEN Journals 2023-2025 (reading list)
subjects Computer systems organization
Computer systems organization / Dependable and fault-tolerant systems and networks
General and reference
General and reference / Cross-computing tools and techniques
Hardware
Hardware / Hardware test
Hardware / Robustness
Software and its engineering
Software and its engineering / Software organization and properties
Software and its engineering / Software organization and properties / Extra-functional properties
Software and its engineering / Software organization and properties / Extra-functional properties / Software fault tolerance
Software and its engineering / Software organization and properties / Extra-functional properties / Software fault tolerance / Checkpoint / restart
title LEC-MiCs: Low-Energy Checkpointing in Mixed-Criticality Multi-Core Systems
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-27T01%3A59%3A34IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-acm_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=LEC-MiCs:%20Low-Energy%20Checkpointing%20in%20Mixed-Criticality%20Multi-Core%20Systems&rft.jtitle=ACM%20transactions%20on%20cyber-physical%20systems&rft.au=Safari,%20Sepideh&rft.date=2024-03-26&rft.issn=2378-962X&rft.eissn=2378-9638&rft_id=info:doi/10.1145/3653720&rft_dat=%3Cacm_cross%3E3653720%3C/acm_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-a840-3dcfced8ec6136536cf8a9226fae1453ae204be61a362d607c60545216142c223%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true