Loading…

An efficient structure to improve the reliability of deep neural networks on ARMs

As Deep Neural Networks (DNNs) become more pervasive in safety-critical embedded systems, improving the soft error resilience of DNNs will grow increasingly important. This paper proposes a Distribution-based Error Detector (DED) to improve DNN's reliability. We compare the proposed approach wi...

Full description

Saved in:
Bibliographic Details
Published in:Microelectronics and reliability 2022-09, Vol.136, p.114729, Article 114729
Main Authors: Liu, Zhi, Yang, Xinni
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:As Deep Neural Networks (DNNs) become more pervasive in safety-critical embedded systems, improving the soft error resilience of DNNs will grow increasingly important. This paper proposes a Distribution-based Error Detector (DED) to improve DNN's reliability. We compare the proposed approach with the regularization method and the typical Symptom-based Error Detector (SED). From the perspective of the bit error resilience, DED provides the highest fault coverage. Our results show that DED DNNs' Silent Data Corruption rates are less than 0.02 even if error bit rates are up to 1. Further, regarding Architecture Vulnerability Factor (AVF) results, we observe that the regularization method and the SED cannot improve the error resilience of register files for quantized DNNs. On the contrary, DED can reduce the SDC AVF by order of magnitude. In addition, DED can increase Mean Work To Failure (MWTF) by more than 19×, while the regularization method and the SED only increase MWTF by less than 2×. •Propose a Distribution-based Error Detector (DED)to improve the reliability of Deep Neural Networks (DNNs).•Observe that regularization methods cannot improve the error resilience of quantized DNNs.•DED can reduce SDCs to less than 0.02 even if error bit rates are up to 1 and increase Mean Work To Failure (MWTF) by 19X.
ISSN:0026-2714
1872-941X
DOI:10.1016/j.microrel.2022.114729