Characterizing System-Level Vulnerability for Instruction Caches against Soft Errors

Author(s):  
Shuai Wang
Electronics ◽  
2021 ◽  
Vol 10 (18) ◽  
pp. 2286
Author(s):  
Yohan Ko

From early design phases to final release, the reliability of modern embedded systems against soft errors should be carefully considered. Several schemes have been proposed to protect embedded systems against soft errors, but they are neither always functional nor robust, even with expensive overhead in terms of hardware area, performance, and power consumption. Thus, system designers need to estimate reliability quantitatively to apply appropriate protection techniques for resource-constrained embedded systems. Vulnerability modeling based on lifetime analysis is one of the most efficient ways to quantify system reliability against soft errors. However, lifetime analysis can be inaccurate, mainly because it fails to comprehensively capture several system-level masking effects. This study analyzes and characterizes microarchitecture-level and software-level masking effects by developing an automated framework with exhaustive fault injections (i.e., soft errors) based on a cycle-accurate gem5 simulator. We injected faults into a register file because errors in the register file can easily be propagated to other components in a processor. We found that only 5% of injected faults can cause system failures on an average over benchmarks, mainly from the MiBench suite. Further analyses showed that 71% of soft errors are overwritten by write operations before being used, and the CPU does not use 20% of soft errors at all after fault injections. The remainder are also masked by several software-level masking effects, such as dynamically dead instructions, compare and logical instructions that do not change the result, and incorrect control flows that do not affect program outputs.


2014 ◽  
Vol 2014 ◽  
pp. 1-8 ◽  
Author(s):  
Seyyed Amir Asghari ◽  
Okyay Kaynak ◽  
Hassan Taheri

Electronic equipment operating in harsh environments such as space is subjected to a range of threats. The most important of these is radiation that gives rise to permanent and transient errors on microelectronic components. The occurrence rate of transient errors is significantly more than permanent errors. The transient errors, or soft errors, emerge in two formats: control flow errors (CFEs) and data errors. Valuable research results have already appeared in literature at hardware and software levels for their alleviation. However, there is the basic assumption behind these works that the operating system is reliable and the focus is on other system levels. In this paper, we investigate the effects of soft errors on the operating system components and compare their vulnerability with that of application level components. Results show that soft errors in operating system components affect both operating system and application level components. Therefore, by providing endurance to operating system level components against soft errors, both operating system and application level components gain tolerance.


2008 ◽  
Vol 52 (3) ◽  
pp. 293-306 ◽  
Author(s):  
J. A. Rivers ◽  
P. Bose ◽  
P. Kudva ◽  
J.-D. Wellman ◽  
P. N. Sanda ◽  
...  
Keyword(s):  

Author(s):  
Hyungmin Cho ◽  
Eric Cheng ◽  
Thomas Shepherd ◽  
Chen-Yong Cher ◽  
Subhasish Mitra
Keyword(s):  

1998 ◽  
Author(s):  
Martin P. Charns ◽  
Victoria A. Parker ◽  
William H. Wubbenhorst
Keyword(s):  

2018 ◽  
Vol 4 (3) ◽  
pp. 228-244 ◽  
Author(s):  
Ivan J. Raymond ◽  
Matthew Iasiello ◽  
Aaron Jarden ◽  
David Michael Kelly
Keyword(s):  

2007 ◽  
Vol 51 (1-2) ◽  
pp. 43
Author(s):  
Balázs Polgár ◽  
Endre Selényi
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document