Precise Cache Profiling for Studying Radiation Effects

James Marshall; Robert Gifford; Gedare Bloom; Gabriel Parmer; Rahul Simha

doi:10.1145/3442339

Precise Cache Profiling for Studying Radiation Effects

ACM Transactions on Embedded Computing Systems ◽

10.1145/3442339 ◽

2021 ◽

Vol 20 (3) ◽

pp. 1-25

Author(s):

James Marshall ◽

Robert Gifford ◽

Gedare Bloom ◽

Gabriel Parmer ◽

Rahul Simha

Keyword(s):

Radiation Effects ◽

Fault Injection ◽

Error Correcting Codes ◽

Direct Access ◽

Transient Faults ◽

Large Area ◽

Common Multiple ◽

Single Event Upsets ◽

On Chip ◽

Future Work

Increased access to space has led to an increase in the usage of commodity processors in radiation environments. These processors are vulnerable to transient faults such as single event upsets that may cause bit-flips in processor components. Caches in particular are vulnerable due to their relatively large area, yet are often omitted from fault injection testing because many processors do not provide direct access to cache contents and they are often not fully modeled by simulators. The performance benefits of caches make disabling them undesirable, and the presence of error correcting codes is insufficient to correct for increasingly common multiple bit upsets. This work explores building a program’s cache profile by collecting cache usage information at an instruction granularity via commonly available on-chip debugging interfaces. The profile provides a tighter bound than cache utilization for cache vulnerability estimates (50% for several benchmarks). This can be applied to reduce the number of fault injections required to characterize behavior by at least two-thirds for the benchmarks we examine. The profile enables future work in hardware fault injection for caches that avoids the biases of existing techniques.

Download Full-text

Tracing Fault Effects in FPGA Systems

International Journal of Electronics and Telecommunications ◽

10.2478/eletel-2014-0012 ◽

2014 ◽

Vol 60 (1) ◽

pp. 92-97 ◽

Cited By ~ 1

Author(s):

Mariusz Węgrzyn ◽

Janusz Sosnowski

Keyword(s):

Detailed Analysis ◽

Fault Injection ◽

Original Method ◽

Injection Technique ◽

Transient Faults ◽

Single Event ◽

Single Event Upsets ◽

Fault Handling

Abstract The paper presents the extent of fault effects in FPGA based systems and concentrates on transient faults (induced by single event upsets - SEUs) within the configuration memory of FPGA. An original method of detailed analysis of fault effect propagation is presented. It is targeted at microprocessor based FPGA systems using the developed fault injection technique. The fault injection is performed at HDL description level of the microprocessor using special simulators and developed supplementary programs. The proposed methodology is illustrated for soft PicoBlaze microprocessor running 3 programs. The presented results reveal some problems with fault handling at the software level.

Download Full-text

Fault Injection in Modern Microprocessors Using On-Chip Debugging Infrastructures

IEEE Transactions on Dependable and Secure Computing ◽

10.1109/tdsc.2010.50 ◽

2011 ◽

Vol 8 (2) ◽

pp. 308-314 ◽

Cited By ~ 20

Author(s):

Marta Portela-Garcia ◽

Celia Lopez-Ongil ◽

Mario Garcia Valderas ◽

Luis Entrena

Keyword(s):

Fault Injection ◽

On Chip

Download Full-text

Test mode method and strategy for RF-based fault injection analysis for on-chip relaxation oscillators under EMC standard tests or RFI susceptibility characterization

2010 11th Latin American Test Workshop ◽

10.1109/latw.2010.5550382 ◽

2010 ◽

Cited By ~ 1

Author(s):

A. Olmos ◽

A. Vilas Boas ◽

E. R. da Silva ◽

J. C. Silva ◽

R. Maltione

Keyword(s):

Fault Injection ◽

Test Mode ◽

Relaxation Oscillators ◽

Mode Method ◽

On Chip ◽

Standard Tests

Download Full-text

An on-chip glitchy-clock generator for testing fault injection attacks

Journal of Cryptographic Engineering ◽

10.1007/s13389-011-0022-y ◽

2011 ◽

Vol 1 (4) ◽

pp. 265-270 ◽

Cited By ~ 35

Author(s):

Sho Endo ◽

Takeshi Sugawara ◽

Naofumi Homma ◽

Takafumi Aoki ◽

Akashi Satoh

Keyword(s):

Fault Injection ◽

Clock Generator ◽

Injection Attacks ◽

On Chip ◽

Fault Injection Attacks

Download Full-text

Reducing Design Margins by Adaptive Compensation for Thermal and Aging Variations

Sustainable ICTs and Management Systems for Green Computing ◽

10.4018/978-1-4666-1839-8.ch009 ◽

2012 ◽

pp. 201-230

Author(s):

Zhenyu Qi ◽

Yan Zhang ◽

Mircea Stan

Keyword(s):

Design Parameters ◽

Worst Case Analysis ◽

Transimpedance Amplifier ◽

Large Area ◽

Negative Bias Temperature Instability ◽

Worst Case ◽

Adaptive Computation ◽

Bias Temperature Instability ◽

Computation Efficiency ◽

On Chip

Corner-based design and verification are based on worst-case analysis, thus introducing over-pessimism and large area and power overhead and leading to unnecessary energy consumption. Typical case-based design and verification maximize energy efficiency through design margins reduction and adaptive computation, thus helping achieve sustainable computing. Dynamically adapting to manufacturing, environmental, and usage variations is the key to shaving unnecessary design margins, which requires on-chip modules that can sense and configure design parameters both globally and locally to maximize computation efficiency, and maintain this efficiency over the lifetime of the system. This chapter presents an adaptive threshold compensation scheme using a transimpedance amplifier and adaptive body biasing to overcome the effects of temperature variation, reliability degradation, and process variation. The effectiveness and versatility of the scheme are demonstrated with two example applications, one as a temperature aware design to maintain IONto IOFFcurrent ratio, the other as a reliability sensor for NBTI (Negative Bias Temperature Instability).

Download Full-text

Advanced Technologies for Transient Faults Detection and Compensation

Design and Test Technology for Dependable Systems-on-Chip - Advances in Computer and Electrical Engineering ◽

10.4018/978-1-60960-212-3.ch006 ◽

2011 ◽

pp. 132-154

Author(s):

Matteo Sonza Reorda ◽

Luca Sterpone ◽

Massimo Violante

Keyword(s):

Market Failure ◽

Propagation Mechanism ◽

Transient Faults ◽

Niche Markets ◽

Time Redundancy ◽

Ip Cores ◽

On Chip ◽

Manufacturing Technologies ◽

Processor Cores ◽

Nuclear Applications

Transient faults became an increasing issue in the past few years as smaller geometries of newer, highly miniaturized, silicon manufacturing technologies brought to the mass-market failure mechanisms traditionally bound to niche markets as electronic equipments for avionic, space or nuclear applications. This chapter presents the origin of transient faults, it discusses the propagation mechanism, it outlines models devised to represent them and finally it discusses the state-of-the-art design techniques that can be used to detect and correct transient faults. The concepts of hardware, data and time redundancy are presented, and their implementations to cope with transient faults affecting storage elements, combinational logic and IP-cores (e.g., processor cores) typically found in a System-on-Chip are discussed.

Download Full-text

Proposal of an Adaptive Fault Tolerance Mechanism to Tolerate Intermittent Faults in RAM

Electronics ◽

10.3390/electronics9122074 ◽

2020 ◽

Vol 9 (12) ◽

pp. 2074

Author(s):

J.-Carlos Baraza-Calvo ◽

Joaquín Gracia-Morán ◽

Luis-J. Saiz-Adalid ◽

Daniel Gil-Tomás ◽

Pedro-J. Gil-Vicente

Keyword(s):

Fault Tolerance ◽

Error Correction ◽

Error Detection ◽

Fault Injection ◽

Error Correction Codes ◽

Transient Faults ◽

Tolerance Mechanism ◽

Intermittent Faults ◽

Risc Processor ◽

Simulation Based

Due to transistor shrinking, intermittent faults are a major concern in current digital systems. This work presents an adaptive fault tolerance mechanism based on error correction codes (ECC), able to modify its behavior when the error conditions change without increasing the redundancy. As a case example, we have designed a mechanism that can detect intermittent faults and swap from an initial generic ECC to a specific ECC capable of tolerating one intermittent fault. We have inserted the mechanism in the memory system of a 32-bit RISC processor and validated it by using VHDL simulation-based fault injection. We have used two (39, 32) codes: a single error correction–double error detection (SEC–DED) and a code developed by our research group, called EPB3932, capable of correcting single errors and double and triple adjacent errors that include a bit previously tagged as error-prone. The results of injecting transient, intermittent, and combinations of intermittent and transient faults show that the proposed mechanism works properly. As an example, the percentage of failures and latent errors is 0% when injecting a triple adjacent fault after an intermittent stuck-at fault. We have synthesized the adaptive fault tolerance mechanism proposed in two types of FPGAs: non-reconfigurable and partially reconfigurable. In both cases, the overhead introduced is affordable in terms of hardware, time and power consumption.

Download Full-text

Rainbow on a Chip: Experimental Observation of the Trapped Rainbow Effect Using Tapered Hollow Bragg Waveguides

Eureka ◽

10.29173/eureka22828 ◽

2014 ◽

Vol 4 (1) ◽

pp. 35-39 ◽

Cited By ~ 1

Author(s):

Aaron Melnyk

Keyword(s):

Theoretical Analysis ◽

Experimental Observation ◽

Entire Length ◽

Lab On Chip ◽

Out Of Plane ◽

Frequency Components ◽

Input Spectrum ◽

Bragg Waveguides ◽

On Chip ◽

Future Work

Experimental observation of the ‘trapped rainbow’ in the visible is demonstrated using tapered hollow Bragg waveguides. These waveguides spatially disperse an input spectrum into its various frequency components and vertical out of plane radiation was observed at wavelength dependant positions along the entire length of the waveguide. The experimental observation is corroborated by a brief theoretical analysis and simulation. These devices form the foundation for future work involving integration into a micro-spectrometer for eventual lab-on-chip use.

Download Full-text

A Single Error Correcting Code with One-Step Group Partitioned Decoding Based on Shared Majority-Vote

Electronics ◽

10.3390/electronics9050709 ◽

2020 ◽

Vol 9 (5) ◽

pp. 709

Author(s):

Abhishek Das ◽

Nur A. Touba

Keyword(s):

Optimization Technique ◽

Error Correcting Code ◽

Latin Square ◽

Error Correcting Codes ◽

Trade Off ◽

Cache Memories ◽

Hamming Codes ◽

Single Error ◽

Memory Overhead ◽

On Chip

Technology scaling has led to an increase in density and capacity of on-chip caches. This has enabled higher throughput by enabling more low latency memory transfers. With the reduction in size of SRAMs and development of emerging technologies, e.g., STT-MRAM, for on-chip cache memories, reliability of such memories becomes a major concern. Traditional error correcting codes, e.g., Hamming codes and orthogonal Latin square codes, either suffer from high decoding latency, which leads to lower overall throughput, or high memory overhead. In this paper, a new single error correcting code based on a shared majority voting logic is presented. The proposed codes trade off decoding latency in order to improve the memory overhead posed by orthogonal Latin square codes. A latency optimization technique is also proposed which lowers the decoding latency by incurring a slight memory overhead. It is shown that the proposed codes achieve better redundancy compared to orthogonal Latin square codes. The proposed codes are also shown to achieve lower decoding latency compared to Hamming codes. Thus, the proposed codes achieve a balanced trade-off between memory overhead and decoding latency, which makes them highly suitable for on-chip cache memories which have stringent throughput and memory overhead constraints.

Download Full-text

Fault injection via on-chip debugging in the internal memory of systems-on-chip processor

IOP Conference Series Materials Science and Engineering ◽

10.1088/1757-899x/94/1/012020 ◽

2015 ◽

Vol 94 ◽

pp. 012020 ◽

Cited By ~ 3

Author(s):

S A Chekmarev ◽

V Kh Khanov

Keyword(s):

Fault Injection ◽

Internal Memory ◽

Systems On Chip ◽

On Chip

Download Full-text