A Case Study of Designing Efficient Algorithm-based Fault Tolerant Application for Exascale Parallelism

Author(s):  
Erlin Yao ◽  
Rui Wang ◽  
Mingyu Chen ◽  
Guangming Tan ◽  
Ninghui Sun
2021 ◽  
Vol 20 (5s) ◽  
pp. 1-22
Author(s):  
Haoran Li ◽  
Chenyang Lu ◽  
Christopher D. Gill

Fault-tolerant coordination services have been widely used in distributed applications in cloud environments. Recent years have witnessed the emergence of time-sensitive applications deployed in edge computing environments, which introduces both challenges and opportunities for coordination services. On one hand, coordination services must recover from failures in a timely manner. On the other hand, edge computing employs local networked platforms that can be exploited to achieve timely recovery. In this work, we first identify the limitations of the leader election and recovery protocols underlying Apache ZooKeeper, the prevailing open-source coordination service. To reduce recovery latency from leader failures, we then design RT-Zookeeper with a set of novel features including a fast-convergence election protocol, a quorum channel notification mechanism, and a distributed epoch persistence protocol. We have implemented RT-Zookeeper based on ZooKeeper version 3.5.8. Empirical evaluation shows that RT-ZooKeeper achieves 91% reduction in maximum recovery latency in comparison to ZooKeeper. Furthermore, a case study demonstrates that fast failure recovery in RT-ZooKeeper can benefit a common messaging service like Kafka in terms of message latency.


Author(s):  
P Quiñones-Reyes ◽  
H Benítez-Pérez ◽  
E Mendez-Monroy ◽  
F Cárdenas-Flores ◽  
F García-Nocetti

Author(s):  
Nidhal Mahmud

The use of robotics systems is increasingly widespread and spans a variety of application areas. From manufacturing, to surgeries, to chemical, these systems can be required to perform difficult, dangerous and critical tasks. The nature of such tasks places high demands on the dependability of robotics systems. Fault tree analysis is among the most often used dependability assessment techniques in various domains of robotics. However, there is still a lack of adjustment methods that can efficiently cope with the sequential dependencies among the components of such systems. In this paper, the authors first introduce some relevant techniques to analyze the dependability of robotics systems. Thereafter, an experience from research projects such as MAENAD (European automotive project investigating development of dependable Fully Electric Vehicles) is presented; emphasis is put on a novel approach to synthesizing fault trees from the components and that is suitable for modern high-technology robotics. Finally, the benefits of the approach are highlighted by using a fault-tolerant case study.


Energies ◽  
2020 ◽  
Vol 13 (1) ◽  
pp. 239
Author(s):  
Piotr Hadaj ◽  
Dominik Strzałka

In this paper, a case study is conducted based on the real data obtained from the local Distribution System Operator (DSO) of electrical energy. The analyzed network represents connections and high-voltage switchgears of 110 kV. Selected graph parameters—vertex degree distribution, the average vertex degree, the graph density, network efficiency, the clustering coefficient, the average path length, and the graph diameter were examined, taking into account that in the analysis, some nodes were removed due to the different failures. For each failure, the possible effects on network parameters were tested. As a final result, it was shown that in the analyzed case, the removal of only five nodes could cause a significant (almost four times) fall of graph efficiency. In turn, this means that the whole analyzed network cannot be considered as a fault-tolerant.


Author(s):  
Ali Hatamizadeh ◽  
Yuanping Song ◽  
Jonathan B. Hopkins

In this paper, we introduce a new computational tool called the Boundary Learning Optimization Tool (BLOT) that rapidly identifies the boundary of the performance capabilities achieved by a general flexure topology if its geometric parameters are allowed to vary from their smallest allowable feature sizes to the largest geometrically compatible feature sizes for a given constituent material. The boundaries generated by the BLOT fully define a flexure topology’s design space and allow designers to visually identify which geometric versions of their synthesized topology best achieve a desired combination of performance capabilities. The BLOT was created as a complementary tool to the Freedom And Constraint Topologies (FACT) synthesis approach in that the BLOT is intended to optimize the geometry of the flexure topologies synthesized using the FACT approach. The BLOT trains artificial neural networks to create sufficiently accurate models of parameterized flexure topologies using the fewest number of design instantiations and their corresponding numerically generated performance solutions. These models are then used by an efficient algorithm to plot the desired topology’s performance boundary. A FACT-synthesized flexure topology is optimized using the BLOT as a case study.


2003 ◽  
Vol 36 (5) ◽  
pp. 119-124 ◽  
Author(s):  
Jan M. Maciejowski ◽  
Colin N. Jones

2018 ◽  
Vol 2018 ◽  
pp. 1-8 ◽  
Author(s):  
Younes Ait El Maati ◽  
Lhoussain El Bahir ◽  
Khalid Faitah

This paper presents a method to control the rotor speed of wind turbines in presence of gearbox efficiency fault. This kind of faults happens due to lack of lubrication. It affects the dynamic of the principal shaft and thus the rotor speed. The principle of the fault tolerant control is to find a bloc that equalizes the dynamics of the healthy and faulty situations. The effectiveness decrease impacts on not only the dynamics but also the steady state value of the rotor speed. The last reason makes it mandatory to add an integral term on the steady state error to cancel the residual between the measured and operating point rotor speed. The convergence of the method is proven with respect to the rotor parameters and its effectiveness is evaluated through the rotor speed.


Sign in / Sign up

Export Citation Format

Share Document