Formal Verification of Fault-Tolerant and Recovery Mechanisms for Safe Node Sequence Protocol

Formal Verification of Fault-Tolerant Startup Algorithms for Time-Triggered Architectures: A Survey

Proceedings of the IEEE ◽

10.1109/jproc.2016.2519247 ◽

2016 ◽

Vol 104 (5) ◽

pp. 904-922 ◽

Cited By ~ 7

Author(s):

Indranil Saha ◽

Suman Roy ◽

S. Ramesh

Keyword(s):

Formal Verification ◽

Fault Tolerant

Download Full-text

Formal verification for fault-tolerant architectures: Some lessons learned

Lecture Notes in Computer Science - FME '93: Industrial-Strength Formal Methods ◽

10.1007/bfb0024663 ◽

1993 ◽

pp. 482-500 ◽

Cited By ~ 7

Author(s):

Sam Owre ◽

John Rushby ◽

Natarajan Shankar ◽

Friedrich von Henke

Keyword(s):

Formal Verification ◽

Fault Tolerant ◽

Lessons Learned

Download Full-text

Systematic formal verification for fault-tolerant time-triggered algorithms

IEEE Transactions on Software Engineering ◽

10.1109/32.815324 ◽

1999 ◽

Vol 25 (5) ◽

pp. 651-660 ◽

Cited By ~ 49

Author(s):

J. Rushby

Keyword(s):

Formal Verification ◽

Fault Tolerant

Download Full-text

Formal verification of composite service recovery mechanisms consistency

2007 International Conference on Collaborative Computing: Networking, Applications and Worksharing (CollaborateCom 2007) ◽

10.1109/colcom.2007.4553842 ◽

2007 ◽

Cited By ~ 1

Author(s):

Walid Gaaloul ◽

Sami Bhiri ◽

Manfred Hauswirth ◽

Mohsen Rouached ◽

Claude Godart

Keyword(s):

Formal Verification ◽

Service Recovery ◽

Composite Service ◽

Recovery Mechanisms

Download Full-text

Verification of HotStuff BFT Consensus Protocol With TLA+/TLC in an Industrial Setting

SHS Web of Conferences ◽

10.1051/shsconf/20219301006 ◽

2021 ◽

Vol 93 ◽

pp. 01006

Author(s):

Vladimir Kukharenko ◽

Kirill Ziborov ◽

Rafael Sadykov ◽

Ruslan Rezin

Keyword(s):

Formal Verification ◽

Formal Specification ◽

Fault Tolerant ◽

Software Implementation ◽

Actual Behavior ◽

Smart Contracts ◽

Consensus Protocol ◽

Verification Methods ◽

Formal Specification And Verification ◽

Specification And Verification

The extent of formal verification methods applied in industrial projects has always been limited. The proliferation of distributed ledger systems (DLS), also known as blockchain, is rapidly changing the situation. Since the main area of DLSs’ application is the automation of financial transactions, the properties of predictability and reliability are critical for implementing such systems. The actual behavior of the DLS is largely determined by the chosen consensus protocol, which properties require strict specification and formal verification. Formal specification and verification of the consensus protocol is necessary but not sufficient. It is also required to ensure that the software implementation of the DLS nodes complies with this protocol. Finally, the verified software implementation of the protocol must run on a fairly reliable operating system. The financial focus of DLS application has also led to the emergence of the so-called smart contracts, which are an important part of the applied implementations of specific business processes based on DLSs. Therefore, the verifiability of smart contracts is also a critical requirement for industrial DLSs. In this paper, we describe an ongoing industrial project between a large Russian airline and three universities – Innopolis University (IU), Moscow Institute of Physics and Technology (MIPT) and Lomonosov Moscow State University (MSU). The main expected project result is a DLS for more flexible refueling of aircrafts, verified at least at the four technological levels described above. After brief project overview, we focus on our experience with the formal specification and verification of HotStuff, a leader-based fault-tolerant protocol that ensures reaching distributed consensus in the presence of Byzantine processes. The formal specification of the protocol is performed in the TLA+ language and then verified with a specialized TLC tool to verify models based on TLA+ specifications.

Download Full-text

Formal verification of a fault tolerant computer

[1992] Proceedings IEEE/AIAA 11th Digital Avionics Systems Conference ◽

10.1109/dasc.1992.282170 ◽

2003 ◽

Cited By ~ 5

Author(s):

N.A. Brock ◽

D.M. Jackson

Keyword(s):

Formal Verification ◽

Fault Tolerant

Download Full-text

A Model-checking Algorithm for Formal-verification of Peer-to-peer Fault-tolerant Networks

Lecture Notes on Information Theory ◽

10.12720/lnit.1.3.128-131 ◽

2013 ◽

Vol 1 (3) ◽

pp. 128-131 ◽

Cited By ~ 1

Author(s):

Sungeetha Dakshinamurthy ◽

Vasumathi K. Narayanan

Keyword(s):

Model Checking ◽

Formal Verification ◽

Fault Tolerant ◽

Peer To Peer

Download Full-text

InnoChain: a Distributed Ledger for Industry with Formal Verification on all Implementation Levels

Modeling and Analysis of Information Systems ◽

10.18255/1818-1015-2020-4-454-471 ◽

2020 ◽

Vol 27 (4) ◽

pp. 454-471

Author(s):

Vladimir Aleksandrovich Kukharenko ◽

Kirill Viktorovich Ziborov ◽

Rafael Faritovich Sadykov ◽

Alexandr Vladimirovich Naumchev ◽

Ruslan Maratovich Rezin ◽

...

Keyword(s):

Formal Verification ◽

Formal Specification ◽

Fault Tolerant ◽

Software Implementation ◽

Actual Behavior ◽

Consensus Protocol ◽

Distributed Ledger ◽

Verification Methods ◽

Formal Specification And Verification ◽

Specification And Verification

The extent of formal verification methods applied to industrial projects has always been limited. The proliferation of distributed ledger systems (DLS), also known as blockchain, is rapidly changing the situation. Since the main area of DLSs' application is the automation of financial transactions, the properties of predictability and reliability are critical for implementing such systems. The actual behavior of the DLS is determined by the chosen consensus protocol, which properties require strict specification and formal verification. Formal specification and verification of the consensus protocol is necessary but not sufficient. It is required to ensure that the software implementation of the DLS nodes complies with this protocol. The verified software implementation of the protocol must run on a fairly reliable operating system. The so-called “smart contracts”, which are an important part of the applied implementations of specific business processes based on DLSs, must be verifiable as well. In this paper, we describe an ongoing industrial project that will result in a DLS verified at least at the four technological levels described above. We then share our experience with the formal specification and verification of HotStuff, a leader-based fault-tolerant protocol that ensures reaching distributed consensus in the presence of Byzantine processes.

Download Full-text

Exploring Parallel MPI Fault Tolerance Mechanisms for Phylogenetic Inference with RAxML-NG

10.1101/2021.01.15.426773 ◽

2021 ◽

Author(s):

Lukas Hübner ◽

Alexey M. Kozlov ◽

Demian Hespe ◽

Peter Sanders ◽

Alexandros Stamatakis

Keyword(s):

Fault Tolerance ◽

Phylogenetic Trees ◽

Large Scale ◽

Fault Tolerant ◽

Phylogenetic Inference ◽

Molecular Data ◽

Supplementary Information ◽

Tolerance Mechanisms ◽

Recovery Mechanisms ◽

Mpi Implementation

Phylogenetic trees are now routinely inferred on large scale HPC systems with thousands of cores as the parallel scalability of phylogenetic inference tools has improved over the past years to cope with the molecular data avalanche. Thus, the parallel fault tolerance of phylogenetic inference tools has become a relevant challenge. To this end, we explore parallel fault tolerance mechanisms and algorithms, the software modifications required, and the performance penalties induced via enabling parallel fault tolerance by example of RAxML-NG, the successor of the widely used RAxML tool for maximum likelihood based phylogenetic tree inference. We find that the slowdown induced by the necessary additional recovery mechanisms in RAxML-NG is on average 2%. The overall slowdown by using these recovery mechanisms in conjunction with a fault tolerant MPI implementation amounts to 8% on average for large empirical datasets. Via failure simulations, we show that RAxML-NG can successfully recover from multiple simultaneous failures, subsequent failures, failures during recovery, and failures during checkpointing. Recoveries are automatic and transparent to the user. The modified fault tolerant RAxML-NG code is available under GNU GPL at https://github.com/lukashuebner/ft-raxml-ng Contact: lukas.huebner@{kit.edu,h-its.org};, [email protected], [email protected], [email protected], [email protected] Supplementary information: Supplementary data are available at bioRχiv.

Download Full-text

A note on inconsistent axioms in Rushby's "systematic formal verification for fault-tolerant time-triggered algorithms"

IEEE Transactions on Software Engineering ◽

10.1109/tse.2006.41 ◽

2006 ◽

Vol 32 (5) ◽

pp. 347-348 ◽

Cited By ~ 7

Author(s):

L. Pike

Keyword(s):

Formal Verification ◽

Fault Tolerant

Download Full-text