FATAL+: A Self-Stabilizing Byzantine Fault-tolerant Clocking Scheme for SoCs
Danny Dolev, Matthias F\"ugger, Christoph Lenzen, Markus Posch, Ulrich, Schmid, Andreas Steininger

TL;DR
This paper introduces a novel self-stabilizing Byzantine fault-tolerant clocking scheme for SoCs, combining low-frequency pulse generation with high-frequency synchronization, validated through theoretical proofs and FPGA experiments.
Contribution
It presents a new integrated clock generation scheme that is self-stabilizing and Byzantine fault-tolerant, with implementation and validation in FPGA hardware.
Findings
Proven correctness and performance bounds through theoretical analysis.
Successful FPGA implementation demonstrating short stabilization times.
Experimental validation confirming skew and frequency bounds.
Abstract
We present concept and implementation of a self-stabilizing Byzantine fault-tolerant distributed clock generation scheme for multi-synchronous GALS architectures in critical applications. It combines a variant of a recently introduced self-stabilizing algorithm for generating low-frequency, low-accuracy synchronized pulses with a simple non-stabilizing high-frequency, high-accuracy clock synchronization algorithm. We provide thorough correctness proofs and a performance analysis, which use methods from fault-tolerant distributed computing research but also addresses hardware-related issues like metastability. The algorithm, which consists of several concurrent communicating asynchronous state machines, has been implemented in VHDL using Petrify in conjunction with some extensions, and synthetisized for an Altera Cyclone FPGA. An experimental validation of this prototype has been carried…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed systems and fault tolerance · Radiation Effects in Electronics · Parallel Computing and Optimization Techniques
