Reliable Linear, Sesquilinear and Bijective Operations On Integer Data Streams Via Numerical Entanglement
Mohammad Ashraful Anam, Yiannis Andreopoulos

TL;DR
The paper introduces a fault-tolerant method for linear, sesquilinear, and bijective operations on multiple integer data streams using numerical entanglement, enabling error detection and recovery with minimal performance overhead.
Contribution
It presents a novel numerical entanglement technique that allows fault detection and recovery in integer stream operations without significant computational overhead, outperforming traditional ABFT methods.
Findings
Detects soft errors in output streams via reliability checks.
Enables recovery from single fail-stop failures with separate cores.
Imposes only 0.03% to 7% throughput reduction in experiments.
Abstract
A new technique is proposed for fault-tolerant linear, sesquilinear and bijective (LSB) operations on integer data streams (), such as: scaling, additions/subtractions, inner or outer vector products, permutations and convolutions. In the proposed method, the input integer data streams are linearly superimposed to form numerically-entangled integer data streams that are stored in-place of the original inputs. A series of LSB operations can then be performed directly using these entangled data streams. The results are extracted from the entangled output streams by additions and arithmetic shifts. Any soft errors affecting any single disentangled output stream are guaranteed to be detectable via a specific post-computation reliability check. In addition, when utilizing a separate processor core for each of the streams, the proposed approach can recover all…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
