Zero Memory Overhead Approach for Protecting Vision Transformer Parameters

Fereshteh Baradaran; Mohsen Raji; Azadeh Baradaran; Arezoo Baradaran; Reihaneh Akbarifard

arXiv:2507.03816·cs.CV·July 8, 2025

Zero Memory Overhead Approach for Protecting Vision Transformer Parameters

Fereshteh Baradaran, Mohsen Raji, Azadeh Baradaran, Arezoo Baradaran, Reihaneh Akbarifard

PDF

TL;DR

This paper presents a zero memory overhead fault tolerance method for Vision Transformers that detects and masks bit-flip faults by using parity bits in least significant bits, significantly improving robustness in safety-critical applications.

Contribution

It introduces a novel zero-overhead fault detection and masking technique for ViT parameters using parity bits, enhancing reliability without increasing memory usage.

Findings

01

Improves fault tolerance of ViT models by up to three orders of magnitude.

02

Detects bit-flip faults with zero memory overhead using parity bits.

03

Effectively prevents accuracy degradation in safety-critical applications.

Abstract

Vision Transformers (ViTs) have demonstrated superior performance over Convolutional Neural Networks (CNNs) in various vision-related tasks such as classification, object detection, and segmentation due to their use of self-attention mechanisms. As ViTs become more popular in safety-critical applications like autonomous driving, ensuring their correct functionality becomes essential, especially in the presence of bit-flip faults in their parameters stored in memory. In this paper, a fault tolerance technique is introduced to protect ViT parameters against bit-flip faults with zero memory overhead. Since the least significant bits of parameters are not critical for model accuracy, replacing the LSB with a parity bit provides an error detection mechanism without imposing any overhead on the model. When faults are detected, affected parameters are masked by zeroing out, as most parameters…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.