Memory Faults in Activation-sparse Quantized Deep Neural Networks: Analysis and Mitigation using Sharpness-aware Training
Akul Malhotra, Sumeet Kumar Gupta

TL;DR
This paper analyzes how memory faults affect activation-sparse quantized DNNs and proposes sharpness-aware training to improve their fault tolerance and accuracy in real-world hardware conditions.
Contribution
It reveals the vulnerability of activation-sparse quantized DNNs to memory faults and introduces sharpness-aware training as a novel mitigation technique.
Findings
Activation sparsity increases fault vulnerability by up to 11.13%.
Sharpness-aware training improves accuracy by up to 19.50%.
SAQ-trained models outperform conventionally trained models in faulty environments.
Abstract
Improving the hardware efficiency of deep neural network (DNN) accelerators with techniques such as quantization and sparsity enhancement have shown an immense promise. However, their inference accuracy in non-ideal real-world settings (such as in the presence of hardware faults) is yet to be systematically analyzed. In this work, we investigate the impact of memory faults on activation-sparse quantized DNNs (AS QDNNs). We show that a high level of activation sparsity comes at the cost of larger vulnerability to faults, with AS QDNNs exhibiting up to 11.13% lower accuracy than the standard QDNNs. We establish that the degraded accuracy correlates with a sharper minima in the loss landscape for AS QDNNs, which makes them more sensitive to perturbations in the weight values due to faults. Based on this observation, we employ sharpness-aware quantization (SAQ) training to mitigate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and ELM · Advanced Neural Network Applications · Neural Networks and Applications
