Aging Aware Adaptive Voltage Scaling for Reliable and Efficient AI Accelerators
Tong Xie, Zuodong Zhang, Chao Yang, Yuan Wang, Runsheng Wang, Meng Li

TL;DR
This paper introduces an aging-aware adaptive voltage scaling framework for AI accelerators that leverages DNN resilience to reduce aging effects and power consumption.
Contribution
It develops an accurate aging prediction model and a fault-tolerant voltage scaling policy that together improve reliability and efficiency in AI inference.
Findings
Reduces predicted threshold voltage shift by ~19%.
Mitigates aging degradation by up to 45.8%.
Achieves 14% average lifetime power savings.
Abstract
Deep neural networks (DNNs) have showcased remarkable performance across various tasks and are widely deployed on AI accelerators fabricated in advanced technology nodes for efficiency. As aging effects become more pronounced, timing and voltage guardbands are increasingly applied. Aging-aware adaptive voltage scaling (AVS), which adjusts supply voltage based on on-chip aging scenarios, has emerged as a promising solution to avoid excessive guardbanding. However, conventional AVS techniques overlook the inherent resilience of DNNs and frequently raise the supply voltage unnecessarily, thereby exacerbating aging and increasing power consumption. To enable reliable and efficient AI inference with AVS, in this paper, we develop an accurate aging prediction framework that incorporates historical effects and iterative extrapolation for full-lifetime modeling. Building on this framework, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
