ReGate: Enabling Power Gating in Neural Processing Units
Yuqi Xue, Jian Huang

TL;DR
ReGate introduces a hardware/software co-designed power-gating approach for NPUs, significantly reducing energy consumption by up to 32.8% with minimal performance impact.
Contribution
It presents a novel, fine-grained power-gating method tailored for NPU architectures, addressing unique hardware and execution challenges.
Findings
Up to 32.8% energy reduction in NPU chips.
Negligible hardware overhead of less than 3.3%.
Effective power management across diverse NPU components.
Abstract
The energy efficiency of neural processing units (NPU) is playing a critical role in developing sustainable data centers. Our study with different generations of NPU chips reveals that 30%-72% of their energy consumption is contributed by static power dissipation, due to the lack of power management support in modern NPU chips. In this paper, we present ReGate, which enables fine-grained power-gating of each hardware component in NPU chips with hardware/software co-design. Unlike conventional power-gating techniques for generic processors, enabling power-gating in NPUs faces unique challenges due to the fundamental difference in hardware architecture and program execution model. To address these challenges, we carefully investigate the power-gating opportunities in each component of NPU chips and decide the best-fit power management scheme (i.e., hardware- vs. software-managed power…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
