SparseDVFS: Sparse-Aware DVFS for Energy-Efficient Edge Inference
Ziyang Zhang, Zheshun Wu, Jie Liu, Luca Mottola

TL;DR
SparseDVFS is a novel, fine-grained, sparse-aware DVFS framework that significantly improves energy efficiency for edge DNN inference by leveraging operator sparsity and specialized frequency scaling.
Contribution
It introduces a sparse-aware, operator-level DVFS system with offline modeling, runtime graph partitioning, and a unified co-governor to optimize energy use on edge devices.
Findings
78.17% average energy efficiency gain
14% better cost-gain ratio
Effective handling of operator sparsity and switching latency
Abstract
Deploying deep neural networks (DNNs) on power-sensitive edge devices presents a formidable challenge. While Dynamic Voltage and Frequency Scaling (DVFS) is widely employed for energy optimization, traditional model-level scaling is often too coarse to capture intra-inference variations, whereas fine-grained operator-level scaling suffers from prohibitive performance degradation due to significant hardware switching latency. This paper presents SparseDVFS, a fine-grained, sparse-aware DVFS framework designed for energy-efficient edge inference. Our key insight is that operator sparsity is a primary metric for hardware frequency modulation. By distinguishing between compute-bound dense operators and memory-bound sparse operators, the system can apply specialized frequency triplets to maximize energy efficiency. To overcome switching overheads and component interference, SparseDVFS…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLow-power high-performance VLSI design · Green IT and Sustainability · Advanced Neural Network Applications
