SparseDVFS: Sparse-Aware DVFS for Energy-Efficient Edge Inference

Ziyang Zhang; Zheshun Wu; Jie Liu; Luca Mottola

arXiv:2603.21908·cs.LG·March 24, 2026

SparseDVFS: Sparse-Aware DVFS for Energy-Efficient Edge Inference

Ziyang Zhang, Zheshun Wu, Jie Liu, Luca Mottola

PDF

Open Access

TL;DR

SparseDVFS is a novel, fine-grained, sparse-aware DVFS framework that significantly improves energy efficiency for edge DNN inference by leveraging operator sparsity and specialized frequency scaling.

Contribution

It introduces a sparse-aware, operator-level DVFS system with offline modeling, runtime graph partitioning, and a unified co-governor to optimize energy use on edge devices.

Findings

01

78.17% average energy efficiency gain

02

14% better cost-gain ratio

03

Effective handling of operator sparsity and switching latency

Abstract

Deploying deep neural networks (DNNs) on power-sensitive edge devices presents a formidable challenge. While Dynamic Voltage and Frequency Scaling (DVFS) is widely employed for energy optimization, traditional model-level scaling is often too coarse to capture intra-inference variations, whereas fine-grained operator-level scaling suffers from prohibitive performance degradation due to significant hardware switching latency. This paper presents SparseDVFS, a fine-grained, sparse-aware DVFS framework designed for energy-efficient edge inference. Our key insight is that operator sparsity is a primary metric for hardware frequency modulation. By distinguishing between compute-bound dense operators and memory-bound sparse operators, the system can apply specialized frequency triplets to maximize energy efficiency. To overcome switching overheads and component interference, SparseDVFS…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLow-power high-performance VLSI design · Green IT and Sustainability · Advanced Neural Network Applications