PowerFlow-DNN: Compiler-Directed Fine-Grained Power Orchestration for End-to-End Edge AI Inference
Paul Chen, Jeongeun Kim, Wenbo Zhu, Yuanhan Li, Shunyao Huang, Chenjie Weng, and Christopher Torng

TL;DR
PowerFlow-DNN is a compiler-driven framework that optimizes energy efficiency in edge AI accelerators by intelligently scheduling power states across layers within real-time constraints, significantly reducing energy consumption.
Contribution
It introduces a novel end-to-end power scheduling method that accounts for inter-layer effects and practical hardware constraints, enabling near-optimal energy savings in ultra-low-power edge AI systems.
Findings
Achieves energy within 0.68% of the optimal ILP solution.
Reduces energy consumption by up to 37% over baseline.
Operates efficiently over a massive schedule space with 10^{160} possibilities.
Abstract
Edge AI systems often operate under stringent energy and volume constraints that demand extreme efficiency under limited battery capacity, with requirements worsening as intelligent capability demands advance. Prior literature suggests that fine-grained power orchestration, including DVFS and power gating, enables significant energy efficiency benefits that cannot be left unexploited, while still exhibiting unexplored challenges. We observe that layer-level approaches incur unintended overheads due to inter-layer coupling of power control decisions, and that jointly managing these mechanisms under practical constraints such as limited voltage rails and transition overheads leads to a rapidly growing combinatorial schedule space. To address this, we propose PowerFlow-DNN, a compiler-directed framework for end-to-end power-state orchestration in ultra-low-power accelerators. By…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Big Data and Digital Economy · IoT and Edge/Fog Computing
