Runtime Configurable Deep Neural Networks for Energy-Accuracy Trade-off
Hokchhay Tann, Soheil Hashemi, R. Iris Bahar, Sherief Reda

TL;DR
This paper introduces a dynamic configuration method for deep neural networks that enables energy-accuracy trade-offs during runtime by adjusting network channels, achieving significant energy savings with minimal accuracy loss.
Contribution
It presents a novel training and configuration technique allowing a single network to adapt its energy and accuracy trade-offs dynamically at runtime.
Findings
Up to 95% energy reduction with less than 1% accuracy loss.
Achieves approximately 50% savings in storage compared to prior methods.
Effective across multiple datasets and hardware platforms.
Abstract
We present a novel dynamic configuration technique for deep neural networks that permits step-wise energy-accuracy trade-offs during runtime. Our configuration technique adjusts the number of channels in the network dynamically depending on response time, power, and accuracy targets. To enable this dynamic configuration technique, we co-design a new training algorithm, where the network is incrementally trained such that the weights in channels trained in earlier steps are fixed. Our technique provides the flexibility of multiple networks while storing and utilizing one set of weights. We evaluate our techniques using both an ASIC-based hardware accelerator as well as a low-power embedded GPGPU and show that our approach leads to only a small or negligible loss in the final network accuracy. We analyze the performance of our proposed methodology using three well-known networks for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
