Energy-Aware Dynamic Neural Inference
Marcello Bullo, Seifallah Jardak, Pietro Carnelli, Deniz G\"und\"uz

TL;DR
This paper presents an adaptive on-device neural inference system that dynamically manages energy consumption through multi-model selection and early exiting, improving accuracy under energy constraints with confidence-aware control policies.
Contribution
It introduces a principled, theoretically-guaranteed policy for energy-aware neural inference, integrating confidence measures and incremental decision-making for improved performance.
Findings
Energy-aware control schemes improve accuracy by ~5% with increased ambient energy.
Incremental exit-by-exit decisions enhance accuracy, especially with limited energy storage.
Confidence integration in control policies yields better energy-performance trade-offs.
Abstract
The growing demand for intelligent applications beyond the network edge, coupled with the need for sustainable operation, are driving the seamless integration of deep learning (DL) algorithms into energy-limited, and even energy-harvesting end-devices. However, the stochastic nature of ambient energy sources often results in insufficient harvesting rates, failing to meet the energy requirements for inference and causing significant performance degradation in energy-agnostic systems. To address this problem, we consider an on-device adaptive inference system equipped with an energy-harvester and finite-capacity energy storage. We then allow the device to reduce the run-time execution cost on-demand, by either switching between differently-sized neural networks, referred to as multi-model selection (MMS), or by enabling earlier predictions at intermediate layers, called early exiting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsEarly exiting using confidence measures
