Memory-efficient Energy-adaptive Inference of Pre-Trained Models on Batteryless Embedded Systems
Pietro Farina, Subrata Biswas, Eren Y{\i}ld{\i}z, Khakim Akhunov, Saad, Ahmed, Bashima Islam, Kas{\i}m Sinan Y{\i}ld{\i}r{\i}m

TL;DR
FreeML is a framework that enables memory-efficient, energy-adaptive inference of pre-trained neural networks on batteryless embedded systems by combining novel compression and early exit techniques.
Contribution
It introduces a novel compression method and a single-exit early stopping mechanism for energy-adaptive inference on ultra-constrained, batteryless devices.
Findings
Reduces model sizes by up to 95 times
Supports adaptive inference with 2.03-19.65 times less memory overhead
Achieves significant time and energy savings with negligible accuracy loss
Abstract
Batteryless systems frequently face power failures, requiring extra runtime buffers to maintain inference progress and leaving only a memory space for storing ultra-tiny deep neural networks (DNNs). Besides, making these models responsive to stochastic energy harvesting dynamics during inference requires a balance between inference accuracy, latency, and energy overhead. Recent works on compression mostly focus on time and memory, but often ignore energy dynamics or significantly reduce the accuracy of pre-trained DNNs. Existing energy-adaptive inference works modify the architecture of pre-trained models and have significant memory overhead. Thus, energy-adaptive and accurate inference of pre-trained DNNs on batteryless devices with extreme memory constraints is more challenging than traditional microcontrollers. We combat these issues by proposing FreeML, a framework to optimize…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Embedded Systems Design Techniques · Real-Time Systems Scheduling
MethodsFocus
