Enhancing User Experience in On-Device Machine Learning with Gated Compression Layers
Haiguang Li, Usama Pervaiz, Joseph Antognini, Micha{\l} Matuszak,, Lawrence Au, Gilles Roux, Trausti Thormundsson

TL;DR
This paper introduces Gated Compression layers for on-device machine learning, significantly reducing power consumption while maintaining accuracy, thereby improving user experience on resource-constrained devices.
Contribution
It proposes Gated Compression layers that dynamically regulate data flow, enabling more power-efficient and accurate ODML models, especially for always-on applications.
Findings
Power efficiency gains from 158x to 30,000x in experiments.
Enhanced user experience through longer battery life and better responsiveness.
Successful integration into vision and speech models, including ViT.
Abstract
On-device machine learning (ODML) enables powerful edge applications, but power consumption remains a key challenge for resource-constrained devices. To address this, developers often face a trade-off between model accuracy and power consumption, employing either computationally intensive models on high-power cores or pared-down models on low-power cores. Both approaches typically lead to a compromise in user experience (UX). This work focuses on the use of Gated Compression (GC) layer to enhance ODML model performance while conserving power and maximizing cost-efficiency, especially for always-on use cases. GC layers dynamically regulate data flow by selectively gating activations of neurons within the neural network and effectively filtering out non-essential inputs, which reduces power needs without compromising accuracy, and enables more efficient execution on heterogeneous compute…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Quality Assessment · IoT and Edge/Fog Computing
