Lightweight Edge Learning via Dataset Pruning
Laha Ale, Hu Luo, Mingsheng Cao, Shichao Li, Huanlai Xing, Haifeng Sun

TL;DR
This paper introduces a dataset pruning method for edge learning that reduces training time and energy use on mobile devices by selecting only the most important data points, without significant accuracy loss.
Contribution
It presents a novel, lightweight, on-device dataset pruning framework that improves resource efficiency in edge learning without requiring inter-device communication.
Findings
Near-linear reduction in training latency with pruning
Significant energy savings proportional to pruning ratio
Negligible accuracy degradation with dataset pruning
Abstract
Edge learning facilitates ubiquitous intelligence by enabling model training and adaptation directly on data-generating devices, thereby mitigating privacy risks and communication latency. However, the high computational and energy overhead of on-device training hinders its deployment on battery-powered mobile systems with strict thermal and memory budgets. While prior research has extensively optimized model architectures for efficient inference, the training phase remains bottlenecked by the processing of massive, often redundant, local datasets. In this work, we propose a data-centric optimization framework that leverages dataset pruning to achieve resource-efficient edge learning. Unlike standard methods that process all available data, our approach constructs compact, highly informative training subsets via a lightweight, on-device importance evaluation. Specifically, we utilize…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIoT and Edge/Fog Computing · Green IT and Sustainability · Advanced Neural Network Applications
