Accelerated Training on Low-Power Edge Devices
Mohamed Aboelenien Ahmed, Kilian Pfeiffer, Heba Khdr, Osama Abboud,, Ramin Khalili, J\"org Henkel

TL;DR
This paper presents a cross-layer optimization method for accelerating training on resource-constrained edge devices by jointly adjusting GPU frequency and batch size, significantly reducing training time and energy consumption.
Contribution
It introduces a novel methodology combining device profiling and batch efficiency predictions to optimize training parameters under power constraints.
Findings
Training time reduced by 2.4x compared to baselines.
Energy consumption during training is substantially lowered.
Model performance remains unaffected by the optimization.
Abstract
Training on edge devices poses several challenges as these devices are generally resource-constrained, especially in terms of power. State-of-the-art techniques at the device level reduce the GPU frequency to enforce power constraints, leading to a significant increase in training time. To accelerate training, we propose to jointly adjust the system and application parameters (in our case, the GPU frequency and the batch size of the training task) while adhering to the power constraints on devices. We introduce a novel cross-layer methodology that combines predictions of batch size efficiency and device profiling to achieve the desired optimization. Our evaluation on real hardware shows that our method outperforms the current baselines that depend on state of the art techniques, reducing the training time by with results very close to optimal. Our measurements also indicate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topicssolar cell performance optimization · Engineering Education and Technology · Technology Assessment and Management
