Incremental Training and Group Convolution Pruning for Runtime DNN Performance Scaling on Heterogeneous Embedded Platforms
Lei Xun, Long Tran-Thanh, Bashir M Al-Hashimi, Geoff V. Merrett

TL;DR
This paper introduces a dynamic DNN approach using incremental training and group convolution pruning, enabling flexible performance scaling on heterogeneous embedded platforms with reduced memory overhead and wider dynamic range.
Contribution
It proposes a novel incremental training and group convolution pruning method that allows runtime model adaptation without retraining, combined with task mapping and DVFS for enhanced performance scaling.
Findings
Up to 2.36x energy and 2.73x time dynamic range improvement.
Achieved 10.6x energy and 41.6x time wider dynamic range with combined techniques.
Reduced memory footprint by 2.4x at the same compression rate.
Abstract
Inference for Deep Neural Networks is increasingly being executed locally on mobile and embedded platforms due to its advantages in latency, privacy and connectivity. Since modern System on Chips typically execute a combination of different and dynamic workloads concurrently, it is challenging to consistently meet inference time/energy budget at runtime because of the local computing resources available to the DNNs vary considerably. To address this challenge, a variety of dynamic DNNs were proposed. However, these works have significant memory overhead, limited runtime recoverable compression rate and narrow dynamic ranges of performance scaling. In this paper, we present a dynamic DNN using incremental training and group convolution pruning. The channels of the DNN convolution layer are divided into groups, which are then trained incrementally. At runtime, following groups can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning
MethodsConvolution
