Energy-efficient and Robust Cumulative Training with Net2Net Transformation
Aosong Feng, and Priyadarshini Panda

TL;DR
This paper introduces a cumulative training method with Net2Net transformations that significantly reduces training time and energy consumption for deep neural networks while maintaining accuracy and improving robustness.
Contribution
The paper presents a novel incremental training strategy using Net2Net transformations that enhances efficiency and robustness compared to traditional training methods.
Findings
~2x reduction in training computational complexity for TinyImageNet with VGG19
Final networks have ~0.4x lower inference compute complexity after pruning
Improved generalization, noise robustness, and adversarial detection through cumulative training
Abstract
Deep learning has achieved state-of-the-art accuracies on several computer vision tasks. However, the computational and energy requirements associated with training such deep neural networks can be quite high. In this paper, we propose a cumulative training strategy with Net2Net transformation that achieves training computational efficiency without incurring large accuracy loss, in comparison to a model trained from scratch. We achieve this by first training a small network (with lesser parameters) on a small subset of the original dataset, and then gradually expanding the network using Net2Net transformation to train incrementally on larger subsets of the dataset. This incremental training strategy with Net2Net utilizes function-preserving transformations that transfers knowledge from each previous small network to the next larger network, thereby, reducing the overall training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
MethodsPruning
