Energy-efficient and Robust Cumulative Training with Net2Net   Transformation

Aosong Feng; and Priyadarshini Panda

arXiv:2003.01204·cs.CV·March 4, 2020·1 cites

Energy-efficient and Robust Cumulative Training with Net2Net Transformation

Aosong Feng, and Priyadarshini Panda

PDF

Open Access

TL;DR

This paper introduces a cumulative training method with Net2Net transformations that significantly reduces training time and energy consumption for deep neural networks while maintaining accuracy and improving robustness.

Contribution

The paper presents a novel incremental training strategy using Net2Net transformations that enhances efficiency and robustness compared to traditional training methods.

Findings

01

~2x reduction in training computational complexity for TinyImageNet with VGG19

02

Final networks have ~0.4x lower inference compute complexity after pruning

03

Improved generalization, noise robustness, and adversarial detection through cumulative training

Abstract

Deep learning has achieved state-of-the-art accuracies on several computer vision tasks. However, the computational and energy requirements associated with training such deep neural networks can be quite high. In this paper, we propose a cumulative training strategy with Net2Net transformation that achieves training computational efficiency without incurring large accuracy loss, in comparison to a model trained from scratch. We achieve this by first training a small network (with lesser parameters) on a small subset of the original dataset, and then gradually expanding the network using Net2Net transformation to train incrementally on larger subsets of the dataset. This incremental training strategy with Net2Net utilizes function-preserving transformations that transfers knowledge from each previous small network to the next larger network, thereby, reducing the overall training…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning

MethodsPruning