FreezeOut: Accelerate Training by Progressively Freezing Layers

Andrew Brock; Theodore Lim; J.M. Ritchie; Nick Weston

arXiv:1706.04983·stat.ML·June 20, 2017·75 cites

FreezeOut: Accelerate Training by Progressively Freezing Layers

Andrew Brock, Theodore Lim, J.M. Ritchie, Nick Weston

PDF

Open Access 2 Repos

TL;DR

FreezeOut is a training acceleration technique that progressively freezes layers during training, reducing computation time with minimal or no loss in accuracy across different neural network architectures.

Contribution

The paper introduces FreezeOut, a novel method for speeding up neural network training by gradually freezing layers, which was empirically validated on multiple architectures.

Findings

01

Up to 20% reduction in training time for DenseNets with slight accuracy loss.

02

20% speedup for ResNets with no accuracy loss.

03

No significant speedup observed for VGG networks.

Abstract

The early layers of a deep neural net have the fewest parameters, but take up the most computation. In this extended abstract, we propose to only train the hidden layers for a set portion of the training run, freezing them out one-by-one and excluding them from the backward pass. Through experiments on CIFAR, we empirically demonstrate that FreezeOut yields savings of up to 20% wall-clock time during training with 3% loss in accuracy for DenseNets, a 20% speedup without loss of accuracy for ResNets, and no improvement for VGG networks. Our code is publicly available at https://github.com/ajbrock/FreezeOut

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning