Comparing Rewinding and Fine-tuning in Neural Network Pruning
Alex Renda, Jonathan Frankle, Michael Carbin

TL;DR
This paper compares fine-tuning and rewinding techniques in neural network pruning, showing that rewinding methods outperform fine-tuning and can achieve state-of-the-art results across various networks.
Contribution
It introduces learning rate rewinding and demonstrates that rewinding techniques outperform fine-tuning in neural network pruning.
Findings
Rewinding techniques outperform fine-tuning in accuracy recovery.
Learning rate rewinding matches the effectiveness of weight rewinding.
Rewinding-based pruning achieves state-of-the-art compression and accuracy.
Abstract
Many neural network pruning algorithms proceed in three steps: train the network to completion, remove unwanted structure to compress the network, and retrain the remaining structure to recover lost accuracy. The standard retraining technique, fine-tuning, trains the unpruned weights from their final trained values using a small fixed learning rate. In this paper, we compare fine-tuning to alternative retraining techniques. Weight rewinding (as proposed by Frankle et al., (2019)), rewinds unpruned weights to their values from earlier in training and retrains them from there using the original training schedule. Learning rate rewinding (which we propose) trains the unpruned weights from their final values using the same learning rate schedule as weight rewinding. Both rewinding techniques outperform fine-tuning, forming the basis of a network-agnostic pruning algorithm that matches the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications
MethodsPruning
