A Signal Propagation Perspective for Pruning Neural Networks at   Initialization

Namhoon Lee; Thalaiyasingam Ajanthan; Stephen Gould; Philip H. S. Torr

arXiv:1906.06307·cs.LG·February 18, 2020·19 cites

A Signal Propagation Perspective for Pruning Neural Networks at Initialization

Namhoon Lee, Thalaiyasingam Ajanthan, Stephen Gould, Philip H. S. Torr

PDF

Open Access 1 Repo

TL;DR

This paper provides a theoretical analysis of why pruning neural networks at initialization works, introduces a signal propagation perspective to improve pruning methods, and demonstrates enhanced results across various models and scenarios.

Contribution

It offers a formal characterization of initialization conditions for effective pruning, analyzes signal propagation in pruned networks, and proposes a data-free method to enhance trainability.

Findings

01

Improved pruning results on multiple image classification models.

02

A formal link between connection sensitivity and gradient-based measures.

03

Enhanced trainability of pruned networks through signal propagation analysis.

Abstract

Network pruning is a promising avenue for compressing deep neural networks. A typical approach to pruning starts by training a model and then removing redundant parameters while minimizing the impact on what is learned. Alternatively, a recent approach shows that pruning can be done at initialization prior to training, based on a saliency criterion called connection sensitivity. However, it remains unclear exactly why pruning an untrained, randomly initialized neural network is effective. In this work, by noting connection sensitivity as a form of gradient, we formally characterize initialization conditions to ensure reliable connection sensitivity measurements, which in turn yields effective pruning results. Moreover, we analyze the signal propagation properties of the resulting pruned networks and introduce a simple, data-free method to improve their trainability. Our modifications to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

namhoonlee/spp-public
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning

MethodsPruning