Compressing Neural Networks using the Variational Information Bottleneck

Bin Dai; Chen Zhu; David Wipf

arXiv:1802.10399·cs.CV·April 20, 2018·71 cites

Compressing Neural Networks using the Variational Information Bottleneck

Bin Dai, Chen Zhu, David Wipf

PDF

Open Access 1 Repo

TL;DR

This paper introduces a neural network compression method based on the variational information bottleneck that effectively prunes neurons, reducing model size and computation while maintaining performance.

Contribution

It applies the information bottleneck principle with a variational bound to improve neuron pruning, outperforming existing methods in compression efficiency.

Findings

01

Achieves state-of-the-art compression rates

02

Reduces redundancy between layers effectively

03

Provides natural sparse regularization without extra tuning

Abstract

Neural networks can be compressed to reduce memory and computational requirements, or to increase accuracy by facilitating the use of a larger base architecture. In this paper we focus on pruning individual neurons, which can simultaneously trim model size, FLOPs, and run-time memory. To improve upon the performance of existing compression algorithms we utilize the information bottleneck principle instantiated via a tractable variational bound. Minimization of this information theoretic bound reduces the redundancy between adjacent layers by aggregating useful information into a subset of neurons that can be preserved. In contrast, the activations of disposable neurons are shut off via an attractive form of sparse regularization that emerges naturally from this framework, providing tangible advantages over traditional sparsity penalties without contributing additional tuning parameters…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhuchen03/VIBNet
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Advanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis

MethodsPruning