Adversarial Network Compression

Vasileios Belagiannis; Azade Farshad; Fabio Galasso

arXiv:1803.10750·cs.CV·November 15, 2018

Adversarial Network Compression

Vasileios Belagiannis, Azade Farshad, Fabio Galasso

PDF

TL;DR

This paper introduces an adversarial network compression method that effectively transfers knowledge from large to small neural networks without labels, achieving superior performance and generalization across multiple datasets.

Contribution

It presents a novel adversarial training approach for neural network compression, including a regularization scheme to enhance stability and generalization.

Findings

01

Student networks achieve minimal accuracy loss.

02

Outperforms existing knowledge transfer methods.

03

Surpasses performance of labeled training on the same architecture.

Abstract

Neural network compression has recently received much attention due to the computational requirements of modern deep models. In this work, our objective is to transfer knowledge from a deep and accurate model to a smaller one. Our contributions are threefold: (i) we propose an adversarial network compression approach to train the small student network to mimic the large teacher, without the need for labels during training; (ii) we introduce a regularization scheme to prevent a trivially-strong discriminator without reducing the network capacity and (iii) our approach generalizes on different teacher-student models. In an extensive evaluation on five standard datasets, we show that our student has small accuracy drop, achieves better performance than other knowledge transfer approaches and it surpasses the performance of the same network trained with labels. In addition, we demonstrate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.