The Knowledge Within: Methods for Data-Free Model Compression

Matan Haroush; Itay Hubara; Elad Hoffer; and Daniel Soudry

arXiv:1912.01274·cs.LG·April 8, 2020

The Knowledge Within: Methods for Data-Free Model Compression

Matan Haroush, Itay Hubara, Elad Hoffer, and Daniel Soudry

PDF

1 Video

TL;DR

This paper introduces three methods to generate synthetic data from trained models, enabling data-free calibration and fine-tuning of compressed neural networks, which is crucial when real data is unavailable or sensitive.

Contribution

The paper proposes novel data-free techniques for model calibration and fine-tuning that do not require access to original training data, leveraging batch normalization statistics.

Findings

01

Synthetic samples enable effective model calibration without real data

02

The best method achieves negligible accuracy loss compared to using real data

03

Approach facilitates data-free deployment of compressed neural networks

Abstract

Recently, an extensive amount of research has been focused on compressing and accelerating Deep Neural Networks (DNN). So far, high compression rate algorithms require part of the training dataset for a low precision calibration, or a fine-tuning process. However, this requirement is unacceptable when the data is unavailable or contains sensitive information, as in medical and biometric use-cases. We present three methods for generating synthetic samples from trained models. Then, we demonstrate how these samples can be used to calibrate and fine-tune quantized models without using any real data in the process. Our best performing method has a negligible accuracy degradation compared to the original training set. This method, which leverages intrinsic batch normalization layers' statistics of the trained model, can be used to evaluate data similarity. Our approach opens a path towards…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

The Knowledge Within: Methods for Data-Free Model Compression· youtube

Taxonomy

MethodsBatch Normalization