What Do Compressed Deep Neural Networks Forget?
Sara Hooker, Aaron Courville, Gregory Clark, Yann Dauphin, Andrea, Frome

TL;DR
This paper investigates how model compression techniques like pruning and quantization affect the performance of deep neural networks on specific challenging data points, revealing that compression disproportionately impacts atypical and noisy images.
Contribution
It introduces the concept of Pruning Identified Exemplars (PIEs) and shows that compression impacts a small subset of data more severely, especially on long-tail, noisy, and atypical images.
Findings
Models with different sizes perform similarly overall but differ on PIEs.
Compression disproportionately affects long-tail, noisy, and atypical images.
PIEs are more challenging for both humans and algorithms to classify.
Abstract
Deep neural network pruning and quantization techniques have demonstrated it is possible to achieve high levels of compression with surprisingly little degradation to test set accuracy. However, this measure of performance conceals significant differences in how different classes and images are impacted by model compression techniques. We find that models with radically different numbers of weights have comparable top-line performance metrics but diverge considerably in behavior on a narrow subset of the dataset. This small subset of data points, which we term Pruning Identified Exemplars (PIEs) are systematically more impacted by the introduction of sparsity. Compression disproportionately impacts model performance on the underrepresented long-tail of the data distribution. PIEs over-index on atypical or noisy images that are far more challenging for both humans and algorithms to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis
MethodsPruning · Test
