TL;DR
This paper introduces a domain-adaptive low-rank compression method for neural networks that considers activation statistics, enabling more effective compression after domain transfer with minimal accuracy loss.
Contribution
It proposes a novel compression algorithm that incorporates activation statistics, improving upon existing low-rank methods for domain-transferred neural networks.
Findings
Significantly outperforms existing low-rank compression techniques.
Enables over 4x compression of VGG19's fc6 layer with minimal accuracy loss.
Achieves 5-20% of original parameters with minor performance drop.
Abstract
Deep Neural Networks trained on large datasets can be easily transferred to new domains with far fewer labeled examples by a process called fine-tuning. This has the advantage that representations learned in the large source domain can be exploited on smaller target domains. However, networks designed to be optimal for the source task are often prohibitively large for the target task. In this work we address the compression of networks after domain transfer. We focus on compression algorithms based on low-rank matrix decomposition. Existing methods base compression solely on learned network weights and ignore the statistics of network activations. We show that domain transfer leads to large shifts in network activations and that it is desirable to take this into account when compressing. We demonstrate that considering activation statistics when compressing weights leads to a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
