Distribution-Aware Tensor Decomposition for Compression of Convolutional Neural Networks
Alper Kalle, Theo Rudkiewicz, Mohamed-Oumar Ouerfelli, Mohamed Tamaazousti

TL;DR
This paper introduces a data-informed tensor decomposition method for CNN compression that minimizes output distribution change, often eliminating the need for fine-tuning and enabling dataset transferability.
Contribution
It proposes a novel covariance-based norm for tensor decomposition that improves CNN compression by directly optimizing function space error, reducing fine-tuning requirements.
Findings
Achieves competitive accuracy without fine-tuning.
Effective across multiple CNN architectures and datasets.
Enables transfer of covariance-based norms between datasets.
Abstract
Neural networks are widely used for image-related tasks but typically demand considerable computing power. Once a network has been trained, however, its memory- and compute-footprint can be reduced by compression. In this work, we focus on compression through tensorization and low-rank representations. Whereas classical approaches search for a low-rank approximation by minimizing an isotropic norm such as the Frobenius norm in weight-space, we use data-informed norms that measure the error in function space. Concretely, we minimize the change in the layer's output distribution, which can be expressed as where is the square root of the covariance matrix of the layer's input and , are the original and compressed weights. We propose new alternating least square algorithms for the two most common tensor…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTensor decomposition and applications · Sparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques
