Convolutional Neural Network Compression Based on Low-Rank Decomposition

Yaping He; Linhao Jiang; Di Wu

arXiv:2408.16289·cs.CV·August 30, 2024

Convolutional Neural Network Compression Based on Low-Rank Decomposition

Yaping He, Linhao Jiang, Di Wu

PDF

Open Access

TL;DR

This paper introduces a neural network compression technique combining Variational Bayesian Matrix Factorization with orthogonal regularization to reduce model size while maintaining accuracy, suitable for edge device deployment.

Contribution

It presents a novel integration of VBMF and orthogonal regularization for CNN compression, improving accuracy retention at various compression ratios.

Findings

01

Outperforms existing compression methods at high and low ratios

02

Maintains high accuracy with significant model size reduction

03

Applicable to various CNN architectures

Abstract

Deep neural networks typically impose significant computational loads and memory consumption. Moreover, the large parameters pose constraints on deploying the model on edge devices such as embedded systems. Tensor decomposition offers a clear advantage in compressing large-scale weight tensors. Nevertheless, direct utilization of low-rank decomposition typically leads to significant accuracy loss. This paper proposes a model compression method that integrates Variational Bayesian Matrix Factorization (VBMF) with orthogonal regularization. Initially, the model undergoes over-parameterization and training, with orthogonal regularization applied to enhance its likelihood of achieving the accuracy of the original model. Secondly, VBMF is employed to estimate the rank of the weight tensor at each layer. Our framework is sufficiently general to apply to other convolutional neural networks and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Data Compression Techniques · Advanced Computational Techniques and Applications · Advanced Algorithms and Applications

MethodsOrthogonal Regularization