Tensor network compressibility of convolutional models

Sukhbinder Singh; Saeed S. Jahromi; Roman Orus

arXiv:2403.14379·cs.CV·August 20, 2024·2 cites

Tensor network compressibility of convolutional models

Sukhbinder Singh, Saeed S. Jahromi, Roman Orus

PDF

Open Access

TL;DR

This paper investigates why tensorization-based compression of CNNs often preserves accuracy, revealing that internal correlation structures can be truncated with minimal impact, enabling effective model compression and re-training.

Contribution

It demonstrates that CNN kernels can be truncated along multiple cuts with little accuracy loss, highlighting the intrinsic correlation compression in CNNs and informing better tensorization strategies.

Findings

01

Kernel truncation often preserves classification accuracy

02

Deep layer kernels are more amenable to truncation

03

Re-training can recover accuracy after aggressive truncation

Abstract

Convolutional neural networks (CNNs) are one of the most widely used neural network architectures, showcasing state-of-the-art performance in computer vision tasks. Although larger CNNs generally exhibit higher accuracy, their size can be effectively reduced by ``tensorization'' while maintaining accuracy, namely, replacing the convolution kernels with compact decompositions such as Tucker, Canonical Polyadic decompositions, or quantum-inspired decompositions such as matrix product states, and directly training the factors in the decompositions to bias the learning towards low-rank decompositions. But why doesn't tensorization seem to impact the accuracy adversely? We explore this by assessing how \textit{truncating} the convolution kernels of \textit{dense} (untensorized) CNNs impact their accuracy. Specifically, we truncated the kernels of (i) a vanilla four-layer CNN and (ii)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTensor decomposition and applications

MethodsConvolution