Decomposable-Net: Scalable Low-Rank Compression for Neural Networks

Atsushi Yaguchi; Taiji Suzuki; Shuhei Nitta; Yukinobu Sakata; Akiyuki; Tanizawa

arXiv:1910.13141·cs.LG·September 30, 2021

Decomposable-Net: Scalable Low-Rank Compression for Neural Networks

Atsushi Yaguchi, Taiji Suzuki, Shuhei Nitta, Yukinobu Sakata, Akiyuki, Tanizawa

PDF

1 Repo

TL;DR

Decomposable-Net enables flexible neural network size adjustment via low-rank decomposition without retraining, maintaining or improving performance across sizes, demonstrated on ImageNet with superior accuracy and efficiency.

Contribution

We introduce a novel backpropagation method for low-rank decomposition that allows neural networks to be scaled to any size without retraining, unlike previous fixed-size methods.

Findings

01

Achieves 73.2% top-1 accuracy with 0.27x MACs on ResNet-50.

02

Outperforms Tucker decomposition, Trained Rank Pruning, and universally slimmable networks.

03

Effectively suppresses approximation error with a new rank selection criterion.

Abstract

Compressing DNNs is important for the real-world applications operating on resource-constrained devices. However, we typically observe drastic performance deterioration when changing model size after training is completed. Therefore, retraining is required to resume the performance of the compressed models suitable for different devices. In this paper, we propose Decomposable-Net (the network decomposable in any size), which allows flexible changes to model size without retraining. We decompose weight matrices in the DNNs via singular value decomposition and adjust ranks according to the target model size. Unlike the existing low-rank compression methods that specialize the model to a fixed size, we propose a novel backpropagation scheme that jointly minimizes losses for both of full- and low-rank networks. This enables not only to maintain the performance of a full-rank network {\it…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ygcats/Scalable-Low-Rank-Compression-for-Neural-Networks
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsPruning · TuckER