Layer-Specific Optimization: Sensitivity Based Convolution Layers Basis Search
Vasiliy Alekseev, Ilya Lukashevich, Ilia Zharikov, Ilya Vasiliev

TL;DR
This paper introduces a layer-specific matrix decomposition method for convolutional neural networks that reduces model size and accelerates computation by training only a subset of basis convolutions, with a fast layer selection technique.
Contribution
It proposes a novel basis convolution approach with a fast layer selection method to optimize convolutional layers without degrading model performance.
Findings
Reduces model size significantly
Speeds up forward and backward passes
Maintains accuracy with proper layer selection
Abstract
Deep neural network models have a complex architecture and are overparameterized. The number of parameters is more than the whole dataset, which is highly resource-consuming. This complicates their application and limits its usage on different devices. Reduction in the number of network parameters helps to reduce the size of the model, but at the same time, thoughtlessly applied, can lead to a deterioration in the quality of the network. One way to reduce the number of model parameters is matrix decomposition, where a matrix is represented as a product of smaller matrices. In this paper, we propose a new way of applying the matrix decomposition with respect to the weights of convolutional layers. The essence of the method is to train not all convolutions, but only the subset of convolutions (basis convolutions), and represent the rest as linear combinations of the basis ones.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVLSI and FPGA Design Techniques
MethodsAverage Pooling · Kaiming Initialization · Convolution · Global Average Pooling · Max Pooling
