Exploiting Linear Structure Within Convolutional Networks for Efficient   Evaluation

Remi Denton; Wojciech Zaremba; Joan Bruna; Yann LeCun; Rob Fergus

arXiv:1404.0736·cs.CV·March 15, 2024·1.1k cites

Exploiting Linear Structure Within Convolutional Networks for Efficient Evaluation

Remi Denton, Wojciech Zaremba, Joan Bruna, Yann LeCun, Rob Fergus

PDF

Open Access

TL;DR

This paper introduces methods to accelerate large convolutional networks for object recognition by exploiting linear structures in filters, achieving 2x speedups with minimal accuracy loss on CPU and GPU.

Contribution

It presents novel approximation techniques that leverage linear structures in convolutional filters to significantly reduce computation during evaluation.

Findings

01

2x speedup in convolutional layer evaluation

02

Accuracy within 1% of original models

03

Effective on both CPU and GPU

Abstract

We present techniques for speeding up the test-time evaluation of large convolutional networks, designed for object recognition tasks. These models deliver impressive accuracy but each image evaluation requires millions of floating point operations, making their deployment on smartphones and Internet-scale clusters problematic. The computation is dominated by the convolution operations in the lower layers of the model. We exploit the linear structure present within the convolutional filters to derive approximations that significantly reduce the required computation. Using large state-of-the-art models, we demonstrate we demonstrate speedups of convolutional layers on both CPU and GPU by a factor of 2x, while keeping the accuracy within 1% of the original model.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Sparse and Compressive Sensing Techniques · Domain Adaptation and Few-Shot Learning

MethodsConvolution