Accelerating Very Deep Convolutional Networks for Classification and   Detection

Xiangyu Zhang; Jianhua Zou; Kaiming He; Jian Sun

arXiv:1505.06798·cs.CV·November 19, 2015·30 cites

Accelerating Very Deep Convolutional Networks for Classification and Detection

Xiangyu Zhang, Jianhua Zou, Kaiming He, Jian Sun

PDF

Open Access

TL;DR

This paper presents a nonlinear optimization method to accelerate very deep CNNs, achieving a 4x speedup on VGG-16 with minimal accuracy loss, applicable to classification and detection tasks.

Contribution

It introduces a nonlinear, layer-wise approximation technique that reduces accumulated errors in deep CNNs without using SGD, improving speed while maintaining accuracy.

Findings

01

Achieved 4x speedup on VGG-16 with only 0.3% top-5 error increase.

02

Enabled effective acceleration for deep CNNs in classification and detection.

03

Reduced error accumulation across multiple layers in deep networks.

Abstract

This paper aims to accelerate the test-time computation of convolutional neural networks (CNNs), especially very deep CNNs that have substantially impacted the computer vision community. Unlike previous methods that are designed for approximating linear filters or linear responses, our method takes the nonlinear units into account. We develop an effective solution to the resulting nonlinear optimization problem without the need of stochastic gradient descent (SGD). More importantly, while previous methods mainly focus on optimizing one or two layers, our nonlinear method enables an asymmetric reconstruction that reduces the rapidly accumulated error when multiple (e.g., >=10) layers are approximated. For the widely used very deep VGG-16 model, our method achieves a whole-model speedup of 4x with merely a 0.3% increase of top-5 error in ImageNet classification. Our 4x accelerated VGG-16…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning

MethodsSoftmax · Convolution · RoIPool · Fast R-CNN