ShiftCNN: Generalized Low-Precision Architecture for Inference of   Convolutional Neural Networks

Denis A. Gudovskiy; Luca Rigazio

arXiv:1706.02393·cs.CV·June 9, 2017·53 cites

ShiftCNN: Generalized Low-Precision Architecture for Inference of Convolutional Neural Networks

Denis A. Gudovskiy, Luca Rigazio

PDF

Open Access 1 Repo

TL;DR

ShiftCNN introduces a low-precision, multiplierless CNN architecture using shift and addition operations, significantly reducing computational costs and power consumption while maintaining high accuracy.

Contribution

The paper presents ShiftCNN, a novel low-precision CNN architecture based on power-of-two weights, enabling efficient inference on custom accelerators without retraining.

Findings

01

Reduces convolutional layer computations by over 100x.

02

Maintains less than 1% accuracy drop on ImageNet.

03

Decreases power consumption by a factor of 4 on FPGA simulations.

Abstract

In this paper we introduce ShiftCNN, a generalized low-precision architecture for inference of multiplierless convolutional neural networks (CNNs). ShiftCNN is based on a power-of-two weight representation and, as a result, performs only shift and addition operations. Furthermore, ShiftCNN substantially reduces computational cost of convolutional layers by precomputing convolution terms. Such an optimization can be applied to any CNN architecture with a relatively small codebook of weights and allows to decrease the number of product operations by at least two orders of magnitude. The proposed architecture targets custom inference accelerators and can be realized on FPGAs or ASICs. Extensive evaluation on ImageNet shows that the state-of-the-art CNNs can be converted without retraining into ShiftCNN with less than 1% drop in accuracy when the proposed quantization algorithm is employed.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gudovskiy/ShiftCNN
caffe2Official

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning

MethodsConvolution