DSConv: Efficient Convolution Operator

Marcelo Gennari; Roger Fawcett; Victor Adrian Prisacariu

arXiv:1901.01928·cs.CV·November 11, 2019·31 cites

DSConv: Efficient Convolution Operator

Marcelo Gennari, Roger Fawcett, Victor Adrian Prisacariu

PDF

Open Access 1 Repo

TL;DR

DSConv is a novel 4-bit quantized convolution operator that efficiently replaces standard convolutions, enabling near-original accuracy without retraining, suitable for pre-trained models and unlabelled data scenarios.

Contribution

Introduces DSConv, a flexible 4-bit quantized convolution operator that maintains accuracy and reduces computational cost without retraining.

Findings

01

Achieves less than 1% accuracy loss with 4-bit quantization.

02

Demonstrates state-of-the-art results across multiple architectures.

03

Improves results further with distillation-based adaptation.

Abstract

Quantization is a popular way of increasing the speed and lowering the memory usage of Convolution Neural Networks (CNNs). When labelled training data is available, network weights and activations have successfully been quantized down to 1-bit. The same cannot be said about the scenario when labelled training data is not available, e.g. when quantizing a pre-trained model, where current approaches show, at best, no loss of accuracy at 8-bit quantizations. We introduce DSConv, a flexible quantized convolution operator that replaces single-precision operations with their far less expensive integer counterparts, while maintaining the probability distributions over both the kernel weights and the outputs. We test our model as a plug-and-play replacement for standard convolution on most popular neural network architectures, ResNet, DenseNet, GoogLeNet, AlexNet and VGG-Net and demonstrate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ActiveVisionLab/NUQ
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Batch Normalization · Bottleneck Residual Block · Residual Connection · Residual Block · Average Pooling · Local Response Normalization · Auxiliary Classifier · Inception Module · Concatenated Skip Connection