DSConv: Efficient Convolution Operator
Marcelo Gennari, Roger Fawcett, Victor Adrian Prisacariu

TL;DR
DSConv is a novel 4-bit quantized convolution operator that efficiently replaces standard convolutions, enabling near-original accuracy without retraining, suitable for pre-trained models and unlabelled data scenarios.
Contribution
Introduces DSConv, a flexible 4-bit quantized convolution operator that maintains accuracy and reduces computational cost without retraining.
Findings
Achieves less than 1% accuracy loss with 4-bit quantization.
Demonstrates state-of-the-art results across multiple architectures.
Improves results further with distillation-based adaptation.
Abstract
Quantization is a popular way of increasing the speed and lowering the memory usage of Convolution Neural Networks (CNNs). When labelled training data is available, network weights and activations have successfully been quantized down to 1-bit. The same cannot be said about the scenario when labelled training data is not available, e.g. when quantizing a pre-trained model, where current approaches show, at best, no loss of accuracy at 8-bit quantizations. We introduce DSConv, a flexible quantized convolution operator that replaces single-precision operations with their far less expensive integer counterparts, while maintaining the probability distributions over both the kernel weights and the outputs. We test our model as a plug-and-play replacement for standard convolution on most popular neural network architectures, ResNet, DenseNet, GoogLeNet, AlexNet and VGG-Net and demonstrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Batch Normalization · Bottleneck Residual Block · Residual Connection · Residual Block · Average Pooling · Local Response Normalization · Auxiliary Classifier · Inception Module · Concatenated Skip Connection
