CTMQ: Cyclic Training of Convolutional Neural Networks with Multiple   Quantization Steps

HyunJin Kim; Jungwoo Shin; Alberto A. Del Barrio

arXiv:2206.12794·cs.CV·June 28, 2022·1 cites

CTMQ: Cyclic Training of Convolutional Neural Networks with Multiple Quantization Steps

HyunJin Kim, Jungwoo Shin, Alberto A. Del Barrio

PDF

Open Access

TL;DR

This paper introduces a cyclic training method for low-bit quantized CNNs that iteratively refines weights across multiple quantization steps, significantly improving accuracy on complex datasets like ImageNet.

Contribution

The proposed cyclic training approach with multiple quantization steps enhances low-bit CNN performance by leveraging iterative knowledge transfer from higher-precision models.

Findings

01

Improved Top-1 accuracy by 5.80% on ImageNet

02

Enhanced Top-5 accuracy by 6.85% on ImageNet

03

Effective in training binarized ResNet-18 models

Abstract

This paper proposes a training method having multiple cyclic training for achieving enhanced performance in low-bit quantized convolutional neural networks (CNNs). Quantization is a popular method for obtaining lightweight CNNs, where the initialization with a pretrained model is widely used to overcome degraded performance in low-resolution quantization. However, large quantization errors between real values and their low-bit quantized ones cause difficulties in achieving acceptable performance for complex networks and large datasets. The proposed training method softly delivers the knowledge of pretrained models to low-bit quantized models in multiple quantization steps. In each quantization step, the trained weights of a model are used to initialize the weights of the next model with the quantization bit depth reduced by one. With small change of the quantization bit depth, the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques