Exploring the Potential of Low-bit Training of Convolutional Neural   Networks

Kai Zhong; Xuefei Ning; Guohao Dai; Zhenhua Zhu; Tianchen Zhao; Shulin; Zeng; Yu Wang; Huazhong Yang

arXiv:2006.02804·cs.LG·July 15, 2021·1 cites

Exploring the Potential of Low-bit Training of Convolutional Neural Networks

Kai Zhong, Xuefei Ning, Guohao Dai, Zhenhua Zhu, Tianchen Zhao, Shulin, Zeng, Yu Wang, Huazhong Yang

PDF

Open Access

TL;DR

This paper introduces a low-bit training framework for CNNs using a novel multi-level scaling tensor format, significantly reducing energy consumption while maintaining accuracy.

Contribution

It presents the MLS tensor format and dynamic quantization methods, enabling effective low-bit training with improved energy efficiency and accuracy trade-offs.

Findings

01

Achieves within 1% accuracy loss using 1-bit mantissa and 2-bit exponent on CIFAR-10.

02

Maintains within 1% accuracy loss with 4-bit mantissa and 2-bit exponent on ImageNet.

03

Provides 8.3 to 10.2 times higher energy efficiency compared to full-precision training.

Abstract

In this work, we propose a low-bit training framework for convolutional neural networks, which is built around a novel multi-level scaling (MLS) tensor format. Our framework focuses on reducing the energy consumption of convolution operations by quantizing all the convolution operands to low bit-width format. Specifically, we propose the MLS tensor format, in which the element-wise bit-width can be largely reduced. Then, we describe the dynamic quantization and the low-bit tensor convolution arithmetic to leverage the MLS tensor format efficiently. Experiments show that our framework achieves a superior trade-off between the accuracy and the bit-width than previous low-bit training frameworks. For training a variety of models on CIFAR-10, using 1-bit mantissa and 2-bit exponent is adequate to keep the accuracy loss within $1%$ . And on larger datasets like ImageNet, using 4-bit mantissa…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Tensor decomposition and applications · Computational Physics and Python Applications

MethodsConvolution