Exploring the Potential of Low-bit Training of Convolutional Neural Networks
Kai Zhong, Xuefei Ning, Guohao Dai, Zhenhua Zhu, Tianchen Zhao, Shulin, Zeng, Yu Wang, Huazhong Yang

TL;DR
This paper introduces a low-bit training framework for CNNs using a novel multi-level scaling tensor format, significantly reducing energy consumption while maintaining accuracy.
Contribution
It presents the MLS tensor format and dynamic quantization methods, enabling effective low-bit training with improved energy efficiency and accuracy trade-offs.
Findings
Achieves within 1% accuracy loss using 1-bit mantissa and 2-bit exponent on CIFAR-10.
Maintains within 1% accuracy loss with 4-bit mantissa and 2-bit exponent on ImageNet.
Provides 8.3 to 10.2 times higher energy efficiency compared to full-precision training.
Abstract
In this work, we propose a low-bit training framework for convolutional neural networks, which is built around a novel multi-level scaling (MLS) tensor format. Our framework focuses on reducing the energy consumption of convolution operations by quantizing all the convolution operands to low bit-width format. Specifically, we propose the MLS tensor format, in which the element-wise bit-width can be largely reduced. Then, we describe the dynamic quantization and the low-bit tensor convolution arithmetic to leverage the MLS tensor format efficiently. Experiments show that our framework achieves a superior trade-off between the accuracy and the bit-width than previous low-bit training frameworks. For training a variety of models on CIFAR-10, using 1-bit mantissa and 2-bit exponent is adequate to keep the accuracy loss within . And on larger datasets like ImageNet, using 4-bit mantissa…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Tensor decomposition and applications · Computational Physics and Python Applications
MethodsConvolution
