Towards Efficient Full 8-bit Integer DNN Online Training on Resource-limited Devices without Batch Normalization
Yukuan Yang, Xiaowei Chi, Lei Deng, Tianyi Yan, Feng Gao, Guoqi Li

TL;DR
This paper introduces EOQ, a novel framework enabling efficient full 8-bit integer online training of deep neural networks without batch normalization, significantly reducing computational costs on resource-limited devices while maintaining high accuracy.
Contribution
The paper proposes EOQ, combining Fixup initialization and a new quantization scheme, to achieve full 8-bit integer training and BN removal in large-scale DNNs, with theoretical analysis and practical benefits.
Findings
EOQ enables full 8-bit integer training of DNNs.
Removing BN simplifies chip design and accelerates processing.
EOQ maintains state-of-the-art accuracy with reduced computational cost.
Abstract
Huge computational costs brought by convolution and batch normalization (BN) have caused great challenges for the online training and corresponding applications of deep neural networks (DNNs), especially in resource-limited devices. Existing works only focus on the convolution or BN acceleration and no solution can alleviate both problems with satisfactory performance. Online training has gradually become a trend in resource-limited devices like mobile phones while there is still no complete technical scheme with acceptable model performance, processing speed, and computational cost. In this research, an efficient online-training quantization framework termed EOQ is proposed by combining Fixup initialization and a novel quantization scheme for DNN model compression and acceleration. Based on the proposed framework, we have successfully realized full 8-bit integer network training and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Reservoir Computing · Neural Networks and Applications · Machine Learning and ELM
MethodsFixup Initialization · Batch Normalization · Convolution
