Block-Wise Dynamic-Precision Neural Network Training Acceleration via Online Quantization Sensitivity Analytics
Ruoyang Liu, Chenhan Wei, Yixiong Yang, Wenxun Wang, Huazhong Yang,, Yongpan Liu

TL;DR
DYNASTY is a novel block-wise dynamic-precision training framework that uses online sensitivity analytics to accelerate neural network training and reduce energy consumption without sacrificing accuracy.
Contribution
It introduces a fast online analytics method for data sensitivity and an adaptive bit-width map generator for stable low-bit quantized training.
Findings
Up to 5.1x training speedup on CIFAR-100 and ImageNet.
Up to 4.7x energy reduction with no accuracy loss.
Effective block-wise dynamic-precision quantization.
Abstract
Data quantization is an effective method to accelerate neural network training and reduce power consumption. However, it is challenging to perform low-bit quantized training: the conventional equal-precision quantization will lead to either high accuracy loss or limited bit-width reduction, while existing mixed-precision methods offer high compression potential but failed to perform accurate and efficient bit-width assignment. In this work, we propose DYNASTY, a block-wise dynamic-precision neural network training framework. DYNASTY provides accurate data sensitivity information through fast online analytics, and maintains stable training convergence with an adaptive bit-width map generator. Network training experiments on CIFAR-100 and ImageNet dataset are carried out, and compared to 8-bit quantization baseline, DYNASTY brings up to speedup and energy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Image Enhancement Techniques
