Adaptive Precision Training (AdaPT): A dynamic fixed point quantized training approach for DNNs
Lorenz Kummer, Kevin Sidak, Tabea Reichmann, Wilfried Gansterer

TL;DR
AdaPT introduces a dynamic fixed-point quantization method for DNN training that adaptively adjusts precision per layer, leading to faster training and inference with minimal accuracy loss.
Contribution
It proposes a novel adaptive precision training strategy that optimizes bit-widths per layer based on information theory, improving resource efficiency over fixed global precision methods.
Findings
Achieves 1.27x training speedup over float32 baseline.
Attains 0.98% higher accuracy on CIFAR datasets.
Realizes 2.33x inference speedup with reduced model size.
Abstract
Quantization is a technique for reducing deep neural networks (DNNs) training and inference times, which is crucial for training in resource constrained environments or applications where inference is time critical. State-of-the-art (SOTA) quantization approaches focus on post-training quantization, i.e., quantization of pre-trained DNNs for speeding up inference. While work on quantized training exists, most approaches require refinement in full precision (usually single precision) in the final training phase or enforce a global word length across the entire DNN. This leads to suboptimal assignments of bit-widths to layers and, consequently, suboptimal resource usage. In an attempt to overcome such limitations, we introduce AdaPT, a new fixed-point quantized sparsifying training strategy. AdaPT decides about precision switches between training epochs based on information theoretic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques
