BiTAT: Neural Network Binarization with Task-dependent Aggregated Transformation
Geon Park, Jaehong Yoon, Haiyang Zhang, Xing Zhang, Sung Ju Hwang,, Yonina C. Eldar

TL;DR
This paper introduces BiTAT, a novel quantization-aware training method that uses task-dependent transformations to effectively binarize neural networks, especially compact models, with minimal performance loss.
Contribution
The paper proposes a new orthonormal transformation approach that disentangles weights based on importance, improving extreme quantization performance for edge-device neural networks.
Findings
Reduces performance loss in 1-bit quantized MobileNets on ImageNet.
Preserves full-precision accuracy on CIFAR-100 with compact backbones.
Outperforms existing quantization baselines in various benchmarks.
Abstract
Neural network quantization aims to transform high-precision weights and activations of a given neural network into low-precision weights/activations for reduced memory usage and computation, while preserving the performance of the original model. However, extreme quantization (1-bit weight/1-bit activations) of compactly-designed backbone architectures (e.g., MobileNets) often used for edge-device deployments results in severe performance degeneration. This paper proposes a novel Quantization-Aware Training (QAT) method that can effectively alleviate performance degeneration even with extreme quantization by focusing on the inter-weight dependencies, between the weights within each layer and across consecutive layers. To minimize the quantization impact of each weight on others, we perform an orthonormal transformation of the weights at each layer by training an input-dependent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · COVID-19 diagnosis using AI · Brain Tumor Detection and Classification
