Binary Quadratic Quantization: Beyond First-Order Quantization for Real-Valued Matrix Compression
Kyo Kuroki, Yasuyuki Okoshi, Thiem Van Chu, Kazushi Kawamura, Masato Motomura

TL;DR
This paper introduces Binary Quadratic Quantization (BQQ), a new matrix compression method that surpasses traditional approaches by using quadratic binary expressions, achieving better memory efficiency and accuracy in neural network compression tasks.
Contribution
The paper presents BQQ, a novel quadratic binary matrix quantization technique that improves compression performance over first-order methods without relying on PTQ-specific optimization.
Findings
BQQ outperforms traditional methods in matrix compression benchmarks.
BQQ achieves up to 2.2% improvement on ImageNet in PTQ tasks.
BQQ demonstrates strong performance even without specialized optimization.
Abstract
This paper proposes a novel matrix quantization method, Binary Quadratic Quantization (BQQ). In contrast to conventional first-order quantization approaches, such as uniform quantization and binary coding quantization, that approximate real-valued matrices via linear combinations of binary bases, BQQ leverages the expressive power of binary quadratic expressions while maintaining an extremely compact data format. We validate our approach with two experiments: a matrix compression benchmark and post-training quantization (PTQ) on pretrained Vision Transformer-based models. Experimental results demonstrate that BQQ consistently achieves a superior trade-off between memory efficiency and reconstruction error than conventional methods for compressing diverse matrix data. It also delivers strong PTQ performance, even though we neither target state-of-the-art PTQ accuracy under tight memory…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Data Compression Techniques · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques
