iFairy: the First 2-bit Complex LLM with All Parameters in $\{\pm1, \pm i\}$
Feiyu Wang, Guoan Wang, Yihao Zhang, Shengfan Wang, Weitao Li, Bokai Huang, Shimao Chen, Zihan Jiang, Rui Xu, Tong Yang

TL;DR
This paper introduces Fairy±i, a novel 2-bit complex-valued LLM quantization method that surpasses existing accuracy ceilings by leveraging complex domain advantages and symmetric representations, enabling efficient, multiplication-free inference.
Contribution
It proposes the first 2-bit complex-valued LLM quantization framework that exceeds the accuracy of full-precision models by utilizing complex domain representations and symmetric weight mappings.
Findings
Outperforms existing 2-bit quantization methods in PPL and downstream tasks.
Enables multiplication-free inference with addition and swaps.
Maintains strict storage and compute efficiency.
Abstract
Quantization-Aware Training (QAT) integrates quantization into the training loop, enabling LLMs to learn robust low-bit representations, and is widely recognized as one of the most promising research directions. All current QAT research focuses on minimizing quantization error on full-precision models, where the full-precision accuracy acts as an upper bound (accuracy ceiling). No existing method has even attempted to surpass this ceiling. To break this ceiling, we propose a new paradigm: raising the ceiling (full-precision model), and then still quantizing it efficiently into 2 bits. We propose Fairy, the first 2-bit quantization framework for complex-valued LLMs. Specifically, our method leverages the representational advantages of the complex domain to boost full-precision accuracy. We map weights to the fourth roots of unity , forming a perfectly symmetric…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Algorithms and Data Compression · Magnetic confinement fusion research
