TL;DR
PolarQuant is a novel post-training quantization method for large language models that uses Hadamard rotation to transform weights, enabling near-lossless compression with minimal performance loss.
Contribution
The paper introduces PolarQuant, a three-stage quantization process leveraging Hadamard rotation to significantly improve LLM weight compression without calibration data.
Findings
Hadamard rotation accounts for 98% of quality improvement.
PolarQuant reduces Qwen3.5-9B perplexity from 6.90 to 6.40.
PolarQuant enables effective INT4 quantization with minimal perplexity increase.
Abstract
We present PolarQuant, a post-training weight quantization method for large language models (LLMs) that exploits the distributional structure of neural network weights to achieve near-lossless compression. PolarQuant operates in three stages: (1) block-wise normalization to the unit hypersphere, (2) Walsh-Hadamard rotation to transform coordinates into approximately Gaussian random variables, and (3) quantization with centroids matched to the Gaussian distribution. Our ablation reveals that Hadamard rotation alone accounts for 98% of the quality improvement, reducing Qwen3.5-9B perplexity from 6.90 (absmax Q5) to 6.40 (Delta = +0.03 from FP16), making it practically lossless without any calibration data. Furthermore, PolarQuant functions as an effective preprocessing step for downstream INT4 quantizers: PolarQuant Q5 dequantized and re-quantized by torchao INT4 achieves perplexity 6.56…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗caiovicentino1/Qwopus3.5-9B-v3-HLWQ-MLX-4bitmodel· 401 dl· ♡ 8401 dl♡ 8
- 🤗caiovicentino1/Gemma-4-26B-A4B-it-HLWQ-Q5model· 4 dl· ♡ 84 dl♡ 8
- 🤗caiovicentino1/LTX-2.3-22B-HLWQ-Q5model· ♡ 35♡ 35
- 🤗caiovicentino1/Qwen3.5-9B-HLWQ-Engine-v4model· 10 dl10 dl
- 🤗caiovicentino1/Qwen3.5-9B-HLWQ-Q5model· 221 dl· ♡ 3221 dl♡ 3
- 🤗caiovicentino1/Qwen3.5-9B-HLWQ-MLX-4bitmodel· 175 dl· ♡ 3175 dl♡ 3
- 🤗caiovicentino1/Qwen3.5-27B-HLWQ-Q5model· 35 dl· ♡ 1035 dl♡ 10
- 🤗caiovicentino1/Qwen3.5-9B-Claude-Opus-HLWQ-Q5model· 31 dl· ♡ 331 dl♡ 3
- 🤗caiovicentino1/Qwen3.5-27B-Claude-Opus-HLWQ-Q5model· 270 dl270 dl
- 🤗caiovicentino1/Qwopus3.5-9B-v3-HLWQ-Q5model· 55 dl· ♡ 955 dl♡ 9
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
