PolarQuant: Optimal Gaussian Weight Quantization via Hadamard Rotation for LLM Compression

Caio Vicentino

arXiv:2603.29078·cs.CL·April 22, 2026

PolarQuant: Optimal Gaussian Weight Quantization via Hadamard Rotation for LLM Compression

Caio Vicentino

PDF

1 Repo 37 Models

TL;DR

PolarQuant is a novel post-training quantization method for large language models that uses Hadamard rotation to transform weights, enabling near-lossless compression with minimal performance loss.

Contribution

The paper introduces PolarQuant, a three-stage quantization process leveraging Hadamard rotation to significantly improve LLM weight compression without calibration data.

Findings

01

Hadamard rotation accounts for 98% of quality improvement.

02

PolarQuant reduces Qwen3.5-9B perplexity from 6.90 to 6.40.

03

PolarQuant enables effective INT4 quantization with minimal perplexity increase.

Abstract

We present PolarQuant, a post-training weight quantization method for large language models (LLMs) that exploits the distributional structure of neural network weights to achieve near-lossless compression. PolarQuant operates in three stages: (1) block-wise normalization to the unit hypersphere, (2) Walsh-Hadamard rotation to transform coordinates into approximately Gaussian random variables, and (3) quantization with centroids matched to the Gaussian distribution. Our ablation reveals that Hadamard rotation alone accounts for 98% of the quality improvement, reducing Qwen3.5-9B perplexity from 6.90 (absmax Q5) to 6.40 (Delta = +0.03 from FP16), making it practically lossless without any calibration data. Furthermore, PolarQuant functions as an effective preprocessing step for downstream INT4 quantizers: PolarQuant Q5 dequantized and re-quantized by torchao INT4 achieves perplexity 6.56…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

null
github

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.