Coverage-Based Calibration for Post-Training Quantization via Weighted Set Cover over Outlier Channels
Ibne Farabi Shihab, Sanjeda Akter, Anuj Sharma

TL;DR
This paper introduces COVERCAL, a calibration selection method for Post-Training Quantization that uses weighted outlier channel coverage to improve model compression quality, especially with limited calibration data.
Contribution
It formulates calibration sample selection as a weighted set cover problem over outlier channels, providing a principled and efficient approach with theoretical justification.
Findings
COVERCAL outperforms baselines across multiple models and evaluation metrics.
Significant improvements in MMLU and perplexity at small calibration budgets.
The method requires no GPU time at selection and is based on activation statistics.
Abstract
Post-Training Quantization (PTQ) compresses large language models to low bit-widths using a small calibration set, and its quality depends strongly on which samples are chosen. We identify a failure mode in which calibration samples fail to activate outlier channels, hidden dimensions with unusually large activations, causing the quantizer to underestimate their dynamic range and producing per-channel reconstruction errors that dominate layer-wise loss. Motivated by this observation, we argue that PTQ calibration quality is governed more by weighted outlier-channel coverage than by generic sample representativeness, and formulate calibration selection as a weighted set cover problem over outlier channels. The objective is monotone submodular, and the greedy algorithm, COVERCAL, operates on pre-computed activation statistics and requires no GPU time at selection. We further show that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
