Benford's Law as a Distributional Prior for Post-Training Quantization of Large Language Models
Arthur Negr\~ao, Pedro Silva, Vander L. S. Freitas, Gladston Moreira, Eduardo Luz

TL;DR
This paper introduces Benford-Quant, a non-uniform quantizer based on Benford's Law, which improves weight quantization in large language models by allocating more resolution to small weights, leading to better perplexity scores.
Contribution
The paper presents a novel Benford-inspired quantization method that is data-free, simple to implement, and enhances the accuracy of LLMs in low-bit regimes compared to uniform quantization.
Findings
Benford-Quant improves perplexity on small language models by over 10% with 4-bit quantization.
Transformer layer weights closely follow Benford's Law, unlike normalization layers.
Benford-Quant remains competitive on larger models, with benefits explained by over-parameterization effects.
Abstract
The rapid growth of Large Language Models (LLMs) intensifies the need for effective compression, with weight quantization being the most widely adopted technique. Standard uniform quantizers assume that parameters are evenly distributed, an assumption at odds with the highly skewed distributions observed in practice. We propose Benford-Quant, a simple, data-free non-uniform quantizer inspired by Benford's Law, which predicts that leading digits follow a logarithmic distribution. Benford-Quant replaces the uniform grid with a log-spaced codebook, dedicating more resolution to the frequent small-magnitude weights. We provide both theoretical intuition and empirical evidence: (i) weights in transformer transformational layers adhere closely to Benford statistics, while normalization layers systematically deviate; (ii) on Small Language Models (SLMs), Benford-Quant consistently improves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBenford’s Law and Fraud Detection · Computational Physics and Python Applications · Explainable Artificial Intelligence (XAI)
