Benford's Law as a Distributional Prior for Post-Training Quantization of Large Language Models

Arthur Negr\~ao; Pedro Silva; Vander L. S. Freitas; Gladston Moreira; Eduardo Luz

arXiv:2602.00165·cs.LG·February 3, 2026

Benford's Law as a Distributional Prior for Post-Training Quantization of Large Language Models

Arthur Negr\~ao, Pedro Silva, Vander L. S. Freitas, Gladston Moreira, Eduardo Luz

PDF

Open Access

TL;DR

This paper introduces Benford-Quant, a non-uniform quantizer based on Benford's Law, which improves weight quantization in large language models by allocating more resolution to small weights, leading to better perplexity scores.

Contribution

The paper presents a novel Benford-inspired quantization method that is data-free, simple to implement, and enhances the accuracy of LLMs in low-bit regimes compared to uniform quantization.

Findings

01

Benford-Quant improves perplexity on small language models by over 10% with 4-bit quantization.

02

Transformer layer weights closely follow Benford's Law, unlike normalization layers.

03

Benford-Quant remains competitive on larger models, with benefits explained by over-parameterization effects.

Abstract

The rapid growth of Large Language Models (LLMs) intensifies the need for effective compression, with weight quantization being the most widely adopted technique. Standard uniform quantizers assume that parameters are evenly distributed, an assumption at odds with the highly skewed distributions observed in practice. We propose Benford-Quant, a simple, data-free non-uniform quantizer inspired by Benford's Law, which predicts that leading digits follow a logarithmic distribution. Benford-Quant replaces the uniform grid with a log-spaced codebook, dedicating more resolution to the frequent small-magnitude weights. We provide both theoretical intuition and empirical evidence: (i) weights in transformer transformational layers adhere closely to Benford statistics, while normalization layers systematically deviate; (ii) on Small Language Models (SLMs), Benford-Quant consistently improves…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBenford’s Law and Fraud Detection · Computational Physics and Python Applications · Explainable Artificial Intelligence (XAI)