Benford's Curse: Tracing Digit Bias to Numerical Hallucination in LLMs
Jiandong Shao, Yao Lu, Jianfei Yang

TL;DR
This paper investigates how digit bias in large language models, influenced by training data statistics like Benford's Law, causes numerical hallucinations, and demonstrates that targeted neuron pruning can reduce this bias.
Contribution
It uncovers the link between corpus digit distributions and LLM numerical biases, and shows how pruning specific neurons mitigates hallucinations.
Findings
LLMs exhibit digit bias resembling Benford's Law.
Pruning digit-selective neurons reduces bias and improves numerical accuracy.
Corpus digit distributions influence LLM numerical behavior.
Abstract
Large Language Models (LLMs) exhibit impressive performance on complex reasoning tasks, yet they frequently fail on basic numerical problems, producing incorrect outputs. Inspired by Benford's Law, a statistical pattern in which lower digits occur more frequently as leading digits, we hypothesize that the skewed digit distributions in web-collected corpora may be learned by LLMs during pretraining, leading to biased numerical generation. To investigate the hypothesis, we first examine whether digits frequencies in pretraining corpus (OLMo2) follows Benford's law. We then construct an evaluation benchmark in which the ground-truth digits are uniformly distributed within each of the seven numerical reasoning tasks. Our evaluation results demonstrate that leading open-source LLMs show a consistent pattern of digit bias that resembles Benford's law. Through logit-lens tracing and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBenford’s Law and Fraud Detection · Academic integrity and plagiarism
