On the Entropy Calibration of Language Models
Steven Cao, Gregory Valiant, Percy Liang

TL;DR
This paper investigates whether larger language models naturally improve entropy calibration with scale, finds that miscalibration scales slowly, and explores theoretical methods to improve calibration without sacrificing diversity or increasing error.
Contribution
It provides a theoretical analysis of miscalibration scaling in language models and empirically shows that larger models do not significantly improve calibration, proposing a theoretical approach for better calibration.
Findings
Miscalibration improves very slowly with model scale for certain data distributions.
Larger models tend to accumulate error at similar rates as smaller models.
Theoretically, entropy can be reduced without increasing log loss if a predictive entropy model is used.
Abstract
We study the problem of entropy calibration, which asks whether a language model's entropy over generations matches its log loss on human text. Past work found that models are miscalibrated, with entropy per step increasing as generations grow longer, due to error accumulation. To calibrate the model and improve text quality, it has become standard practice to truncate the distribution, but this approach reduces output diversity, which we would like to avoid. Therefore, in this paper, we ask: does miscalibration improve automatically with scale, and if not, is it theoretically possible to calibrate without tradeoffs? To build intuition, we first study a simplified theoretical setting to characterize the scaling behavior of miscalibration with respect to dataset size. We find that the rate of scaling depends on the power law exponent of the data distribution -- in particular, for a power…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsComputational and Text Analysis Methods · Topic Modeling · Text Readability and Simplification
