Geometric Scaling of Bayesian Inference in LLMs
Naman Agarwal, Siddhartha R. Dalal, Vishal Misra

TL;DR
This paper demonstrates that large language models preserve a geometric structure enabling Bayesian inference, with last-layer representations correlating with predictive uncertainty across various models and prompts.
Contribution
It reveals that the geometric signature of Bayesian inference persists in production models and shows how this structure relates to uncertainty and model behavior.
Findings
Last-layer representations align along a dominant axis correlated with entropy.
Prompt restrictions collapse the geometric structure into low-dimensional manifolds.
Perturbing the entropy axis disrupts local uncertainty geometry without degrading Bayesian behavior.
Abstract
Recent work has shown that small transformers trained in controlled "wind-tunnel'' settings can implement exact Bayesian inference, and that their training dynamics produce a geometric substrate -- low-dimensional value manifolds and progressively orthogonal keys -- that encodes posterior structure. We investigate whether this geometric signature persists in production-grade language models. Across Pythia, Phi-2, Llama-3, and Mistral families, we find that last-layer value representations organize along a single dominant axis whose position strongly correlates with predictive entropy, and that domain-restricted prompts collapse this structure into the same low-dimensional manifolds observed in synthetic settings. To probe the role of this geometry, we perform targeted interventions on the entropy-aligned axis of Pythia-410M during in-context learning. Removing or perturbing this axis…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLanguage and cultural evolution · Embodied and Extended Cognition · Generative Adversarial Networks and Image Synthesis
