Filtering Beats Fine Tuning: A Bayesian Kalman View of In Context Learning in LLMs
Andrew Kiruluta

TL;DR
This paper introduces a Bayesian Kalman filter framework to interpret inference-time adaptation in large language models as sequential state estimation, providing theoretical insights into in-context learning and test-time adaptation without parameter updates.
Contribution
It formulates task-specific learning in LLMs as Bayesian filtering, establishing stability, convergence, and uncertainty dynamics, and unifies various adaptation methods under a probabilistic framework.
Findings
Inference-time learning driven by covariance collapse precedes convergence of the posterior mean.
The Bayesian filter is proven to be stable with exponential covariance contraction rates.
Optimization-based adaptation is shown as a degenerate limit of Bayesian filtering.
Abstract
We present a theory-first framework that interprets inference-time adaptation in large language models (LLMs) as online Bayesian state estimation. Rather than modeling rapid adaptation as implicit optimization or meta-learning, we formulate task- and context-specific learning as the sequential inference of a low-dimensional latent adaptation state governed by a linearized state-space model. Under Gaussian assumptions, adaptation follows a Kalman recursion with closed-form updates for both the posterior mean and covariance. This perspective elevates epistemic uncertainty to an explicit dynamical variable. We show that inference-time learning is driven by covariance collapse, i.e., rapid contraction of posterior uncertainty induced by informative tokens, which typically precedes convergence of the posterior mean. Using observability conditions on token-level Jacobians, we establish…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Language and cultural evolution · Speech and dialogue systems
