Filtering Beats Fine Tuning: A Bayesian Kalman View of In Context Learning in LLMs

Andrew Kiruluta

arXiv:2601.06100·cs.LG·January 13, 2026

Filtering Beats Fine Tuning: A Bayesian Kalman View of In Context Learning in LLMs

Andrew Kiruluta

PDF

Open Access

TL;DR

This paper introduces a Bayesian Kalman filter framework to interpret inference-time adaptation in large language models as sequential state estimation, providing theoretical insights into in-context learning and test-time adaptation without parameter updates.

Contribution

It formulates task-specific learning in LLMs as Bayesian filtering, establishing stability, convergence, and uncertainty dynamics, and unifies various adaptation methods under a probabilistic framework.

Findings

01

Inference-time learning driven by covariance collapse precedes convergence of the posterior mean.

02

The Bayesian filter is proven to be stable with exponential covariance contraction rates.

03

Optimization-based adaptation is shown as a degenerate limit of Bayesian filtering.

Abstract

We present a theory-first framework that interprets inference-time adaptation in large language models (LLMs) as online Bayesian state estimation. Rather than modeling rapid adaptation as implicit optimization or meta-learning, we formulate task- and context-specific learning as the sequential inference of a low-dimensional latent adaptation state governed by a linearized state-space model. Under Gaussian assumptions, adaptation follows a Kalman recursion with closed-form updates for both the posterior mean and covariance. This perspective elevates epistemic uncertainty to an explicit dynamical variable. We show that inference-time learning is driven by covariance collapse, i.e., rapid contraction of posterior uncertainty induced by informative tokens, which typically precedes convergence of the posterior mean. Using observability conditions on token-level Jacobians, we establish…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Language and cultural evolution · Speech and dialogue systems