Knowledge Adaptation as Posterior Correction

Mohammad Emtiyaz Khan

arXiv:2506.14262·cs.LG·December 10, 2025

Knowledge Adaptation as Posterior Correction

Mohammad Emtiyaz Khan

PDF

Open Access

TL;DR

This paper presents a unified view of adaptation as posterior correction, showing that many existing methods operate on this principle and that more accurate posteriors facilitate faster adaptation.

Contribution

It introduces a theoretical framework for adaptation as posterior correction based on Bayesian learning, unifying various adaptation methods under this principle.

Findings

01

Many adaptation methods follow the posterior correction principle.

02

More accurate posteriors enable faster adaptation.

03

The framework is supported by multiple examples demonstrating quick learning.

Abstract

Adaptation is the holy grail of intelligence, but even the best AI models lack the adaptability of toddlers. In spite of great progress, little is known about the mechanisms by which machines can learn to adapt as fast as humans and animals. Here, we cast adaptation as `correction' of old posteriors and show that a wide-variety of existing adaptation methods follow this very principle, including those used for continual learning, federated learning, unlearning, and model merging. In all these settings, more accurate posteriors often lead to smaller corrections and can enable faster adaptation. Posterior correction is derived by using the dual representation of the Bayesian Learning Rule of Khan and Rue (2023), where the interference between the old representation and new information is quantified by using the natural-gradient mismatch. We present many examples demonstrating how machines…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis · Gaussian Processes and Bayesian Inference