Recursive Meta-Distillation: An Axiomatic Framework for Iterative Knowledge Refinement

Aaron R. Flouro; Shawn P. Chadwick

arXiv:2601.13100·cs.LG·January 21, 2026

Recursive Meta-Distillation: An Axiomatic Framework for Iterative Knowledge Refinement

Aaron R. Flouro, Shawn P. Chadwick

PDF

Open Access

TL;DR

This paper introduces an axiomatic, operator-theoretic framework for recursive knowledge distillation, providing foundational insights into its convergence, stability, and theoretical properties without relying on specific algorithms.

Contribution

It formalizes recursive distillation as a sequence of probability operators, proving convergence and stability under mild assumptions, and offers a theoretical basis for understanding iterative knowledge refinement.

Findings

01

Recursive distillation induces contraction in KL divergence.

02

The framework guarantees geometric convergence to base teacher distributions.

03

It characterizes conditions for well-posed and stable recursive distillation.

Abstract

Recent work in probability-domain knowledge distillation has established axiomatic frameworks for temperature scaling, multi-teacher aggregation, and bias-variance trade-offs in single-stage settings. However, the mathematical behavior of recursive or multi-generation distillation remains poorly understood, with prior approaches relying primarily on empirical heuristics. In this work, we introduce an axiomatic and operator-theoretic framework for recursive meta-distillation, formalizing iterative knowledge distillation as a sequence of probability-distribution operators with explicit anchoring to base teachers. We define structural axioms for valid meta-teacher construction and prove the existence of non-trivial operator families satisfying these axioms without specifying particular algorithms or loss functions. Under mild realizability and convexity assumptions, we show that anchored…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Mechanics and Entropy · Constraint Satisfaction and Optimization · Bayesian Modeling and Causal Inference