Recursive Meta-Distillation: An Axiomatic Framework for Iterative Knowledge Refinement
Aaron R. Flouro, Shawn P. Chadwick

TL;DR
This paper introduces an axiomatic, operator-theoretic framework for recursive knowledge distillation, providing foundational insights into its convergence, stability, and theoretical properties without relying on specific algorithms.
Contribution
It formalizes recursive distillation as a sequence of probability operators, proving convergence and stability under mild assumptions, and offers a theoretical basis for understanding iterative knowledge refinement.
Findings
Recursive distillation induces contraction in KL divergence.
The framework guarantees geometric convergence to base teacher distributions.
It characterizes conditions for well-posed and stable recursive distillation.
Abstract
Recent work in probability-domain knowledge distillation has established axiomatic frameworks for temperature scaling, multi-teacher aggregation, and bias-variance trade-offs in single-stage settings. However, the mathematical behavior of recursive or multi-generation distillation remains poorly understood, with prior approaches relying primarily on empirical heuristics. In this work, we introduce an axiomatic and operator-theoretic framework for recursive meta-distillation, formalizing iterative knowledge distillation as a sequence of probability-distribution operators with explicit anchoring to base teachers. We define structural axioms for valid meta-teacher construction and prove the existence of non-trivial operator families satisfying these axioms without specifying particular algorithms or loss functions. Under mild realizability and convexity assumptions, we show that anchored…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Mechanics and Entropy · Constraint Satisfaction and Optimization · Bayesian Modeling and Causal Inference
