GateKD: Confidence-Gated Closed-Loop Distillation for Robust Reasoning
Kasidit Sermsri, Teerapong Panboonyuen

TL;DR
GateKD introduces a confidence-gated closed-loop distillation method that dynamically improves reasoning transfer from large language models to smaller models, reducing errors and hallucinations.
Contribution
The paper presents a novel confidence-gated closed-loop framework for reasoning distillation, enhancing robustness and reliability over traditional open-loop methods.
Findings
GateKD outperforms open-loop baselines across reasoning benchmarks.
It significantly improves logical and symbolic reasoning accuracy.
The framework remains effective under low-resource distillation settings.
Abstract
Distilling multi-step reasoning abilities from large language models (LLMs) into compact student models remains challenging due to noisy rationales, hallucinated supervision, and static teacher-student interactions. Existing reasoning distillation methods, including mentor-based approaches, predominantly operate in an open-loop manner, implicitly assuming uniform teacher reliability and consequently propagating erroneous intermediate reasoning. We propose GateKD, a confidence-gated closed-loop distillation framework that enables robust reasoning transfer by treating the teacher as a dynamic gatekeeper rather than a static oracle. GateKD introduces three complementary mechanisms: (i) confidence-gated soft supervision that selectively distills reliable predictive signals, (ii) gated hidden-state evolution that aligns intermediate representations only when teacher confidence is high, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
