GateKD: Confidence-Gated Closed-Loop Distillation for Robust Reasoning

Kasidit Sermsri; Teerapong Panboonyuen

arXiv:2605.13136·cs.CL·May 14, 2026

GateKD: Confidence-Gated Closed-Loop Distillation for Robust Reasoning

Kasidit Sermsri, Teerapong Panboonyuen

PDF

TL;DR

GateKD introduces a confidence-gated closed-loop distillation method that dynamically improves reasoning transfer from large language models to smaller models, reducing errors and hallucinations.

Contribution

The paper presents a novel confidence-gated closed-loop framework for reasoning distillation, enhancing robustness and reliability over traditional open-loop methods.

Findings

01

GateKD outperforms open-loop baselines across reasoning benchmarks.

02

It significantly improves logical and symbolic reasoning accuracy.

03

The framework remains effective under low-resource distillation settings.

Abstract

Distilling multi-step reasoning abilities from large language models (LLMs) into compact student models remains challenging due to noisy rationales, hallucinated supervision, and static teacher-student interactions. Existing reasoning distillation methods, including mentor-based approaches, predominantly operate in an open-loop manner, implicitly assuming uniform teacher reliability and consequently propagating erroneous intermediate reasoning. We propose GateKD, a confidence-gated closed-loop distillation framework that enables robust reasoning transfer by treating the teacher as a dynamic gatekeeper rather than a static oracle. GateKD introduces three complementary mechanisms: (i) confidence-gated soft supervision that selectively distills reliable predictive signals, (ii) gated hidden-state evolution that aligns intermediate representations only when teacher confidence is high, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.