Good Teachers Explain: Explanation-Enhanced Knowledge Distillation
Amin Parchami-Araghi, Moritz B\"ohle, Sukrut Rao, Bernt Schiele

TL;DR
This paper introduces explanation-enhanced knowledge distillation (e$^2$KD), a method that improves student models by aligning their explanations with teachers, leading to better accuracy, interpretability, and robustness across various settings.
Contribution
The paper proposes a novel explanation-enhanced KD method that aligns student and teacher explanations, improving accuracy and interpretability beyond traditional KD.
Findings
e$^2$KD significantly improves student accuracy and explanation similarity.
It ensures students learn for the right reasons, matching teacher explanations.
The method is robust across architectures, data sizes, and explanation types.
Abstract
Knowledge Distillation (KD) has proven effective for compressing large teacher models into smaller student models. While it is well known that student models can achieve similar accuracies as the teachers, it has also been shown that they nonetheless often do not learn the same function. It is, however, often highly desirable that the student's and teacher's functions share similar properties such as basing the prediction on the same input features, as this ensures that students learn the 'right features' from the teachers. In this work, we explore whether this can be achieved by not only optimizing the classic KD loss but also the similarity of the explanations generated by the teacher and the student. Despite the idea being simple and intuitive, we find that our proposed 'explanation-enhanced' KD (eKD) (1) consistently provides large gains in terms of accuracy and student-teacher…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEducation and Critical Thinking Development · Innovative Teaching and Learning Methods · Educational Assessment and Pedagogy
