PURSUhInT: In Search of Informative Hint Points Based on Layer Clustering for Knowledge Distillation
Reyhan Kevser Keser, Aydin Ayanzadeh, Omid Abdollahi Aghdam, Caglar, Kilcioglu, Behcet Ugur Toreyin, Nazim Kemal Ure

TL;DR
This paper introduces a clustering-based method for selecting informative hint points in knowledge distillation, significantly improving model compression performance across various datasets and teacher-student configurations.
Contribution
It proposes a novel clustering approach to select hint points, enhancing the effectiveness of knowledge distillation over traditional fixed hint point methods.
Findings
Outperforms state-of-the-art distillation algorithms on CIFAR-100 and ImageNet
Applicable to any student network with different teacher models
Improves compression performance significantly
Abstract
One of the most efficient methods for model compression is hint distillation, where the student model is injected with information (hints) from several different layers of the teacher model. Although the selection of hint points can drastically alter the compression performance, conventional distillation approaches overlook this fact and use the same hint points as in the early studies. Therefore, we propose a clustering based hint selection methodology, where the layers of teacher model are clustered with respect to several metrics and the cluster centers are used as the hint points. Our method is applicable for any student network, once it is applied on a chosen teacher network. The proposed approach is validated in CIFAR-100 and ImageNet datasets, using various teacher-student pairs and numerous hint distillation methods. Our results show that hint points selected by our algorithm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Face and Expression Recognition
MethodsKnowledge Distillation
