Uncertainty-Aware Dual-Student Knowledge Distillation for Efficient Image Classification

Aakash Gore; Anoushka Dey; Aryan Mishra

arXiv:2511.18826·cs.CV·November 25, 2025

Uncertainty-Aware Dual-Student Knowledge Distillation for Efficient Image Classification

Aakash Gore, Anoushka Dey, Aryan Mishra

PDF

Open Access

TL;DR

This paper introduces an uncertainty-aware dual-student knowledge distillation method that improves image classification by leveraging teacher confidence and collaborative learning between heterogeneous student models.

Contribution

It presents a novel framework combining uncertainty estimation with peer learning of two different student architectures for enhanced model compression.

Findings

01

ResNet-18 achieves 83.84% top-1 accuracy

02

MobileNetV2 achieves 81.46% top-1 accuracy

03

Outperforms traditional distillation methods on ImageNet-100

Abstract

Knowledge distillation has emerged as a powerful technique for model compression, enabling the transfer of knowledge from large teacher networks to compact student models. However, traditional knowledge distillation methods treat all teacher predictions equally, regardless of the teacher's confidence in those predictions. This paper proposes an uncertainty-aware dual-student knowledge distillation framework that leverages teacher prediction uncertainty to selectively guide student learning. We introduce a peer-learning mechanism where two heterogeneous student architectures, specifically ResNet-18 and MobileNetV2, learn collaboratively from both the teacher network and each other. Experimental results on ImageNet-100 demonstrate that our approach achieves superior performance compared to baseline knowledge distillation methods, with ResNet-18 achieving 83.84\% top-1 accuracy and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning