Learn from Balance: Rectifying Knowledge Transfer for Long-Tailed   Scenarios

Xinlei Huang; Jialiang Tang; Xubin Zheng; Jinjia Zhou; Wenxin Yu; Ning; Jiang

arXiv:2409.07694·cs.CV·September 23, 2024

Learn from Balance: Rectifying Knowledge Transfer for Long-Tailed Scenarios

Xinlei Huang, Jialiang Tang, Xubin Zheng, Jinjia Zhou, Wenxin Yu, Ning, Jiang

PDF

Open Access

TL;DR

This paper introduces KRDistill, a framework that rectifies biased knowledge transfer from teacher to student networks in long-tailed data scenarios, improving performance on underrepresented tail categories.

Contribution

The paper proposes a novel knowledge rectification method that incorporates balanced priors to address bias in teacher networks trained on imbalanced data.

Findings

01

KRDistill improves tail category performance significantly.

02

The method outperforms existing knowledge distillation approaches.

03

Effective across various long-tailed datasets.

Abstract

Knowledge Distillation (KD) transfers knowledge from a large pre-trained teacher network to a compact and efficient student network, making it suitable for deployment on resource-limited media terminals. However, traditional KD methods require balanced data to ensure robust training, which is often unavailable in practical applications. In such scenarios, a few head categories occupy a substantial proportion of examples. This imbalance biases the trained teacher network towards the head categories, resulting in severe performance degradation on the less represented tail categories for both the teacher and student networks. In this paper, we propose a novel framework called Knowledge Rectification Distillation (KRDistill) to address the imbalanced knowledge inherited in the teacher network through the incorporation of the balanced category priors. Furthermore, we rectify the biased…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEducational Technology and Optimization