Self Supervision to Distillation for Long-Tailed Visual Recognition
Tianhao Li, Limin Wang, Gangshan Wu

TL;DR
This paper introduces a multi-stage training scheme called Self Supervised to Distillation (SSD) that leverages soft labels and self-distillation to improve long-tailed visual recognition, achieving state-of-the-art results.
Contribution
The paper proposes a novel SSD framework that automatically mines label relations and generates distillation labels guided by self-supervision for long-tailed recognition.
Findings
Achieves state-of-the-art results on ImageNet-LT, CIFAR100-LT, and iNaturalist 2018.
Outperforms previous methods by 2.7% to 4.5% on various datasets.
Effectively models long-tailed distribution through soft label-based knowledge transfer.
Abstract
Deep learning has achieved remarkable progress for visual recognition on large-scale balanced datasets but still performs poorly on real-world long-tailed data. Previous methods often adopt class re-balanced training strategies to effectively alleviate the imbalance issue, but might be a risk of over-fitting tail classes. The recent decoupling method overcomes over-fitting issues by using a multi-stage training scheme, yet, it is still incapable of capturing tail class information in the feature learning stage. In this paper, we show that soft label can serve as a powerful solution to incorporate label correlation into a multi-stage training scheme for long-tailed recognition. The intrinsic relation between classes embodied by soft labels turns out to be helpful for long-tailed recognition by transferring knowledge from head to tail classes. Specifically, we propose a conceptually…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques
Methods1x1 Convolution · Convolution · Non Maximum Suppression · SSD
