MDCS: More Diverse Experts with Consistency Self-distillation for Long-tailed Recognition
Qihao Zhao, Chen Jiang, Wei Hu, Fan Zhang, Jun Liu

TL;DR
The paper introduces MDCS, a novel method for long-tailed recognition that enhances expert diversity and reduces model variance through a combination of diversity loss and consistency self-distillation, leading to improved accuracy.
Contribution
This work proposes MDCS, integrating diversity loss and consistency self-distillation to significantly improve long-tailed recognition performance.
Findings
Outperforms state-of-the-art by 1-2% on five benchmarks.
Effectively increases expert diversity and reduces model variance.
Mutually reinforcing roles of diversity loss and self-distillation.
Abstract
Recently, multi-expert methods have led to significant improvements in long-tail recognition (LTR). We summarize two aspects that need further enhancement to contribute to LTR boosting: (1) More diverse experts; (2) Lower model variance. However, the previous methods didn't handle them well. To this end, we propose More Diverse experts with Consistency Self-distillation (MDCS) to bridge the gap left by earlier methods. Our MDCS approach consists of two core components: Diversity Loss (DL) and Consistency Self-distillation (CS). In detail, DL promotes diversity among experts by controlling their focus on different categories. To reduce the model variance, we employ KL divergence to distill the richer knowledge of weakly augmented instances for the experts' self-distillation. In particular, we design Confident Instance Sampling (CIS) to select the correctly classified instances for CS to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Fetal and Pediatric Neurological Disorders · Advanced Neural Network Applications
MethodsFocus
