You Only Need End-to-End Training for Long-Tailed Recognition

Zhiwei Zhang

arXiv:2112.05958·cs.CV·March 13, 2023·1 cites

You Only Need End-to-End Training for Long-Tailed Recognition

Zhiwei Zhang

PDF

Open Access

TL;DR

This paper demonstrates that end-to-end training with channel decorrelation and novel sampling strategies can outperform decoupled methods in long-tailed recognition tasks, simplifying the training process.

Contribution

It introduces Channel Whitening, B3RS, and BET modules, providing a theoretically grounded and empirically validated approach for improved long-tailed recognition.

Findings

01

End-to-end training with our methods surpasses decoupled training performance.

02

Channel decorrelation improves classifier learning on imbalanced data.

03

Proposed modules effectively address class imbalance and overfitting.

Abstract

The generalization gap on the long-tailed data sets is largely owing to most categories only occupying a few training samples. Decoupled training achieves better performance by training backbone and classifier separately. What causes the poorer performance of end-to-end model training (e.g., logits margin-based methods)? In this work, we identify a key factor that affects the learning of the classifier: the channel-correlated features with low entropy before inputting into the classifier. From the perspective of information theory, we analyze why cross-entropy loss tends to produce highly correlated features on the imbalanced data. In addition, we theoretically analyze and prove its impacts on the gradients of classifier weights, the condition number of Hessian, and logits margin-based approach. Therefore, we firstly propose to use Channel Whitening to decorrelate ("scatter") the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and ELM · Anomaly Detection Techniques and Applications