You Only Need End-to-End Training for Long-Tailed Recognition
Zhiwei Zhang

TL;DR
This paper demonstrates that end-to-end training with channel decorrelation and novel sampling strategies can outperform decoupled methods in long-tailed recognition tasks, simplifying the training process.
Contribution
It introduces Channel Whitening, B3RS, and BET modules, providing a theoretically grounded and empirically validated approach for improved long-tailed recognition.
Findings
End-to-end training with our methods surpasses decoupled training performance.
Channel decorrelation improves classifier learning on imbalanced data.
Proposed modules effectively address class imbalance and overfitting.
Abstract
The generalization gap on the long-tailed data sets is largely owing to most categories only occupying a few training samples. Decoupled training achieves better performance by training backbone and classifier separately. What causes the poorer performance of end-to-end model training (e.g., logits margin-based methods)? In this work, we identify a key factor that affects the learning of the classifier: the channel-correlated features with low entropy before inputting into the classifier. From the perspective of information theory, we analyze why cross-entropy loss tends to produce highly correlated features on the imbalanced data. In addition, we theoretically analyze and prove its impacts on the gradients of classifier weights, the condition number of Hessian, and logits margin-based approach. Therefore, we firstly propose to use Channel Whitening to decorrelate ("scatter") the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and ELM · Anomaly Detection Techniques and Applications
