Confusion-Aware Spectral Regularizer for Long-Tailed Recognition
Ziquan Zhu, Gaojie Jin, Hanruo Zhu, Si-Yuan Lu, Yunxiao Zhang, Zeyu Fu, Ronghui Mu, Guoqiang Zhang, Zhao Sun, Xia Yuhang, Jiaxing Shang, Xiang Li, Lu Liu, Tianjin Huang

TL;DR
This paper introduces a confusion-aware spectral regularizer that improves long-tailed image classification by focusing on worst-class errors, leading to better tail-class generalization and overall accuracy.
Contribution
It proposes a novel spectral regularizer based on class confusion, with a theoretical framework and practical optimization techniques, outperforming existing methods on multiple benchmarks.
Findings
Significant improvement in worst-class accuracy.
Enhanced overall performance across datasets.
Outperforms state-of-the-art methods when combined with augmentation.
Abstract
Long-tailed image classification remains a long-standing challenge, as real-world data typically follow highly imbalanced distributions where a few head classes dominate and many tail classes contain only limited samples. This imbalance biases feature learning toward head categories and leads to significant degradation on rare classes. Although recent studies have proposed re-sampling, re-weighting, and decoupled learning strategies, the improvement on the most underrepresented classes still remains marginal compared with overall accuracy. In this work, we present a confusion-centric perspective for long-tailed recognition that explicitly focuses on worst-class generalization. We first establish a new theoretical framework of class-specific error analysis, which shows that the worst-class error can be tightly upper-bounded by the spectral norm of the frequency-weighted confusion matrix…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Face and Expression Recognition · COVID-19 diagnosis using AI
