Cross-Domain Empirical Risk Minimization for Unbiased Long-tailed   Classification

Beier Zhu; Yulei Niu; Xian-Sheng Hua; Hanwang Zhang

arXiv:2112.14380·cs.CV·December 30, 2021

Cross-Domain Empirical Risk Minimization for Unbiased Long-tailed Classification

Beier Zhu, Yulei Niu, Xian-Sheng Hua, Hanwang Zhang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces xERM, a method that trains unbiased long-tailed classifiers by balancing domain risks, improving performance across different test distributions, especially when test data is also long-tailed.

Contribution

The paper proposes xERM, a novel approach that addresses bias in long-tailed classification by balancing risks across domains, supported by theoretical causality analysis.

Findings

01

xERM outperforms existing methods on long-tailed benchmarks.

02

It achieves balanced performance on both head and tail classes.

03

Theoretical analysis explains unbiasedness via domain risk adjustment.

Abstract

We address the overlooked unbiasedness in existing long-tailed classification methods: we find that their overall improvement is mostly attributed to the biased preference of tail over head, as the test distribution is assumed to be balanced; however, when the test is as imbalanced as the long-tailed training data -- let the test respect Zipf's law of nature -- the tail bias is no longer beneficial overall because it hurts the head majorities. In this paper, we propose Cross-Domain Empirical Risk Minimization (xERM) for training an unbiased model to achieve strong performances on both test distributions, which empirically demonstrates that xERM fundamentally improves the classification by learning better feature representation rather than the head vs. tail game. Based on causality, we further theoretically explain why xERM achieves unbiasedness: the bias caused by the domain selection…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

beierzhu/xerm
pytorchOfficial

Videos

Cross-Domain Empirical Risk Minimization for Unbiased Long-Tailed Classification· underline

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Anomaly Detection Techniques and Applications