Long-tail learning via logit adjustment

Aditya Krishna Menon; Sadeep Jayasumana; Ankit Singh Rawat and; Himanshu Jain; Andreas Veit; Sanjiv Kumar

arXiv:2007.07314·cs.LG·July 13, 2021·52 cites

Long-tail learning via logit adjustment

Aditya Krishna Menon, Sadeep Jayasumana, Ankit Singh Rawat and, Himanshu Jain, Andreas Veit, Sanjiv Kumar

PDF

Open Access 3 Repos 1 Video

TL;DR

This paper introduces two simple logit adjustment methods based on label frequencies to improve classification performance on long-tailed datasets, addressing bias towards dominant labels and enhancing generalization.

Contribution

It proposes two modifications of softmax cross-entropy using logit adjustment, unifying and generalizing recent approaches with stronger statistical basis and empirical validation.

Findings

01

Improved classification on long-tailed datasets.

02

Logit adjustment enhances margin between rare and dominant labels.

03

Methods outperform baseline models in experiments.

Abstract

Real-world classification problems typically exhibit an imbalanced or long-tailed label distribution, wherein many labels are associated with only a few samples. This poses a challenge for generalisation on such labels, and also makes na\"ive learning biased towards dominant labels. In this paper, we present two simple modifications of standard softmax cross-entropy training to cope with these challenges. Our techniques revisit the classic idea of logit adjustment based on the label frequencies, either applied post-hoc to a trained model, or enforced in the loss during training. Such adjustment encourages a large relative margin between logits of rare versus dominant labels. These techniques unify and generalise several recent proposals in the literature, while possessing firmer statistical grounding and empirical performance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Long-tail learning via logit adjustment· slideslive

Taxonomy

TopicsImbalanced Data Classification Techniques · Anomaly Detection Techniques and Applications · Machine Learning and Data Classification

MethodsSoftmax