A Survey on Long-Tailed Visual Recognition
Lu Yang, He Jiang, Qing Song, Jun Guo

TL;DR
This survey reviews the challenges of long-tailed data distributions in visual recognition, summarizes datasets and methods, introduces the Gini coefficient for evaluation, and discusses future research directions.
Contribution
It provides a comprehensive overview of long-tailed visual recognition, categorizes existing methods, and introduces the Gini coefficient as a new evaluation metric.
Findings
Long-tailed phenomenon is widespread across datasets.
Existing studies are limited in addressing long-tailedness.
Gini coefficient effectively measures dataset imbalance.
Abstract
The heavy reliance on data is one of the major reasons that currently limit the development of deep learning. Data quality directly dominates the effect of deep learning models, and the long-tailed distribution is one of the factors affecting data quality. The long-tailed phenomenon is prevalent due to the prevalence of power law in nature. In this case, the performance of deep learning models is often dominated by the head classes while the learning of the tail classes is severely underdeveloped. In order to learn adequately for all classes, many researchers have studied and preliminarily addressed the long-tailed problem. In this survey, we focus on the problems caused by long-tailed data distribution, sort out the representative long-tailed visual recognition datasets and summarize some mainstream long-tailed studies. Specifically, we summarize these studies into ten categories from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
