RSG: A Simple but Effective Module for Learning Imbalanced Datasets
Jianfeng Wang, Thomas Lukasiewicz, Xiaolin Hu, Jianfei Cai, Zhenghua, Xu

TL;DR
This paper introduces RSG, a versatile data augmentation module that generates samples for rare classes during training, significantly improving deep neural network performance on imbalanced datasets without affecting testing efficiency.
Contribution
The paper proposes RSG, a simple and effective module for generating rare class samples during training, compatible with various networks and loss functions, achieving state-of-the-art results.
Findings
RSG improves performance on imbalanced datasets.
RSG achieves new state-of-the-art on Places-LT, ImageNet-LT, and iNaturalist 2018.
RSG is easy to integrate and does not increase testing burden.
Abstract
Imbalanced datasets widely exist in practice and area great challenge for training deep neural models with agood generalization on infrequent classes. In this work, wepropose a new rare-class sample generator (RSG) to solvethis problem. RSG aims to generate some new samplesfor rare classes during training, and it has in particularthe following advantages: (1) it is convenient to use andhighly versatile, because it can be easily integrated intoany kind of convolutional neural network, and it works wellwhen combined with different loss functions, and (2) it isonly used during the training phase, and therefore, no ad-ditional burden is imposed on deep neural networks duringthe testing phase. In extensive experimental evaluations, weverify the effectiveness of RSG. Furthermore, by leveragingRSG, we obtain competitive results on Imbalanced CIFARand new state-of-the-art results on Places-LT,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Domain Adaptation and Few-Shot Learning · Anomaly Detection Techniques and Applications
