TL;DR
This paper introduces Balanced-MixUp, a novel data sampling method that combines instance-based and class-based sampling with MixUp regularization to improve classification performance on highly imbalanced medical image datasets.
Contribution
The paper proposes Balanced-MixUp, a new sampling technique that enhances learning from minority classes in imbalanced datasets by integrating balanced sampling with MixUp.
Findings
Balanced-MixUp outperforms traditional sampling methods.
It improves classification accuracy on imbalanced medical datasets.
The method is effective across different CNN architectures.
Abstract
Highly imbalanced datasets are ubiquitous in medical image classification problems. In such problems, it is often the case that rare classes associated to less prevalent diseases are severely under-represented in labeled databases, typically resulting in poor performance of machine learning algorithms due to overfitting in the learning process. In this paper, we propose a novel mechanism for sampling training data based on the popular MixUp regularization technique, which we refer to as Balanced-MixUp. In short, Balanced-MixUp simultaneously performs regular (i.e., instance-based) and balanced (i.e., class-based) sampling of the training data. The resulting two sets of samples are then mixed-up to create a more balanced training distribution from which a neural network can effectively learn without incurring in heavily under-fitting the minority classes. We experiment with a highly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMixup
