Balanced-MixUp for Highly Imbalanced Medical Image Classification

Adrian Galdran; Gustavo Carneiro; Miguel A. Gonz\'alez Ballester

arXiv:2109.09850·cs.CV·September 15, 2022

Balanced-MixUp for Highly Imbalanced Medical Image Classification

Adrian Galdran, Gustavo Carneiro, Miguel A. Gonz\'alez Ballester

PDF

1 Repo

TL;DR

This paper introduces Balanced-MixUp, a novel data sampling method that combines instance-based and class-based sampling with MixUp regularization to improve classification performance on highly imbalanced medical image datasets.

Contribution

The paper proposes Balanced-MixUp, a new sampling technique that enhances learning from minority classes in imbalanced datasets by integrating balanced sampling with MixUp.

Findings

01

Balanced-MixUp outperforms traditional sampling methods.

02

It improves classification accuracy on imbalanced medical datasets.

03

The method is effective across different CNN architectures.

Abstract

Highly imbalanced datasets are ubiquitous in medical image classification problems. In such problems, it is often the case that rare classes associated to less prevalent diseases are severely under-represented in labeled databases, typically resulting in poor performance of machine learning algorithms due to overfitting in the learning process. In this paper, we propose a novel mechanism for sampling training data based on the popular MixUp regularization technique, which we refer to as Balanced-MixUp. In short, Balanced-MixUp simultaneously performs regular (i.e., instance-based) and balanced (i.e., class-based) sampling of the training data. The resulting two sets of samples are then mixed-up to create a more balanced training distribution from which a neural network can effectively learn without incurring in heavily under-fitting the minority classes. We experiment with a highly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

agaldran/balanced_mixup
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsMixup