GAN-based Data Augmentation for Chest X-ray Classification

Shobhita Sundaram; Neha Hulkund

arXiv:2107.02970·eess.IV·July 8, 2021·5 cites

GAN-based Data Augmentation for Chest X-ray Classification

Shobhita Sundaram, Neha Hulkund

PDF

Open Access

TL;DR

This paper explores the use of GANs to generate synthetic chest X-ray images to address data scarcity and class imbalance, improving classification performance especially in low-data scenarios.

Contribution

It demonstrates that GAN-based data augmentation outperforms traditional methods in enhancing chest X-ray classification, particularly for underrepresented classes.

Findings

01

GAN augmentation improves classification accuracy for minority classes

02

GAN-based augmentation is especially effective with limited data

03

Synthetic data helps prevent overfitting in medical image analysis

Abstract

A common problem in computer vision -- particularly in medical applications -- is a lack of sufficiently diverse, large sets of training data. These datasets often suffer from severe class imbalance. As a result, networks often overfit and are unable to generalize to novel examples. Generative Adversarial Networks (GANs) offer a novel method of synthetic data augmentation. In this work, we evaluate the use of GAN- based data augmentation to artificially expand the CheXpert dataset of chest radiographs. We compare performance to traditional augmentation and find that GAN-based augmentation leads to higher downstream performance for underrepresented classes. Furthermore, we see that this result is pronounced in low data regimens. This suggests that GAN-based augmentation a promising area of research to improve network performance when data collection is prohibitively expensive.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCOVID-19 diagnosis using AI · Anomaly Detection Techniques and Applications · Phonocardiography and Auscultation Techniques