TSynD: Targeted Synthetic Data Generation for Enhanced Medical Image   Classification

Joshua Niemeijer; Jan Ehrhardt; Hristina Uzunova; Heinz Handels

arXiv:2406.17473·cs.CV·June 26, 2024

TSynD: Targeted Synthetic Data Generation for Enhanced Medical Image Classification

Joshua Niemeijer, Jan Ehrhardt, Hristina Uzunova, Heinz Handels

PDF

Open Access

TL;DR

This paper proposes a targeted synthetic data generation method for medical image classification that improves model accuracy and robustness by focusing on underrepresented data points with high epistemic uncertainty.

Contribution

It introduces a novel approach to guide generative models to produce synthetic data with high epistemic uncertainty, enhancing training effectiveness.

Findings

01

Improved classification accuracy on medical imaging tasks.

02

Enhanced robustness against data augmentations.

03

Greater resilience to adversarial attacks.

Abstract

The usage of medical image data for the training of large-scale machine learning approaches is particularly challenging due to its scarce availability and the costly generation of data annotations, typically requiring the engagement of medical professionals. The rapid development of generative models allows towards tackling this problem by leveraging large amounts of realistic synthetically generated data for the training process. However, randomly choosing synthetic samples, might not be an optimal strategy. In this work, we investigate the targeted generation of synthetic training data, in order to improve the accuracy and robustness of image classification. Therefore, our approach aims to guide the generative model to synthesize data with high epistemic uncertainty, since large measures of epistemic uncertainty indicate underrepresented data points in the training set. During the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI in cancer detection · Brain Tumor Detection and Classification · COVID-19 diagnosis using AI