Diffusion-based Data Augmentation for Skin Disease Classification: Impact Across Original Medical Datasets to Fully Synthetic Images
Mohamed Akrout, B\'alint Gyepesi, P\'eter Holl\'o, Adrienn Po\'or,, Bl\'aga Kincs\H{o}, Stephen Solis, Katrina Cirone, Jeremy Kawahara, Dekker, Slade, Latif Abid, M\'at\'e Kov\'acs, Istv\'an Fazekas

TL;DR
This paper explores the use of diffusion probabilistic models for generating synthetic skin disease images to augment training datasets, maintaining classifier accuracy and addressing data scarcity in healthcare.
Contribution
It introduces a diffusion-based data augmentation method with fine-grained control via text prompts for skin disease classification datasets.
Findings
Synthetic images maintain classification accuracy
Diffusion models generate high-quality skin images
Augmentation improves dataset diversity
Abstract
Despite continued advancement in recent years, deep neural networks still rely on large amounts of training data to avoid overfitting. However, labeled training data for real-world applications such as healthcare is limited and difficult to access given longstanding privacy, and strict data sharing policies. By manipulating image datasets in the pixel or feature space, existing data augmentation techniques represent one of the effective ways to improve the quantity and diversity of training data. Here, we look to advance augmentation techniques by building upon the emerging success of text-to-image diffusion probabilistic models in augmenting the training samples of our macroscopic skin disease dataset. We do so by enabling fine-grained control of the image generation process via input text prompts. We demonstrate that this generative data augmentation approach successfully maintains a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCutaneous Melanoma Detection and Management
MethodsDiffusion
