MEDDAP: Medical Dataset Enhancement via Diversified Augmentation   Pipeline

Yasamin Medghalchi; Niloufar Zakariaei; Arman Rahmim; Ilker; Hacihaliloglu

arXiv:2403.16335·eess.IV·March 28, 2024·2 cites

MEDDAP: Medical Dataset Enhancement via Diversified Augmentation Pipeline

Yasamin Medghalchi, Niloufar Zakariaei, Arman Rahmim, Ilker, Hacihaliloglu

PDF

Open Access 1 Repo

TL;DR

MEDDAP is a novel pipeline that enhances small medical datasets by using Stable Diffusion models with Ultrasound Low-Rank Adaptation for diversified augmentation, improving classifier robustness and performance on unseen data.

Contribution

Introduces USLoRA, a lightweight fine-tuning method for Stable Diffusion tailored to ultrasound images, enabling effective medical dataset augmentation with minimal computational resources.

Findings

01

Outperforms classifiers trained on original datasets

02

Demonstrates superior performance on unseen datasets

03

Enhances dataset diversity through prompt modifications

Abstract

The effectiveness of Deep Neural Networks (DNNs) heavily relies on the abundance and accuracy of available training data. However, collecting and annotating data on a large scale is often both costly and time-intensive, particularly in medical cases where practitioners are already occupied with their duties. Moreover, ensuring that the model remains robust across various scenarios of image capture is crucial in medical domains, especially when dealing with ultrasound images that vary based on the settings of different devices and the manual operation of the transducer. To address this challenge, we introduce a novel pipeline called MEDDAP, which leverages Stable Diffusion (SD) models to augment existing small datasets by automatically generating new informative labeled samples. Pretrained checkpoints for SD are typically based on natural images, and training them for medical images…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yasamin-med/meddap
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare

MethodsDiffusion