MEDDAP: Medical Dataset Enhancement via Diversified Augmentation Pipeline
Yasamin Medghalchi, Niloufar Zakariaei, Arman Rahmim, Ilker, Hacihaliloglu

TL;DR
MEDDAP is a novel pipeline that enhances small medical datasets by using Stable Diffusion models with Ultrasound Low-Rank Adaptation for diversified augmentation, improving classifier robustness and performance on unseen data.
Contribution
Introduces USLoRA, a lightweight fine-tuning method for Stable Diffusion tailored to ultrasound images, enabling effective medical dataset augmentation with minimal computational resources.
Findings
Outperforms classifiers trained on original datasets
Demonstrates superior performance on unseen datasets
Enhances dataset diversity through prompt modifications
Abstract
The effectiveness of Deep Neural Networks (DNNs) heavily relies on the abundance and accuracy of available training data. However, collecting and annotating data on a large scale is often both costly and time-intensive, particularly in medical cases where practitioners are already occupied with their duties. Moreover, ensuring that the model remains robust across various scenarios of image capture is crucial in medical domains, especially when dealing with ultrasound images that vary based on the settings of different devices and the manual operation of the transducer. To address this challenge, we introduce a novel pipeline called MEDDAP, which leverages Stable Diffusion (SD) models to augment existing small datasets by automatically generating new informative labeled samples. Pretrained checkpoints for SD are typically based on natural images, and training them for medical images…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare
MethodsDiffusion
