SkinDualGen: Prompt-Driven Diffusion for Simultaneous Image-Mask Generation in Skin Lesions
Zhaobin Xu

TL;DR
SkinDualGen uses a prompt-driven diffusion model to generate high-quality synthetic skin lesion images and masks, significantly improving diagnostic model performance by augmenting limited datasets.
Contribution
It introduces a novel prompt-driven diffusion method with domain-specific fine-tuning for simultaneous image and mask generation in skin lesion analysis.
Findings
Synthetic data improves classification accuracy by up to 15%.
Generated images have high fidelity validated by FID scores.
Hybrid datasets enhance segmentation metrics like Dice coefficient.
Abstract
Medical image analysis plays a pivotal role in the early diagnosis of diseases such as skin lesions. However, the scarcity of data and the class imbalance significantly hinder the performance of deep learning models. We propose a novel method that leverages the pretrained Stable Diffusion-2.0 model to generate high-quality synthetic skin lesion images and corresponding segmentation masks. This approach augments training datasets for classification and segmentation tasks. We adapt Stable Diffusion-2.0 through domain-specific Low-Rank Adaptation (LoRA) fine-tuning and joint optimization of multi-objective loss functions, enabling the model to simultaneously generate clinically relevant images and segmentation masks conditioned on textual descriptions in a single step. Experimental results show that the generated images, validated by FID scores, closely resemble real images in quality. A…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCutaneous Melanoma Detection and Management · Generative Adversarial Networks and Image Synthesis · AI in cancer detection
