Hybrid Diffusion Model for Breast Ultrasound Image Augmentation
Farhan Fuad Abir, Sanjeda Sara Jennifer, Niloofar Yousefi, Laura J. Brattain

TL;DR
This paper introduces a hybrid diffusion framework that enhances breast ultrasound image augmentation by combining text-to-image, image refinement, and low-rank adaptation, resulting in more realistic synthetic images.
Contribution
The proposed method improves ultrasound image augmentation by integrating multiple diffusion techniques, achieving higher fidelity and class consistency compared to standard models.
Findings
Reduced FID from 45.97 to 33.29 compared to baseline
Generated realistic, class-consistent ultrasound images
Enhanced augmentation quality for diagnostic modeling
Abstract
We propose a hybrid diffusion-based augmentation framework to overcome the critical challenge of ultrasound data augmentation in breast ultrasound (BUS) datasets. Unlike conventional diffusion-based augmentations, our approach improves visual fidelity and preserves ultrasound texture by combining text-to-image generation with image-to-image (img2img) refinement, as well as fine-tuning with low-rank adaptation (LoRA) and textual inversion (TI). Our method generated realistic, class-consistent images on an open-source Kaggle breast ultrasound image dataset (BUSI). Compared to the Stable Diffusion v1.5 baseline, incorporating TI and img2img refinement reduced the Frechet Inception Distance (FID) from 45.97 to 33.29, demonstrating a substantial gain in fidelity while maintaining comparable downstream classification performance. Overall, the proposed framework effectively mitigates the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
