Hybrid Diffusion Model for Breast Ultrasound Image Augmentation

Farhan Fuad Abir; Sanjeda Sara Jennifer; Niloofar Yousefi; Laura J. Brattain

arXiv:2603.26834·eess.IV·March 31, 2026

Hybrid Diffusion Model for Breast Ultrasound Image Augmentation

Farhan Fuad Abir, Sanjeda Sara Jennifer, Niloofar Yousefi, Laura J. Brattain

PDF

TL;DR

This paper introduces a hybrid diffusion framework that enhances breast ultrasound image augmentation by combining text-to-image, image refinement, and low-rank adaptation, resulting in more realistic synthetic images.

Contribution

The proposed method improves ultrasound image augmentation by integrating multiple diffusion techniques, achieving higher fidelity and class consistency compared to standard models.

Findings

01

Reduced FID from 45.97 to 33.29 compared to baseline

02

Generated realistic, class-consistent ultrasound images

03

Enhanced augmentation quality for diagnostic modeling

Abstract

We propose a hybrid diffusion-based augmentation framework to overcome the critical challenge of ultrasound data augmentation in breast ultrasound (BUS) datasets. Unlike conventional diffusion-based augmentations, our approach improves visual fidelity and preserves ultrasound texture by combining text-to-image generation with image-to-image (img2img) refinement, as well as fine-tuning with low-rank adaptation (LoRA) and textual inversion (TI). Our method generated realistic, class-consistent images on an open-source Kaggle breast ultrasound image dataset (BUSI). Compared to the Stable Diffusion v1.5 baseline, incorporating TI and img2img refinement reduced the Frechet Inception Distance (FID) from 45.97 to 33.29, demonstrating a substantial gain in fidelity while maintaining comparable downstream classification performance. Overall, the proposed framework effectively mitigates the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.