Federated Breast Cancer Detection Enhanced by Synthetic Ultrasound Image Augmentation

Hongyi Pan; Ziliang Hong; Gorkem Durak; Ziyue Xu; Ulas Bagci

arXiv:2506.23334·eess.IV·April 17, 2026

Federated Breast Cancer Detection Enhanced by Synthetic Ultrasound Image Augmentation

Hongyi Pan, Ziliang Hong, Gorkem Durak, Ziyue Xu, Ulas Bagci

PDF

TL;DR

This paper introduces a generative model-based data augmentation method using synthetic ultrasound images to improve federated breast cancer detection models, demonstrating significant performance gains on multiple datasets.

Contribution

It presents a novel augmentation framework combining GANs and diffusion models to enhance federated learning for breast ultrasound classification.

Findings

01

Synthetic images improved AUC from 0.9206 to 0.9362 for FedAvg.

02

Synthetic images increased AUC from 0.9429 to 0.9574 for FedProx.

03

Excessive synthetic data can reduce model performance.

Abstract

Federated learning enables collaborative training of deep learning models across institutions without sharing sensitive patient data. However, its performance is often limited by small datasets and non-independent, identically distributed data, which can impair model generalization. In this work, we propose a generative model-based data augmentation framework for breast ultrasound classification. It leverages synthetic images generated by deep convolutional generative adversarial networks and a class-conditioned denoising diffusion probabilistic model. Experiments on three publicly available datasets (BUSI, BUS-BRA, and UDIAT) demonstrated that incorporating a suitable number of synthetic images improved average AUC from 0.9206 to 0.9362 for FedAvg and from 0.9429 to 0.9574 for FedProx. Furthermore, we noticed that excessive use of synthetic data reduced performance. This highlights the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.