Federated Breast Cancer Detection Enhanced by Synthetic Ultrasound Image Augmentation
Hongyi Pan, Ziliang Hong, Gorkem Durak, Ziyue Xu, Ulas Bagci

TL;DR
This paper introduces a generative model-based data augmentation method using synthetic ultrasound images to improve federated breast cancer detection models, demonstrating significant performance gains on multiple datasets.
Contribution
It presents a novel augmentation framework combining GANs and diffusion models to enhance federated learning for breast ultrasound classification.
Findings
Synthetic images improved AUC from 0.9206 to 0.9362 for FedAvg.
Synthetic images increased AUC from 0.9429 to 0.9574 for FedProx.
Excessive synthetic data can reduce model performance.
Abstract
Federated learning enables collaborative training of deep learning models across institutions without sharing sensitive patient data. However, its performance is often limited by small datasets and non-independent, identically distributed data, which can impair model generalization. In this work, we propose a generative model-based data augmentation framework for breast ultrasound classification. It leverages synthetic images generated by deep convolutional generative adversarial networks and a class-conditioned denoising diffusion probabilistic model. Experiments on three publicly available datasets (BUSI, BUS-BRA, and UDIAT) demonstrated that incorporating a suitable number of synthetic images improved average AUC from 0.9206 to 0.9362 for FedAvg and from 0.9429 to 0.9574 for FedProx. Furthermore, we noticed that excessive use of synthetic data reduced performance. This highlights the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
