Federated Learning via Synthetic Data

Jack Goetz; Ambuj Tewari

arXiv:2008.04489·cs.LG·September 29, 2020·38 cites

Federated Learning via Synthetic Data

Jack Goetz, Ambuj Tewari

PDF

Open Access

TL;DR

This paper introduces a federated learning approach that replaces transmitting model updates with synthetic data, significantly reducing communication costs while maintaining model performance.

Contribution

It proposes a novel method for federated learning that transmits synthetic data instead of model updates, reducing communication overhead.

Findings

01

Over an order of magnitude reduction in communication costs

02

Minimal degradation in model accuracy

03

Potential for scalable federated learning applications

Abstract

Federated learning allows for the training of a model using data on multiple clients without the clients transmitting that raw data. However the standard method is to transmit model parameters (or updates), which for modern neural networks can be on the scale of millions of parameters, inflicting significant computational costs on the clients. We propose a method for federated learning where instead of transmitting a gradient update back to the server, we instead transmit a small amount of synthetic `data'. We describe the procedure and show some experimental results suggesting this procedure has potential, providing more than an order of magnitude reduction in communication costs with minimal model degradation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Advanced Graph Neural Networks