Federated Learning with Additional Mechanisms on Clients to Reduce   Communication Costs

Xin Yao; Tianchi Huang; Chenglei Wu; Rui-Xiao Zhang; Lifeng Sun

arXiv:1908.05891·cs.LG·September 4, 2019·31 cites

Federated Learning with Additional Mechanisms on Clients to Reduce Communication Costs

Xin Yao, Tianchi Huang, Chenglei Wu, Rui-Xiao Zhang, Lifeng Sun

PDF

Open Access 2 Repos

TL;DR

This paper introduces two novel mechanisms, FedMMD and FedFusion, to reduce communication costs in federated learning, especially under non-IID data distributions, while maintaining or improving model accuracy.

Contribution

It proposes two new methods that significantly lower communication rounds and enhance model performance in federated learning with non-IID data.

Findings

01

FedMMD reduces communication rounds by over 20%.

02

FedFusion decreases communication rounds by more than 60%.

03

Both methods improve accuracy and generalization in FL scenarios.

Abstract

Federated learning (FL) enables on-device training over distributed networks consisting of a massive amount of modern smart devices, such as smartphones and IoT (Internet of Things) devices. However, the leading optimization algorithm in such settings, i.e., federated averaging (FedAvg), suffers from heavy communication costs and the inevitable performance drop, especially when the local data is distributed in a non-IID way. To alleviate this problem, we propose two potential solutions by introducing additional mechanisms to the on-device training. The first (FedMMD) is adopting a two-stream model with the MMD (Maximum Mean Discrepancy) constraint instead of a single model in vanilla FedAvg to be trained on devices. Experiments show that the proposed method outperforms baselines, especially in non-IID FL settings, with a reduction of more than 20% in required communication rounds.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Wireless Communication Security Techniques

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings