Adaptive Differential Filters for Fast and Communication-Efficient   Federated Learning

Daniel Becking; Heiner Kirchhoffer; Gerhard Tech; Paul Haase; and Karsten M\"uller; Heiko Schwarz; Wojciech Samek

arXiv:2204.04424·cs.LG·April 12, 2022

Adaptive Differential Filters for Fast and Communication-Efficient Federated Learning

Daniel Becking, Heiner Kirchhoffer, Gerhard Tech, Paul Haase, and Karsten M\"uller, Heiko Schwarz, Wojciech Samek

PDF

Open Access

TL;DR

This paper introduces adaptive differential filters for federated learning that enhance convergence speed and significantly reduce communication costs by intelligently scaling sparse model updates.

Contribution

It proposes a novel filter-level scaling method that compensates for sparse updates, improves feature adaptation, and increases update sparsity for better compression.

Findings

01

Achieves up to 377x data transfer savings.

02

Improves central model performance across multiple vision tasks.

03

Converges faster than previous methods.

Abstract

Federated learning (FL) scenarios inherently generate a large communication overhead by frequently transmitting neural network updates between clients and server. To minimize the communication cost, introducing sparsity in conjunction with differential updates is a commonly used technique. However, sparse model updates can slow down convergence speed or unintentionally skip certain update aspects, e.g., learned features, if error accumulation is not properly addressed. In this work, we propose a new scaling method operating at the granularity of convolutional filters which 1) compensates for highly sparse updates in FL processes, 2) adapts the local models to new data domains by enhancing some features in the filter space while diminishing others and 3) motivates extra sparsity in updates and thus achieves higher compression ratios, i.e., savings in the overall data transfer. Compared…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Brain Tumor Detection and Classification

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings