Adaptive Differential Filters for Fast and Communication-Efficient Federated Learning
Daniel Becking, Heiner Kirchhoffer, Gerhard Tech, Paul Haase, and Karsten M\"uller, Heiko Schwarz, Wojciech Samek

TL;DR
This paper introduces adaptive differential filters for federated learning that enhance convergence speed and significantly reduce communication costs by intelligently scaling sparse model updates.
Contribution
It proposes a novel filter-level scaling method that compensates for sparse updates, improves feature adaptation, and increases update sparsity for better compression.
Findings
Achieves up to 377x data transfer savings.
Improves central model performance across multiple vision tasks.
Converges faster than previous methods.
Abstract
Federated learning (FL) scenarios inherently generate a large communication overhead by frequently transmitting neural network updates between clients and server. To minimize the communication cost, introducing sparsity in conjunction with differential updates is a commonly used technique. However, sparse model updates can slow down convergence speed or unintentionally skip certain update aspects, e.g., learned features, if error accumulation is not properly addressed. In this work, we propose a new scaling method operating at the granularity of convolutional filters which 1) compensates for highly sparse updates in FL processes, 2) adapts the local models to new data domains by enhancing some features in the filter space while diminishing others and 3) motivates extra sparsity in updates and thus achieves higher compression ratios, i.e., savings in the overall data transfer. Compared…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Brain Tumor Detection and Classification
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
