Federated Learning with Lossy Distributed Source Coding: Analysis and Optimization
Huiyuan Yang, Tian Ding, Xiaojun Yuan

TL;DR
This paper analyzes the fundamental trade-off between communication cost and convergence in federated learning using rate-distortion theory, proposing optimization algorithms to improve model aggregation efficiency.
Contribution
It introduces a rate-distortion based framework for analyzing FL aggregation, deriving bounds and proposing algorithms to optimize the trade-off.
Findings
Derived an inner bound of the rate-distortion region for model aggregation.
Connected aggregation distortion to FL convergence performance.
Numerical results show potential for improving existing aggregation schemes.
Abstract
Recently, federated learning (FL), which replaces data sharing with model sharing, has emerged as an efficient and privacy-friendly machine learning (ML) paradigm. One of the main challenges in FL is the huge communication cost for model aggregation. Many compression/quantization schemes have been proposed to reduce the communication cost for model aggregation. However, the following question remains unanswered: What is the fundamental trade-off between the communication cost and the FL convergence performance? In this paper, we manage to answer this question. Specifically, we first put forth a general framework for model aggregation performance analysis based on the rate-distortion theory. Under the proposed analysis framework, we derive an inner bound of the rate-distortion region of model aggregation. We then conduct an FL convergence analysis to connect the aggregation distortion…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Distributed Sensor Networks and Detection Algorithms
