An Efficient Gradient-Aware Error-Bounded Lossy Compressor for Federated Learning
Zhijing Ye, Sheng Di, Jiamin Wang, Zhiqing Zhong, Zhaorui Zhang, Xiaodong Yu

TL;DR
This paper introduces a novel error-bounded lossy gradient compressor tailored for federated learning, significantly reducing communication costs while maintaining model accuracy by exploiting temporal and structural correlations.
Contribution
It proposes a new prediction mechanism for gradient compression in FL that leverages temporal and kernel-level structures, outperforming existing methods like SZ in compression ratio and efficiency.
Findings
Achieves up to 1.53x higher compression ratios than SZ3.
Reduces communication time by up to 96.2% in FL scenarios.
Maintains model accuracy with lower loss compared to existing compressors.
Abstract
Federated learning (FL) enables collaborative model training without exposing clients' private data, but its deployment is often constrained by the communication cost of transmitting gradients between clients and the central server, especially under system heterogeneity where low-bandwidth clients bottleneck overall performance. Lossy compression of gradient data can mitigate this overhead, and error-bounded lossy compression (EBLC) is particularly appealing for its fine-grained utility-compression tradeoff. However, existing EBLC methods (e.g., SZ), originally designed for smooth scientific data with strong spatial locality, rely on generic predictors such as Lorenzo and interpolation for entropy reduction to improve compression ratio. Gradient tensors, in contrast, exhibit low smoothness and weak spatial correlation, rendering these predictors ineffective and leading to poor…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning
