Two-Stage Coded Distributed Edge Learning: A Dynamic Partial Gradient Coding Perspective
Tingting Yang, Xinghan Wang, Jiahong Ning, Yang Yang

TL;DR
This paper introduces a lightweight, two-stage dynamic coding scheme and a fair transmission protocol to effectively mitigate stragglers in distributed edge learning, reducing overhead and improving accuracy and resource efficiency.
Contribution
It proposes a novel two-stage dynamic coding method combined with a fair transmission protocol, addressing straggler issues with lower complexity and overhead.
Findings
Outperforms existing methods in accuracy and resource utilization.
Effective under practical network conditions and benchmark datasets.
Reduces encoding and decoding complexity linearly with number of workers.
Abstract
The widespread adoption of distributed learning to train a global model from local data has been hindered by the challenge posed by stragglers. Recent attempts to mitigate this issue through gradient coding have proved difficult due to the large amounts of data redundancy, computational and communicational overhead it brings. Additionally, the complexity of encoding and decoding increases linearly with the number of local workers. In this paper, we present a lightweight coding method for the computing phase and a fair transmission protocol for the communication phase, to mitigate the straggler problem. A two-stage dynamic coding scheme is proposed for the computing phase, where partial gradients are computed by a portion of workers in the first stage and the remainder are decided based on their completion status in the first stage. To ensure fair communication, a perturbed Lyapunov…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Distributed Sensor Networks and Detection Algorithms · Distributed Control Multi-Agent Systems
