A Novel Coded Computing Approach for Distributed Multi-Task Learning
Minquan Cheng, Yongkang Wang, Lingyu Zhang, and Youlong Wu

TL;DR
This paper introduces a new coded computing scheme for distributed multi-task learning that significantly reduces communication costs and achieves theoretical optimality, applicable to both homogeneous and heterogeneous environments.
Contribution
It proposes a novel coded scheme for DMTL that minimizes communication overhead and is optimal under mild conditions, extending to heterogeneous data placements.
Findings
Achieves the theoretical lower bound for communication cost.
Effective in both homogeneous and heterogeneous computing environments.
Extensible to distributed linear separable computation problems.
Abstract
Distributed multi-task learning (DMTL) effectively improves model generalization performance through the collaborative training of multiple related models. However, in large-scale learning scenarios, communication bottlenecks severely limit practical system performance. In this paper, we investigate the communication bottleneck within a typical DMTL system that employs non-linear global updates. This system involves distributed workers, assisted by a central server, who collaboratively learn distinct models derived from a non-linear aggregation of their local model parameters. We first characterize the communication process as a matrix decomposition problem. It transforms workers' data storage constraints into structural characteristics of the uplink encoding matrix, and worker data retrieval demands into Maximum Distance Separable (MDS) properties of the downlink encoding matrix.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
