Design and Optimization of Heterogeneous Coded Distributed Computing with Nonuniform File Popularity
Yong Deng, Min Dong

TL;DR
This paper introduces a novel heterogeneous coded distributed computing scheme that accounts for nonuniform file popularity, optimizing file placement and shuffling strategies to improve efficiency and reduce load in MapReduce frameworks.
Contribution
It proposes a flexible file placement and nested shuffling strategy for heterogeneous CDC with nonuniform file popularity, along with a low-complexity approximation method and a compressed CDC scheme for machine learning tasks.
Findings
Optimized CDC scheme outperforms alternatives in simulations.
Two-file-group-based approach achieves near-optimal performance with lower complexity.
C-CDC scheme significantly reduces shuffling load in machine learning applications.
Abstract
This paper studies MapReduce-based heterogeneous coded distributed computing (CDC) where, besides different computing capabilities at workers, input files to be accessed by computing jobs have nonuniform popularity. We propose a file placement strategy that can handle an arbitrary number of input files. Furthermore, we design a nested coded shuffling strategy that can efficiently manage the nonuniformity of file popularity to maximize the coded multicasting opportunity. We then formulate the joint optimization of the proposed file placement and nested shuffling design variables to optimize the proposed CDC scheme. To reduce the high computational complexity in solving the resulting mixed-integer linear programming (MILP) problem, we propose a simple two-file-group-based file placement approach to obtain an approximate solution. Numerical results show that the optimized CDC scheme…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Face and Expression Recognition
