Optimizing egalitarian performance in the side-effects model of colocation for data center resource management
Fanny Pascual, Krzysztof Rzadca

TL;DR
This paper addresses the challenge of optimizing the worst-case performance in data center task colocation by developing approximation algorithms and heuristics that consider task heterogeneity and resource competition.
Contribution
It introduces a new model incorporating task types for performance impact, proves complexity results, and provides approximation algorithms and heuristics for egalitarian optimization.
Findings
Algorithms with type-awareness outperform standard makespan solutions.
A PTAS and heuristics effectively handle the problem for a fixed number of task types.
Simulation on real data demonstrates practical efficiency of proposed methods.
Abstract
In data centers, up to dozens of tasks are colocated on a single physical machine. Machines are used more efficiently, but tasks' performance deteriorates, as colocated tasks compete for shared resources. As tasks are heterogeneous, the resulting performance dependencies are complex. In our previous work [18] we proposed a new combinatorial optimization model that uses two parameters of a task - its size and its type - to characterize how a task influences the performance of other tasks allocated to the same machine. In this paper, we study the egalitarian optimization goal: maximizing the worst-off performance. This problem generalizes the classic makespan minimization on multiple processors (P||Cmax). We prove that polynomially-solvable variants of multiprocessor scheduling are NP-hard and hard to approximate when the number of types is not constant. For a constant number of types,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Distributed and Parallel Computing Systems · Parallel Computing and Optimization Techniques
