Eventually-Consistent Federated Scheduling for Data Center Workloads
Meghana Thiyyakat, Subramaniam Kalambur, Rishit Chaudhary, Saurav G, Nayak, Adarsh Shetty, Dinkar Sitaram

TL;DR
This paper presents Megha, a federated decentralized data center scheduling system that employs eventual consistency to improve scheduling speed and resource utilization at large scales, outperforming existing architectures.
Contribution
It introduces Megha, a novel scheduling architecture combining parallelism and eventual consistency, and demonstrates its superior performance through simulation and real cluster experiments.
Findings
Megha reduces job completion delays compared to other architectures.
Megha achieves higher resource utilization in data center workloads.
Experimental results show Megha's scalability and efficiency.
Abstract
Data center schedulers operate at unprecedented scales today to accommodate the growing demand for computing and storage power. The challenge that schedulers face is meeting the requirements of scheduling speeds despite the scale. To do so, most scheduler architectures use parallelism. However, these architectures consist of multiple parallel scheduling entities that can only utilize partial knowledge of the data center's state, as maintaining consistent global knowledge or state would involve considerable communication overhead. The disadvantage of scheduling without global knowledge is sub-optimal placements-tasks may be made to wait in queues even though there are resources available in zones outside the scope of the scheduling entity's state. This leads to unnecessary queuing overheads and lower resource utilization of the data center. In this paper, extend our previous work on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Advanced Data Storage Technologies · Distributed and Parallel Computing Systems
