DCSim: Computing and Networking Integration based Container Scheduling Simulator for Data Centers
Jinlong Hu, Zhizhe Rao, Xingchen Liu, Lihao Deng, Shoubin Dong

TL;DR
DCSim is a comprehensive container scheduling simulator for data centers that integrates detailed network and computing models, enabling more accurate performance evaluation of containerized workloads.
Contribution
The paper introduces DCSim, a novel simulator that combines network and computing resource modeling for container scheduling in data centers, filling a gap in existing simulation tools.
Findings
DCSim effectively models heterogeneous computing resources.
The simulator accurately simulates network communication and container scheduling.
Validation shows DCSim's capability to evaluate scheduling strategies.
Abstract
The increasing prevalence of cloud-native technologies, particularly containers, has led to the widespread adoption of containerized deployments in data centers. The advancement of deep neural network models has increased the demand for container-based distributed model training and inference, where frequent data transmission among nodes has emerged as a significant performance bottleneck. However, traditional container scheduling simulators often overlook the influence of network modeling on the efficiency of container scheduling, primarily concentrating on modeling computational resources. In this paper, we focus on a container scheduling simulator based on collaboration between computing and networking within data centers. We propose a new container scheduling simulator for data centers, named DCSim. The simulator consists of several modules: a data center module, a network…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Distributed and Parallel Computing Systems · Advanced Data Storage Technologies
