Rank-Aware Resource Scheduling for Tightly-Coupled MPI Workloads on Kubernetes
Tianfang Xie

TL;DR
This paper introduces rank-aware resource scheduling on Kubernetes for MPI workloads, enabling fine-grained CPU provisioning and in-place scaling to improve cluster efficiency and performance in cloud environments.
Contribution
It proposes a novel rank-aware scheduling method with in-place CPU scaling for MPI on Kubernetes, reducing resource waste and improving performance.
Findings
Proportional CPU requests eliminate Linux CFS throttling.
Concentric decomposition with proportional CPU is 19% faster than Scotch baseline.
Proportional allocation reduces CPU provisioning by 82%, freeing scheduling resources.
Abstract
Fully provisioned Message Passing Interface (MPI) parallelism achieves near-optimal wall-clock time for Computational Fluid Dynamics (CFD) solvers. This work addresses a complementary question for shared, cloud-managed clusters: can fine-grained CPU provisioning reduce resource reservation of low-load subdomains, improving cluster packing efficiency without unacceptably degrading performance? We propose rank-aware resource scheduling on Kubernetes, mapping each MPI rank to a pod whose CPU request is proportional to its subdomain cell count. We also demonstrate In-Place Pod Vertical Scaling (Kubernetes v1.35 GA) for mid-simulation CPU adjustment without pod restart. Three findings emerge. First, hard CPU limits via the Linux CFS bandwidth controller cause 78x slowdown through cascading stalls at MPI_Allreduce barriers; requests-only allocation eliminates throttling entirely. Second,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Distributed and Parallel Computing Systems · Embedded Systems Design Techniques
