NL-CPS: Reinforcement Learning-Based Kubernetes Control Plane Placement in Multi-Region Clusters
Sajid Alam, Amjad Ullah, Ze Wang

TL;DR
This paper presents a reinforcement learning framework for optimizing Kubernetes control-plane node placement in multi-region environments, improving cluster performance and resilience.
Contribution
It introduces a neural contextual bandit-based RL method for dynamic control-plane placement across multi-region cloud-edge resources.
Findings
Significant performance improvements over baseline placement strategies.
Effective learning of optimal placement policies from infrastructure data.
Enhanced cluster reliability and scalability in geographically distributed setups.
Abstract
The placement of Kubernetes control-plane nodes is critical to ensuring cluster reliability, scalability, and performance, and therefore represents a significant deployment challenge in heterogeneous, multi-region environments. Existing initialisation procedures typically select control-plane hosts arbitrarily, without considering node resource capacity or network topology, often leading to suboptimal cluster performance and reduced resilience. Given Kubernetes's status as the de facto standard for container orchestration, there is a need to rigorously evaluate how control-plane node placement influences the overall performance of the cluster operating across multiple regions. This paper advances this goal by introducing an intelligent methodology for selecting control-plane node placement across dynamically selected Cloud-Edge resources spanning multiple regions, as part of an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
