Scalable Group Management in Large-Scale Virtualized Clusters
Wei Zhou, Lei Wang, Dan Meng, Lin Yuan, Jianfeng Zhan

TL;DR
This paper presents a scalable, reliable management infrastructure for virtual clusters that ensures consistent member views efficiently, supporting large-scale virtual machine provisioning in data centers.
Contribution
It introduces a hybrid peer-to-peer and hierarchical management system, a lightweight membership algorithm, and a scalable service for virtual machine cluster management.
Findings
Verified on Dawning 5000A supercomputer
Achieved consistent member views within a single message round
Demonstrated scalability in large virtualized clusters
Abstract
To save cost, recently more and more users choose to provision virtual machine resources in cluster systems, especially in data centres. Maintaining a consistent member view is the foundation of reliable cluster managements, and it also raises several challenge issues for large scale cluster systems deployed with virtual machines (which we call virtualized clusters). In this paper, we introduce our experiences in design and implementation of scalable member view management on large-scale virtual clusters. Our research contributions are three-fold: 1) we propose a scalable and reliable management infrastructure that combines a peer-to-peer structure and a hierarchy structure to maintain a consistent member view in virtual clusters; 2) we present a light-weighted group membership algorithm that can reach the consistent member view within a single round of message exchange; and 3) we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Distributed and Parallel Computing Systems · Peer-to-Peer Network Technologies
