Container Orchestration on HPC Systems
Naweiluo Zhou, Yiannis Georgiou, Li Zhong, Huan Zhou, Marcin, Pospieszny

TL;DR
This paper introduces Torque-Operator, a plugin that bridges HPC workload managers with container orchestrators like Kubernetes, enhancing container management and micro-services support in HPC systems.
Contribution
The paper presents Torque-Operator, a novel plugin that integrates HPC workload managers with container orchestrators, addressing micro-services support limitations.
Findings
Improved container management in HPC systems
Enhanced micro-services support in HPC clusters
Successful integration of HPC workload managers with Kubernetes
Abstract
Containerisation demonstrates its efficiency in application deployment in cloud computing. Containers can encapsulate complex programs with their dependencies in isolated environments, hence are being adopted in HPC clusters. HPC workload managers lack micro-services support and deeply integrated container management, as opposed to container orchestrators (e.g. Kubernetes). We introduce Torque-Operator (a plugin) which serves as a bridge between HPC workload managers and container Orchestrators.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Distributed and Parallel Computing Systems · Scientific Computing and Data Management
