Dynamic scheduling of virtual machines running hpc workloads in scientific grids
Omer Khalid, Ivo Maljevic, Richard Anthony, Miltos Petridis, Kevin, Parrot, Markus Schulz

TL;DR
This paper proposes an intelligent, real-time scheduling method for virtual machines running HPC workloads in scientific grids, aiming to optimize job completion within deadlines despite virtualization overheads.
Contribution
It introduces a novel dynamic scheduling approach that monitors workload types and overheads to improve deadline adherence in virtualized HPC environments.
Findings
Enhanced scheduling efficiency for virtualized HPC jobs.
Reduced deadline misses through workload-aware overhead management.
Improved resource utilization in scientific grid environments.
Abstract
The primary motivation for uptake of virtualization has been resource isolation, capacity management and resource customization allowing resource providers to consolidate their resources in virtual machines. Various approaches have been taken to integrate virtualization in to scientific Grids especially in the arena of High Performance Computing (HPC) to run grid jobs in virtual machines, thus enabling better provisioning of the underlying resources and customization of the execution environment on runtime. Despite the gains, virtualization layer also incur a performance penalty and its not very well understood that how such an overhead will impact the performance of systems where jobs are scheduled with tight deadlines. Since this overhead varies the types of workload whether they are memory intensive, CPU intensive or network I/O bound, and could lead to unpredictable deadline…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Cloud Computing and Resource Management · Parallel Computing and Optimization Techniques
