Capacity Allocation for Clouds with Parallel Processing, Batch Arrivals, and Heterogeneous Service Requirements
Eugene Furman, Arik Senderovich, Shane Bergsma, J. Christopher Beck

TL;DR
This paper presents a heuristic capacity allocation policy for cloud services with heterogeneous, batch arrivals and multiple resource types, improving efficiency and SLA satisfaction based on a queueing model and real data.
Contribution
It introduces a novel heuristic policy leveraging diffusion approximation for capacity planning in complex cloud environments with heterogeneous demands.
Findings
20% capacity reduction compared to benchmarks
Improved service quality and SLA compliance
Higher system utilization with less resource idling
Abstract
Problem Definition: Allocating sufficient capacity to cloud services is a challenging task, especially when demand is time-varying, heterogeneous, contains batches, and requires multiple types of resources for processing. In this setting, providers decide whether to reserve portions of their capacity to individual job classes or to offer it in a flexible manner. Methodology/results: In collaboration with Huawei Cloud, a worldwide provider of cloud services, we propose a heuristic policy that allocates multiple types of resources to jobs and also satisfies their pre-specified service level agreements (SLAs). We model the system as a multi-class queueing network with parallel processing and multiple types of resources, where arrivals (i.e., virtual machines and containers) follow time-varying patterns and require at least one unit of each resource for processing. While virtual machines…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Queuing Theory Analysis · Cloud Computing and Resource Management · Transportation and Mobility Innovations
