TL;DR
This paper introduces a practical algorithm for dynamic resource allocation in cloud computing that balances high utilization with SLA compliance, using a multiplicative weight update approach and validated on real data.
Contribution
It presents a near-optimal, simple algorithm for resource allocation that effectively manages the tradeoff between utilization and SLA satisfaction in cloud systems.
Findings
Achieves near-optimal resource utilization compared to offline optimal.
Satisfies all SLAs with minimal error.
Validated performance on real cloud data.
Abstract
Cloud computing has motivated renewed interest in resource allocation problems with new consumption models. A common goal is to share a resource, such as CPU or I/O bandwidth, among distinct users with different demand patterns as well as different quality of service requirements. To ensure these service requirements, cloud offerings often come with a service level agreement (SLA) between the provider and the users. An SLA specifies the amount of a resource a user is entitled to utilize. In many cloud settings, providers would like to operate resources at high utilization while simultaneously respecting individual SLAs. There is typically a tradeoff between these two objectives; for example, utilization can be increased by shifting away resources from idle users to "scavenger" workload, but with the risk of the former then becoming active again. We study this fundamental tradeoff by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
