Reliable Provisioning of Spot Instances for Compute-intensive Applications
William Voorsluys, Rajkumar Buyya

TL;DR
This paper presents a fault-aware resource provisioning strategy for running compute-intensive applications on spot instances, combining price estimation, runtime prediction, and fault tolerance techniques to ensure reliable and economical execution.
Contribution
It introduces a novel resource allocation policy that integrates fault tolerance methods with price and runtime estimation for spot instances.
Findings
Effective execution of applications on spot instances despite failures
Improved cost efficiency with fault-aware provisioning
Successful maintenance of QoS constraints during execution
Abstract
Cloud computing providers are now offering their unused resources for leasing in the spot market, which has been considered the first step towards a full-fledged market economy for computational resources. Spot instances are virtual machines (VMs) available at lower prices than their standard on-demand counterparts. These VMs will run for as long as the current price is lower than the maximum bid price users are willing to pay per hour. Spot instances have been increasingly used for executing compute-intensive applications. In spite of an apparent economical advantage, due to an intermittent nature of biddable resources, application execution times may be prolonged or they may not finish at all. This paper proposes a resource allocation strategy that addresses the problem of running compute-intensive jobs on a pool of intermittent virtual machines, while also aiming to run applications…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Distributed systems and fault tolerance · Distributed and Parallel Computing Systems
