Utility-based Resource Allocation and Pricing for Serverless Computing
Vipul Gupta, Soham Phade, Thomas Courtade, Kannan Ramchandran

TL;DR
This paper introduces a utility-based resource allocation and dynamic pricing scheme for serverless computing that maximizes social welfare by considering customer delay sensitivities and converges to optimal allocation through feedback mechanisms.
Contribution
It proposes a novel scheduler and pricing scheme that optimizes resource allocation in serverless computing using utility functions and dual optimization, improving efficiency and social welfare.
Findings
Achieves higher social welfare compared to existing schemes.
Converges to optimal resource allocation with private utility information.
Effectively tracks market demand through simulations.
Abstract
Serverless computing platforms currently rely on basic pricing schemes that are static and do not reflect customer feedback. This leads to significant inefficiencies from a total utility perspective. As one of the fastest-growing cloud services, serverless computing provides an opportunity to better serve both users and providers through the incorporation of market-based strategies for pricing and resource allocation. With the help of utility functions to model the delay-sensitivity of customers, we propose a novel scheduler to allocate resources for serverless computing. The resulting resource allocation scheme is optimal in the sense that it maximizes the aggregate utility of all users across the system, thus maximizing social welfare. Our approach gives rise to a natural dynamic pricing scheme that is obtained by solving an optimization problem in its dual form. We further develop…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Peer-to-Peer Network Technologies · Distributed and Parallel Computing Systems
