Predict-and-Critic: Accelerated End-to-End Predictive Control for Cloud   Computing through Reinforcement Learning

Kaustubh Sridhar; Vikramank Singh; Balakrishnan Narayanaswamy; Abishek; Sankararaman

arXiv:2212.01348·cs.LG·February 28, 2023

Predict-and-Critic: Accelerated End-to-End Predictive Control for Cloud Computing through Reinforcement Learning

Kaustubh Sridhar, Vikramank Singh, Balakrishnan Narayanaswamy, Abishek, Sankararaman

PDF

Open Access

TL;DR

This paper introduces the Predict-and-Critic framework, a reinforcement learning-based approach that enhances VM packing in cloud computing by efficiently approximating long-term costs, outperforming previous predict-and-optimize methods.

Contribution

The paper presents the Predict-and-Critic framework that scales end-to-end predictive control in cloud VM packing by using a terminal Q function, enabling better decision-making over long horizons.

Findings

01

PnC outperforms PnO in decision quality.

02

Back-propagating through constraints improves results.

03

Hardening soft constraints enhances decision accuracy.

Abstract

Cloud computing holds the promise of reduced costs through economies of scale. To realize this promise, cloud computing vendors typically solve sequential resource allocation problems, where customer workloads are packed on shared hardware. Virtual machines (VM) form the foundation of modern cloud computing as they help logically abstract user compute from shared physical infrastructure. Traditionally, VM packing problems are solved by predicting demand, followed by a Model Predictive Control (MPC) optimization over a future horizon. We introduce an approximate formulation of an industrial VM packing problem as an MILP with soft-constraints parameterized by the predictions. Recently, predict-and-optimize (PnO) was proposed for end-to-end training of prediction models by back-propagating the cost of decisions through the optimization problem. But, PnO is unable to scale to the large…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCloud Computing and Resource Management · Stochastic Gradient Optimization Techniques · IoT and Edge/Fog Computing