Predict-and-Critic: Accelerated End-to-End Predictive Control for Cloud Computing through Reinforcement Learning
Kaustubh Sridhar, Vikramank Singh, Balakrishnan Narayanaswamy, Abishek, Sankararaman

TL;DR
This paper introduces the Predict-and-Critic framework, a reinforcement learning-based approach that enhances VM packing in cloud computing by efficiently approximating long-term costs, outperforming previous predict-and-optimize methods.
Contribution
The paper presents the Predict-and-Critic framework that scales end-to-end predictive control in cloud VM packing by using a terminal Q function, enabling better decision-making over long horizons.
Findings
PnC outperforms PnO in decision quality.
Back-propagating through constraints improves results.
Hardening soft constraints enhances decision accuracy.
Abstract
Cloud computing holds the promise of reduced costs through economies of scale. To realize this promise, cloud computing vendors typically solve sequential resource allocation problems, where customer workloads are packed on shared hardware. Virtual machines (VM) form the foundation of modern cloud computing as they help logically abstract user compute from shared physical infrastructure. Traditionally, VM packing problems are solved by predicting demand, followed by a Model Predictive Control (MPC) optimization over a future horizon. We introduce an approximate formulation of an industrial VM packing problem as an MILP with soft-constraints parameterized by the predictions. Recently, predict-and-optimize (PnO) was proposed for end-to-end training of prediction models by back-propagating the cost of decisions through the optimization problem. But, PnO is unable to scale to the large…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Stochastic Gradient Optimization Techniques · IoT and Edge/Fog Computing
