CloudProphet: A Machine Learning-Based Performance Prediction for Public Clouds
Darong Huang, Luis Costero, Ali Pahlevan, Marina Zapater, David, Atienza

TL;DR
CloudProphet introduces a machine learning approach to accurately predict application performance in public clouds by identifying applications and selecting relevant metrics, outperforming existing methods especially for variable workloads.
Contribution
The paper presents a novel ML-based performance prediction method that identifies applications inside VMs and selects correlated metrics, significantly improving prediction accuracy in black-box cloud environments.
Findings
Outperforms existing methods by over 2x in worst prediction error.
Effectively predicts performance for variable workload applications.
Validated across different servers and VM configurations.
Abstract
Computing servers have played a key role in developing and processing emerging compute-intensive applications in recent years. Consolidating multiple virtual machines (VMs) inside one server to run various applications introduces severe competence for limited resources among VMs. Many techniques such as VM scheduling and resource provisioning are proposed to maximize the cost-efficiency of the computing servers while alleviating the performance inference between VMs. However, these management techniques require accurate performance prediction of the application running inside the VM, which is challenging to get in the public cloud due to the black-box nature of the VMs. From this perspective, this paper proposes a novel machine learning-based performance prediction approach for applications running in the cloud. To achieve high accuracy predictions for black-box VMs, the proposed method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · IoT and Edge/Fog Computing · Software System Performance and Reliability
