Cloud Performance Decomposition for Long-Term Performance Engineering: A Case Study
Shimul Debnath, William Hart, Lori Pollock, Donald Lien, Wei Wang

TL;DR
This paper introduces advanced time-series decomposition techniques for cloud performance analysis, enabling better trend detection, accurate performance prediction, and resource optimization across diverse cloud workloads.
Contribution
It proposes two novel decomposition methods that outperform basic approaches, revealing hidden patterns and improving performance prediction and resource management in cloud environments.
Findings
Decomposition methods accurately identify weekly and quarterly performance patterns.
Predictions using decomposed data achieve MAPE of around 2%.
Decomposition-guided resource scaling reduces latency variability by over 60%.
Abstract
Cloud performance fluctuates due to factors such as resource contention and workload changes. These factors can be short-term, seasonal, or long-term. Their effects are often intertwined in performance traces, making performance management difficult. Prior work on cloud performance engineering used time-series decomposition to separate these factors. However, existing approaches rely on basic decomposition methods that may miss key variation patterns and fail on traces with complex or intermittent patterns, limiting their usefulness across diverse cloud deployments. To address this limitation, we propose two time-series decomposition techniques for cloud performance engineering: a hybrid/manual method and a fully automatic method. Through a case study of 11 serverless functions, we show that both approaches can successfully and consistently reveal trends and seasonal cycles, such as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
