Multi-Level Service Performance Forecasting via Spatiotemporal Graph Neural Networks
Zhihao Xue, Yun Zi, Nia Qi, Ming Gong, Yujun Zou

TL;DR
This paper introduces a spatiotemporal graph neural network model for accurately forecasting distributed system performance, capturing complex dependencies and dynamic changes over time.
Contribution
It presents a novel end-to-end framework combining graph convolution and gated recurrent networks for multi-level service performance prediction.
Findings
Outperforms existing methods on key metrics like MAE, RMSE, R2
Maintains robustness under different load conditions
Effectively models non-stationary temporal sequences
Abstract
This paper proposes a spatiotemporal graph neural network-based performance prediction algorithm to address the challenge of forecasting performance fluctuations in distributed backend systems with multi-level service call structures. The method abstracts system states at different time slices into a sequence of graph structures. It integrates the runtime features of service nodes with the invocation relationships among services to construct a unified spatiotemporal modeling framework. The model first applies a graph convolutional network to extract high-order dependency information from the service topology. Then it uses a gated recurrent network to capture the dynamic evolution of performance metrics over time. A time encoding mechanism is also introduced to enhance the model's ability to represent non-stationary temporal sequences. The architecture is trained in an end-to-end manner,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Traffic Prediction and Management Techniques · Cloud Computing and Resource Management
