A Scenario-Oriented Benchmark for Assessing AIOps Algorithms in Microservice Management
Yongqian Sun, Jiaju Wang, Zhengdan Li, Xiaohui Nie, Minghua Ma,, Shenglin Zhang, Yuhe Ji, Lu Zhang, Wen Long, Hengmao Chen, Yongnan Luo, Dan, Pei

TL;DR
This paper introduces MicroServo, a real-time, scenario-oriented benchmark for evaluating AIOps algorithms in microservice management, addressing the limitations of static offline datasets.
Contribution
It presents a live microservice benchmark framework that generates real-time datasets and simulates specific operation scenarios for more effective algorithm evaluation.
Findings
MicroServo effectively evaluates AIOps algorithms in real-time scenarios.
It supports multiple operation scenarios and algorithm hot-plugging.
Demonstrates efficiency and usability in three typical microservice scenarios.
Abstract
AIOps algorithms play a crucial role in the maintenance of microservice systems. Many previous benchmarks' performance leaderboard provides valuable guidance for selecting appropriate algorithms. However, existing AIOps benchmarks mainly utilize offline datasets to evaluate algorithms. They cannot consistently evaluate the performance of algorithms using real-time datasets, and the operation scenarios for evaluation are static, which is insufficient for effective algorithm selection. To address these issues, we propose an evaluation-consistent and scenario-oriented evaluation framework named MicroServo. The core idea is to build a live microservice benchmark to generate real-time datasets and consistently simulate the specific operation scenarios on it. MicroServo supports different leaderboards by selecting specific algorithms and datasets according to the operation scenarios. It also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Cloud Computing and Resource Management · Service-Oriented Architecture and Web Services
