Learning Service Slowdown using Observational Data
Xu Kuang, Gal Mendelson

TL;DR
This paper investigates how to detect service slowdowns in multi-server systems with adaptive congestion control using observational data, proposing a new robust statistic that outperforms traditional methods.
Contribution
It introduces a novel potential routing action-based statistic for more accurate slowdown detection in adaptive congestion control environments.
Findings
Traditional marginal congestion statistics can be highly inaccurate with adaptive control.
The proposed potential routing action statistic provides a more robust signal for slowdown detection.
Combining multiple orthogonal statistics improves reliability in observational slowdown analysis.
Abstract
Being able to identify service slowdowns is crucial to many operational problems. We study how to use observational congestion data to learn service slowdown in a multi-server system that uses adaptive congestion control mechanisms. We show that a commonly used summary statistic that relies on the marginal congestion measured at individual servers can be highly inaccurate in the presence of adaptive congestion control. We propose a new statistic based on potential routing actions, and show it provides a much more robust signal for server slowdown in these settings. Unlike the marginal statistic, potential action aims to detect changes in the routing actions, and is able to uncover slowdowns even when they do not reflect in marginal congestion. Our results highlight the complexity in performing observational statistical analysis for service systems in the presence of adaptive congestion…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Traffic and Congestion Control · Traffic Prediction and Management Techniques · Complex Network Analysis Techniques
