Transparently Capturing Request Execution Path for Anomaly Detection
Yong Yang, Long Wang, Jing Gu, Ying Li

TL;DR
This paper introduces REPTrace, a transparent method for capturing complete request execution paths in distributed systems, enabling effective anomaly detection with high accuracy and minimal overhead.
Contribution
It proposes a generic, transparent methodology for capturing end-to-end execution paths and applies it to anomaly detection in distributed platforms.
Findings
High recall (96%) in anomaly detection
Captures execution paths with reasonable latency
Negligible network overhead
Abstract
With the increasing scale and complexity of cloud systems and big data analytics platforms, it is becoming more and more challenging to understand and diagnose the processing of a service request in such distributed platforms. One way that helps to deal with this problem is to capture the complete end-to-end execution path of service requests among all involved components accurately. This paper presents REPTrace, a generic methodology for capturing such execution paths in a transparent fashion. We analyze a comprehensive list of execution scenarios, and propose principles and algorithms for generating the end-to-end request execution path for all the scenarios. Moreover, this paper presents an anomaly detection approach exploiting request execution paths to detect anomalies of the execution during request processing. The experiments on four popular distributed platforms with different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Network Security and Intrusion Detection · Software Reliability and Analysis Research
