AHA: Scalable Alternative History Analysis for Operational Timeseries Applications
Harshavardhan Kamarthi, Harshil Shah, Henry Milner, Sayan Sinha, Yan Li, B. Aditya Prakash, Vyas Sekar

TL;DR
AHA is a scalable system designed for cost-effective and accurate retrospective analysis of high-dimensional operational timeseries data, enabling reliable alternative history analysis for various applications.
Contribution
The paper introduces AHA, a novel system that achieves high accuracy and low operational costs for retrospective analysis of high-dimensional timeseries data, addressing limitations of traditional solutions.
Findings
AHA provides 100% accuracy across multiple tasks.
Achieves up to 85x reduction in total cost of ownership.
Demonstrated effectiveness on real-world datasets and production pipelines.
Abstract
Many operational systems collect high-dimensional timeseries data about users/systems on key performance metrics. For instance, ISPs, content distribution networks, and video delivery services collect quality of experience metrics for user sessions associated with metadata (e.g., location, device, ISP). Over such historical data, operators and data analysts often need to run retrospective analysis; e.g., analyze anomaly detection algorithms, experiment with different configurations for alerts, evaluate new algorithms, and so on. We refer to this class of workloads as alternative history analysis for operational datasets. We show that in such settings, traditional data processing solutions (e.g., data warehouses, sampling, sketching, big-data systems) either pose high operational costs or do not guarantee accurate replay. We design and implement a system, called AHA (Alternative History…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Time Series Analysis and Forecasting · Anomaly Detection Techniques and Applications
