Real-time Top-K Predictive Query Processing over Event Streams
Saurav Acharya, Byung Suk Lee, Paul Hines

TL;DR
This paper introduces a real-time top-k event prediction method over streaming data that dynamically learns causal relationships, overcoming limitations of previous approaches that ignored cyclic causality and conservative assumptions.
Contribution
It proposes a novel event precedence model using an incremental Markov chain and a run-time causal inference mechanism for more accurate, efficient top-k event predictions.
Findings
Reduced runtime by 25-80% with minimal accuracy loss.
Effectively models cyclic causality in event streams.
Validated on power system and web data.
Abstract
This paper addresses the problem of predicting the k events that are most likely to occur next, over historical real-time event streams. Existing approaches to causal prediction queries have a number of limitations. First, they exhaustively search over an acyclic causal network to find the most likely k effect events; however, data from real event streams frequently reflect cyclic causality. Second, they contain conservative assumptions intended to exclude all possible non-causal links in the causal network; it leads to the omission of many less-frequent but important causal links. We overcome these limitations by proposing a novel event precedence model and a run-time causal inference mechanism. The event precedence model constructs a first order absorbing Markov chain incrementally over event streams, where an edge between two events signifies a temporal precedence relationship…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Data Management and Algorithms · Data Stream Mining Techniques
