ONCE and ONCE+: Counting the Frequency of Time-constrained Serial Episodes in a Streaming Sequence
Hui Li, Sizhe Peng, Jian Li, Jingjing Li, Jiangtao Cui and, Jianfeng Ma

TL;DR
This paper introduces two efficient one-pass algorithms, ONCE and ONCE+, for counting time-constrained serial episodes in streaming data, addressing limitations of previous methods by being stream-oriented and time-aware.
Contribution
The paper presents novel algorithms that accurately count serial episodes with time constraints in streaming data, supporting real-time processing with minimal resources.
Findings
Algorithms work efficiently on real-world and synthetic datasets.
They handle millions of signals per second with low time and space complexity.
Successful application demonstrated in a real stream processing system.
Abstract
As a representative sequential pattern mining problem, counting the frequency of serial episodes from a streaming sequence has drawn continuous attention in academia due to its wide application in practice, e.g., telecommunication alarms, stock market, transaction logs, bioinformatics, etc. Although a number of serial episodes mining algorithms have been developed recently, most of them are neither stream-oriented, as they require multi-pass of dataset, nor time-aware, as they fail to take into account the time constraint of serial episodes. In this paper, we propose two novel one-pass algorithms, ONCE and ONCE+, each of which can respectively compute two popular frequencies of given episodes satisfying predefined time-constraint as signals in a stream arrives one-after-another. ONCE is only used for non-overlapped frequency where the occurrences of a serial episode in sequence are not…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Time Series Analysis and Forecasting · Algorithms and Data Compression
