Assessing pattern recognition or labeling in streams of temporal data
Pierre-Fran\c{c}ois Marteau (EXPRESSION)

TL;DR
This paper introduces an editing distance-based method for evaluating pattern recognition in streaming temporal data, enabling alignment, confusion matrix computation, and performance metrics including latency.
Contribution
It presents a novel approach using editing distance for assessing temporal pattern recognition, addressing challenges in streaming data evaluation.
Findings
Effective alignment of labeled segments with ground truth
Facilitates computation of confusion matrices and standard metrics
Includes latency measurement for early pattern detection
Abstract
In the data deluge context, pattern recognition or labeling in streams is becoming quite an essential and pressing task as data flows inside always bigger streams. The assessment of such tasks is not so easy when dealing with temporal data, namely patterns that have a duration (a beginning and an end time-stamp). This paper details an approach based on an editing distance to first align a sequence of labeled temporal segments with a ground truth sequence, and then, by back-tracing an optimal alignment path, to provide a confusion matrix at the label level. From this confusion matrix, standard evaluation measures can easily be derived as well as other measures such as the "latency" that can be quite important in (early) pattern detection applications.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Data Stream Mining Techniques · Advanced Database Systems and Queries
