NLP Based Anomaly Detection for Categorical Time Series

Matthew Horak; Sowmya Chandrasekaran; Giovanni Tobar

arXiv:2204.10483·cs.LG·April 25, 2022·1 cites

NLP Based Anomaly Detection for Categorical Time Series

Matthew Horak, Sowmya Chandrasekaran, Giovanni Tobar

PDF

Open Access

TL;DR

This paper introduces a novel NLP-inspired approach for anomaly detection in multi-dimensional categorical time series, leveraging language modeling techniques to improve detection and root cause analysis.

Contribution

It formalizes an analogy between categorical time series and NLP, developing and testing three machine learning models for anomaly detection and root cause analysis.

Findings

01

Effective anomaly detection in categorical time series

02

Improved root cause investigation capabilities

03

Demonstrated the strength of NLP analogy in this context

Abstract

Identifying anomalies in large multi-dimensional time series is a crucial and difficult task across multiple domains. Few methods exist in the literature that address this task when some of the variables are categorical in nature. We formalize an analogy between categorical time series and classical Natural Language Processing and demonstrate the strength of this analogy for anomaly detection and root cause investigation by implementing and testing three different machine learning anomaly detection and root cause investigation models based upon it.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTime Series Analysis and Forecasting · Anomaly Detection Techniques and Applications · Advanced Text Analysis Techniques