Unsupervised Event Outlier Detection in Continuous Time

Somjit Nath; Yik Chau Lui; Siqi Liu

arXiv:2411.16427·cs.LG·November 26, 2024

Unsupervised Event Outlier Detection in Continuous Time

Somjit Nath, Yik Chau Lui, Siqi Liu

PDF

Open Access 4 Reviews

TL;DR

This paper introduces an unsupervised outlier detection method for continuous-time event sequences using a GAN-RL framework, enabling online detection of anomalies with improved accuracy over existing methods.

Contribution

It presents the first unsupervised outlier detection approach for event sequences based on GANs and reinforcement learning, capable of online anomaly detection.

Findings

01

Outperforms state-of-the-art outlier detection methods

02

Effective in online detection scenarios

03

Utilizes data augmentation via generator mistakes

Abstract

Event sequence data record the occurrences of events in continuous time. Event sequence forecasting based on temporal point processes (TPPs) has been extensively studied, but outlier or anomaly detection, especially without any supervision from humans, is still underexplored. In this work, we develop, to the best our knowledge, the first unsupervised outlier detection approach to detecting abnormal events. Our novel unsupervised outlier detection framework is based on ideas from generative adversarial networks (GANs) and reinforcement learning (RL). We train a 'generator' that corrects outliers in the data with a 'discriminator' that learns to discriminate the corrected data from the real data, which may contain outliers. A key insight is that if the generator made a mistake in the correction, it would generate anomalies that are different from the anomalies in the real data, so it…

Peer Reviews

Decision·ICLR 2024 Conference Withdrawn Submission

Reviewer 01Rating 6· marginally above the acceptance thresholdConfidence 3

Strengths

+ The paper's focus on unsupervised methods which is a real setting for event detection. + The paper introduces an unsupervised outlier detection in event sequence data. The combination of Generative Adversarial Networks (GANs) and Reinforcement Learning (RL), to detect abnormal events and unexpected absence of events, is interesting and innovative.

Weaknesses

+ The paper does not discuss the scalability of the proposed method, especially concerning large-scale event sequence data. Considering the potential applications in real-time systems where massive event data streams are common, addressing scalability concerns and optimizing the method for efficient processing of large datasets would be essential. + While the paper compares the proposed method with semi-supervised approaches, a broader comparative analysis against various unsupervised outlier de

Reviewer 02Rating 5· marginally below the acceptance thresholdConfidence 4

Strengths

The problem formulation sounds plausible. The proposed network architecture is sensible.

Weaknesses

W1: The claim to be the first paper on outlier detection in an event sequence is somewhat strong. There are already works on the same theme as listed out here: Manish Gupta, Jing Gao, Charu C. Aggarwal, Jiawei Han: Outlier Detection for Temporal Data: A Survey. IEEE Trans. Knowl. Data Eng. 26(9): 2250-2267 (2014) Please double check the above paper for potential baselines. ------ W2: The paper should include at least a diagram about the model architecture. This will greatly improve readabili

Reviewer 03Rating 3· reject, not good enoughConfidence 4

Strengths

1. The paper's use of reinforcement learning is clever and is implemented in a way that corresponds well to the intended problem of finding errors of commission and errors of omission together with where the event should appear within the sequence. 2. The paper's study of the impact of the amount of corruption in the data is good---it should be an obvious step that is always taken, but is actually rarely done.

Weaknesses

1. The paper did not evaluate the accuracy of the RL agent that predicts the time at which the omitted event should occur. 2. There are several unsupervised anomaly detection methods for time series against which the authors should compare their algorithm: Examples include: Das, et. al., Multiple Kernel Learning for Heterogeneous Anomaly Detection, KDD-2010 Melnyk, et. al., Semi-Markov Switching Vector Autoregressive Model-Based Anomaly Detection in Aviation Systems, KDD-2016. Memarzadeh, M., et

Reviewer 04Rating 5· marginally below the acceptance thresholdConfidence 4

Strengths

The paper proposes a strong method for event outlier detection in continuous time. Its combination of different ideas from deep reinforcement learning and neural sequence models is novel and interesting, and may be widely applicable in other tasks. It would be interesting to try the ideas in the paper, for example, in time series anomaly detection. The paper also makes a significant effort to rigorously set up the event outlier detection task where an outlier / anomaly may simply be that an ev

Weaknesses

The main weakness of the paper is its lack of ablation studies and reproducibility. The paper introduces an overly complex architecture for a task where naive baselines are often available in practice. The architecture comprises an LSTM for encoding, multiple RL agents (where the policy of one depends on the policy of the other--and not vice versa), a discriminator architecture that could also be seen as the "reward function," design choices in the neural networks in the form of "continuous time

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Fault Detection and Control Systems · Advanced Statistical Methods and Models