Anomaly Transformer: Time Series Anomaly Detection with Association   Discrepancy

Jiehui Xu; Haixu Wu; Jianmin Wang; Mingsheng Long

arXiv:2110.02642·cs.LG·June 30, 2022

Anomaly Transformer: Time Series Anomaly Detection with Association Discrepancy

Jiehui Xu, Haixu Wu, Jianmin Wang, Mingsheng Long

PDF

3 Repos 1 Video

TL;DR

The paper introduces the Anomaly Transformer, a novel model leveraging association discrepancy via self-attention to improve unsupervised time series anomaly detection, achieving state-of-the-art results across multiple benchmarks.

Contribution

It proposes the Anomaly Transformer with Anomaly-Attention mechanism and a minimax strategy to enhance anomaly detection by modeling association discrepancy.

Findings

01

Achieves state-of-the-art results on six benchmarks

02

Effectively distinguishes anomalies through association discrepancy

03

Demonstrates robustness across diverse application domains

Abstract

Unsupervised detection of anomaly points in time series is a challenging problem, which requires the model to derive a distinguishable criterion. Previous methods tackle the problem mainly through learning pointwise representation or pairwise association, however, neither is sufficient to reason about the intricate dynamics. Recently, Transformers have shown great power in unified modeling of pointwise representation and pairwise association, and we find that the self-attention weight distribution of each time point can embody rich association with the whole series. Our key observation is that due to the rarity of anomalies, it is extremely difficult to build nontrivial associations from abnormal points to the whole series, thereby, the anomalies' associations shall mainly concentrate on their adjacent time points. This adjacent-concentration bias implies an association-based criterion…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Anomaly Transformer: Time Series Anomaly Detection with Association Discrepancy· slideslive

Taxonomy

Methodstravel james · Attention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Adam · Byte Pair Encoding · Dropout · Dense Connections · Label Smoothing · Softmax