Anomaly detection using surprisals
Rob J Hyndman, David T. Frazier

TL;DR
This paper introduces a unified anomaly detection framework based on surprisal, providing robust tail probability estimators that effectively identify inlier anomalies even with model misspecification.
Contribution
It proposes a novel, model-robust anomaly detection method using surprisal and develops two estimators with theoretical guarantees under various conditions.
Findings
Effective detection of inlier anomalies in complex data.
Robustness to model misspecification demonstrated in simulations.
Theoretical guarantees for tail probability estimators.
Abstract
Anomaly detection methods are widely used but often rely on ad hoc rules or strong assumptions, and they often focus on tail events, missing ``inlier'' anomalies that occur in low-density gaps between modes. We propose a unified framework that defines an anomaly as an observation with unusually low probability under a (possibly misspecified) model. For each observation we compute its surprisal (the negative log generalized density) and define an anomaly score as the probability of a surprisal at least as large as that observed. This reduces anomaly detection for complex univariate or multivariate data to estimating the upper tail of a univariate surprisal distribution. We develop two model-robust estimators of these tail probabilities: an empirical estimator based on the observed surprisal distribution and an extreme-value estimator that fits a Generalized Pareto Distribution above a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Advanced Statistical Methods and Models · Financial Risk and Volatility Modeling
