Anomaly detection using surprisals

Rob J Hyndman; David T. Frazier

arXiv:2603.09318·stat.ME·March 11, 2026

Anomaly detection using surprisals

Rob J Hyndman, David T. Frazier

PDF

Open Access

TL;DR

This paper introduces a unified anomaly detection framework based on surprisal, providing robust tail probability estimators that effectively identify inlier anomalies even with model misspecification.

Contribution

It proposes a novel, model-robust anomaly detection method using surprisal and develops two estimators with theoretical guarantees under various conditions.

Findings

01

Effective detection of inlier anomalies in complex data.

02

Robustness to model misspecification demonstrated in simulations.

03

Theoretical guarantees for tail probability estimators.

Abstract

Anomaly detection methods are widely used but often rely on ad hoc rules or strong assumptions, and they often focus on tail events, missing ``inlier'' anomalies that occur in low-density gaps between modes. We propose a unified framework that defines an anomaly as an observation with unusually low probability under a (possibly misspecified) model. For each observation we compute its surprisal (the negative log generalized density) and define an anomaly score as the probability of a surprisal at least as large as that observed. This reduces anomaly detection for complex univariate or multivariate data to estimating the upper tail of a univariate surprisal distribution. We develop two model-robust estimators of these tail probabilities: an empirical estimator based on the observed surprisal distribution and an extreme-value estimator that fits a Generalized Pareto Distribution above a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Advanced Statistical Methods and Models · Financial Risk and Volatility Modeling