Is AUC the best measure for practical comparison of anomaly detectors?

V\'it \v{S}kv\'ara; Tom\'a\v{s} Pevn\'y; V\'aclav \v{S}m\'idl

arXiv:2305.04754·cs.LG·May 9, 2023·2 cites

Is AUC the best measure for practical comparison of anomaly detectors?

V\'it \v{S}kv\'ara, Tom\'a\v{s} Pevn\'y, V\'aclav \v{S}m\'idl

PDF

Open Access 1 Repo

TL;DR

This paper critically examines the effectiveness of AUC as a metric for anomaly detection, highlighting its limitations and proposing alternative evaluation approaches aligned with practical needs.

Contribution

The study questions AUC's suitability for anomaly detection and suggests that low false positive rate metrics and representative anomalous samples are crucial for meaningful comparisons.

Findings

01

AUC may not reflect practical detection needs.

02

Metrics emphasizing low false positive rates are more relevant.

03

Representative anomalous samples are essential for comparison.

Abstract

The area under receiver operating characteristics (AUC) is the standard measure for comparison of anomaly detectors. Its advantage is in providing a scalar number that allows a natural ordering and is independent on a threshold, which allows to postpone the choice. In this work, we question whether AUC is a good metric for anomaly detection, or if it gives a false sense of comfort, due to relying on assumptions which are unlikely to hold in practice. Our investigation shows that variations of AUC emphasizing accuracy at low false positive rate seem to be better correlated with the needs of practitioners, but also that we can compare anomaly detectors only in the case when we have representative examples of anomalous samples. This last result is disturbing, as it suggests that in many cases, we should do active or few-show learning instead of pure anomaly detection.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vitskvara/admetricevaluation.jl
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Distributed Sensor Networks and Detection Algorithms · Advanced Statistical Process Monitoring