Copula Quadrant Similarity for Anomaly Scores

Matthew Davidow; David Matteson

arXiv:2101.02330·stat.ML·January 8, 2021·1 cites

Copula Quadrant Similarity for Anomaly Scores

Matthew Davidow, David Matteson

PDF

Open Access

TL;DR

This paper introduces a novel similarity measure for anomaly detection scores based on extremal similarity, enabling effective clustering of algorithms for robust ensemble detection.

Contribution

It proposes a new copula-based quadrant similarity measure focusing on extremal scores, facilitating comparison and clustering of diverse anomaly detection methods.

Findings

01

The method accurately clusters anomaly detection algorithms.

02

It improves ensemble robustness by selecting similar methods.

03

The approach outperforms existing dependence measures.

Abstract

Practical anomaly detection requires applying numerous approaches due to the inherent difficulty of unsupervised learning. Direct comparison between complex or opaque anomaly detection algorithms is intractable; we instead propose a framework for associating the scores of multiple methods. Our aim is to answer the question: how should one measure the similarity between anomaly scores generated by different methods? The scoring crux is the extremes, which identify the most anomalous observations. A pair of algorithms are defined here to be similar if they assign their highest scores to roughly the same small fraction of observations. To formalize this, we propose a measure based on extremal similarity in scoring distributions through a novel upper quadrant modeling approach, and contrast it with tail and other dependence measures. We illustrate our method with simulated and real…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Complex Systems and Time Series Analysis · Network Security and Intrusion Detection