Learning Efficient Anomaly Detectors from $K$-NN Graphs
Jing Qian, Jonathan Root, Venkatesh Saligrama

TL;DR
This paper introduces a non-parametric anomaly detection method using $K$-NN graphs that is asymptotically optimal and computationally efficient, outperforming existing algorithms on synthetic and real data.
Contribution
It develops a novel $K$-NN based anomaly detection approach that learns to rank anomalies efficiently with theoretical optimality guarantees.
Findings
Outperforms existing $K$-NN anomaly detection methods.
Achieves significant computational savings.
Demonstrates asymptotic optimality in anomaly detection.
Abstract
We propose a non-parametric anomaly detection algorithm for high dimensional data. We score each datapoint by its average -NN distance, and rank them accordingly. We then train limited complexity models to imitate these scores based on the max-margin learning-to-rank framework. A test-point is declared as an anomaly at -false alarm level if the predicted score is in the -percentile. The resulting anomaly detector is shown to be asymptotically optimal in that for any false alarm rate , its decision region converges to the -percentile minimum volume level set of the unknown underlying density. In addition, we test both the statistical performance and computational efficiency of our algorithm on a number of synthetic and real-data experiments. Our results demonstrate the superiority of our algorithm over existing -NN based anomaly detection algorithms,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Data Stream Mining Techniques · Network Security and Intrusion Detection
