Learning Minimum Volume Sets and Anomaly Detectors from KNN Graphs
Jonathan Root, Venkatesh Saligrama, Jing Qian

TL;DR
This paper introduces a non-parametric anomaly detection method for high-dimensional data that learns to approximate minimum volume sets using KNN graphs and max-margin ranking, achieving asymptotic optimality and computational efficiency.
Contribution
It presents a novel anomaly detection algorithm that combines KNN graph scores with max-margin learning to efficiently approximate density level sets with proven asymptotic optimality.
Findings
Outperforms existing KNN-based anomaly detection methods.
Achieves significant computational savings.
Demonstrates asymptotic optimality in detecting anomalies.
Abstract
We propose a non-parametric anomaly detection algorithm for high dimensional data. We first rank scores derived from nearest neighbor graphs on -point nominal training data. We then train limited complexity models to imitate these scores based on the max-margin learning-to-rank framework. A test-point is declared as an anomaly at -false alarm level if the predicted score is in the -percentile. The resulting anomaly detector is shown to be asymptotically optimal in that for any false alarm rate , its decision region converges to the -percentile minimum volume level set of the unknown underlying density. In addition, we test both the statistical performance and computational efficiency of our algorithm on a number of synthetic and real-data experiments. Our results demonstrate the superiority of our algorithm over existing -NN based anomaly detection…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Network Security and Intrusion Detection · Data-Driven Disease Surveillance
