Efficient SVDD Sampling with Approximation Guarantees for the Decision Boundary
Adrian Englhardt, Holger Trittenbach, Daniel Kottke, Bernhard Sick,, and Klemens B\"ohm

TL;DR
This paper introduces RAPID, an efficient sampling algorithm for SVDD that guarantees approximation of the decision boundary, improving scalability and accuracy in anomaly detection tasks.
Contribution
The paper formulates SVDD sampling as an optimization problem with guarantees and proposes RAPID, a parameter-free, scalable algorithm that outperforms existing methods.
Findings
RAPID achieves higher classification accuracy.
RAPID uses smaller sample sizes.
RAPID has faster runtime.
Abstract
Support Vector Data Description (SVDD) is a popular one-class classifiers for anomaly and novelty detection. But despite its effectiveness, SVDD does not scale well with data size. To avoid prohibitive training times, sampling methods select small subsets of the training data on which SVDD trains a decision boundary hopefully equivalent to the one obtained on the full data set. According to the literature, a good sample should therefore contain so-called boundary observations that SVDD would select as support vectors on the full data set. However, non-boundary observations also are essential to not fragment contiguous inlier regions and avoid poor classification accuracy. Other aspects, such as selecting a sufficiently representative sample, are important as well. But existing sampling methods largely overlook them, resulting in poor classification accuracy. In this article, we study…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Network Security and Intrusion Detection · Fault Detection and Control Systems
