Efficient SVDD Sampling with Approximation Guarantees for the Decision   Boundary

Adrian Englhardt; Holger Trittenbach; Daniel Kottke; Bernhard Sick,; and Klemens B\"ohm

arXiv:2009.13853·cs.LG·September 30, 2020·1 cites

Efficient SVDD Sampling with Approximation Guarantees for the Decision Boundary

Adrian Englhardt, Holger Trittenbach, Daniel Kottke, Bernhard Sick,, and Klemens B\"ohm

PDF

Open Access 2 Repos

TL;DR

This paper introduces RAPID, an efficient sampling algorithm for SVDD that guarantees approximation of the decision boundary, improving scalability and accuracy in anomaly detection tasks.

Contribution

The paper formulates SVDD sampling as an optimization problem with guarantees and proposes RAPID, a parameter-free, scalable algorithm that outperforms existing methods.

Findings

01

RAPID achieves higher classification accuracy.

02

RAPID uses smaller sample sizes.

03

RAPID has faster runtime.

Abstract

Support Vector Data Description (SVDD) is a popular one-class classifiers for anomaly and novelty detection. But despite its effectiveness, SVDD does not scale well with data size. To avoid prohibitive training times, sampling methods select small subsets of the training data on which SVDD trains a decision boundary hopefully equivalent to the one obtained on the full data set. According to the literature, a good sample should therefore contain so-called boundary observations that SVDD would select as support vectors on the full data set. However, non-boundary observations also are essential to not fragment contiguous inlier regions and avoid poor classification accuracy. Other aspects, such as selecting a sufficiently representative sample, are important as well. But existing sampling methods largely overlook them, resulting in poor classification accuracy. In this article, we study…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Network Security and Intrusion Detection · Fault Detection and Control Systems