Isolation Distributional Kernel: A New Tool for Point & Group Anomaly Detection
Kai Ming Ting, Bi-Cun Xu, Takashi Washio, Zhi-Hua Zhou

TL;DR
This paper introduces the Isolation Distributional Kernel (IDK), a data-dependent kernel for improved point and group anomaly detection that outperforms existing methods in accuracy and speed.
Contribution
The paper proposes IDK, a novel data-dependent kernel, and IDK$^2$, a fast group anomaly detector, addressing key limitations of previous kernel mean embedding approaches.
Findings
IDK outperforms OCSVM and other kernel methods in point anomaly detection.
IDK$^2$ is significantly faster than existing group anomaly detectors.
Data-dependent characteristic kernels are essential for effective kernel-based anomaly detection.
Abstract
We introduce Isolation Distributional Kernel as a new way to measure the similarity between two distributions. Existing approaches based on kernel mean embedding, which convert a point kernel to a distributional kernel, have two key issues: the point kernel employed has a feature map with intractable dimensionality; and it is {\em data independent}. This paper shows that Isolation Distributional Kernel (IDK), which is based on a {\em data dependent} point kernel, addresses both key issues. We demonstrate IDK's efficacy and efficiency as a new tool for kernel based anomaly detection for both point and group anomalies. Without explicit learning, using IDK alone outperforms existing kernel based point anomaly detector OCSVM and other kernel mean embedding methods that rely on Gaussian kernel. For group anomaly detection,we introduce an IDK based detector called IDK. It reformulates the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Data-Driven Disease Surveillance · Domain Adaptation and Few-Shot Learning
