Deep Orthogonal Hypersphere Compression for Anomaly Detection
Yunhe Zhang, Yan Sun, Jinyu Cai, Jicong Fan

TL;DR
This paper introduces a novel deep anomaly detection approach using orthogonal projections and bi-hypersphere compression to create more compact and accurate decision boundaries, applicable to various data types including graphs.
Contribution
It proposes a new deep hypersphere learning model with orthogonal projection and bi-hypersphere compression, extending anomaly detection to graph data with improved accuracy.
Findings
Enhanced true positive rates and reduced false negatives.
More compact decision regions than traditional hyperballs.
Superior performance on benchmark datasets.
Abstract
Many well-known and effective anomaly detection methods assume that a reasonable decision boundary has a hypersphere shape, which however is difficult to obtain in practice and is not sufficiently compact, especially when the data are in high-dimensional spaces. In this paper, we first propose a novel deep anomaly detection model that improves the original hypersphere learning through an orthogonal projection layer, which ensures that the training data distribution is consistent with the hypersphere hypothesis, thereby increasing the true positive rate and decreasing the false negative rate. Moreover, we propose a bi-hypersphere compression method to obtain a hyperspherical shell that yields a more compact decision region than a hyperball, which is demonstrated theoretically and numerically. The proposed methods are not confined to common datasets such as image and tabular data, but are…
Peer Reviews
Decision·ICLR 2024 spotlight
The paper stands out in the following aspects: 1. Originality: The paper presents a problem that may lead to suboptimal performance in the field of anomaly detection and provides a solution, offering a novel approach to enhance the performance of anomaly detection algorithms. 2. Quality: The research is of high quality, marked by a well-structured approach, rigorous validation, and superior performance compared to existing methods. The use of benchmark datasets adds to the credibility. 3. Clarit
1. This article mentions the concept of hyperspheres but doesn't provide a more rigorous theoretical explanation for why standard hyperspheres are superior to boundaries formed by ellipsoids. Adding mathematical proofs or a deeper theoretical foundation would strengthen the paper. 2. High-dimensional data and large datasets pose scalability challenges. The paper could address the scalability of the proposed methods and discuss their efficiency and computational requirements in dealing with big d
* The motivations and technique details of the proposed methods are clearly illustrated. The visualizations (e.g. Figures 1-4, 12 and 13) are very impressive. * The ideas especially the two concentric hyperspheres for anomaly detection are novel and interesting. They provide new insights into anomaly detection. * The theoretical analysis such as Propositions 1, 2, and 3 make the paper solid. * The experiments are extensive. There are image datasets (e.g. CIFAR10), tabular datasets, and six graph
A minor weakness is that some points haven’t been sufficiently explained. Please refer to my questions.
+ The paper's content is grounded in sound research, with a particularly innovative contribution in the form of the bi-hypersphere concept. + The research is substantiated by a comprehensive and diverse set of experiments, encompassing three distinct data types. The visualizations effectively convey the superiority of the proposed method. + The visualization results pertaining to anomaly detection are distinctive and intuitive, enhancing the paper's overall quality. The improvement over the pr
- Why can the orthogonal projection layer ensure a standard hypersphere? - The occurrence of the "soap bubble phenomenon" needs further clarification. Does it mean incompletely optimized? - We know that Deep SVDD compels normal data close to the center of the decision boundary, why do anomalies appear within this decision boundary? What are the main differences and similarities between normal data and these anomaly data? - Authors claimed that DO2HSC is to control training data to be more com
Code & Models
Videos
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Complex Network Analysis Techniques · Advanced Graph Neural Networks
