Multiple-Input Variational Auto-Encoder for Anomaly Detection in Heterogeneous Data
Phai Vu Dinh, Diep N. Nguyen, Dinh Thai Hoang, Quang Uy Nguyen, and, Eryk Dutkiewicz

TL;DR
This paper introduces a novel neural network architecture, MIVAE, and a multiple-input auto-encoder, MIAEAD, designed to improve anomaly detection in heterogeneous, non-IID data by leveraging feature subset reconstruction errors and distribution modeling.
Contribution
The paper proposes MIVAE and MIAEAD, innovative models that enhance anomaly detection in heterogeneous data by combining sub-encoder reconstruction errors with distribution learning, outperforming existing methods.
Findings
MIVAE achieves higher AUC than traditional VAEs.
MIAEAD performs better on datasets with low feature heterogeneity.
Both models outperform state-of-the-art unsupervised anomaly detection methods.
Abstract
Anomaly detection (AD) plays a pivotal role in AI applications, e.g., in classification, and intrusion/threat detection in cybersecurity. However, most existing methods face challenges of heterogeneity amongst feature subsets posed by non-independent and identically distributed (non-IID) data. We propose a novel neural network model called Multiple-Input Auto-Encoder for AD (MIAEAD) to address this. MIAEAD assigns an anomaly score to each feature subset of a data sample to indicate its likelihood of being an anomaly. This is done by using the reconstruction error of its sub-encoder as the anomaly score. All sub-encoders are then simultaneously trained using unsupervised learning to determine the anomaly scores of feature subsets. The final AUC of MIAEAD is calculated for each sub-dataset, and the maximum AUC obtained among the sub-datasets is selected. To leverage the modelling of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications
