Multiple-Input Variational Auto-Encoder for Anomaly Detection in   Heterogeneous Data

Phai Vu Dinh; Diep N. Nguyen; Dinh Thai Hoang; Quang Uy Nguyen; and; Eryk Dutkiewicz

arXiv:2501.08149·cs.AI·January 15, 2025

Multiple-Input Variational Auto-Encoder for Anomaly Detection in Heterogeneous Data

Phai Vu Dinh, Diep N. Nguyen, Dinh Thai Hoang, Quang Uy Nguyen, and, Eryk Dutkiewicz

PDF

Open Access

TL;DR

This paper introduces a novel neural network architecture, MIVAE, and a multiple-input auto-encoder, MIAEAD, designed to improve anomaly detection in heterogeneous, non-IID data by leveraging feature subset reconstruction errors and distribution modeling.

Contribution

The paper proposes MIVAE and MIAEAD, innovative models that enhance anomaly detection in heterogeneous data by combining sub-encoder reconstruction errors with distribution learning, outperforming existing methods.

Findings

01

MIVAE achieves higher AUC than traditional VAEs.

02

MIAEAD performs better on datasets with low feature heterogeneity.

03

Both models outperform state-of-the-art unsupervised anomaly detection methods.

Abstract

Anomaly detection (AD) plays a pivotal role in AI applications, e.g., in classification, and intrusion/threat detection in cybersecurity. However, most existing methods face challenges of heterogeneity amongst feature subsets posed by non-independent and identically distributed (non-IID) data. We propose a novel neural network model called Multiple-Input Auto-Encoder for AD (MIAEAD) to address this. MIAEAD assigns an anomaly score to each feature subset of a data sample to indicate its likelihood of being an anomaly. This is done by using the reconstruction error of its sub-encoder as the anomaly score. All sub-encoders are then simultaneously trained using unsupervised learning to determine the anomaly scores of feature subsets. The final AUC of MIAEAD is calculated for each sub-dataset, and the maximum AUC obtained among the sub-datasets is selected. To leverage the modelling of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications