Variational Autoencoder for Anomaly Detection: A Comparative Study

Huy Hoang Nguyen; Cuong Nhat Nguyen; Xuan Tung Dao; Quoc Trung Duong,; Dzung Pham Thi Kim; Minh-Tan Pham

arXiv:2408.13561·cs.CV·August 27, 2024·2 cites

Variational Autoencoder for Anomaly Detection: A Comparative Study

Huy Hoang Nguyen, Cuong Nhat Nguyen, Xuan Tung Dao, Quoc Trung Duong,, Dzung Pham Thi Kim, Minh-Tan Pham

PDF

Open Access 1 Repo

TL;DR

This study compares different Variational Autoencoder architectures for anomaly detection, highlighting ViT-VAE's superior performance and emphasizing the importance of diverse datasets for robust benchmarking.

Contribution

It provides a comprehensive comparison of VAE variants in anomaly detection and introduces the use of the MiAD dataset for more robust evaluation.

Findings

01

ViT-VAE outperforms other architectures in various scenarios.

02

VAE-GRF requires careful hyperparameter tuning.

03

Using MiAD dataset improves benchmarking robustness.

Abstract

This paper aims to conduct a comparative analysis of contemporary Variational Autoencoder (VAE) architectures employed in anomaly detection, elucidating their performance and behavioral characteristics within this specific task. The architectural configurations under consideration encompass the original VAE baseline, the VAE with a Gaussian Random Field prior (VAE-GRF), and the VAE incorporating a vision transformer (ViT-VAE). The findings reveal that ViT-VAE exhibits exemplary performance across various scenarios, whereas VAE-GRF may necessitate more intricate hyperparameter tuning to attain its optimal performance state. Additionally, to mitigate the propensity for over-reliance on results derived from the widely used MVTec dataset, this paper leverages the recently-public MiAD dataset for benchmarking. This deliberate inclusion seeks to enhance result competitiveness by alleviating…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

endtheme123/vae-compare
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications

MethodsAttention Is All You Need · Softmax · Linear Layer · Layer Normalization · Dense Connections · Residual Connection · Multi-Head Attention · Vision Transformer