VSCOUT: A Hybrid Variational Autoencoder Approach to Outlier Detection in High-Dimensional Retrospective Monitoring
Waldyn G. Martinez

TL;DR
VSCOUT is a novel distribution-free framework combining variational autoencoders, ensemble filtering, and changepoint detection to improve outlier detection in high-dimensional, contaminated data for industrial process monitoring.
Contribution
It introduces a two-stage, distribution-free approach using ARD-VAE and ensemble filtering to enhance baseline estimation and outlier detection in high-dimensional retrospective data.
Findings
Outperforms classical SPC and modern machine learning methods in sensitivity and false alarm control.
Effectively handles complex contamination patterns and high-dimensional data.
Demonstrates scalability and robustness across benchmark datasets.
Abstract
Modern industrial and service processes generate high-dimensional, non-Gaussian, and contamination-prone data that challenge the foundational assumptions of classical Statistical Process Control (SPC). Heavy tails, multimodality, nonlinear dependencies, and sparse special-cause observations can distort baseline estimation, mask true anomalies, and prevent reliable identification of an in-control (IC) reference set. To address these challenges, we introduce VSCOUT, a distribution-free framework designed specifically for retrospective (Phase I) monitoring in high-dimensional settings. VSCOUT combines an Automatic Relevance Determination Variational Autoencoder (ARD-VAE) architecture with ensemble-based latent outlier filtering and changepoint detection. The ARD prior isolates the most informative latent dimensions, while the ensemble and changepoint filters identify pointwise and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Fault Detection and Control Systems · Data Stream Mining Techniques
