Calibrating Tabular Anomaly Detection via Optimal Transport
Hangting Ye, He Zhao, Wei Fan, Xiaozhuang Song, Dandan Guo, Yi Chang, Hongyuan Zha

TL;DR
This paper introduces CTAD, a post-processing calibration framework using Optimal Transport to improve the performance of any tabular anomaly detection method across diverse datasets.
Contribution
It proposes a model-agnostic calibration technique that leverages OT distance to distinguish anomalies from normal data, enhancing existing TAD detectors without extra tuning.
Findings
CTAD consistently improves detection performance across 34 datasets.
It enhances state-of-the-art deep learning TAD methods.
The method is robust to hyperparameter variations.
Abstract
Tabular anomaly detection (TAD) remains challenging due to the heterogeneity of tabular data: features lack natural relationships, vary widely in distribution and scale, and exhibit diverse types. Consequently, each TAD method makes implicit assumptions about anomaly patterns that work well on some datasets but fail on others, and no method consistently outperforms across diverse scenarios. We present CTAD (Calibrating Tabular Anomaly Detection), a model-agnostic post-processing framework that enhances any existing TAD detector through sample-specific calibration. Our approach characterizes normal data via two complementary distributions, i.e., an empirical distribution from random sampling and a structural distribution from K-means centroids, and measures how adding a test sample disrupts their compatibility using Optimal Transport (OT) distance. Normal samples maintain low disruption…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
