Task Agnostic and Post-hoc Unseen Distribution Detection
Radhika Dua, Seongjun Yang, Yixuan Li, Edward Choi

TL;DR
This paper introduces TAPUDD, a task-agnostic, post-hoc clustering-based method for detecting unseen data distributions that leverages feature clustering and ensembling to improve detection across diverse tasks.
Contribution
The paper proposes TAPUDD, a novel clustering and ensembling approach that detects unseen distributions without task-specific tuning, applicable post-hoc to trained models.
Findings
Effective detection of unseen samples across multiple datasets
Outperforms or matches existing baselines in various tasks
Eliminates need for optimal cluster number tuning
Abstract
Despite the recent advances in out-of-distribution(OOD) detection, anomaly detection, and uncertainty estimation tasks, there do not exist a task-agnostic and post-hoc approach. To address this limitation, we design a novel clustering-based ensembling method, called Task Agnostic and Post-hoc Unseen Distribution Detection (TAPUDD) that utilizes the features extracted from the model trained on a specific task. Explicitly, it comprises of TAP-Mahalanobis, which clusters the training datasets' features and determines the minimum Mahalanobis distance of the test sample from all clusters. Further, we propose the Ensembling module that aggregates the computation of iterative TAP-Mahalanobis for a different number of clusters to provide reliable and efficient cluster computation. Through extensive experiments on synthetic and real-world datasets, we observe that our approach can detect unseen…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Task Agnostic and Post-hoc Unseen Distribution Detection· youtube
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Air Quality Monitoring and Forecasting · Data Stream Mining Techniques
MethodsTest
