Detecting Unusual Input-Output Associations in Multivariate Conditional Data
Charmgil Hong, Milos Hauskrecht

TL;DR
This paper introduces a novel framework for detecting multivariate conditional outliers by modeling input-output associations with a probabilistic approach, effectively identifying unusual responses in context-dependent data.
Contribution
It proposes a decomposable probabilistic model with weighted components to detect conditional outliers, addressing limitations of existing unconditional outlier detection methods.
Findings
Successfully identifies multivariate conditional outliers across various domains.
Demonstrates the effectiveness of global and local component weighting schemes.
Outperforms traditional methods in detecting context-dependent anomalies.
Abstract
Despite tremendous progress in outlier detection research in recent years, the majority of existing methods are designed only to detect unconditional outliers that correspond to unusual data patterns expressed in the joint space of all data attributes. Such methods are not applicable when we seek to detect conditional outliers that reflect unusual responses associated with a given context or condition. This work focuses on multivariate conditional outlier detection, a special type of the conditional outlier detection problem, where data instances consist of multi-dimensional input (context) and output (responses) pairs. We present a novel outlier detection framework that identifies abnormal input-output associations in data with the help of a decomposable conditional probabilistic model that is learned from all data instances. Since components of this model can vary in their quality, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Advanced Statistical Methods and Models · Imbalanced Data Classification Techniques
