ODIM: Outlier Detection via Likelihood of Under-Fitted Generative Models
Dongha Kim, Jaesung Hwang, Jongjin Lee, Kunwoong Kim, Yongdai Kim

TL;DR
This paper introduces ODIM, a novel outlier detection method leveraging the likelihood of under-fitted deep generative models, which efficiently identifies inliers across various data types by exploiting the inlier-memorization effect.
Contribution
The paper proposes ODIM, a new outlier detection approach that uses the likelihood of under-fitted DGMs and the inlier-memorization effect, offering significant computational efficiency and broad data applicability.
Findings
ODIM is at least tens of times faster than existing deep-learning-based methods.
It effectively filters out outliers across tabular, image, and text data.
Extensive experiments on nearly 60 datasets validate its superiority.
Abstract
The unsupervised outlier detection (UOD) problem refers to a task to identify inliers given training data which contain outliers as well as inliers, without any labeled information about inliers and outliers. It has been widely recognized that using fully-trained likelihood-based deep generative models (DGMs) often results in poor performance in distinguishing inliers from outliers. In this study, we claim that the likelihood itself could serve as powerful evidence for identifying inliers in UOD tasks, provided that DGMs are carefully under-fitted. Our approach begins with a novel observation called the inlier-memorization (IM) effect-when training a deep generative model with data including outliers, the model initially memorizes inliers before outliers. Based on this finding, we develop a new method called the outlier detection via the IM effect (ODIM). Remarkably, the ODIM requires…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Generative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection
