Out-Of-Bag Anomaly Detection
Egor Klevak, Sangdi Lin, Andy Martin, Ondrej Linda, Eric, Ringger

TL;DR
This paper introduces Out-of-Bag anomaly detection, a novel ensemble-based method for unsupervised detection of anomalies in multi-dimensional datasets, improving ML system reliability.
Contribution
The paper presents a new ensemble model approach that uses Out-of-Bag estimates for effective anomaly detection in complex datasets.
Findings
Achieves state-of-the-art performance on benchmark datasets.
Enhances ML system accuracy and reliability in a case study.
Handles both numerical and categorical features effectively.
Abstract
Data anomalies are ubiquitous in real world datasets, and can have an adverse impact on machine learning (ML) systems, such as automated home valuation. Detecting anomalies could make ML applications more responsible and trustworthy. However, the lack of labels for anomalies and the complex nature of real-world datasets make anomaly detection a challenging unsupervised learning problem. In this paper, we propose a novel model-based anomaly detection method, that we call Out-of- Bag anomaly detection, which handles multi-dimensional datasets consisting of numerical and categorical features. The proposed method decomposes the unsupervised problem into the training of a set of ensemble models. Out-of-Bag estimates are leveraged to derive an effective measure for anomaly detection. We not only demonstrate the state-of-the-art performance of our method through comprehensive experiments on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Data Stream Mining Techniques · Data-Driven Disease Surveillance
