Outlier detection in mixed-attribute data: a semi-supervised approach with fuzzy approximations and relative entropy
Baiyang Chen, Zhong Yuan, Zheng Liu, Dezhong Peng, Yongxiang Li, Chang Liu, Guiduo Duan

TL;DR
This paper presents FROD, a semi-supervised outlier detection method for mixed-attribute data that uses fuzzy rough sets, attribute classification accuracy, and fuzzy relative entropy to improve detection performance.
Contribution
It introduces a novel semi-supervised outlier detection approach combining fuzzy approximations and relative entropy, effectively handling uncertainty and heterogeneity in mixed-attribute data.
Findings
FROD performs comparably or better than existing algorithms on 16 datasets.
The method effectively leverages limited labeled data for improved detection.
Experimental results validate the robustness of the proposed approach.
Abstract
Outlier detection is a critical task in data mining, aimed at identifying objects that significantly deviate from the norm. Semi-supervised methods improve detection performance by leveraging partially labeled data but typically overlook the uncertainty and heterogeneity of real-world mixed-attribute data. This paper introduces a semi-supervised outlier detection method, namely fuzzy rough sets-based outlier detection (FROD), to effectively handle these challenges. Specifically, we first utilize a small subset of labeled data to construct fuzzy decision systems, through which we introduce the attribute classification accuracy based on fuzzy approximations to evaluate the contribution of attribute sets in outlier detection. Unlabeled data is then used to compute fuzzy relative entropy, which provides a characterization of outliers from the perspective of uncertainty. Finally, we develop…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Rough Sets and Fuzzy Logic · Artificial Immune Systems Applications
