SSDBCODI: Semi-Supervised Density-Based Clustering with Outliers Detection Integrated
Jiahao Deng, Eli T. Brown

TL;DR
SSDBCODI is a semi-supervised density-based clustering algorithm that effectively handles complex-shaped clusters and outliers by integrating user labels and multiple outlier detection scores.
Contribution
The paper introduces SSDBCODI, a novel semi-supervised density-based clustering method that incorporates outlier detection using multiple scoring metrics.
Findings
Achieves superior clustering accuracy with minimal labeled data.
Effectively detects outliers based on combined scoring metrics.
Performs well on diverse datasets compared to state-of-the-art methods.
Abstract
Clustering analysis is one of the critical tasks in machine learning. Traditionally, clustering has been an independent task, separate from outlier detection. Due to the fact that the performance of clustering can be significantly eroded by outliers, a small number of algorithms try to incorporate outlier detection in the process of clustering. However, most of those algorithms are based on unsupervised partition-based algorithms such as k-means. Given the nature of those algorithms, they often fail to deal with clusters of complex, non-convex shapes. To tackle this challenge, we have proposed SSDBCODI, a semi-supervised density-based algorithm. SSDBCODI combines the advantage of density-based algorithms, which are capable of dealing with clusters of complex shapes, with the semi-supervised element, which offers flexibility to adjust the clustering results based on a few user labels. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Data-Driven Disease Surveillance · Artificial Immune Systems Applications
