Certainty of outlier and boundary points processing in data mining
Elyas Rashno, Sanaz Saki Norouzi, Behrouz Minaei-bidgoli, Yanhui, Guo

TL;DR
This paper introduces a neutrosophic set-based clustering method that effectively identifies boundary and outlier points by assigning certainty values, improving accuracy over existing techniques.
Contribution
A novel neutrosophic set-based cost function for clustering that accurately detects boundary and outlier points, outperforming current methods.
Findings
The proposed method accurately detects outliers and boundary points.
It outperforms existing clustering methods in accuracy.
Effective on diverse datasets including Scatter and UCI types.
Abstract
Data certainty is one of the issues in the real-world applications which is caused by unwanted noise in data. Recently, more attentions have been paid to overcome this problem. We proposed a new method based on neutrosophic set (NS) theory to detect boundary and outlier points as challenging points in clustering methods. Generally, firstly, a certainty value is assigned to data points based on the proposed definition in NS. Then, certainty set is presented for the proposed cost function in NS domain by considering a set of main clusters and noise cluster. After that, the proposed cost function is minimized by gradient descent method. Data points are clustered based on their membership degrees. Outlier points are assigned to noise cluster and boundary points are assigned to main clusters with almost same membership degrees. To show the effectiveness of the proposed method, two types of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
