Ambiguity-Driven Fuzzy C-Means Clustering: How to Detect Uncertain   Clustered Records

Meysam Ghaffari; Nasser Ghadiri

arXiv:1409.2821·cs.AI·May 31, 2017

Ambiguity-Driven Fuzzy C-Means Clustering: How to Detect Uncertain Clustered Records

Meysam Ghaffari, Nasser Ghadiri

PDF

TL;DR

This paper introduces a certainty factor into Fuzzy C-Means clustering to identify ambiguous records, reducing false detections and improving accuracy in critical decision-making applications.

Contribution

It proposes a novel method to detect uncertain records in FCM by incorporating a certainty factor, enhancing accuracy without sacrificing performance.

Findings

01

Significant decrease in error rate across multiple datasets

02

Improved sensitivity of the clustering algorithm

03

Maintains low processing time for most records

Abstract

As a well-known clustering algorithm, Fuzzy C-Means (FCM) allows each input sample to belong to more than one cluster, providing more flexibility than non-fuzzy clustering methods. However, the accuracy of FCM is subject to false detections caused by noisy records, weak feature selection and low certainty of the algorithm in some cases. The false detections are very important in some decision-making application domains like network security and medical diagnosis, where weak decisions based on such false detections may lead to catastrophic outcomes. They are mainly emerged from making decisions about a subset of records that do not provide enough evidence to make a good decision. In this paper, we propose a method for detecting such ambiguous records in FCM by introducing a certainty factor to decrease invalid detections. This approach enables us to send the detected ambiguous records to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.