# Convalescing Cluster Configuration Using a Superlative Framework

**Authors:** R. Sabitha, S. Karthik

PMC · DOI: 10.1155/2015/180749 · The Scientific World Journal · 2015-10-12

## TL;DR

This paper introduces a new data clustering algorithm that improves accuracy by combining discretization and binary search initialization.

## Contribution

A novel clustering algorithm that integrates data discretization and binary search initialization for better performance.

## Key findings

- The proposed algorithm outperforms simple K-means and Binary Search method in accuracy and validity.
- Discretization significantly enhances the efficacy of descriptive data mining tasks.
- Experiments on UCI datasets confirm the superiority of the new approach.

## Abstract

Competent data mining methods are vital to discover knowledge from databases which are built as a result of enormous growth of data. Various techniques of data mining are applied to obtain knowledge from these databases. Data clustering is one such descriptive data mining technique which guides in partitioning data objects into disjoint segments. K-means algorithm is a versatile algorithm among the various approaches used in data clustering. The algorithm and its diverse adaptation methods suffer certain problems in their performance. To overcome these issues a superlative algorithm has been proposed in this paper to perform data clustering. The specific feature of the proposed algorithm is discretizing the dataset, thereby improving the accuracy of clustering, and also adopting the binary search initialization method to generate cluster centroids. The generated centroids are fed as input to K-means approach which iteratively segments the data objects into respective clusters. The clustered results are measured for accuracy and validity. Experiments conducted by testing the approach on datasets from the UC Irvine Machine Learning Repository evidently show that the accuracy and validity measure is higher than the other two approaches, namely, simple K-means and Binary Search method. Thus, the proposed approach proves that discretization process will improve the efficacy of descriptive data mining tasks.

## Full-text entities

- **Diseases:** Cancer (MESH:D009369)
- **Species:** Apis mellifera (bee, species) [taxon 7460]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC4620246/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC4620246/full.md

## References

36 references — full list in the complete paper: https://tomesphere.com/paper/PMC4620246/full.md

---
Source: https://tomesphere.com/paper/PMC4620246