A new approach for evaluating internal cluster validation indices

Zolt\'an Botta-Duk\'at

arXiv:2308.03894·cs.LG·August 9, 2023·2 cites

A new approach for evaluating internal cluster validation indices

Zolt\'an Botta-Duk\'at

PDF

Open Access

TL;DR

This paper reviews existing internal cluster validation indices and proposes a new evaluation approach to better assess clustering quality without external information.

Contribution

It introduces a novel method for evaluating internal validation indices, addressing limitations of previous approaches.

Findings

01

Analysis of existing evaluation methods

02

Introduction of a new evaluation approach

03

Improved assessment of clustering quality

Abstract

A vast number of different methods are available for unsupervised classification. Since no algorithm and parameter setting performs best in all types of data, there is a need for cluster validation to select the actually best-performing algorithm. Several indices were proposed for this purpose without using any additional (external) information. These internal validation indices can be evaluated by applying them to classifications of datasets with a known cluster structure. Evaluation approaches differ in how they use the information on the ground-truth classification. This paper reviews these approaches, considering their advantages and disadvantages, and then suggests a new approach.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Clustering Algorithms Research