Tree Index: A New Cluster Evaluation Technique
A. H. Beg, Md Zahidul Islam, Vladimir Estivill-Castro

TL;DR
Tree Index is a novel cluster evaluation method that uses decision trees to assess the structural quality of clustering, effectively distinguishing reasonable clusters from non-sensible ones based on tree complexity and purity.
Contribution
Introduces Tree Index, a new structural cluster evaluation technique that leverages decision trees to assess cluster quality without relying on traditional quantitative indexes.
Findings
Tree Index effectively discriminates between sensible and non-sensible clusters.
It outperforms existing indexes in identifying meaningful clustering solutions.
Graphical visualizations confirm the method's effectiveness.
Abstract
We introduce a cluster evaluation technique called Tree Index. Our Tree Index algorithm aims at describing the structural information of the clustering rather than the quantitative format of cluster-quality indexes (where the representation power of clustering is some cumulative error similar to vector quantization). Our Tree Index is finding margins amongst clusters for easy learning without the complications of Minimum Description Length. Our Tree Index produces a decision tree from the clustered data set, using the cluster identifiers as labels. It combines the entropy of each leaf with their depth. Intuitively, a shorter tree with pure leaves generalizes the data well (the clusters are easy to learn because they are well separated). So, the labels are meaningful clusters. If the clustering algorithm does not separate well, trees learned from their results will be large and too…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Complex Network Analysis Techniques · Data Visualization and Analytics
