A general framework for the IT-based clustering methods
Teng Qiu, Yongjie Li

TL;DR
This paper introduces a versatile framework for constructing in-tree (IT) graphs from initial neighborhood graphs, enhancing the ability to identify complex cluster structures, especially in manifold and sparse datasets.
Contribution
It generalizes previous IT graph construction methods, enabling better capture of diverse cluster structures and improving clustering effectiveness on complex datasets.
Findings
Framework effectively captures manifold structures.
Improves clustering of sparse and graph-based datasets.
Generalizes previous IT graph construction methods.
Abstract
Previously, we proposed a physically inspired rule to organize the data points in a sparse yet effective structure, called the in-tree (IT) graph, which is able to capture a wide class of underlying cluster structures in the datasets, especially for the density-based datasets. Although there are some redundant edges or lines between clusters requiring to be removed by computer, this IT graph has a big advantage compared with the k-nearest-neighborhood (k-NN) or the minimal spanning tree (MST) graph, in that the redundant edges in the IT graph are much more distinguishable and thus can be easily determined by several methods previously proposed by us. In this paper, we propose a general framework to re-construct the IT graph, based on an initial neighborhood graph, such as the k-NN or MST, etc, and the corresponding graph distances. For this general framework, our previous way of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Data Mining Algorithms and Applications · Complex Network Analysis Techniques
Methodsk-Nearest Neighbors
