ACTGNN: Assessment of Clustering Tendency with Synthetically-Trained Graph Neural Networks
Yiran Luo, Evangelos E. Papalexakis

TL;DR
ACTGNN introduces a graph neural network framework trained on synthetic data to reliably assess clustering tendency in complex, noisy, and high-dimensional datasets, outperforming traditional methods.
Contribution
The paper presents a novel GNN-based approach, ACTGNN, trained solely on synthetic datasets to improve clustering tendency assessment in challenging data conditions.
Findings
Outperforms baseline methods on synthetic datasets.
Effective in detecting faint clustering structures.
Robust in high-dimensional and noisy data environments.
Abstract
Determining clustering tendency in datasets is a fundamental but challenging task, especially in noisy or high-dimensional settings where traditional methods, such as the Hopkins Statistic and Visual Assessment of Tendency (VAT), often struggle to produce reliable results. In this paper, we propose ACTGNN, a graph-based framework designed to assess clustering tendency by leveraging graph representations of data. Node features are constructed using Locality-Sensitive Hashing (LSH), which captures local neighborhood information, while edge features incorporate multiple similarity metrics, such as the Radial Basis Function (RBF) kernel, to model pairwise relationships. A Graph Neural Network (GNN) is trained exclusively on synthetic datasets, enabling robust learning of clustering structures under controlled conditions. Extensive experiments demonstrate that ACTGNN significantly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Advanced Graph Neural Networks
MethodsGraph Neural Network
