VillageNet: Graph-based, Easily-interpretable, Unsupervised Clustering for Broad Biomedical Applications
Aditya Ballal, Gregory A. DePaul, Esha Datta, Asuka Hatano, Erik Carlsson, Ye Chen-Izu, Javier E. L\'opez, Leighton T. Izu

TL;DR
VillageNet is an unsupervised, graph-based clustering algorithm that effectively handles high-dimensional biomedical data, autonomously determines the optimal number of clusters, and demonstrates competitive performance on real-world datasets.
Contribution
We introduce VillageNet, a novel unsupervised clustering method combining K-Means and community detection, capable of automatically identifying the number of clusters in high-dimensional data.
Findings
Achieves high normalized mutual information scores on benchmark datasets.
Computationally efficient with linear time complexity in dataset size.
Effectively handles large-scale, high-dimensional biomedical data.
Abstract
Clustering large high-dimensional datasets with diverse variable is essential for extracting high-level latent information from these datasets. Here, we developed an unsupervised clustering algorithm, we call "Village-Net". Village-Net is specifically designed to effectively cluster high-dimension data without priori knowledge on the number of existing clusters. The algorithm operates in two phases: first, utilizing K-Means clustering, it divides the dataset into distinct subsets we refer to as "villages". Next, a weighted network is created, with each node representing a village, capturing their proximity relationships. To achieve optimal clustering, we process this network using a community detection algorithm called Walk-likelihood Community Finder (WLCF), a community detection algorithm developed by one of our team members. A salient feature of Village-Net Clustering is its ability…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research
