TL;DR
This paper introduces a multidiversified ensemble clustering method for high-dimensional data, leveraging diverse metrics and subspaces to improve clustering robustness and accuracy, validated through extensive experiments.
Contribution
It proposes a novel approach that creates diversified metrics and subspaces, integrating multi-level diversity in ensemble clustering within a unified framework.
Findings
Outperforms state-of-the-art methods on 30 high-dimensional datasets.
Demonstrates the effectiveness of diversified metrics and subspaces.
Provides publicly available source code for reproducibility.
Abstract
The rapid emergence of high-dimensional data in various areas has brought new challenges to current ensemble clustering research. To deal with the curse of dimensionality, recently considerable efforts in ensemble clustering have been made by means of different subspace-based techniques. However, besides the emphasis on subspaces, rather limited attention has been paid to the potential diversity in similarity/dissimilarity metrics. It remains a surprisingly open problem in ensemble clustering how to create and aggregate a large population of diversified metrics, and furthermore, how to jointly investigate the multi-level diversity in the large populations of metrics, subspaces, and clusters in a unified framework. To tackle this problem, this paper proposes a novel multidiversified ensemble clustering approach. In particular, we create a large number of diversified metrics by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
