Combining Multiple Clusterings via Crowd Agreement Estimation and Multi-Granularity Link Analysis
Dong Huang, Jian-Huang Lai, Chang-Dong Wang

TL;DR
This paper introduces a novel clustering ensemble method that evaluates base clustering quality using crowd agreement, incorporates multi-granularity information, and demonstrates improved robustness and effectiveness on real datasets.
Contribution
It proposes unsupervised quality evaluation of base clusterings and integrates multi-granularity cues into ensemble methods for improved clustering performance.
Findings
Effective weighting of base clusterings improves ensemble robustness.
Multi-granularity link analysis enhances clustering consensus.
Proposed methods outperform existing ensemble approaches on real datasets.
Abstract
The clustering ensemble technique aims to combine multiple clusterings into a probably better and more robust clustering and has been receiving an increasing attention in recent years. There are mainly two aspects of limitations in the existing clustering ensemble approaches. Firstly, many approaches lack the ability to weight the base clusterings without access to the original data and can be affected significantly by the low-quality, or even ill clusterings. Secondly, they generally focus on the instance level or cluster level in the ensemble system and fail to integrate multi-granularity cues into a unified model. To address these two limitations, this paper proposes to solve the clustering ensemble problem via crowd agreement estimation and multi-granularity link analysis. We present the normalized crowd agreement index (NCAI) to evaluate the quality of base clusterings in an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
