Clustering Uncertain Data via Representative Possible Worlds with Consistency Learning
Han Liu, Xianchao Zhang, Xiaotong Zhang, Qimai Li, Xiao-Ming Wu

TL;DR
This paper introduces a novel clustering algorithm for uncertain data that selects representative possible worlds and leverages consistency learning to improve clustering accuracy, outperforming existing methods.
Contribution
The paper proposes a new RPC algorithm that selects representative possible worlds and integrates consistency learning into spectral clustering for uncertain data.
Findings
Outperforms state-of-the-art algorithms in experiments
Effectively selects representative possible worlds to reduce negative effects
Utilizes consistency among possible worlds for improved clustering
Abstract
Clustering uncertain data is an essential task in data mining for the internet of things. Possible world based algorithms seem promising for clustering uncertain data. However, there are two issues in existing possible world based algorithms: (1) They rely on all the possible worlds and treat them equally, but some marginal possible worlds may cause negative effects. (2) They do not well utilize the consistency among possible worlds, since they conduct clustering or construct the affinity matrix on each possible world independently. In this paper, we propose a representative possible world based consistent clustering (RPC) algorithm for uncertain data. First, by introducing representative loss and using Jensen-Shannon divergence as the distribution measure, we design a heuristic strategy for the selection of representative possible worlds, thus avoiding the negative effects caused by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Advanced Clustering Algorithms Research · Data Mining Algorithms and Applications
MethodsSpectral Clustering
