Shared Generative Latent Representation Learning for Multi-view Clustering
Ming Yin, Weitian Huang, Junbin Gao

TL;DR
This paper introduces a novel multi-view clustering approach that learns a shared generative latent space, effectively capturing correlations among views and improving clustering accuracy on large-scale datasets.
Contribution
It proposes a shared generative latent representation model based on Gaussian mixtures, enhancing multi-view clustering with deep generative learning techniques.
Findings
Outperforms state-of-the-art methods on multiple datasets
Effective in capturing nonlinear features and view correlations
Handles large-scale datasets efficiently
Abstract
Clustering multi-view data has been a fundamental research topic in the computer vision community. It has been shown that a better accuracy can be achieved by integrating information of all the views than just using one view individually. However, the existing methods often struggle with the issues of dealing with the large-scale datasets and the poor performance in reconstructing samples. This paper proposes a novel multi-view clustering method by learning a shared generative latent representation that obeys a mixture of Gaussian distributions. The motivation is based on the fact that the multi-view data share a common latent embedding despite the diversity among the views. Specifically, benefited from the success of the deep generative learning, the proposed model not only can extract the nonlinear features from the views, but render a powerful ability in capturing the correlations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Face and Expression Recognition · Advanced Image and Video Retrieval Techniques
