Multi-VAE: Learning Disentangled View-common and View-peculiar Visual   Representations for Multi-view Clustering

Jie Xu; Yazhou Ren; Huayi Tang; Xiaorong Pu; Xiaofeng Zhu; Ming Zeng,; Lifang He

arXiv:2106.11232·cs.CV·December 6, 2021

Multi-VAE: Learning Disentangled View-common and View-peculiar Visual Representations for Multi-view Clustering

Jie Xu, Yazhou Ren, Huayi Tang, Xiaorong Pu, Xiaofeng Zhu, Ming Zeng,, Lifang He

PDF

Open Access

TL;DR

This paper introduces Multi-VAE, a novel variational autoencoder framework that learns disentangled, explainable visual representations for multi-view clustering by separating common and view-specific features, leading to improved clustering accuracy.

Contribution

The paper proposes a new VAE-based model with a discrete view-common variable and continuous view-peculiar variables, enhancing multi-view clustering by disentangling shared and view-specific information.

Findings

01

Achieves superior clustering performance over state-of-the-art methods.

02

Produces disentangled and explainable visual representations.

03

Effectively separates common and view-specific features.

Abstract

Multi-view clustering, a long-standing and important research problem, focuses on mining complementary information from diverse views. However, existing works often fuse multiple views' representations or handle clustering in a common feature space, which may result in their entanglement especially for visual representations. To address this issue, we present a novel VAE-based multi-view clustering framework (Multi-VAE) by learning disentangled visual representations. Concretely, we define a view-common variable and multiple view-peculiar variables in the generative model. The prior of view-common variable obeys approximately discrete Gumbel Softmax distribution, which is introduced to extract the common cluster factor of multiple views. Meanwhile, the prior of view-peculiar variable follows continuous Gaussian distribution, which is used to represent each view's peculiar visual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Image Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques

MethodsGumbel Softmax · Softmax