Self-supervised Multi-view Clustering in Computer Vision: A Survey
Jiatai Wang, Zhiwei Xu, Xuewen Yang, Hailong Li, Bo Li, Xuying Meng

TL;DR
This survey reviews the recent progress of self-supervised multi-view clustering in computer vision, highlighting its advantages, methodologies, datasets, and open challenges in the evolving field.
Contribution
It provides a comprehensive analysis and classification of current self-supervised MVC methods, datasets, and challenges, filling a gap in existing literature.
Findings
Self-supervised MVC leverages proxy tasks for representation learning.
The survey categorizes methods and discusses their applications.
Open problems for future research are identified.
Abstract
Multi-view clustering (MVC) has had significant implications in cross-modal representation learning and data-driven decision-making in recent years. It accomplishes this by leveraging the consistency and complementary information among multiple views to cluster samples into distinct groups. However, as contrastive learning continues to evolve within the field of computer vision, self-supervised learning has also made substantial research progress and is progressively becoming dominant in MVC methods. It guides the clustering process by designing proxy tasks to mine the representation of image and video data itself as supervisory information. Despite the rapid development of self-supervised MVC, there has yet to be a comprehensive survey to analyze and summarize the current state of research progress. Therefore, this paper explores the reasons and advantages of the emergence of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Face and Expression Recognition · Image Retrieval and Classification Techniques
MethodsContrastive Learning
