SiamMM: A Mixture Model Perspective on Deep Unsupervised Learning

Xiaodong Wang; Jing Huang; Kevin J Liang

arXiv:2511.05462·cs.LG·November 10, 2025

SiamMM: A Mixture Model Perspective on Deep Unsupervised Learning

Xiaodong Wang, Jing Huang, Kevin J Liang

PDF

Open Access

TL;DR

SiamMM introduces a mixture model framework to improve clustering-based unsupervised learning, achieving state-of-the-art results and revealing insights into label quality.

Contribution

This work connects clustering methods with classical mixture models, enhancing their effectiveness and introducing the SiamMM model for deep unsupervised learning.

Findings

01

SiamMM achieves state-of-the-art performance on benchmarks.

02

Learned clusters closely resemble ground truth labels.

03

The approach uncovers potential mislabeling in datasets.

Abstract

Recent studies have demonstrated the effectiveness of clustering-based approaches for self-supervised and unsupervised learning. However, the application of clustering is often heuristic, and the optimal methodology remains unclear. In this work, we establish connections between these unsupervised clustering methods and classical mixture models from statistics. Through this framework, we demonstrate significant enhancements to these clustering methods, leading to the development of a novel model named SiamMM. Our method attains state-of-the-art performance across various self-supervised learning benchmarks. Inspection of the learned clusters reveals a strong resemblance to unseen ground truth labels, uncovering potential instances of mislabeling.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Bayesian Methods and Mixture Models · Face and Expression Recognition