Bridging Distribution Learning and Image Clustering in High-dimensional Space
Guanfang Dong, Chenqiu Zhao, Anup Basu

TL;DR
This paper explores how distribution learning via Gaussian Mixture Models can be integrated with image clustering in high-dimensional spaces, using autoencoders and novel loss functions to improve clustering performance.
Contribution
It introduces a method combining autoencoders, Monte-Carlo Marginalization, and KL divergence to enhance GMM-based clustering in high-dimensional image data.
Findings
MCMarg and KL divergence improve clustering accuracy.
The approach mitigates the curse of dimensionality.
Distribution learning enhances GMM effectiveness in high-dimensional spaces.
Abstract
Distribution learning focuses on learning the probability density function from a set of data samples. In contrast, clustering aims to group similar objects together in an unsupervised manner. Usually, these two tasks are considered unrelated. However, the relationship between the two may be indirectly correlated, with Gaussian Mixture Models (GMM) acting as a bridge. In this paper, we focus on exploring the correlation between distribution learning and clustering, with the motivation to fill the gap between these two fields, utilizing an autoencoder (AE) to encode images into a high-dimensional latent space. Then, Monte-Carlo Marginalization (MCMarg) and Kullback-Leibler (KL) divergence loss are used to fit the Gaussian components of the GMM and learn the data distribution. Finally, image clustering is achieved through each Gaussian component of GMM. Yet, the "curse of dimensionality"…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Advanced Clustering Algorithms Research · Image Retrieval and Classification Techniques
MethodsFocus
