Estimation of Non-Normalized Mixture Models and Clustering Using Deep Representation
Takeru Matsuda, Aapo Hyvarinen

TL;DR
This paper introduces a novel method for estimating finite mixtures of non-normalized models using extended noise contrastive estimation, enabling effective clustering with deep neural representations, demonstrated on image data.
Contribution
It extends noise contrastive estimation to handle mixtures of non-normalized models and integrates deep representations for clustering unlabeled data.
Findings
Effective estimation of mixture models with intractable normalization constants.
Successful application to image clustering with promising results.
Provides a probabilistic clustering method leveraging deep neural networks.
Abstract
We develop a general method for estimating a finite mixture of non-normalized models. Here, a non-normalized model is defined to be a parametric distribution with an intractable normalization constant. Existing methods for estimating non-normalized models without computing the normalization constant are not applicable to mixture models because they contain more than one intractable normalization constant. The proposed method is derived by extending noise contrastive estimation (NCE), which estimates non-normalized models by discriminating between the observed data and some artificially generated noise. We also propose an extension of NCE with multiple noise distributions. Then, based on the observation that conventional classification learning with neural networks is implicitly assuming an exponential family as a generative model, we introduce a method for clustering unlabeled data by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Gaussian Processes and Bayesian Inference · Generative Adversarial Networks and Image Synthesis
