On the Semi-supervised Expectation Maximization

Erixhen Sula; Lizhong Zheng

arXiv:2211.00537·cs.LG·January 26, 2023·1 cites

On the Semi-supervised Expectation Maximization

Erixhen Sula, Lizhong Zheng

PDF

Open Access

TL;DR

This paper analyzes how labeled samples influence the convergence rate of the EM algorithm in semi-supervised learning, providing theoretical guarantees and extending results for Gaussian mixture models.

Contribution

It offers a convergence rate analysis for semi-supervised EM, highlighting the role of labeled data, and extends findings to Gaussian mixtures with theoretical proofs.

Findings

01

Labeled samples improve convergence rate for exponential family mixtures

02

Provides a comprehensive convergence analysis for Gaussian mixture models

03

Extends proof for symmetric Gaussian mixtures with unlabeled data

Abstract

The Expectation Maximization (EM) algorithm is widely used as an iterative modification to maximum likelihood estimation when the data is incomplete. We focus on a semi-supervised case to learn the model from labeled and unlabeled samples. Existing work in the semi-supervised case has focused mainly on performance rather than convergence guarantee, however we focus on the contribution of the labeled samples to the convergence rate. The analysis clearly demonstrates how the labeled samples improve the convergence rate for the exponential family mixture model. In this case, we assume that the population EM (EM with unlimited data) is initialized within the neighborhood of global convergence for the population EM that consists solely of samples that have not been labeled. The analysis for the labeled samples provides a comprehensive description of the convergence rate for the Gaussian…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models · Gaussian Processes and Bayesian Inference · Target Tracking and Data Fusion in Sensor Networks