Stochastic Discriminative EM

Andres R. Masegosa

arXiv:1410.1784·cs.LG·April 5, 2017·1 cites

Stochastic Discriminative EM

Andres R. Masegosa

PDF

Open Access

TL;DR

This paper introduces sdEM, an online-EM algorithm that performs discriminative training of probabilistic generative models using natural gradient descent, enabling effective handling of missing data and latent variables.

Contribution

The paper presents sdEM as a stochastic natural gradient descent method for discriminative training of exponential family models, unifying generative modeling with discriminative loss functions.

Findings

01

Effective text classification with multinomial naive Bayes.

02

Discriminative training improves model performance.

03

Handles missing data and latent variables effectively.

Abstract

Stochastic discriminative EM (sdEM) is an online-EM-type algorithm for discriminative training of probabilistic generative models belonging to the exponential family. In this work, we introduce and justify this algorithm as a stochastic natural gradient descent method, i.e. a method which accounts for the information geometry in the parameter space of the statistical model. We show how this learning algorithm can be used to train probabilistic generative models by minimizing different discriminative loss functions, such as the negative conditional log-likelihood and the Hinge loss. The resulting models trained by sdEM are always generative (i.e. they define a joint probability distribution) and, in consequence, allows to deal with missing data and latent variables in a principled way either when being learned or when making predictions. The performance of this method is illustrated by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Topic Modeling · Bayesian Methods and Mixture Models