Incremental Variational Inference for Latent Dirichlet Allocation

Cedric Archambeau; Beyza Ermis

arXiv:1507.05016·stat.ML·July 23, 2015·2 cites

Incremental Variational Inference for Latent Dirichlet Allocation

Cedric Archambeau, Beyza Ermis

PDF

Open Access

TL;DR

This paper presents incremental variational inference for LDA, enabling faster convergence and scalability to large datasets, with a novel distributed version that maintains performance while significantly increasing speed.

Contribution

It introduces incremental variational inference for LDA, offering an alternative to stochastic methods with faster convergence and a new distributed algorithm for large-scale topic modeling.

Findings

01

Incremental LDA converges faster and monotonically increases the variational bound.

02

The distributed algorithm achieves comparable performance with significant speed-up.

03

The method effectively processes massive document collections without learning rate tuning.

Abstract

We introduce incremental variational inference and apply it to latent Dirichlet allocation (LDA). Incremental variational inference is inspired by incremental EM and provides an alternative to stochastic variational inference. Incremental LDA can process massive document collections, does not require to set a learning rate, converges faster to a local optimum of the variational bound and enjoys the attractive property of monotonically increasing it. We study the performance of incremental LDA on large benchmark data sets. We further introduce a stochastic approximation of incremental variational inference which extends to the asynchronous distributed setting. The resulting distributed algorithm achieves comparable performance as single host incremental variational inference, but with a significant speed-up.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning

MethodsLinear Discriminant Analysis