Autoencoding Variational Inference For Topic Models

Akash Srivastava; Charles Sutton

arXiv:1703.01488·stat.ML·March 7, 2017·ICLR·146 cites

Autoencoding Variational Inference For Topic Models

Akash Srivastava, Charles Sutton

PDF

Open Access 5 Repos

TL;DR

This paper introduces AVITM, a novel autoencoding variational inference method for LDA that improves inference speed and flexibility, enabling easier application to new topic models and demonstrating a more interpretable model called ProdLDA.

Contribution

It presents the first effective AEVB-based inference method for LDA, addressing Dirichlet prior challenges and introducing ProdLDA for better interpretability.

Findings

01

AVITM matches traditional methods in accuracy

02

AVITM significantly improves inference time

03

ProdLDA produces more interpretable topics

Abstract

Topic models are one of the most popular methods for learning representations of text, but a major challenge is that any change to the topic model requires mathematically deriving a new inference algorithm. A promising approach to address this problem is autoencoding variational Bayes (AEVB), but it has proven diffi- cult to apply to topic models in practice. We present what is to our knowledge the first effective AEVB based inference method for latent Dirichlet allocation (LDA), which we call Autoencoded Variational Inference For Topic Model (AVITM). This model tackles the problems caused for AEVB by the Dirichlet prior and by component collapsing. We find that AVITM matches traditional methods in accuracy with much better inference time. Indeed, because of the inference network, we find that it is unnecessary to pay the computational cost of running variational optimization on test…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Computational and Text Analysis Methods · Natural Language Processing Techniques

MethodsLinear Discriminant Analysis