Autoencoding Variational Inference For Topic Models
Akash Srivastava, Charles Sutton

TL;DR
This paper introduces AVITM, a novel autoencoding variational inference method for LDA that improves inference speed and flexibility, enabling easier application to new topic models and demonstrating a more interpretable model called ProdLDA.
Contribution
It presents the first effective AEVB-based inference method for LDA, addressing Dirichlet prior challenges and introducing ProdLDA for better interpretability.
Findings
AVITM matches traditional methods in accuracy
AVITM significantly improves inference time
ProdLDA produces more interpretable topics
Abstract
Topic models are one of the most popular methods for learning representations of text, but a major challenge is that any change to the topic model requires mathematically deriving a new inference algorithm. A promising approach to address this problem is autoencoding variational Bayes (AEVB), but it has proven diffi- cult to apply to topic models in practice. We present what is to our knowledge the first effective AEVB based inference method for latent Dirichlet allocation (LDA), which we call Autoencoded Variational Inference For Topic Model (AVITM). This model tackles the problems caused for AEVB by the Dirichlet prior and by component collapsing. We find that AVITM matches traditional methods in accuracy with much better inference time. Indeed, because of the inference network, we find that it is unnecessary to pay the computational cost of running variational optimization on test…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Computational and Text Analysis Methods · Natural Language Processing Techniques
MethodsLinear Discriminant Analysis
