WHAI: Weibull Hybrid Autoencoding Inference for Deep Topic Modeling

Hao Zhang; Bo Chen; Dandan Guo; Mingyuan Zhou

arXiv:1803.01328·stat.ML·April 28, 2020·ICLR·51 cites

WHAI: Weibull Hybrid Autoencoding Inference for Deep Topic Modeling

Hao Zhang, Bo Chen, Dandan Guo, Mingyuan Zhou

PDF

Open Access 1 Repo

TL;DR

This paper introduces WHAI, a scalable deep topic modeling method that combines stochastic-gradient MCMC and autoencoding variational Bayes, using Weibull distributions for efficient inference in large text corpora.

Contribution

It develops a novel hybrid inference network for deep latent Dirichlet allocation utilizing Weibull distributions for improved efficiency and scalability.

Findings

01

Demonstrates effectiveness on large corpora

02

Achieves faster inference with comparable accuracy

03

Outperforms existing deep topic models in scalability

Abstract

To train an inference network jointly with a deep generative topic model, making it both scalable to big corpora and fast in out-of-sample prediction, we develop Weibull hybrid autoencoding inference (WHAI) for deep latent Dirichlet allocation, which infers posterior samples via a hybrid of stochastic-gradient MCMC and autoencoding variational Bayes. The generative network of WHAI has a hierarchy of gamma distributions, while the inference network of WHAI is a Weibull upward-downward variational autoencoder, which integrates a deterministic-upward deep neural network, and a stochastic-downward deep generative model based on a hierarchy of Weibull distributions. The Weibull distribution can be used to well approximate a gamma distribution with an analytic Kullback-Leibler divergence, and has a simple reparameterization via the uniform noise, which help efficiently compute the gradients…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sophieburkhardt/dirichlet-vae-topic-models
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Topic Modeling · Computational and Text Analysis Methods