Learning document embeddings along with their uncertainties

Santosh Kesiraju; Old\v{r}ich Plchot; Luk\'a\v{s} Burget; and; Suryakanth V Gangashetty

arXiv:1908.07599·cs.CL·August 3, 2020

Learning document embeddings along with their uncertainties

Santosh Kesiraju, Old\v{r}ich Plchot, Luk\'a\v{s} Burget, and, Suryakanth V Gangashetty

PDF

2 Repos

TL;DR

This paper introduces Bayesian SMM, a generative model that learns document embeddings as Gaussian distributions with uncertainty, improving data fit and robustness in topic identification over existing models.

Contribution

The paper presents Bayesian SMM, a novel generative model that encodes uncertainty in document embeddings and addresses intractability in variational inference.

Findings

01

Bayesian SMM outperforms neural variational models in perplexity on Fisher and 20Newsgroups datasets.

02

The model demonstrates robustness to overfitting in topic identification tasks.

03

Achieves comparable results to supervised models in unsupervised topic detection.

Abstract

Majority of the text modelling techniques yield only point-estimates of document embeddings and lack in capturing the uncertainty of the estimates. These uncertainties give a notion of how well the embeddings represent a document. We present Bayesian subspace multinomial model (Bayesian SMM), a generative log-linear model that learns to represent documents in the form of Gaussian distributions, thereby encoding the uncertainty in its co-variance. Additionally, in the proposed Bayesian SMM, we address a commonly encountered problem of intractability that appears during variational inference in mixed-logit models. We also present a generative Gaussian linear classifier for topic identification that exploits the uncertainty in document embeddings. Our intrinsic evaluation using perplexity measure shows that the proposed Bayesian SMM fits the data better as compared to the state-of-the-art…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.