# Minimum Volume Topic Modeling

**Authors:** Byoungwook Jang, Alfred Hero

arXiv: 1904.02064 · 2019-04-04

## TL;DR

This paper introduces a novel topic modeling method based on minimizing the volume of the topic simplex, leading to improved computational efficiency and topic recovery accuracy.

## Contribution

It reformulates topic modeling as a volume minimization problem and proposes a convex relaxation with an ADMM solution, under specific assumptions.

## Key findings

- Enhanced computation time for topic modeling.
- Improved accuracy in topic recovery.
- Effective convex relaxation under separability assumptions.

## Abstract

We propose a new topic modeling procedure that takes advantage of the fact that the Latent Dirichlet Allocation (LDA) log likelihood function is asymptotically equivalent to the logarithm of the volume of the topic simplex. This allows topic modeling to be reformulated as finding the probability simplex that minimizes its volume and encloses the documents that are represented as distributions over words. A convex relaxation of the minimum volume topic model optimization is proposed, and it is shown that the relaxed problem has the same global minimum as the original problem under the separability assumption and the sufficiently scattered assumption introduced by Arora et al. (2013) and Huang et al. (2016). A locally convergent alternating direction method of multipliers (ADMM) approach is introduced for solving the relaxed minimum volume problem. Numerical experiments illustrate the benefits of our approach in terms of computation time and topic recovery performance.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.02064/full.md

## Figures

21 figures with captions in the complete paper: https://tomesphere.com/paper/1904.02064/full.md

## References

22 references — full list in the complete paper: https://tomesphere.com/paper/1904.02064/full.md

---
Source: https://tomesphere.com/paper/1904.02064