E-LDA: Toward Interpretable LDA Topic Models with Strong Guarantees in Logarithmic Parallel Time

Adam Breuer

arXiv:2506.07747·cs.LG·June 10, 2025

E-LDA: Toward Interpretable LDA Topic Models with Strong Guarantees in Logarithmic Parallel Time

Adam Breuer

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel, provably guaranteed, combinatorial algorithm for LDA inference that is exponentially faster, interpretable, and suitable for causal inference, outperforming existing methods in quality and speed.

Contribution

It presents the first practical, non-gradient-based algorithm for LDA inference with provable guarantees and interpretability, achieving logarithmic parallel time complexity.

Findings

01

Algorithms converge to near-optimal posterior probabilities.

02

Solutions have higher semantic quality than existing methods.

03

Approach maintains independence assumptions for causal inference.

Abstract

In this paper, we provide the first practical algorithms with provable guarantees for the problem of inferring the topics assigned to each document in an LDA topic model. This is the primary inference problem for many applications of topic models in social science, data exploration, and causal inference settings. We obtain this result by showing a novel non-gradient-based, combinatorial approach to estimating topic models. This yields algorithms that converge to near-optimal posterior probability in logarithmic parallel computation time (adaptivity) -- exponentially faster than any known LDA algorithm. We also show that our approach can provide interpretability guarantees such that each learned topic is formally associated with a known keyword. Finally, we show that unlike alternatives, our approach can maintain the independence assumptions necessary to use the learned topic model for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

E-LDA: Toward Interpretable LDA Topic Models with Strong Guarantees in Logarithmic Parallel Time· slideslive

Taxonomy

TopicsTopic Modeling · Computational and Text Analysis Methods · Misinformation and Its Impacts