E-LDA: Toward Interpretable LDA Topic Models with Strong Guarantees in Logarithmic Parallel Time
Adam Breuer

TL;DR
This paper introduces a novel, provably guaranteed, combinatorial algorithm for LDA inference that is exponentially faster, interpretable, and suitable for causal inference, outperforming existing methods in quality and speed.
Contribution
It presents the first practical, non-gradient-based algorithm for LDA inference with provable guarantees and interpretability, achieving logarithmic parallel time complexity.
Findings
Algorithms converge to near-optimal posterior probabilities.
Solutions have higher semantic quality than existing methods.
Approach maintains independence assumptions for causal inference.
Abstract
In this paper, we provide the first practical algorithms with provable guarantees for the problem of inferring the topics assigned to each document in an LDA topic model. This is the primary inference problem for many applications of topic models in social science, data exploration, and causal inference settings. We obtain this result by showing a novel non-gradient-based, combinatorial approach to estimating topic models. This yields algorithms that converge to near-optimal posterior probability in logarithmic parallel computation time (adaptivity) -- exponentially faster than any known LDA algorithm. We also show that our approach can provide interpretability guarantees such that each learned topic is formally associated with a known keyword. Finally, we show that unlike alternatives, our approach can maintain the independence assumptions necessary to use the learned topic model for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Computational and Text Analysis Methods · Misinformation and Its Impacts
