Probit Normal Correlated Topic Models
Xingchen Yu, Ernest Fokoue

TL;DR
This paper introduces a novel probit normal model for correlated topic modeling, offering improved efficiency and scalability over existing logistic normal models, and demonstrates its effectiveness on large corpora.
Contribution
The paper proposes a probit normal approach to correlated topic modeling, overcoming logistic model inefficiencies and enabling handling of large datasets with many topics.
Findings
Effectively discovers meaningful topics in large corpora
Captures intuitive correlations among topics
Offers scalable inference with existing algorithms
Abstract
The logistic normal distribution has recently been adapted via the transformation of multivariate Gaus- sian variables to model the topical distribution of documents in the presence of correlations among topics. In this paper, we propose a probit normal alternative approach to modelling correlated topical structures. Our use of the probit model in the context of topic discovery is novel, as many authors have so far con- centrated solely of the logistic model partly due to the formidable inefficiency of the multinomial probit model even in the case of very small topical spaces. We herein circumvent the inefficiency of multinomial probit estimation by using an adaptation of the diagonal orthant multinomial probit in the topic models context, resulting in the ability of our topic modelling scheme to handle corpuses with a large number of latent topics. An additional and very important…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Bayesian Methods and Mixture Models · Topic Modeling
