Analysis and tuning of hierarchical topic models based on Renyi entropy approach
Sergei Koltcov, Vera Ignatenko, Maxim Terpilovskii, Paolo Rosso

TL;DR
This paper introduces a Renyi entropy-based method for tuning hierarchical topic models, improving the estimation of the number of topics and hierarchy levels, and demonstrating its effectiveness on various models and datasets.
Contribution
The paper proposes a novel Renyi entropy-based metric and practical tuning approach for hierarchical topic models, addressing parameter selection challenges.
Findings
hLDA model shows instability and inaccurate topic number estimation.
Renyi entropy approach helps determine hierarchy levels in hPAM.
Method estimates topic numbers for two levels in hARTM.
Abstract
Hierarchical topic modeling is a potentially powerful instrument for determining the topical structure of text collections that allows constructing a topical hierarchy representing levels of topical abstraction. However, tuning of parameters of hierarchical models, including the number of topics on each hierarchical level, remains a challenging task and an open issue. In this paper, we propose a Renyi entropy-based approach for a partial solution to the above problem. First, we propose a Renyi entropy-based metric of quality for hierarchical models. Second, we propose a practical concept of hierarchical topic model tuning tested on datasets with human mark-up. In the numerical experiments, we consider three different hierarchical models, namely, hierarchical latent Dirichlet allocation (hLDA) model, hierarchical Pachinko allocation model (hPAM), and hierarchical additive regularization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Bayesian Methods and Mixture Models · Computational and Text Analysis Methods
