Nested Hierarchical Dirichlet Processes for Multi-Level Non-Parametric Admixture Modeling
Lavanya Sita Tekumalla, Priyanka Agrawal, Indrajit Bhattacharya

TL;DR
This paper introduces a multi-level nested Hierarchical Dirichlet Process (nHDP) model for non-parametric admixture modeling, enabling complex hierarchical topic and entity modeling with improved generalization and entity detection.
Contribution
It proposes a novel multi-level nested HDP framework with a Gibbs sampling inference algorithm, addressing scalability and sharing limitations of previous models.
Findings
nHDP outperforms existing models in generalization
Successfully detects missing author entities
Effective in non-parametric entity topic modeling
Abstract
Dirichlet Process(DP) is a Bayesian non-parametric prior for infinite mixture modeling, where the number of mixture components grows with the number of data items. The Hierarchical Dirichlet Process (HDP), is an extension of DP for grouped data, often used for non-parametric topic modeling, where each group is a mixture over shared mixture densities. The Nested Dirichlet Process (nDP), on the other hand, is an extension of the DP for learning group level distributions from data, simultaneously clustering the groups. It allows group level distributions to be shared across groups in a non-parametric setting, leading to a non-parametric mixture of mixtures. The nCRF extends the nDP for multilevel non-parametric mixture modeling, enabling modeling topic hierarchies. However, the nDP and nCRF do not allow sharing of distributions as required in many applications, motivating the need for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Biomedical Text Mining and Ontologies · Natural Language Processing Techniques
