Hierarchical Dirichlet Process Mixture of Products of Multinomial Distributions: Applications to Survey Data with Potentially Missing Values
Chayut Wongkamthong

TL;DR
This paper introduces a nonparametric Bayesian model, HDPMPM, for analyzing survey data with categorical responses, capturing complex latent structures and handling missing data effectively.
Contribution
The study develops the HDPMPM model that allows multiple latent classes per individual and integrates missing data imputation within a Bayesian framework, improving analysis of survey data.
Findings
Successfully identifies political profiles in survey data
Handles missing data under MAR and MCAR assumptions
Recovers dominant latent structures effectively
Abstract
In social science research, understanding latent structures in populations through survey data with categorical responses is a common and important task. Traditional methods like Factor Analysis and Latent Class Analysis have limitations, particularly in handling categorical data and accommodating mixed memberships in latent structures, respectively. Moreover, choosing the number of factors or latent classes is often subjective and can be challenging in the presence of missing values. This study introduces a Hierarchical Dirichlet Process Mixture of Products of Multinomial Distributions (HDPMPM) model, which leverages the flexibility of nonparametric Bayesian methods to address these limitations. The HDPMPM model allows for multiple latent classes within individuals and avoids fixing the number of mixture components at an arbitrary number. Additionally, it incorporates missing data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models
