The supervised hierarchical Dirichlet process
Andrew M. Dai, Amos J. Storkey

TL;DR
The paper introduces the supervised hierarchical Dirichlet process (sHDP), a nonparametric model that jointly learns clusters from grouped data and associated labels, improving predictive performance in classification and regression tasks.
Contribution
It presents the sHDP model, which effectively incorporates supervision into hierarchical Dirichlet process clustering for grouped data, addressing limitations of previous HDP applications.
Findings
sHDP outperforms sLDA in classification and regression tasks
The model captures group-level structure and label information effectively
Experimental results demonstrate improved predictive accuracy
Abstract
We propose the supervised hierarchical Dirichlet process (sHDP), a nonparametric generative model for the joint distribution of a group of observations and a response variable directly associated with that whole group. We compare the sHDP with another leading method for regression on grouped data, the supervised latent Dirichlet allocation (sLDA) model. We evaluate our method on two real-world classification problems and two real-world regression problems. Bayesian nonparametric regression models based on the Dirichlet process, such as the Dirichlet process-generalised linear models (DP-GLM) have previously been explored; these models allow flexibility in modelling nonlinear relationships. However, until now, Hierarchical Dirichlet Process (HDP) mixtures have not seen significant use in supervised problems with grouped data since a straightforward application of the HDP on the grouped…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
