MedLDA: A General Framework of Maximum Margin Supervised Topic Models

Jun Zhu; Amr Ahmed; Eric P. Xing

arXiv:0912.5507·stat.ML·April 9, 2013·1 cites

MedLDA: A General Framework of Maximum Margin Supervised Topic Models

Jun Zhu, Amr Ahmed, Eric P. Xing

PDF

Open Access

TL;DR

MedLDA introduces a max-margin framework for supervised topic modeling that improves predictive performance over traditional likelihood-based methods, applicable to various types of data and models.

Contribution

The paper proposes MedLDA, a novel max-margin supervised topic model that enhances prediction accuracy and can be integrated with different topic modeling approaches.

Findings

01

MedLDA outperforms likelihood-based models on movie review data.

02

MedLDA achieves better classification accuracy on 20 Newsgroups.

03

Efficient variational methods enable scalable inference for MedLDA.

Abstract

Supervised topic models utilize document's side information for discovering predictive low dimensional representations of documents. Existing models apply the likelihood-based estimation. In this paper, we present a general framework of max-margin supervised topic models for both continuous and categorical response variables. Our approach, the maximum entropy discrimination latent Dirichlet allocation (MedLDA), utilizes the max-margin principle to train supervised topic models and estimate predictive topic representations that are arguably more suitable for prediction tasks. The general principle of MedLDA can be applied to perform joint max-margin learning and maximum likelihood estimation for arbitrary topic models, directed or undirected, and supervised or unsupervised, when the supervised side information is available. We develop efficient variational methods for posterior inference…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Data Quality and Management · Natural Language Processing Techniques