Gibbs Max-margin Topic Models with Data Augmentation

Jun Zhu; Ning Chen; Hugh Perkins; Bo Zhang

arXiv:1310.2816·stat.ML·October 11, 2013·74 cites

Gibbs Max-margin Topic Models with Data Augmentation

Jun Zhu, Ning Chen, Hugh Perkins, Bo Zhang

PDF

Open Access

TL;DR

This paper introduces Gibbs max-margin supervised topic models that utilize a new max-margin loss and Gibbs sampling, achieving efficient training and improved classification accuracy across various tasks.

Contribution

It proposes a novel Gibbs max-margin supervised topic model that simplifies training by avoiding SVM subproblems and enhances performance and efficiency.

Findings

01

Significant improvements in training time efficiency.

02

Enhanced classification accuracy on multiple tasks.

03

No need for restrictive assumptions in the sampling process.

Abstract

Max-margin learning is a powerful approach to building classifiers and structured output predictors. Recent work on max-margin supervised topic models has successfully integrated it with Bayesian topic models to discover discriminative latent semantic structures and make accurate predictions for unseen testing data. However, the resulting learning problems are usually hard to solve because of the non-smoothness of the margin loss. Existing approaches to building max-margin supervised topic models rely on an iterative procedure to solve multiple latent SVM subproblems with additional mean-field assumptions on the desired posterior distributions. This paper presents an alternative approach by defining a new max-margin loss. Namely, we present Gibbs max-margin supervised topic models, a latent variable Gibbs classifier to discover hidden topic representations for various tasks, including…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText and Document Classification Technologies · Topic Modeling · Machine Learning and Data Classification

MethodsSupport Vector Machine