Jointly Modeling Topics and Intents with Global Order Structure

Bei Chen; Jun Zhu; Nan Yang; Tian Tian; Ming Zhou; Bo Zhang

arXiv:1512.02009·cs.CL·December 8, 2015·2 cites

Jointly Modeling Topics and Intents with Global Order Structure

Bei Chen, Jun Zhu, Nan Yang, Tian Tian, Ming Zhou, Bo Zhang

PDF

Open Access

TL;DR

This paper introduces GMM-LDA, a Bayesian model that captures document structure by jointly modeling topics and rhetorical intents, incorporating order information and enabling supervised learning.

Contribution

It presents a novel unsupervised and supervised Bayesian model that integrates topic and intent modeling with global order structure in documents.

Findings

01

Outperforms state-of-the-art baselines in experiments

02

Effectively models document intent and topic structure

03

Flexible to incorporate annotations and supervised learning

Abstract

Modeling document structure is of great importance for discourse analysis and related applications. The goal of this research is to capture the document intent structure by modeling documents as a mixture of topic words and rhetorical words. While the topics are relatively unchanged through one document, the rhetorical functions of sentences usually change following certain orders in discourse. We propose GMM-LDA, a topic modeling based Bayesian unsupervised model, to analyze the document intent structure cooperated with order information. Our model is flexible that has the ability to combine the annotations and do supervised learning. Additionally, entropic regularization can be introduced to model the significant divergence between topics and intents. We perform experiments in both unsupervised and supervised settings, results show the superiority of our model over several…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques