Document Context Language Models

Yangfeng Ji; Trevor Cohn; Lingpeng Kong; Chris Dyer; Jacob Eisenstein

arXiv:1511.03962·cs.CL·February 23, 2016·60 cites

Document Context Language Models

Yangfeng Ji, Trevor Cohn, Lingpeng Kong, Chris Dyer, Jacob Eisenstein

PDF

Open Access 1 Repo

TL;DR

This paper introduces Document-Context Language Models (DCLM), a new neural network approach that incorporates multi-level discourse information to improve document coherence and predictive performance.

Contribution

The paper presents a novel multi-level recurrent neural network model that effectively integrates discourse structure for improved language modeling.

Findings

01

DCLM models achieve better predictive likelihoods than word-level models.

02

DCLM models significantly improve assessments of document coherence.

03

Empirical evaluation demonstrates the effectiveness of incorporating discourse structure.

Abstract

Text documents are structured on multiple levels of detail: individual words are related by syntax, but larger units of text are related by discourse structure. Existing language models generally fail to account for discourse structure, but it is crucial if we are to have language models that reward coherence and generate coherent texts. We present and empirically evaluate a set of multi-level recurrent neural network language models, called Document-Context Language Models (DCLM), which incorporate contextual information both within and beyond the sentence. In comparison with word-level recurrent neural network language models, the DCLM models obtain slightly better predictive likelihoods, and considerably better assessments of document coherence.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jiyfeng/dclm
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsService-Oriented Architecture and Web Services · Semantic Web and Ontologies · Data Quality and Management