A Cross-Domain Transferable Neural Coherence Model

Peng Xu; Hamidreza Saghir; Jin Sung Kang; Teng Long; Avishek Joey; Bose; Yanshuai Cao; Jackie Chi Kit Cheung

arXiv:1905.11912·cs.CL·July 10, 2019·5 cites

A Cross-Domain Transferable Neural Coherence Model

Peng Xu, Hamidreza Saghir, Jin Sung Kang, Teng Long, Avishek Joey, Bose, Yanshuai Cao, Jackie Chi Kit Cheung

PDF

Open Access 1 Repo

TL;DR

This paper introduces a neural coherence model that effectively generalizes across different text domains by using a local discriminative approach with reduced negative sampling, outperforming previous methods.

Contribution

A novel local discriminative neural coherence model that enhances cross-domain transferability and outperforms existing models on standard and new challenging datasets.

Findings

01

Significantly outperforms previous state-of-the-art methods.

02

Effective in transfer to unseen categories of discourse.

03

Simpler structure with efficient learning against incorrect orderings.

Abstract

Coherence is an important aspect of text quality and is crucial for ensuring its readability. One important limitation of existing coherence models is that training on one domain does not easily generalize to unseen categories of text. Previous work advocates for generative models for cross-domain generalization, because for discriminative models, the space of incoherent sentence orderings to discriminate against during training is prohibitively large. In this work, we propose a local discriminative neural model with a much smaller negative sampling space that can efficiently learn against incorrect orderings. The proposed coherence model is simple in structure, yet it significantly outperforms previous state-of-art methods on a standard benchmark dataset on the Wall Street Journal corpus, as well as in multiple new challenging settings of transfer to unseen categories of discourse on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

BorealisAI/cross_domain_coherence
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Text Readability and Simplification · Natural Language Processing Techniques