Towards Coherent and Cohesive Long-form Text Generation

Woon Sang Cho; Pengchuan Zhang; Yizhe Zhang; Xiujun Li; Michel Galley,; Chris Brockett; Mengdi Wang; Jianfeng Gao

arXiv:1811.00511·cs.CL·May 30, 2019·1 cites

Towards Coherent and Cohesive Long-form Text Generation

Woon Sang Cho, Pengchuan Zhang, Yizhe Zhang, Xiujun Li, Michel Galley,, Chris Brockett, Mengdi Wang, Jianfeng Gao

PDF

Open Access

TL;DR

This paper introduces a neural language model enhanced with discriminators for sentence and paragraph level feedback, improving the coherence and cohesion of long-form text generation through a novel training method.

Contribution

It presents a new neural language model with integrated discriminators and a negative-critical sequence training method for better long-form text coherence and cohesion.

Findings

01

Improved coherence and cohesion in generated texts.

02

Outperforms baseline models in quality metrics.

03

Efficient training without separate critic models.

Abstract

Generating coherent and cohesive long-form texts is a challenging task. Previous works relied on large amounts of human-generated texts to train neural language models. However, few attempted to explicitly improve neural language models from the perspectives of coherence and cohesion. In this work, we propose a new neural language model that is equipped with two neural discriminators which provide feedback signals at the levels of sentence (cohesion) and paragraph (coherence). Our model is trained using a simple yet efficient variant of policy gradient, called negative-critical sequence training, which is proposed to eliminate the need of training a separate critic for estimating baseline. Results demonstrate the effectiveness of our approach, showing improvements over the strong baseline -- recurrent attention-based bidirectional MLE-trained neural language model.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications