MOCHA: A Multi-Task Training Approach for Coherent Text Generation from   Cognitive Perspective

Zhe Hu; Hou Pong Chan; Lifu Huang

arXiv:2210.14650·cs.CL·October 27, 2022

MOCHA: A Multi-Task Training Approach for Coherent Text Generation from Cognitive Perspective

Zhe Hu, Hou Pong Chan, Lifu Huang

PDF

Open Access 1 Repo

TL;DR

This paper introduces MOCHA, a multi-task training approach grounded in cognitive writing theory, which improves neural models' ability to generate more coherent texts across various narrative tasks.

Contribution

It proposes a novel multi-task training strategy that incorporates writing subskills like planning and reviewing, enhancing coherence in text generation.

Findings

01

Outperforms strong baselines in few-shot and fully-supervised settings.

02

Generates more coherent texts according to human evaluations.

03

Achieves better results on story, news, and argument generation tasks.

Abstract

Teaching neural models to generate narrative coherent texts is a critical problem. Recent pre-trained language models have achieved promising results, but there is still a gap between human written texts and machine-generated outputs. In this work, we propose a novel multi-task training strategy for coherent text generation grounded on the cognitive theory of writing, which empowers the model to learn essential subskills needed for writing including planning and reviewing besides end-to-end generation. We extensively evaluate our model on three open-ended generation tasks including story generation, news article writing and argument generation. Experiments show that our model achieves better results on both few-shot and fully-supervised settings than strong baselines, and human evaluations confirm that our model can generate more coherent outputs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

derekkk/mocha-emnlp22
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques