CoNT: Contrastive Neural Text Generation

Chenxin An; Jiangtao Feng; Kai Lv; Lingpeng Kong; Xipeng Qiu; Xuanjing; Huang

arXiv:2205.14690·cs.CL·February 6, 2023·22 cites

CoNT: Contrastive Neural Text Generation

Chenxin An, Jiangtao Feng, Kai Lv, Lingpeng Kong, Xipeng Qiu, Xuanjing, Huang

PDF

Open Access 2 Repos

TL;DR

CoNT introduces a contrastive learning framework for neural text generation, significantly improving performance across multiple tasks by addressing key bottlenecks in contrastive example construction, loss choice, and decoding strategies.

Contribution

The paper proposes CoNT, a novel contrastive neural text generation framework that enhances performance by optimizing contrastive example construction, loss functions, and decoding strategies.

Findings

01

Outperforms traditional training on 10 benchmarks

02

Achieves state-of-the-art results in summarization and code comment generation

03

Surpasses previous contrastive learning methods by significant margins

Abstract

Recently, contrastive learning attracts increasing interests in neural text generation as a new solution to alleviate the exposure bias problem. It introduces a sequence-level training signal which is crucial to generation tasks that always rely on auto-regressive decoding. However, previous methods using contrastive learning in neural text generation usually lead to inferior performance. In this paper, we analyse the underlying reasons and propose a new Contrastive Neural Text generation framework, CoNT. CoNT addresses bottlenecks that prevent contrastive learning from being widely adopted in generation tasks from three aspects -- the construction of contrastive examples, the choice of the contrastive loss, and the strategy in decoding. We validate CoNT on five generation tasks with ten benchmarks, including machine translation, summarization, code comment generation, data-to-text…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification

MethodsContrastive Learning