CoNT: Contrastive Neural Text Generation
Chenxin An, Jiangtao Feng, Kai Lv, Lingpeng Kong, Xipeng Qiu, Xuanjing, Huang

TL;DR
CoNT introduces a contrastive learning framework for neural text generation, significantly improving performance across multiple tasks by addressing key bottlenecks in contrastive example construction, loss choice, and decoding strategies.
Contribution
The paper proposes CoNT, a novel contrastive neural text generation framework that enhances performance by optimizing contrastive example construction, loss functions, and decoding strategies.
Findings
Outperforms traditional training on 10 benchmarks
Achieves state-of-the-art results in summarization and code comment generation
Surpasses previous contrastive learning methods by significant margins
Abstract
Recently, contrastive learning attracts increasing interests in neural text generation as a new solution to alleviate the exposure bias problem. It introduces a sequence-level training signal which is crucial to generation tasks that always rely on auto-regressive decoding. However, previous methods using contrastive learning in neural text generation usually lead to inferior performance. In this paper, we analyse the underlying reasons and propose a new Contrastive Neural Text generation framework, CoNT. CoNT addresses bottlenecks that prevent contrastive learning from being widely adopted in generation tasks from three aspects -- the construction of contrastive examples, the choice of the contrastive loss, and the strategy in decoding. We validate CoNT on five generation tasks with ten benchmarks, including machine translation, summarization, code comment generation, data-to-text…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
MethodsContrastive Learning
