RankGen: Improving Text Generation with Large Ranking Models

Kalpesh Krishna; Yapei Chang; John Wieting; Mohit Iyyer

arXiv:2205.09726·cs.CL·November 16, 2022·1 cites

RankGen: Improving Text Generation with Large Ranking Models

Kalpesh Krishna, Yapei Chang, John Wieting, Mohit Iyyer

PDF

Open Access 1 Repo

TL;DR

RankGen is a large ranking model that improves text generation quality by scoring and selecting more relevant, coherent outputs from pretrained language models, outperforming existing decoding methods.

Contribution

Introduces RankGen, a 1.2B parameter ranking model trained with contrastive learning to enhance text generation quality across various language models.

Findings

01

RankGen outperforms traditional decoding algorithms in automatic metrics.

02

Human evaluations favor RankGen's relevance and coherence.

03

RankGen improves continuity and relevance in generated text.

Abstract

Given an input sequence (or prefix), modern language models often assign high probabilities to output sequences that are repetitive, incoherent, or irrelevant to the prefix; as such, model-generated text also contains such artifacts. To address these issues we present RankGen, a 1.2B parameter encoder model for English that scores model generations given a prefix. RankGen can be flexibly incorporated as a scoring function in beam search and used to decode from any pretrained language model. We train RankGen using large-scale contrastive learning to map a prefix close to the ground-truth sequence that follows it and far away from two types of negatives: (1) random sequences from the same document as the prefix, and (2) sequences generated from a large language model conditioned on the prefix. Experiments across four different language models (345M-11B parameters) and two domains show…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

martiansideofthemoon/rankgen
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification

MethodsContrastive Learning