Unsupervised Paraphrase Generation using Pre-trained Language Models

Chaitra Hegde; Shrikumar Patil

arXiv:2006.05477·cs.CL·June 11, 2020·46 cites

Unsupervised Paraphrase Generation using Pre-trained Language Models

Chaitra Hegde, Shrikumar Patil

PDF

Open Access

TL;DR

This paper demonstrates how GPT-2 can be used to generate high-quality, diverse paraphrases without supervision, improving downstream NLP task performance through data augmentation.

Contribution

It introduces an unsupervised method for paraphrase generation leveraging GPT-2's capabilities, without requiring labeled data.

Findings

01

Generated paraphrases are of high quality and diversity.

02

Using generated paraphrases improves downstream classification performance.

03

The approach compares favorably with supervised and other unsupervised methods.

Abstract

Large scale Pre-trained Language Models have proven to be very powerful approach in various Natural language tasks. OpenAI's GPT-2 \cite{radford2019language} is notable for its capability to generate fluent, well formulated, grammatically consistent text and for phrase completions. In this paper we leverage this generation capability of GPT-2 to generate paraphrases without any supervision from labelled data. We examine how the results compare with other supervised and unsupervised approaches and the effect of using paraphrases for data augmentation on downstream tasks such as classification. Our experiments show that paraphrases generated with our model are of good quality, are diverse and improves the downstream task performance when used for data augmentation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques

MethodsLinear Layer · Cosine Annealing · Weight Decay · Softmax · Adam · Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · Attention Dropout · Byte Pair Encoding · Dense Connections