Unsupervised Paraphrase Generation using Pre-trained Language Models
Chaitra Hegde, Shrikumar Patil

TL;DR
This paper demonstrates how GPT-2 can be used to generate high-quality, diverse paraphrases without supervision, improving downstream NLP task performance through data augmentation.
Contribution
It introduces an unsupervised method for paraphrase generation leveraging GPT-2's capabilities, without requiring labeled data.
Findings
Generated paraphrases are of high quality and diversity.
Using generated paraphrases improves downstream classification performance.
The approach compares favorably with supervised and other unsupervised methods.
Abstract
Large scale Pre-trained Language Models have proven to be very powerful approach in various Natural language tasks. OpenAI's GPT-2 \cite{radford2019language} is notable for its capability to generate fluent, well formulated, grammatically consistent text and for phrase completions. In this paper we leverage this generation capability of GPT-2 to generate paraphrases without any supervision from labelled data. We examine how the results compare with other supervised and unsupervised approaches and the effect of using paraphrases for data augmentation on downstream tasks such as classification. Our experiments show that paraphrases generated with our model are of good quality, are diverse and improves the downstream task performance when used for data augmentation.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
MethodsLinear Layer · Cosine Annealing · Weight Decay · Softmax · Adam · Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · Attention Dropout · Byte Pair Encoding · Dense Connections
