Generic resources are what you need: Style transfer tasks without   task-specific parallel training data

Huiyuan Lai; Antonio Toral; Malvina Nissim

arXiv:2109.04543·cs.CL·September 13, 2021

Generic resources are what you need: Style transfer tasks without task-specific parallel training data

Huiyuan Lai, Antonio Toral, Malvina Nissim

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel style transfer method that leverages generic resources and pre-trained models, achieving state-of-the-art results without task-specific parallel data by using iterative back-translation and synthetic data generation.

Contribution

It presents a new approach that outperforms existing methods by avoiding task-specific parallel data, utilizing generic resources and iterative back-translation with a pre-trained sequence-to-sequence model.

Findings

01

Outperforms existing unsupervised approaches on style transfer tasks

02

Effective use of generic paraphrases and synthetic pairs improves transfer quality

03

Highlights differences in task response to the proposed methodology

Abstract

Style transfer aims to rewrite a source text in a different target style while preserving its content. We propose a novel approach to this task that leverages generic resources, and without using any task-specific parallel (source-target) data outperforms existing unsupervised approaches on the two most popular style transfer tasks: formality transfer and polarity swap. In practice, we adopt a multi-step procedure which builds on a generic pre-trained sequence-to-sequence model (BART). First, we strengthen the model's ability to rewrite by further pre-training BART on both an existing collection of generic paraphrases, as well as on synthetic pairs created using a general-purpose lexical resource. Second, through an iterative back-translation approach, we train two models, each in a transfer direction, so that they can provide each other with synthetically generated pairs, dynamically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

laihuiyuan/generic-resources-for-tst
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Dense Connections · Byte Pair Encoding · Adam · Residual Connection · Layer Normalization · Softmax · Dropout