Semi-Supervised Formality Style Transfer with Consistency Training

Ao Liu; An Wang; Naoaki Okazaki

arXiv:2203.13620·cs.CL·March 28, 2022

Semi-Supervised Formality Style Transfer with Consistency Training

Ao Liu, An Wang, Naoaki Okazaki

PDF

Open Access 1 Repo

TL;DR

This paper introduces a semi-supervised method for formality style transfer that leverages source-side unlabeled data with consistency training, achieving state-of-the-art results with less than 40% of parallel data.

Contribution

It proposes a novel semi-supervised framework utilizing source-side unlabeled sentences and consistency training, improving formality transfer performance over previous cycle-reconstruction methods.

Findings

01

Achieves state-of-the-art results on GYAFC benchmark.

02

Effective data filtering strategies enhance model performance.

03

Performs well with significantly less parallel data.

Abstract

Formality style transfer (FST) is a task that involves paraphrasing an informal sentence into a formal one without altering its meaning. To address the data-scarcity problem of existing parallel datasets, previous studies tend to adopt a cycle-reconstruction scheme to utilize additional unlabeled data, where the FST model mainly benefits from target-side unlabeled sentences. In this work, we propose a simple yet effective semi-supervised framework to better utilize source-side unlabeled sentences based on consistency training. Specifically, our approach augments pseudo-parallel data obtained from a source-side informal sentence by enforcing the model to generate similar outputs for its perturbed version. Moreover, we empirically examined the effects of various data perturbation methods and propose effective data filtering strategies to improve our framework. Experimental results on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aolius/semi-fst
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis