Replacing Language Model for Style Transfer

Pengyu Cheng; Ruineng Li

arXiv:2211.07343·cs.CL·February 29, 2024

Replacing Language Model for Style Transfer

Pengyu Cheng, Ruineng Li

PDF

Open Access 1 Repo

TL;DR

This paper proposes the Replacing Language Model (RLM), a novel sequence-to-sequence framework for text style transfer that combines autoregressive and non-autoregressive methods to improve style transfer accuracy and control.

Contribution

The paper introduces RLM, a new style transfer approach that autoregressively replaces tokens with contextually similar spans generated by a non-autoregressive model, enhancing style transfer quality.

Findings

01

RLM outperforms existing style transfer baselines.

02

It effectively preserves local context and style content.

03

Token-level style-content disentanglement improves control.

Abstract

We introduce replacing language model (RLM), a sequence-to-sequence language modeling framework for text style transfer (TST). Our method autoregressively replaces each token of the source sentence with a text span that has a similar meaning but in the target style. The new span is generated via a non-autoregressive masked language model, which can better preserve the local-contextual meaning of the replaced token. This RLM generation scheme gathers the flexibility of autoregressive models and the accuracy of non-autoregressive models, which bridges the gap between sentence-level and word-level style transfer methods. To control the generation style more precisely, we conduct a token-level style-content disentanglement on the hidden representations of RLM. Empirical results on real-world text datasets demonstrate the effectiveness of RLM compared with other TST baselines. The code is at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

linear95/rlm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Speech Recognition and Synthesis · Topic Modeling