Prefix-Tuning Based Unsupervised Text Style Transfer

Huiyu Mai; Wenhao Jiang; Zhihong Deng

arXiv:2310.14599·cs.CL·October 24, 2023·1 cites

Prefix-Tuning Based Unsupervised Text Style Transfer

Huiyu Mai, Wenhao Jiang, Zhihong Deng

PDF

Open Access

TL;DR

This paper introduces a prefix-tuning approach using large language models for unsupervised text style transfer, effectively encoding style and content to improve transfer quality without parallel data.

Contribution

It proposes a novel prefix-tuning method with shared, style, and content prefixes, enhancing information encoding for style transfer using large language models.

Findings

01

Outperforms state-of-the-art baselines on standard datasets

02

Provides richer task-specific information through designed prefixes

03

Recursive language model usage improves style transfer performance

Abstract

Unsupervised text style transfer aims at training a generative model that can alter the style of the input sentence while preserving its content without using any parallel data. In this paper, we employ powerful pre-trained large language models and present a new prefix-tuning-based method for unsupervised text style transfer. We construct three different kinds of prefixes, i.e., \textit{shared prefix, style prefix}, and \textit{content prefix}, to encode task-specific information, target style, and the content information of the input sentence, respectively. Compared to embeddings used by previous works, the proposed prefixes can provide richer information for the model. Furthermore, we adopt a recursive way of using language models in the process of style transfer. This strategy provides a more effective way for the interactions between the input sentence and GPT-2, helps the model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis

MethodsMulti-Head Attention · Attention Is All You Need · Attention Dropout · Softmax · Dense Connections · Cosine Annealing · Adam · Residual Connection · Byte Pair Encoding · Dropout