Prefix-Tuning Based Unsupervised Text Style Transfer
Huiyu Mai, Wenhao Jiang, Zhihong Deng

TL;DR
This paper introduces a prefix-tuning approach using large language models for unsupervised text style transfer, effectively encoding style and content to improve transfer quality without parallel data.
Contribution
It proposes a novel prefix-tuning method with shared, style, and content prefixes, enhancing information encoding for style transfer using large language models.
Findings
Outperforms state-of-the-art baselines on standard datasets
Provides richer task-specific information through designed prefixes
Recursive language model usage improves style transfer performance
Abstract
Unsupervised text style transfer aims at training a generative model that can alter the style of the input sentence while preserving its content without using any parallel data. In this paper, we employ powerful pre-trained large language models and present a new prefix-tuning-based method for unsupervised text style transfer. We construct three different kinds of prefixes, i.e., \textit{shared prefix, style prefix}, and \textit{content prefix}, to encode task-specific information, target style, and the content information of the input sentence, respectively. Compared to embeddings used by previous works, the proposed prefixes can provide richer information for the model. Furthermore, we adopt a recursive way of using language models in the process of style transfer. This strategy provides a more effective way for the interactions between the input sentence and GPT-2, helps the model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis
MethodsMulti-Head Attention · Attention Is All You Need · Attention Dropout · Softmax · Dense Connections · Cosine Annealing · Adam · Residual Connection · Byte Pair Encoding · Dropout
