Multilingual Text Style Transfer: Datasets & Models for Indian Languages
Sourabrata Mukherjee, Atul Kr. Ojha, Akanksha Bansal, Deepak Alok,, John P. McCrae, Ond\v{r}ej Du\v{s}ek

TL;DR
This paper introduces new datasets and evaluates models for sentiment-based text style transfer across eight Indian languages, highlighting the importance of parallel data and multilingual approaches.
Contribution
It provides the first comprehensive study of sentiment transfer in multiple Indian languages, including datasets and benchmark evaluations of various models.
Findings
Parallel data significantly improves TST performance
Masked Style Filling (MSF) is effective for non-parallel TST
Cross-lingual and multilingual models show promising results
Abstract
Text style transfer (TST) involves altering the linguistic style of a text while preserving its core content. This paper focuses on sentiment transfer, a popular TST subtask, across a spectrum of Indian languages: Hindi, Magahi, Malayalam, Marathi, Punjabi, Odia, Telugu, and Urdu, expanding upon previous work on English-Bangla sentiment transfer (Mukherjee et al., 2023). We introduce dedicated datasets of 1,000 positive and 1,000 negative style-parallel sentences for each of these eight languages. We then evaluate the performance of various benchmark models categorized into parallel, non-parallel, cross-lingual, and shared learning approaches, including the Llama2 and GPT-3.5 large language models (LLMs). Our experiments highlight the significance of parallel data in TST and demonstrate the effectiveness of the Masked Style Filling (MSF) approach (Mukherjee et al., 2023) in non-parallel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Sparse Evolutionary Training · Adam · Dropout · Dense Connections · Softmax · {Dispute@FaQ-s}How to file a dispute with Expedia? · Layer Normalization
