Towards Robust and Semantically Organised Latent Representations for Unsupervised Text Style Transfer
Sharan Narasimhan, Suvodip Dey, Maunendra Sankar Desarkar

TL;DR
This paper introduces EPAAEs, a novel auto-encoder model with adjustable noise in the embedding space, improving latent space organization and style transfer performance, especially on complex NLI datasets.
Contribution
EPAAEs add a fine-tunable noise component to auto-encoders, enhancing style transfer quality and enabling application to complex NLI datasets, a novel extension in the field.
Findings
EPAAEs produce better-organized latent spaces for style clustering.
EPAAEs outperform similar baselines on diverse style transfer tasks.
EPAAEs effectively handle complex style definitions in NLI datasets.
Abstract
Recent studies show that auto-encoder based approaches successfully perform language generation, smooth sentence interpolation, and style transfer over unseen attributes using unlabelled datasets in a zero-shot manner. The latent space geometry of such models is organised well enough to perform on datasets where the style is "coarse-grained" i.e. a small fraction of words alone in a sentence are enough to determine the overall style label. A recent study uses a discrete token-based perturbation approach to map "similar" sentences ("similar" defined by low Levenshtein distance/ high word overlap) close by in latent space. This definition of "similarity" does not look into the underlying nuances of the constituent words while mapping latent space neighbourhoods and therefore fails to recognise sentences with different style-based semantics while mapping latent neighbourhoods. We introduce…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Topic Modeling · Natural Language Processing Techniques
