Unsupervised Text Embedding Space Generation Using Generative Adversarial Networks for Text Synthesis
Jun-Min Lee, Tae-Bin Ha

TL;DR
This paper introduces TESGAN, a novel GAN-based framework that generates continuous text embeddings for unsupervised text synthesis, overcoming traditional challenges in natural language generation and data memorization.
Contribution
TESGAN is the first to generate continuous text embeddings with GANs in an unsupervised manner, avoiding data memorization and enabling new text synthesis approaches.
Findings
Successfully synthesizes new sentences using continuous embeddings.
Overcomes gradient backpropagation issues in text GANs.
Operates without directly referencing training text, reducing memorization.
Abstract
Generative Adversarial Networks (GAN) is a model for data synthesis, which creates plausible data through the competition of generator and discriminator. Although GAN application to image synthesis is extensively studied, it has inherent limitations to natural language generation. Because natural language is composed of discrete tokens, a generator has difficulty updating its gradient through backpropagation; therefore, most text-GAN studies generate sentences starting with a random token based on a reward system. Thus, the generators of previous studies are pre-trained in an autoregressive way before adversarial training, causing data memorization that synthesized sentences reproduce the training data. In this paper, we synthesize sentences using a framework similar to the original GAN. More specifically, we propose Text Embedding Space Generative Adversarial Networks (TESGAN) which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
