VOLTA: Improving Generative Diversity by Variational Mutual Information   Maximizing Autoencoder

Yueen Ma; Dafeng Chi; Jingjing Li; Kai Song; Yuzheng Zhuang; Irwin; King

arXiv:2307.00852·cs.CL·March 20, 2024

VOLTA: Improving Generative Diversity by Variational Mutual Information Maximizing Autoencoder

Yueen Ma, Dafeng Chi, Jingjing Li, Kai Song, Yuzheng Zhuang, Irwin, King

PDF

Open Access 1 Video

TL;DR

VOLTA enhances diversity in Transformer-based language generation by integrating a novel VAE connection and input-independent latent codes, achieving higher diversity without sacrificing quality across multiple datasets.

Contribution

Introduces VOLTA, a new framework combining Transformer and VAE with cross-attention and InfoGAN-style codes to improve generative diversity in NLG tasks.

Findings

01

Significantly improves diversity across six datasets.

02

Maintains high generative quality.

03

Supports both continuous and discrete inputs.

Abstract

The natural language generation domain has witnessed great success thanks to Transformer models. Although they have achieved state-of-the-art generative quality, they often neglect generative diversity. Prior attempts to tackle this issue suffer from either low model capacity or over-complicated architectures. Some recent methods employ the VAE framework to enhance diversity, but their latent variables fully depend on the input context, restricting exploration of the latent space. In this paper, we introduce VOLTA, a framework that elevates generative diversity by bridging Transformer with VAE via a more effective cross-attention-based connection, departing from conventional embedding concatenation or summation. Additionally, we propose integrating InfoGAN-style latent codes to enable input-independent variability, further diversifying the generation. Moreover, our framework…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

VOLTA: Improving Generative Diversity by Variational Mutual Information Maximizing Autoencoder· underline

Taxonomy

TopicsTopic Modeling · Expert finding and Q&A systems · Natural Language Processing Techniques

MethodsAttention Is All You Need · Linear Layer · Absolute Position Encodings · Softmax · Layer Normalization · Multi-Head Attention · Dropout · Residual Connection · Position-Wise Feed-Forward Layer · Byte Pair Encoding