vec2text with Round-Trip Translations

Geoffrey Cideron; Sertan Girgin; Anton Raichuk; Olivier Pietquin,; Olivier Bachem; L\'eonard Hussenot

arXiv:2209.06792·cs.CL·September 15, 2022·1 cites

vec2text with Round-Trip Translations

Geoffrey Cideron, Sertan Girgin, Anton Raichuk, Olivier Pietquin,, Olivier Bachem, L\'eonard Hussenot

PDF

Open Access

TL;DR

This paper introduces a universal vec2text model capable of generating diverse, fluent, and semantically structured natural language from a controlled vector space, using a novel training method involving round-trip translations.

Contribution

It proposes a new vec2text model with four key properties, implemented with a bottleneck transformer trained on large-scale data, and demonstrates the effectiveness of round-trip translation data augmentation.

Findings

01

The model achieves high universality, diversity, fluency, and semantic structure.

02

Round-trip translation augmentation significantly improves model properties.

03

The model outperforms standard auto-encoders in experiments.

Abstract

We investigate models that can generate arbitrary natural language text (e.g. all English sentences) from a bounded, convex and well-behaved control space. We call them universal vec2text models. Such models would allow making semantic decisions in the vector space (e.g. via reinforcement learning) while the natural language generation is handled by the vec2text model. We propose four desired properties: universality, diversity, fluency, and semantic structure, that such vec2text models should possess and we provide quantitative and qualitative methods to assess them. We implement a vec2text model by adding a bottleneck to a 250M parameters Transformer model and training it with an auto-encoding objective on 400M sentences (10B tokens) extracted from a massive web corpus. We propose a simple data augmentation technique based on round-trip translations and show in extensive experiments…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Adam · Softmax · Dropout · Dense Connections · Residual Connection