Multichannel Generative Language Model: Learning All Possible   Factorizations Within and Across Channels

Harris Chan; Jamie Kiros; William Chan

arXiv:2010.04438·cs.CL·October 12, 2020

Multichannel Generative Language Model: Learning All Possible Factorizations Within and Across Channels

Harris Chan, Jamie Kiros, William Chan

PDF

Open Access

TL;DR

The paper introduces MGLM, a flexible multichannel generative model that learns all possible factorizations across multiple languages, enabling diverse inference tasks and outperforming traditional models on multilingual data.

Contribution

MGLM is the first model to jointly learn and marginalize over all factorizations within and across multiple language channels.

Findings

01

MGLM successfully performs unconditional, conditional, and partially conditional generation.

02

It outperforms traditional bilingual discriminative models in quality-diversity trade-offs.

03

Qualitative samples demonstrate the model's ability to generate diverse multilingual outputs.

Abstract

A channel corresponds to a viewpoint or transformation of an underlying meaning. A pair of parallel sentences in English and French express the same underlying meaning, but through two separate channels corresponding to their languages. In this work, we present the Multichannel Generative Language Model (MGLM). MGLM is a generative joint distribution model over channels. MGLM marginalizes over all possible factorizations within and across all channels. MGLM endows flexible inference, including unconditional generation, conditional generation (where 1 channel is observed and other channels are generated), and partially observed generation (where incomplete observations are spread across all the channels). We experiment with the Multi30K dataset containing English, French, Czech, and German. We demonstrate experiments with unconditional, conditional, and partially conditional generation.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis