Language coverage and generalization in RNN-based continuous sentence   embeddings for interacting agents

Luca Celotti; Simon Brodeur; Jean Rouat

arXiv:1911.02002·cs.CL·November 6, 2019

Language coverage and generalization in RNN-based continuous sentence embeddings for interacting agents

Luca Celotti, Simon Brodeur, Jean Rouat

PDF

Open Access

TL;DR

This paper evaluates the language coverage and generalization of RNN-based sentence embeddings using a CFG-generated dataset, revealing their limitations, and introduces a new arithmetic coding-based embedding method, AriEL, as a baseline.

Contribution

It provides a systematic evaluation of RNN embeddings' ability to cover language space and proposes AriEL, a non-data-driven embedding method based on arithmetic coding.

Findings

01

RNN embeddings underfit training data and cover limited language space.

02

They fail to learn the underlying CFG and generalize poorly to unbiased sentences.

03

AriEL offers an effective baseline for sentence encoding.

Abstract

Continuous sentence embeddings using recurrent neural networks (RNNs), where variable-length sentences are encoded into fixed-dimensional vectors, are often the main building blocks of architectures applied to language tasks such as dialogue generation. While it is known that those embeddings are able to learn some structures of language (e.g. grammar) in a purely data-driven manner, there is very little work on the objective evaluation of their ability to cover the whole language space and to generalize to sentences outside the language bias of the training data. Using a manually designed context-free grammar (CFG) to generate a large-scale dataset of sentences related to the content of realistic 3D indoor scenes, we evaluate the language coverage and generalization abilities of the most common continuous sentence embeddings based on RNNs. We also propose a new embedding method based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Human Pose and Action Recognition