Language coverage and generalization in RNN-based continuous sentence embeddings for interacting agents
Luca Celotti, Simon Brodeur, Jean Rouat

TL;DR
This paper evaluates the language coverage and generalization of RNN-based sentence embeddings using a CFG-generated dataset, revealing their limitations, and introduces a new arithmetic coding-based embedding method, AriEL, as a baseline.
Contribution
It provides a systematic evaluation of RNN embeddings' ability to cover language space and proposes AriEL, a non-data-driven embedding method based on arithmetic coding.
Findings
RNN embeddings underfit training data and cover limited language space.
They fail to learn the underlying CFG and generalize poorly to unbiased sentences.
AriEL offers an effective baseline for sentence encoding.
Abstract
Continuous sentence embeddings using recurrent neural networks (RNNs), where variable-length sentences are encoded into fixed-dimensional vectors, are often the main building blocks of architectures applied to language tasks such as dialogue generation. While it is known that those embeddings are able to learn some structures of language (e.g. grammar) in a purely data-driven manner, there is very little work on the objective evaluation of their ability to cover the whole language space and to generalize to sentences outside the language bias of the training data. Using a manually designed context-free grammar (CFG) to generate a large-scale dataset of sentences related to the content of realistic 3D indoor scenes, we evaluate the language coverage and generalization abilities of the most common continuous sentence embeddings based on RNNs. We also propose a new embedding method based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Human Pose and Action Recognition
