Compositional Generalization in Grounded Language Learning via Induced   Model Sparsity

Sam Spilsbury; Alexander Ilin

arXiv:2207.02518·cs.CL·July 7, 2022·1 cites

Compositional Generalization in Grounded Language Learning via Induced Model Sparsity

Sam Spilsbury, Alexander Ilin

PDF

Open Access 1 Repo

TL;DR

This paper explores how inducing model sparsity in neural networks can improve compositional generalization and sample efficiency in grounded language learning, specifically in navigation tasks within grid worlds.

Contribution

It introduces a goal identification module that promotes sparse word-attribute correlations, enhancing generalization to novel property combinations.

Findings

01

Agent maintains high performance with few demonstrations

02

Correct word-attribute correspondences are learned internally

03

Sparse correlations improve compositional generalization

Abstract

We provide a study of how induced model sparsity can help achieve compositional generalization and better sample efficiency in grounded language learning problems. We consider simple language-conditioned navigation problems in a grid world environment with disentangled observations. We show that standard neural architectures do not always yield compositional generalization. To address this, we design an agent that contains a goal identification module that encourages sparse correlations between words in the instruction and attributes of objects, composing them together to find the goal. The output of the goal identification module is the input to a value iteration network planner. Our agent maintains a high level of performance on goals containing novel combinations of properties even when learning from a handful of demonstrations. We examine the internal representations of our agent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aalto-ai/sparse-compgen
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications