Compositional Generalization in Grounded Language Learning via Induced Model Sparsity
Sam Spilsbury, Alexander Ilin

TL;DR
This paper explores how inducing model sparsity in neural networks can improve compositional generalization and sample efficiency in grounded language learning, specifically in navigation tasks within grid worlds.
Contribution
It introduces a goal identification module that promotes sparse word-attribute correlations, enhancing generalization to novel property combinations.
Findings
Agent maintains high performance with few demonstrations
Correct word-attribute correspondences are learned internally
Sparse correlations improve compositional generalization
Abstract
We provide a study of how induced model sparsity can help achieve compositional generalization and better sample efficiency in grounded language learning problems. We consider simple language-conditioned navigation problems in a grid world environment with disentangled observations. We show that standard neural architectures do not always yield compositional generalization. To address this, we design an agent that contains a goal identification module that encourages sparse correlations between words in the instruction and attributes of objects, composing them together to find the goal. The output of the goal identification module is the input to a value iteration network planner. Our agent maintains a high level of performance on goals containing novel combinations of properties even when learning from a handful of demonstrations. We examine the internal representations of our agent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
