Compositional generalization through meta sequence-to-sequence learning

Brenden M. Lake

arXiv:1906.05381·cs.CL·October 10, 2019·44 cites

Compositional generalization through meta sequence-to-sequence learning

Brenden M. Lake

PDF

Open Access 1 Repo 1 Datasets

TL;DR

This paper introduces a meta sequence-to-sequence learning approach with memory-augmented neural networks that enhances compositional generalization, enabling models to understand and apply new concepts compositionally, similar to human learning.

Contribution

It presents a novel meta-learning framework for seq2seq models with memory augmentation that improves compositional generalization capabilities beyond traditional neural networks.

Findings

01

Meta seq2seq learning solves key SCAN compositionality tests.

02

Models can learn to apply implicit rules to variables.

03

Approach outperforms standard seq2seq models on compositional tasks.

Abstract

People can learn a new concept and use it compositionally, understanding how to "blicket twice" after learning how to "blicket." In contrast, powerful sequence-to-sequence (seq2seq) neural networks fail such tests of compositionality, especially when composing new concepts together with existing concepts. In this paper, I show how memory-augmented neural networks can be trained to generalize compositionally through meta seq2seq learning. In this approach, models train on a series of seq2seq problems to acquire the compositional skills needed to solve new seq2seq problems. Meta se2seq learning solves several of the SCAN tests for compositional learning and can learn to apply implicit rules to variables.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

brendenlake/meta_seq2seq
pytorchOfficial

Datasets

Kylan12/Synthetic-AI-ML-Dataset
dataset· 42 dl
42 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence