Meta-Learning to Compositionally Generalize

Henry Conklin; Bailin Wang; Kenny Smith; Ivan Titov

arXiv:2106.04252·cs.CL·June 30, 2021

Meta-Learning to Compositionally Generalize

Henry Conklin, Bailin Wang, Kenny Smith, Ivan Titov

PDF

1 Repo

TL;DR

This paper introduces a meta-learning approach that enhances neural networks' ability to generalize compositionally in language tasks by optimizing for out-of-distribution performance, demonstrated on COGS and SCAN datasets.

Contribution

It proposes a novel similarity-driven meta-learning method that improves compositional generalization in neural networks beyond traditional supervised learning.

Findings

01

Improved generalization on COGS dataset

02

Enhanced performance on SCAN dataset

03

Meta-learning reduces memorization in models

Abstract

Natural language is compositional; the meaning of a sentence is a function of the meaning of its parts. This property allows humans to create and interpret novel sentences, generalizing robustly outside their prior experience. Neural networks have been shown to struggle with this kind of generalization, in particular performing poorly on tasks designed to assess compositional generalization (i.e. where training and testing distributions differ in ways that would be trivial for a compositional strategy to resolve). Their poor performance on these tasks may in part be due to the nature of supervised learning which assumes training and testing data to be drawn from the same distribution. We implement a meta-learning augmented version of supervised learning whose objective directly optimizes for out-of-distribution generalization. We construct pairs of tasks for meta-learning by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

berlino/tensor2struct-public
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.