Recursive Decoding: A Situated Cognition Approach to Compositional   Generation in Grounded Language Understanding

Matthew Setzler; Scott Howland; Lauren Phillips

arXiv:2201.11766·cs.CL·February 21, 2022

Recursive Decoding: A Situated Cognition Approach to Compositional Generation in Grounded Language Understanding

Matthew Setzler, Scott Howland, Lauren Phillips

PDF

Open Access

TL;DR

This paper introduces Recursive Decoding, a novel training and inference method for seq2seq models that enhances their ability to generate novel output combinations in grounded language understanding tasks, addressing a key generalization challenge.

Contribution

The paper proposes Recursive Decoding, a new approach that decomposes complex sequence generation into incremental steps, significantly improving decode-side generalization in grounded language tasks.

Findings

01

Recursive Decoding dramatically improves performance on gSCAN generalization tasks.

02

The method outperforms baseline models in generating novel output combinations.

03

Analyses suggest Recursive Decoding enhances model robustness and generalization capabilities.

Abstract

Compositional generalization is a troubling blind spot for neural language models. Recent efforts have presented techniques for improving a model's ability to encode novel combinations of known inputs, but less work has focused on generating novel combinations of known outputs. Here we focus on this latter "decode-side" form of generalization in the context of gSCAN, a synthetic benchmark for compositional generalization in grounded language understanding. We present Recursive Decoding (RD), a novel procedure for training and using seq2seq models, targeted towards decode-side generalization. Rather than generating an entire output sequence in one pass, models are trained to predict one token at a time. Inputs (i.e., the external gSCAN environment) are then incrementally updated based on predicted tokens, and re-encoded for the next decoder time step. RD thus decomposes a complex,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence