Recursive Decoding: A Situated Cognition Approach to Compositional Generation in Grounded Language Understanding
Matthew Setzler, Scott Howland, Lauren Phillips

TL;DR
This paper introduces Recursive Decoding, a novel training and inference method for seq2seq models that enhances their ability to generate novel output combinations in grounded language understanding tasks, addressing a key generalization challenge.
Contribution
The paper proposes Recursive Decoding, a new approach that decomposes complex sequence generation into incremental steps, significantly improving decode-side generalization in grounded language tasks.
Findings
Recursive Decoding dramatically improves performance on gSCAN generalization tasks.
The method outperforms baseline models in generating novel output combinations.
Analyses suggest Recursive Decoding enhances model robustness and generalization capabilities.
Abstract
Compositional generalization is a troubling blind spot for neural language models. Recent efforts have presented techniques for improving a model's ability to encode novel combinations of known inputs, but less work has focused on generating novel combinations of known outputs. Here we focus on this latter "decode-side" form of generalization in the context of gSCAN, a synthetic benchmark for compositional generalization in grounded language understanding. We present Recursive Decoding (RD), a novel procedure for training and using seq2seq models, targeted towards decode-side generalization. Rather than generating an entire output sequence in one pass, models are trained to predict one token at a time. Inputs (i.e., the external gSCAN environment) are then incrementally updated based on predicted tokens, and re-encoded for the next decoder time step. RD thus decomposes a complex,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence
