Span-based Semantic Parsing for Compositional Generalization

Jonathan Herzig; Jonathan Berant

arXiv:2009.06040·cs.CL·June 15, 2021

Span-based Semantic Parsing for Compositional Generalization

Jonathan Herzig, Jonathan Berant

PDF

1 Repo

TL;DR

This paper introduces SpanBasedSP, a span-based semantic parser that improves compositional generalization over seq2seq models by explicitly modeling partial program compositions, showing significant gains on challenging datasets.

Contribution

The paper proposes SpanBasedSP, a span-based parser that predicts span trees and enhances compositional generalization in semantic parsing tasks.

Findings

01

Significant accuracy improvement on compositional splits (from 61.0% to 88.9%).

02

Performs comparably to seq2seq models on standard splits.

03

Effectively models non-projective trees with extended CKY.

Abstract

Despite the success of sequence-to-sequence (seq2seq) models in semantic parsing, recent work has shown that they fail in compositional generalization, i.e., the ability to generalize to new structures built of components observed during training. In this work, we posit that a span-based parser should lead to better compositional generalization. we propose SpanBasedSP, a parser that predicts a span tree over an input utterance, explicitly encoding how partial programs compose over spans in the input. SpanBasedSP extends Pasupat et al. (2019) to be comparable to seq2seq models by (i) training from programs, without access to gold trees, treating trees as latent variables, (ii) parsing a class of non-projective trees through an extension to standard CKY. On GeoQuery, SCAN and CLOSURE datasets, SpanBasedSP performs similarly to strong seq2seq baselines on random splits, but dramatically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jonathanherzig/span-based-sp
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence