SUBS: Subtree Substitution for Compositional Semantic Parsing

Jingfeng Yang; Le Zhang; Diyi Yang

arXiv:2205.01538·cs.CL·May 4, 2022·1 cites

SUBS: Subtree Substitution for Compositional Semantic Parsing

Jingfeng Yang, Le Zhang, Diyi Yang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a subtree substitution data augmentation method for semantic parsing, significantly improving compositional generalization performance on benchmarks like SCAN and GeoQuery.

Contribution

It proposes a novel subtree substitution approach for data augmentation that enhances compositional generalization in semantic parsing models.

Findings

01

Achieved state-of-the-art results on GeoQuery compositional split.

02

Significantly improved performance on SCAN dataset.

03

Demonstrated effectiveness of subtree-based augmentation over rule-based methods.

Abstract

Although sequence-to-sequence models often achieve good performance in semantic parsing for i.i.d. data, their performance is still inferior in compositional generalization. Several data augmentation methods have been proposed to alleviate this problem. However, prior work only leveraged superficial grammar or rules for data augmentation, which resulted in limited improvement. We propose to use subtree substitution for compositional data augmentation, where we consider subtrees with similar semantic functions as exchangeable. Our experiments showed that such augmented data led to significantly better performance on SCAN and GeoQuery, and reached new SOTA on compositional split of GeoQuery.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gt-salt/subs
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Genomics and Phylogenetic Studies