SyGNS: A Systematic Generalization Testbed Based on Natural Language   Semantics

Hitomi Yanaka; Koji Mineshima; Kentaro Inui

arXiv:2106.01077·cs.CL·June 3, 2021·1 cites

SyGNS: A Systematic Generalization Testbed Based on Natural Language Semantics

Hitomi Yanaka, Koji Mineshima, Kentaro Inui

PDF

Open Access 1 Repo

TL;DR

This paper introduces SyGNS, a testbed for evaluating neural networks' ability to understand compositional semantics in natural language, revealing their limited generalization to novel logical combinations.

Contribution

The paper presents SyGNS, a new systematic testbed for assessing neural models' semantic compositionality and generalization in natural language understanding.

Findings

01

Transformers and GRUs can generalize to some unseen logical combinations.

02

Simpler meaning representations lead to better generalization.

03

Neural models struggle with systematic generalization to complex semantic structures.

Abstract

Recently, deep neural networks (DNNs) have achieved great success in semantically challenging NLP tasks, yet it remains unclear whether DNN models can capture compositional meanings, those aspects of meaning that have been long studied in formal semantics. To investigate this issue, we propose a Systematic Generalization testbed based on Natural language Semantics (SyGNS), whose challenge is to map natural language sentences to multiple forms of scoped meaning representations, designed to account for various semantic phenomena. Using SyGNS, we test whether neural networks can systematically parse sentences involving novel combinations of logical expressions such as quantifiers and negation. Experiments show that Transformer and GRU models can generalize to unseen combinations of quantifiers, negations, and modifiers that are similar to given training instances in form, but not to the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

verypluming/SyGNS
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Adam · Label Smoothing · Layer Normalization · Residual Connection