Sequence-to-Sequence Networks Learn the Meaning of Reflexive Anaphora

Robert Frank; Jackson Petty

arXiv:2011.00682·cs.CL·November 3, 2020·1 cites

Sequence-to-Sequence Networks Learn the Meaning of Reflexive Anaphora

Robert Frank, Jackson Petty

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that sequence-to-sequence recurrent networks can learn and generalize the meaning of reflexive anaphora in context, challenging previous doubts about their semantic capabilities.

Contribution

It shows that such networks can acquire semantic interpretations for reflexive anaphora and generalize to new antecedents, influenced by attention mechanisms and training data diversity.

Findings

01

Networks can generalize reflexive meanings to novel antecedents.

02

Attention mechanisms affect the learning process.

03

Training data diversity influences generalization success.

Abstract

Reflexive anaphora present a challenge for semantic interpretation: their meaning varies depending on context in a way that appears to require abstract variables. Past work has raised doubts about the ability of recurrent networks to meet this challenge. In this paper, we explore this question in the context of a fragment of English that incorporates the relevant sort of contextual variability. We consider sequence-to-sequence architectures with recurrent units and show that such networks are capable of learning semantic interpretations for reflexive anaphora which generalize to novel antecedents. We explore the effect of attention mechanisms and different recurrent unit types on the type of training data that is needed for success as measured in two ways: how much lexical support is needed to induce an abstract reflexive meaning (i.e., how many distinct reflexive antecedents must occur…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

clay-lab/transductions
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems