Do Language Models Learn Position-Role Mappings?

Jackson Petty; Michael Wilson; Robert Frank

arXiv:2202.03611·cs.CL·February 9, 2022

Do Language Models Learn Position-Role Mappings?

Jackson Petty, Michael Wilson, Robert Frank

PDF

Open Access

TL;DR

This study investigates whether pretrained language models understand position-role mappings in syntax, demonstrating they recognize role distinctions and generalize this knowledge across different structures, with some limitations on novel verbs.

Contribution

It provides evidence that models like BERT and RoBERTa learn shared position-role mappings and can generalize this knowledge across paradigms, revealing insights into their syntactic understanding.

Findings

01

Models recognize theme and recipient roles in ditransitive constructions.

02

Fine-tuning enables models to transfer role knowledge across paradigms.

03

Performance drops with novel ditransitive verbs, indicating lexical specificity.

Abstract

How is knowledge of position-role mappings in natural language learned? We explore this question in a computational setting, testing whether a variety of well-performing pertained language models (BERT, RoBERTa, and DistilBERT) exhibit knowledge of these mappings, and whether this knowledge persists across alternations in syntactic, structural, and lexical alternations. In Experiment 1, we show that these neural models do indeed recognize distinctions between theme and recipient roles in ditransitive constructions, and that these distinct patterns are shared across construction type. We strengthen this finding in Experiment 2 by showing that fine-tuning these language models on novel theme- and recipient-like tokens in one paradigm allows the models to make correct predictions about their placement in other paradigms, suggesting that the knowledge of these mappings is shared rather than…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Language and cultural evolution

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Dense Connections · Adam · Attention Dropout · Linear Warmup With Linear Decay · Layer Normalization · WordPiece