For GPT-4 as with Humans: Information Structure Predicts Acceptability of Long-Distance Dependencies

Nicole Cuneo; Eleanor Graves; Supantho Rakshit; Adele E. Goldberg

arXiv:2505.09005·cs.CL·May 15, 2025

For GPT-4 as with Humans: Information Structure Predicts Acceptability of Long-Distance Dependencies

Nicole Cuneo, Eleanor Graves, Supantho Rakshit, Adele E. Goldberg

PDF

Open Access

TL;DR

This study shows that GPT-4 can understand and predict the acceptability of long-distance dependencies in English based on information structure, mirroring human judgments and revealing a causal relationship.

Contribution

It demonstrates GPT-4's ability to replicate human-like metalinguistic understanding of information structure and long-distance dependencies in language.

Findings

01

GPT-4 reliably predicts acceptability ratings of LDDs

02

Manipulating information structure causally affects acceptability ratings

03

GPT-4's judgments align with human linguistic intuitions

Abstract

It remains debated how well any LM understands natural language or generates reliable metalinguistic judgments. Moreover, relatively little work has demonstrated that LMs can represent and respect subtle relationships between form and function proposed by linguists. We here focus on a particular such relationship established in recent work: English speakers' judgments about the information structure of canonical sentences predicts independently collected acceptability ratings on corresponding 'long distance dependency' [LDD] constructions, across a wide array of base constructions and multiple types of LDDs. To determine whether any LM captures this relationship, we probe GPT-4 on the same tasks used with humans and new extensions.Results reveal reliable metalinguistic skill on the information structure and acceptability tasks, replicating a striking interaction between the two, despite…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Computability, Logic, AI Algorithms · Machine Learning in Healthcare

MethodsAttention Is All You Need · Label Smoothing · Adam · Linear Layer · Dropout · Residual Connection · Multi-Head Attention · Dense Connections · Layer Normalization · Byte Pair Encoding