For GPT-4 as with Humans: Information Structure Predicts Acceptability of Long-Distance Dependencies
Nicole Cuneo, Eleanor Graves, Supantho Rakshit, Adele E. Goldberg

TL;DR
This study shows that GPT-4 can understand and predict the acceptability of long-distance dependencies in English based on information structure, mirroring human judgments and revealing a causal relationship.
Contribution
It demonstrates GPT-4's ability to replicate human-like metalinguistic understanding of information structure and long-distance dependencies in language.
Findings
GPT-4 reliably predicts acceptability ratings of LDDs
Manipulating information structure causally affects acceptability ratings
GPT-4's judgments align with human linguistic intuitions
Abstract
It remains debated how well any LM understands natural language or generates reliable metalinguistic judgments. Moreover, relatively little work has demonstrated that LMs can represent and respect subtle relationships between form and function proposed by linguists. We here focus on a particular such relationship established in recent work: English speakers' judgments about the information structure of canonical sentences predicts independently collected acceptability ratings on corresponding 'long distance dependency' [LDD] constructions, across a wide array of base constructions and multiple types of LDDs. To determine whether any LM captures this relationship, we probe GPT-4 on the same tasks used with humans and new extensions.Results reveal reliable metalinguistic skill on the information structure and acceptability tasks, replicating a striking interaction between the two, despite…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Computability, Logic, AI Algorithms · Machine Learning in Healthcare
MethodsAttention Is All You Need · Label Smoothing · Adam · Linear Layer · Dropout · Residual Connection · Multi-Head Attention · Dense Connections · Layer Normalization · Byte Pair Encoding
