Causal Interventions Reveal Shared Structure Across English Filler-Gap Constructions

Sasha Boguraev; Christopher Potts; Kyle Mahowald

arXiv:2505.16002·cs.CL·October 1, 2025

Causal Interventions Reveal Shared Structure Across English Filler-Gap Constructions

Sasha Boguraev, Christopher Potts, Kyle Mahowald

PDF

1 Video

TL;DR

This paper uses causal interpretability methods on language models to uncover shared underlying structures in English filler-gap constructions, revealing insights that can inform and improve linguistic theories.

Contribution

It introduces a novel application of causal interventions to analyze language models' understanding of filler-gap dependencies, uncovering shared structures and overlooked factors.

Findings

01

Language models converge on similar analyses of filler-gap constructions

02

Identified factors like frequency, filler type, and context influence model analyses

03

Results suggest mechanistic analyses can advance linguistic theory

Abstract

Language Models (LMs) have emerged as powerful sources of evidence for linguists seeking to develop theories of syntax. In this paper, we argue that causal interpretability methods, applied to LMs, can greatly enhance the value of such evidence by helping us characterize the abstract mechanisms that LMs learn to use. Our empirical focus is a set of English filler-gap dependency constructions (e.g., questions, relative clauses). Linguistic theories largely agree that these constructions share many properties. Using experiments based in Distributed Interchange Interventions, we show that LMs converge on similar abstract analyses of these constructions. These analyses also reveal previously overlooked factors -- relating to frequency, filler type, and surrounding context -- that could motivate changes to standard linguistic theory. Overall, these results suggest that mechanistic, internal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Causal Interventions Reveal Shared Structure Across English Filler--Gap Constructions· underline

Taxonomy

MethodsFocus · Sparse Evolutionary Training