Finding Structure in Language Models

Jaap Jumelet

arXiv:2411.16433·cs.CL·November 26, 2024

Finding Structure in Language Models

Jaap Jumelet

PDF

Open Access

TL;DR

This paper investigates whether large language models understand grammatical structure like humans do, using interpretability techniques, linguistic phenomena analysis, and synthetic language testbeds to reveal their grammatical knowledge.

Contribution

It introduces novel interpretability methods and experimental frameworks to analyze the grammatical understanding of language models across multiple linguistic phenomena.

Findings

01

Language models encode abstract grammatical information.

02

Models demonstrate understanding of phenomena like adjective order.

03

Hierarchical structure can be studied using synthetic languages.

Abstract

When we speak, write or listen, we continuously make predictions based on our knowledge of a language's grammar. Remarkably, children acquire this grammatical knowledge within just a few years, enabling them to understand and generalise to novel constructions that have never been uttered before. Language models are powerful tools that create representations of language by incrementally predicting the next word in a sentence, and they have had a tremendous societal impact in recent years. The central research question of this thesis is whether these models possess a deep understanding of grammatical structure similar to that of humans. This question lies at the intersection of natural language processing, linguistics, and interpretability. To address it, we will develop novel interpretability techniques that enhance our understanding of the complex nature of large-scale language models.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques