Finding Structure in Language Models
Jaap Jumelet

TL;DR
This paper investigates whether large language models understand grammatical structure like humans do, using interpretability techniques, linguistic phenomena analysis, and synthetic language testbeds to reveal their grammatical knowledge.
Contribution
It introduces novel interpretability methods and experimental frameworks to analyze the grammatical understanding of language models across multiple linguistic phenomena.
Findings
Language models encode abstract grammatical information.
Models demonstrate understanding of phenomena like adjective order.
Hierarchical structure can be studied using synthetic languages.
Abstract
When we speak, write or listen, we continuously make predictions based on our knowledge of a language's grammar. Remarkably, children acquire this grammatical knowledge within just a few years, enabling them to understand and generalise to novel constructions that have never been uttered before. Language models are powerful tools that create representations of language by incrementally predicting the next word in a sentence, and they have had a tremendous societal impact in recent years. The central research question of this thesis is whether these models possess a deep understanding of grammatical structure similar to that of humans. This question lies at the intersection of natural language processing, linguistics, and interpretability. To address it, we will develop novel interpretability techniques that enhance our understanding of the complex nature of large-scale language models.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
