Verb Conjugation in Transformers Is Determined by Linear Encodings of   Subject Number

Sophie Hao; Tal Linzen

arXiv:2310.15151·cs.CL·October 24, 2023·2 cites

Verb Conjugation in Transformers Is Determined by Linear Encodings of Subject Number

Sophie Hao, Tal Linzen

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that BERT encodes subject number linearly for verb conjugation, and this encoding can be manipulated to affect conjugation accuracy, revealing interpretable internal representations.

Contribution

It shows that BERT's verb conjugation depends on a linear encoding of subject number, identified through causal intervention analysis, clarifying the interpretability of linguistic features in transformers.

Findings

01

Linear encoding of subject number in BERT influences verb conjugation.

02

Subject number encoding is located at different layers depending on position.

03

Manipulating the encoding affects conjugation accuracy predictably.

Abstract

Deep architectures such as Transformers are sometimes criticized for having uninterpretable "black-box" representations. We use causal intervention analysis to show that, in fact, some linguistic features are represented in a linear, interpretable format. Specifically, we show that BERT's ability to conjugate verbs relies on a linear encoding of subject number that can be manipulated with predictable effects on conjugation accuracy. This encoding is found in the subject position at the first layer and the verb position at the last layer, but distributed across positions at middle layers, particularly when there are multiple cues to subject number.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yidinghao/causal-conjugation
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeurobiology of Language and Bilingualism · Natural Language Processing Techniques · Topic Modeling