Using Priming to Uncover the Organization of Syntactic Representations   in Neural Language Models

Grusha Prasad; Marten van Schijndel; Tal Linzen

arXiv:1909.10579·cs.CL·September 25, 2019

Using Priming to Uncover the Organization of Syntactic Representations in Neural Language Models

Grusha Prasad, Marten van Schijndel, Tal Linzen

PDF

1 Repo

TL;DR

This paper introduces a novel priming-based technique to analyze how neural language models organize syntactic information, revealing hierarchical and interpretable representations of complex sentence structures.

Contribution

It proposes a gradient similarity metric to reconstruct the syntactic representational space in neural models, providing new insights into their internal organization.

Findings

01

LSTM models' representations of relative clauses are hierarchically organized.

02

Models track abstract syntactic properties.

03

The technique reveals linguistically interpretable structures.

Abstract

Neural language models (LMs) perform well on tasks that require sensitivity to syntactic structure. Drawing on the syntactic priming paradigm from psycholinguistics, we propose a novel technique to analyze the representations that enable such success. By establishing a gradient similarity metric between structures, this technique allows us to reconstruct the organization of the LMs' syntactic representational space. We use this technique to demonstrate that LSTM LMs' representations of different types of sentences with relative clauses are organized hierarchically in a linguistically interpretable manner, suggesting that the LMs track abstract properties of the sentence.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

grushaprasad/RNN-Priming
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory