Discovering the Compositional Structure of Vector Representations with   Role Learning Networks

Paul Soulos; Tom McCoy; Tal Linzen; Paul Smolensky

arXiv:1910.09113·cs.LG·February 10, 2023

Discovering the Compositional Structure of Vector Representations with Role Learning Networks

Paul Soulos, Tom McCoy, Tal Linzen, Paul Smolensky

PDF

2 Repos

TL;DR

This paper introduces ROLE, a novel analysis technique that reveals how recurrent neural networks implicitly learn symbolic, compositional structures in vector representations, explaining their success on compositional tasks.

Contribution

The paper presents ROLE, a new method to uncover symbolic structures in neural network embeddings, demonstrating how RNNs implicitly learn compositional representations.

Findings

01

RNNs converge to solutions with implicit symbolic structure

02

Manipulating embeddings based on this structure alters outputs as predicted

03

The discovered structure closely matches the encodings of trained seq2seq models

Abstract

How can neural networks perform so well on compositional tasks even though they lack explicit compositional representations? We use a novel analysis technique called ROLE to show that recurrent neural networks perform well on such tasks by converging to solutions which implicitly represent symbolic structure. This method uncovers a symbolic structure which, when properly embedded in vector space, closely approximates the encodings of a standard seq2seq network trained to perform the compositional SCAN task. We verify the causal importance of the discovered symbolic structure by showing that, when we systematically manipulate hidden embeddings based on this symbolic structure, the model's output is changed in the way predicted by our analysis.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence · Gated Recurrent Unit