On the Relationship Between RNN Hidden State Vectors and Semantic Ground   Truth

Edi Mu\v{s}kardin; Martin Tappler; Ingo Pill; Bernhard K.; Aichernig; Thomas Pock

arXiv:2306.16854·cs.LG·June 30, 2023

On the Relationship Between RNN Hidden State Vectors and Semantic Ground Truth

Edi Mu\v{s}kardin, Martin Tappler, Ingo Pill, Bernhard K., Aichernig, Thomas Pock

PDF

Open Access 1 Repo

TL;DR

This paper investigates whether RNN hidden states naturally form semantically meaningful clusters, using automata-based evaluation to validate the clustering hypothesis in modern neural networks.

Contribution

It provides a formal analysis and empirical evidence supporting the clustering hypothesis in RNNs trained on regular languages, using ground-truth automata for validation.

Findings

01

Hidden-state vectors are often separable into semantic classes.

02

Unsupervised clustering can effectively identify meaningful clusters.

03

The clustering hypothesis holds in most examined cases.

Abstract

We examine the assumption that the hidden-state vectors of recurrent neural networks (RNNs) tend to form clusters of semantically similar vectors, which we dub the clustering hypothesis. While this hypothesis has been assumed in the analysis of RNNs in recent years, its validity has not been studied thoroughly on modern neural network architectures. We examine the clustering hypothesis in the context of RNNs that were trained to recognize regular languages. This enables us to draw on perfect ground-truth automata in our evaluation, against which we can compare the RNN's accuracy and the distribution of the hidden-state vectors. We start with examining the (piecewise linear) separability of an RNN's hidden-state vectors into semantically different classes. We continue the analysis by computing clusters over the hidden-state vector space with multiple state-of-the-art unsupervised…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

des-lab/clustering_rnn_hidden_state_space
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFerroelectric and Negative Capacitance Devices · Advanced Memory and Neural Computing · Machine Learning in Materials Science