Uncovering hidden geometry in Transformers via disentangling position   and context

Jiajun Song; Yiqiao Zhong

arXiv:2310.04861·cs.LG·February 6, 2024

Uncovering hidden geometry in Transformers via disentangling position and context

Jiajun Song, Yiqiao Zhong

PDF

Open Access 1 Repo

TL;DR

This paper introduces a decomposition method for transformer embeddings into interpretable components, revealing hidden geometric structures related to position and context, which enhances understanding of their internal representations.

Contribution

It presents a simple decomposition of transformer embeddings into mean, position, context, and residual components, uncovering geometric structures and improving interpretability.

Findings

01

Position vectors form low-dimensional spiral shapes across layers.

02

Context vectors cluster into meaningful topic groups.

03

Position and context vectors are nearly orthogonal.

Abstract

Transformers are widely used to extract semantic meanings from input tokens, yet they usually operate as black-box models. In this paper, we present a simple yet informative decomposition of hidden states (or embeddings) of trained transformers into interpretable components. For any layer, embedding vectors of input sequence samples are represented by a tensor $h \in R^{C \times T \times d}$ . Given embedding vector $h_{c, t} \in R^{d}$ at sequence position $t \leq T$ in a sequence (or context) $c \leq C$ , extracting the mean effects yields the decomposition \[ \boldsymbol{h}_{c,t} = \boldsymbol{\mu} + \mathbf{pos}_t + \mathbf{ctx}_c + \mathbf{resid}_{c,t} \] where $μ$ is the global mean vector, $pos_{t}$ and $ctx_{c}$ are the mean vectors across contexts and across positions respectively, and $resid_{c, t}$ is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jiajunsong629/uncover-hidden-geometry
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInteractive and Immersive Displays · Hand Gesture Recognition Systems · Robotics and Sensor-Based Localization