Do LLMs dream of elephants (when told not to)? Latent concept   association and associative memory in transformers

Yibo Jiang; Goutham Rajendran; Pradeep Ravikumar; Bryon Aragam

arXiv:2406.18400·cs.CL·December 2, 2024

Do LLMs dream of elephants (when told not to)? Latent concept association and associative memory in transformers

Yibo Jiang, Goutham Rajendran, Pradeep Ravikumar, Bryon Aragam

PDF

Open Access 1 Video

TL;DR

This paper investigates how large language models can be manipulated to recall facts through context changes, revealing their behavior as associative memory systems and analyzing the underlying transformer mechanisms both theoretically and empirically.

Contribution

It demonstrates that transformers function as associative memory by using self-attention and value matrices, supported by theoretical analysis and experiments on a simple latent concept association task.

Findings

01

Transformers retrieve facts via self-attention mechanisms.

02

Context manipulation can alter LLMs' factual recall.

03

Transformers can be modeled as associative memory systems.

Abstract

Large Language Models (LLMs) have the capacity to store and recall facts. Through experimentation with open-source models, we observe that this ability to retrieve facts can be easily manipulated by changing contexts, even without altering their factual meanings. These findings highlight that LLMs might behave like an associative memory model where certain tokens in the contexts serve as clues to retrieving facts. We mathematically explore this property by studying how transformers, the building blocks of LLMs, can complete such memory tasks. We study a simple latent concept association problem with a one-layer transformer and we show theoretically and empirically that the transformer gathers information using self-attention and uses the value matrix for associative memory.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Do LLMs dream of elephants (when told not to)? Latent concept association and associative memory in transformers· slideslive

Taxonomy

TopicsNeural Networks and Applications