Slot Machines: How LLMs Keep Track of Multiple Entities
Paul C. Bogdan, Jack Lindsey

TL;DR
This paper investigates how large language models represent multiple entities and their attributes, revealing a dual-slot structure that supports relational reasoning but not explicit factual retrieval.
Contribution
It introduces a multi-slot probing method to disentangle entity representations and uncovers a dual-slot encoding scheme in language models.
Findings
Current-entity and prior-entity slots encode different information types.
Relational inferences are supported by the prior-entity slot.
Frontier models outperform open-weight models on complex binding tasks.
Abstract
Language models must bind entities to the attributes they possess and maintain several such binding relationships within a context. We study how multiple entities are represented across token positions and whether single tokens can carry bindings for more than one entity. We introduce a multi-slot probing approach that disentangles a single token's residual stream activation to recover information about both the currently described entity and the immediately preceding one. These two kinds of information are encoded in separate and largely orthogonal "current-entity" and "prior-entity" slots. We analyze the functional roles of these slots and find that they serve different purposes. In tandem with the current-entity slot, the prior-entity slot supports relational inferences, such as entity-level induction ("who came after Alice in the story?") and conflict detection between adjacent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
