
TL;DR
This paper investigates whether large language models operate through representation-based processing or mere memorization, arguing for the former and proposing methods to analyze their internal representations.
Contribution
It advances the debate on LLM mechanisms by defending the view that they utilize representations and offers practical techniques for studying these representations.
Findings
LLMs are partially driven by representation-based information processing.
Proposes techniques to analyze and interpret LLM internal representations.
Provides a foundation for future theoretical work on language models.
Abstract
The extraordinary success of recent Large Language Models (LLMs) on a diverse array of tasks has led to an explosion of scientific and philosophical theorizing aimed at explaining how they do what they do. Unfortunately, disagreement over fundamental theoretical issues has led to stalemate, with entrenched camps of LLM optimists and pessimists often committed to very different views of how these systems work. Overcoming stalemate requires agreement on fundamental questions, and the goal of this paper is to address one such question, namely: is LLM behavior driven partly by representation-based information processing of the sort implicated in biological cognition, or is it driven entirely by processes of memorization and stochastic table look-up? This is a question about what kind of algorithm LLMs implement, and the answer carries serious implications for higher level questions about…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
