Distributional reasoning in LLMs: Parallel reasoning processes in   multi-hop reasoning

Yuval Shalev; Amir Feder; Ariel Goldstein

arXiv:2406.13858·cs.CL·June 21, 2024

Distributional reasoning in LLMs: Parallel reasoning processes in multi-hop reasoning

Yuval Shalev, Amir Feder, Ariel Goldstein

PDF

Open Access

TL;DR

This paper introduces an interpretable analysis of multi-hop reasoning in LLMs, revealing parallel reasoning paths and the generation of intermediate answer representations during inference.

Contribution

It presents a novel linear transformation model of reasoning processes and uncovers parallel reasoning strategies in LLMs' internal layers.

Findings

01

Interpretable embeddings represent potential intermediate answers.

02

Parallel reasoning paths are activated even without necessary knowledge.

03

Middle layers generate highly interpretable, compositional representations.

Abstract

Large language models (LLMs) have shown an impressive ability to perform tasks believed to require thought processes. When the model does not document an explicit thought process, it becomes difficult to understand the processes occurring within its hidden layers and to determine if these processes can be referred to as reasoning. We introduce a novel and interpretable analysis of internal multi-hop reasoning processes in LLMs. We demonstrate that the prediction process for compositional reasoning questions can be modeled using a simple linear transformation between two semantic category spaces. We show that during inference, the middle layers of the network generate highly interpretable embeddings that represent a set of potential intermediate answers for the multi-hop question. We use statistical analyses to show that a corresponding subset of tokens is activated in the model's…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies · Multi-Agent Systems and Negotiation

MethodsSparse Evolutionary Training