Hopping Too Late: Exploring the Limitations of Large Language Models on   Multi-Hop Queries

Eden Biran; Daniela Gottesman; Sohee Yang; Mor Geva; Amir Globerson

arXiv:2406.12775·cs.CL·October 15, 2024

Hopping Too Late: Exploring the Limitations of Large Language Models on Multi-Hop Queries

Eden Biran, Daniela Gottesman, Sohee Yang, Mor Geva, Amir Globerson

PDF

Open Access 1 Repo

TL;DR

This paper investigates how large language models perform multi-hop reasoning, revealing that key information is resolved early but later layers sometimes lack necessary knowledge, affecting accuracy.

Contribution

The study introduces a novel back-patching method to analyze internal computations, highlighting limitations in LLMs' multi-hop reasoning capabilities.

Findings

01

Bridge entities are resolved in early layers.

02

Later layers often lack necessary knowledge for second-hop reasoning.

03

Back-patching improves correctness in up to 66% of previous errors.

Abstract

Large language models (LLMs) can solve complex multi-step problems, but little is known about how these computations are implemented internally. Motivated by this, we study how LLMs answer multi-hop queries such as "The spouse of the performer of Imagine is". These queries require two information extraction steps: a latent one for resolving the first hop ("the performer of Imagine") into the bridge entity (John Lennon), and another for resolving the second hop ("the spouse of John Lennon") into the target entity (Yoko Ono). Understanding how the latent step is computed internally is key to understanding the overall computation. By carefully analyzing the internal computations of transformer-based LLMs, we discover that the bridge entity is resolved in the early layers of the model. Then, only after this resolution, the two-hop query is solved in the later layers. Because the second hop…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

edenbiran/HoppingTooLate
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Topic Modeling · Advanced Graph Neural Networks

MethodsFast Attention Via Positive Orthogonal Random Features · Performer