GraphWalk: Enabling Reasoning in Large Language Models through Tool-Based Graph Navigation
Taraneh Ghandi, Hamidreza Mahyar, Shachar Klaiman

TL;DR
GraphWalk introduces a tool-based, training-free framework enabling large language models to perform multi-hop reasoning over large knowledge graphs, significantly improving performance on complex graph navigation tasks.
Contribution
It provides a universal, minimal set of graph operations for LLMs to reason over large graphs without task-specific training or domain knowledge.
Findings
GraphWalk enables LLMs to solve maze traversal problems that non-reasoning models cannot.
It achieves substantial performance gains over in-context baselines across various models.
Performance improvements are more significant with larger models where in-context methods fail.
Abstract
The use of knowledge graphs for grounding agents in real-world Q&A applications has become increasingly common. Answering complex queries often requires multi-hop reasoning and the ability to navigate vast relational structures. Standard approaches rely on prompting techniques that steer large language models to reason over raw graph context, or retrieval-augmented generation pipelines where relevant subgraphs are injected into the context. These, however, face severe limitations with enterprise-scale KGs that cannot fit in even the largest context windows available today. We present GraphWalk, a problem-agnostic, training-free, tool-based framework that allows off-the-shelf LLMs to reason through sequential graph navigation, dramatically increasing performance across different tasks. Unlike task-specific agent frameworks that encode domain knowledge into specialized tools, GraphWalk…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
