TL;DR
KG-Hopper introduces a reinforcement learning framework enabling small open LLMs to perform integrated multi-hop reasoning over knowledge graphs in a single inference, outperforming larger models on several benchmarks.
Contribution
It presents a novel RL-based approach that embeds entire KG reasoning into a single stage, improving flexibility and efficiency over traditional step-by-step methods.
Findings
Outperforms larger multi-step systems on eight benchmarks.
Achieves competitive results with GPT-3.5-Turbo and GPT-4o-mini.
Uses a 7B-parameter LLM to enable efficient knowledge graph reasoning.
Abstract
Large Language Models (LLMs) demonstrate impressive natural language capabilities but often struggle with knowledge-intensive reasoning tasks. Knowledge Base Question Answering (KBQA), which leverages structured Knowledge Graphs (KGs) exemplifies this challenge due to the need for accurate multi-hop reasoning. Existing approaches typically perform sequential reasoning steps guided by predefined pipelines, restricting flexibility and causing error cascades due to isolated reasoning at each step. To address these limitations, we propose KG-Hopper, a novel Reinforcement Learning (RL) framework that empowers compact open LLMs with the ability to perform integrated multi-hop KG reasoning within a single inference round. Rather than reasoning step-by-step, we train a Reasoning LLM that embeds the entire KG traversal and decision process into a unified ``thinking'' stage, enabling global…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
