RLM-on-KG: Heuristics First, LLMs When Needed: Adaptive Retrieval Control over Mention Graphs for Scattered Evidence
Andrea Volpini, Elie Raad

TL;DR
This paper introduces RLM-on-KG, a retrieval system that uses LLMs for adaptive, multi-hop exploration of knowledge graphs, demonstrating conditional advantages over heuristic methods depending on evidence scatter and tool sophistication.
Contribution
It presents a novel LLM-controlled retrieval approach for knowledge graph exploration that outperforms heuristics in specific scenarios and clarifies when LLM control is most beneficial.
Findings
LLM control improves performance when evidence is scattered across multiple chunks.
Stronger controllers yield larger gains over heuristic baselines.
The approach transfers well across different datasets, with attenuation on smaller graphs.
Abstract
When does an LLM controller outperform rule-based traversal for knowledge graph exploration? We study this question through RLM-on-KG, a retrieval system that treats an LLM as an autonomous navigator over an RDF-encoded mention graph for grounded question answering. Unlike GraphRAG pipelines that rely on offline LLM indexing, RLM-on-KG performs entity-first, multi-hop exploration at query time using deterministic graph construction and a fixed tool set. Our central finding is a conditional advantage: the value of LLM control depends on evidence scatter and tool-calling sophistication. The paper's core claim is LLM control versus heuristic traversal, not a generic win over GraphRAG. On GraphRAG-Bench Novel (519 questions), Gemini 2.0 Flash achieves +2.47 pp F1 over a rule-based heuristic baseline (p < 0.0001), but only +0.16 pp over a GraphRAG-local variant (not significant). With a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
