Learning to Refine: An Agentic RL Approach for Iterative SPARQL Query Construction

Floris Vossebeld; Shenghui Wang

arXiv:2511.11770·cs.AI·November 18, 2025

Learning to Refine: An Agentic RL Approach for Iterative SPARQL Query Construction

Floris Vossebeld, Shenghui Wang

PDF

Open Access

TL;DR

This paper presents an agentic RL framework enabling LLMs to iteratively refine SPARQL queries for knowledge graph QA, significantly improving accuracy without supervised fine-tuning.

Contribution

Introduces a reinforcement learning-based agentic approach for iterative SPARQL query construction, demonstrating effective policy learning with a compact model without supervised fine-tuning.

Findings

01

49.7% accuracy on LC-QuAD 2.0 subset

02

17.5 percentage point improvement over zero-shot baseline

03

Enhanced performance with explicit reasoning step

Abstract

Generating complex, logically-sound SPARQL queries for multi-hop questions remains a critical bottleneck for Knowledge Graph Question Answering, as the brittle nature of one-shot generation by Large Language Models (LLMs) hinders reliable interaction with structured data. Current methods lack the adaptive policies needed to dynamically debug queries based on real-time execution feedback. This paper introduces a novel agentic framework where an LLM learns a resilient policy for the sequential process of iterative SPARQL construction. We show that a compact 3B-parameter model, trained exclusively via outcome-driven Reinforcement Learning (GRPO) without supervised fine-tuning, can learn effective policies for this task, discovering how to systematically recover from execution errors and refine its queries toward a correct answer. On a curated, executable single-answer subset of LC-QuAD…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Topic Modeling · Graph Theory and Algorithms