Reinforcement Learning Enhanced Multi-hop Reasoning for Temporal Knowledge Question Answering

Wuzhenghong Wen; Chao Xue; Su Pan; Yuwei Sun; Minlong Peng

arXiv:2601.01195·cs.AI·January 6, 2026

Reinforcement Learning Enhanced Multi-hop Reasoning for Temporal Knowledge Question Answering

Wuzhenghong Wen, Chao Xue, Su Pan, Yuwei Sun, Minlong Peng

PDF

Open Access

TL;DR

This paper introduces the MRE framework that enhances multi-hop reasoning in temporal knowledge graph question answering using prompt-guided trajectory generation, supervised fine-tuning, and a recursive exploration method, leading to improved accuracy and robustness.

Contribution

The paper proposes a novel MRE framework with Tree-Group Relative Policy Optimization for better multi-hop reasoning in TKGQA, outperforming existing methods.

Findings

01

Outperforms state-of-the-art on TKGQA benchmarks.

02

Improves interpretability and robustness to noisy data.

03

Enhances reasoning trajectory selection and exploration.

Abstract

Temporal knowledge graph question answering (TKGQA) involves multi-hop reasoning over temporally constrained entity relationships in the knowledge graph to answer a given question. However, at each hop, large language models (LLMs) retrieve subgraphs with numerous temporally similar and semantically complex relations, increasing the risk of suboptimal decisions and error propagation. To address these challenges, we propose the multi-hop reasoning enhanced (MRE) framework, which enhances both forward and backward reasoning to improve the identification of globally optimal reasoning trajectories. Specifically, MRE begins with prompt engineering to guide the LLM in generating diverse reasoning trajectories for a given question. Valid reasoning trajectories are then selected for supervised fine-tuning, serving as a cold-start strategy. Finally, we introduce Tree-Group Relative Policy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Topic Modeling · Multimodal Machine Learning Applications