GlobalRAG: Enhancing Global Reasoning in Multi-hop Question Answering via Reinforcement Learning

Jinchang Luo; Mingquan Cheng; Fan Wan; Ni Li; Xiaoling Xia; Shuangshuang Tian; Tingcheng Bian; Haiwei Wang; Haohuan Fu; Yan Tao

arXiv:2510.20548·cs.CL·March 17, 2026

GlobalRAG: Enhancing Global Reasoning in Multi-hop Question Answering via Reinforcement Learning

Jinchang Luo, Mingquan Cheng, Fan Wan, Ni Li, Xiaoling Xia, Shuangshuang Tian, Tingcheng Bian, Haiwei Wang, Haohuan Fu, Yan Tao

PDF

Open Access 4 Models

TL;DR

GlobalRAG is a reinforcement learning framework that improves multi-hop question answering by enhancing global reasoning, structured planning, and evidence utilization, leading to significant performance gains with less training data.

Contribution

It introduces a novel RL-based approach with planning and subgoal mechanisms, and new reward strategies to improve multi-hop QA reasoning and evidence use.

Findings

01

Achieves 14.2% higher EM and F1 scores over baselines.

02

Uses only 8k training data, half of what strong baselines require.

03

Outperforms in both in-domain and out-of-domain benchmarks.

Abstract

Reinforcement learning has recently shown promise in improving retrieval-augmented generation (RAG). Despite these advances, its effectiveness in multi-hop question answering (QA) remains limited by two fundamental limitations: (i) global planning absence to structure multi-step reasoning, and (ii) unfaithful execution, which hinders effective query formulation and consistent use of retrieved evidence. We propose GlobalRAG, a reinforcement learning framework designed to enhance global reasoning in multi-hop QA. GlobalRAG decomposes questions into subgoals, coordinates retrieval with reasoning, and refines evidence iteratively. To guide this process, we introduce Planning Quality Reward and SubGoal Completion Reward, which encourage coherent planning and reliable subgoal execution. In addition, a progressive weight annealing strategy balances process-oriented and outcome-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Information Retrieval and Search Behavior