ParallelSearch: Train your LLMs to Decompose Query and Search Sub-queries in Parallel with Reinforcement Learning

Shu Zhao; Tan Yu; Anbang Xu; Japinder Singh; Aaditya Shukla; Rama Akkiraju

arXiv:2508.09303·cs.CL·August 14, 2025

ParallelSearch: Train your LLMs to Decompose Query and Search Sub-queries in Parallel with Reinforcement Learning

Shu Zhao, Tan Yu, Anbang Xu, Japinder Singh, Aaditya Shukla, Rama Akkiraju

PDF

TL;DR

ParallelSearch enables large language models to decompose and execute search queries in parallel, significantly improving efficiency and accuracy in multi-step reasoning tasks by overcoming sequential processing limitations.

Contribution

The paper introduces a reinforcement learning framework that allows LLMs to identify and process independent query components concurrently, enhancing search efficiency and performance.

Findings

01

Outperforms state-of-the-art baselines by 2.9% on average across seven QA benchmarks.

02

Achieves 12.7% performance improvement on parallelizable questions.

03

Reduces LLM calls to 69.6% compared to sequential methods.

Abstract

Reasoning-augmented search agents such as Search-R1, trained via reinforcement learning with verifiable rewards (RLVR), demonstrate remarkable capabilities in multi-step information retrieval from external knowledge sources. These agents address the limitations of their parametric memory by dynamically gathering relevant facts to address complex reasoning tasks. However, existing approaches suffer from a fundamental architectural limitation: they process search queries strictly sequentially, even when handling inherently parallelizable and logically independent comparisons. This sequential bottleneck significantly constrains computational efficiency, particularly for queries that require multiple entity comparisons. To address this critical limitation, we propose ParallelSearch, a novel reinforcement learning framework that empowers large language models (LLMs) to recognize…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.