ParallelSearch: Train your LLMs to Decompose Query and Search Sub-queries in Parallel with Reinforcement Learning
Shu Zhao, Tan Yu, Anbang Xu, Japinder Singh, Aaditya Shukla, Rama Akkiraju

TL;DR
ParallelSearch enables large language models to decompose and execute search queries in parallel, significantly improving efficiency and accuracy in multi-step reasoning tasks by overcoming sequential processing limitations.
Contribution
The paper introduces a reinforcement learning framework that allows LLMs to identify and process independent query components concurrently, enhancing search efficiency and performance.
Findings
Outperforms state-of-the-art baselines by 2.9% on average across seven QA benchmarks.
Achieves 12.7% performance improvement on parallelizable questions.
Reduces LLM calls to 69.6% compared to sequential methods.
Abstract
Reasoning-augmented search agents such as Search-R1, trained via reinforcement learning with verifiable rewards (RLVR), demonstrate remarkable capabilities in multi-step information retrieval from external knowledge sources. These agents address the limitations of their parametric memory by dynamically gathering relevant facts to address complex reasoning tasks. However, existing approaches suffer from a fundamental architectural limitation: they process search queries strictly sequentially, even when handling inherently parallelizable and logically independent comparisons. This sequential bottleneck significantly constrains computational efficiency, particularly for queries that require multiple entity comparisons. To address this critical limitation, we propose ParallelSearch, a novel reinforcement learning framework that empowers large language models (LLMs) to recognize…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
