MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability

Weiqi Wu; Xin Guan; Shen Huang; Yong Jiang; Pengjun Xie; Fei Huang; Jiuxin Cao; Hai Zhao; Jingren Zhou

arXiv:2505.20285·cs.CL·May 28, 2025

MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability

Weiqi Wu, Xin Guan, Shen Huang, Yong Jiang, Pengjun Xie, Fei Huang, Jiuxin Cao, Hai Zhao, Jingren Zhou

PDF

Open Access 1 Repo

TL;DR

MaskSearch is a pre-training framework that improves agentic search capabilities in language models by teaching them to leverage search tools for reasoning and retrieval, leading to better performance on multi-hop question answering tasks.

Contribution

The paper introduces MaskSearch, a novel pre-training method with RAMP task, multi-agent data generation, and curriculum learning to enhance universal retrieval and reasoning in LLMs.

Findings

01

Significant performance improvements on open-domain multi-hop QA tasks.

02

Effective transfer to out-of-domain tasks.

03

Enhanced agentic search capabilities in LLMs.

Abstract

Retrieval-Augmented Language Models (RALMs) represent a classic paradigm where models enhance generative capabilities using external knowledge retrieved via a specialized module. Recent advancements in Agent techniques enable Large Language Models (LLMs) to autonomously utilize tools for retrieval, planning, and reasoning. While existing training-based methods show promise, their agentic abilities are limited by inherent characteristics of the task-specific data used during training. To further enhance the universal search capability of agents, we propose a novel pre-training framework, MaskSearch. In the pre-training stage, we introduce the Retrieval Augmented Mask Prediction (RAMP) task, where the model learns to leverage search tools to fill masked spans on a large number of pre-training data, thus acquiring universal retrieval and reasoning capabilities for LLMs. After that, the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alibaba-nlp/masksearch
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScheduling and Optimization Algorithms · Mobile Crowdsensing and Crowdsourcing · Optimization and Search Problems

MethodsADaptive gradient method with the OPTimal convergence rate · Shrink and Fine-Tune · Dialogue-Adaptive Pre-training Objective