MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability
Weiqi Wu, Xin Guan, Shen Huang, Yong Jiang, Pengjun Xie, Fei Huang, Jiuxin Cao, Hai Zhao, Jingren Zhou

TL;DR
MaskSearch is a pre-training framework that improves agentic search capabilities in language models by teaching them to leverage search tools for reasoning and retrieval, leading to better performance on multi-hop question answering tasks.
Contribution
The paper introduces MaskSearch, a novel pre-training method with RAMP task, multi-agent data generation, and curriculum learning to enhance universal retrieval and reasoning in LLMs.
Findings
Significant performance improvements on open-domain multi-hop QA tasks.
Effective transfer to out-of-domain tasks.
Enhanced agentic search capabilities in LLMs.
Abstract
Retrieval-Augmented Language Models (RALMs) represent a classic paradigm where models enhance generative capabilities using external knowledge retrieved via a specialized module. Recent advancements in Agent techniques enable Large Language Models (LLMs) to autonomously utilize tools for retrieval, planning, and reasoning. While existing training-based methods show promise, their agentic abilities are limited by inherent characteristics of the task-specific data used during training. To further enhance the universal search capability of agents, we propose a novel pre-training framework, MaskSearch. In the pre-training stage, we introduce the Retrieval Augmented Mask Prediction (RAMP) task, where the model learns to leverage search tools to fill masked spans on a large number of pre-training data, thus acquiring universal retrieval and reasoning capabilities for LLMs. After that, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScheduling and Optimization Algorithms · Mobile Crowdsensing and Crowdsourcing · Optimization and Search Problems
MethodsADaptive gradient method with the OPTimal convergence rate · Shrink and Fine-Tune · Dialogue-Adaptive Pre-training Objective
