Context Training with Active Information Seeking

Zeyu Huang; Adhiguna Kuncoro; Qixuan Feng; Jiajun Shen; Lucio Dery; Arthur Szlam; Marc'Aurelio Ranzato

arXiv:2605.13050·cs.CL·May 15, 2026

Context Training with Active Information Seeking

Zeyu Huang, Adhiguna Kuncoro, Qixuan Feng, Jiajun Shen, Lucio Dery, Arthur Szlam, Marc'Aurelio Ranzato

PDF

TL;DR

This paper enhances context optimization for large language models by integrating active information seeking tools like Wikipedia search and browsers, leading to significant performance improvements across various tasks.

Contribution

It introduces a search-based training method that effectively combines active information seeking with context optimization, outperforming naive approaches.

Findings

01

Active information seeking improves task performance.

02

The method is data-efficient and robust across models.

03

Significant gains in low-resource translation, health, and reasoning tasks.

Abstract

Most existing large language models (LLMs) are expensive to adapt after deployment, especially when a task requires newly produced information or niche domain knowledge. Recent work has shown that, by manipulating and optimizing their context, LLMs can be tailored to downstream tasks without updating their weights. However, most existing methods remain closed-loop, relying solely on the model's intrinsic knowledge. In this paper, we equip these context optimizers with Wikipedia search and browser tools for active information seeking. We show that naively adding these tools to a standard sequential context optimization pipeline can actually degrade performance compared to baselines. However, when paired with a search-based training procedure that maintains and prunes multiple candidate contexts, active information seeking delivers consistent and substantial gains. We demonstrate these…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.