WebLeaper: Empowering Efficiency and Efficacy in WebAgent via Enabling Info-Rich Seeking

Zhengwei Tao; Haiyang Shen; Baixuan Li; Wenbiao Yin; Jialong Wu; Kuan Li; Zhongwang Zhang; Huifeng Yin; Rui Ye; Liwen Zhang; Xinyu Wang; Pengjun Xie; Jingren Zhou; Yong Jiang

arXiv:2510.24697·cs.CL·October 29, 2025

WebLeaper: Empowering Efficiency and Efficacy in WebAgent via Enabling Info-Rich Seeking

Zhengwei Tao, Haiyang Shen, Baixuan Li, Wenbiao Yin, Jialong Wu, Kuan Li, Zhongwang Zhang, Huifeng Yin, Rui Ye, Liwen Zhang, Xinyu Wang, Pengjun Xie, Jingren Zhou, Yong Jiang

PDF

TL;DR

WebLeaper introduces a novel framework for web information seeking that enhances search efficiency and effectiveness by formulating IS as a tree-structured reasoning problem and synthesizing high-coverage tasks using curated data, leading to improved performance on multiple benchmarks.

Contribution

The paper presents WebLeaper, a new approach that constructs high-coverage IS tasks and models search as tree-structured reasoning to improve efficiency and efficacy of LLM-based agents.

Findings

01

Significant improvements in search efficiency and effectiveness on five benchmarks.

02

WebLeaper's task synthesis methods outperform baseline approaches.

03

Enhanced generalization of search behaviors in LLM agents.

Abstract

Large Language Model (LLM)-based agents have emerged as a transformative approach for open-ended problem solving, with information seeking (IS) being a core capability that enables autonomous reasoning and decision-making. While prior research has largely focused on improving retrieval depth, we observe that current IS agents often suffer from low search efficiency, which in turn constrains overall performance. A key factor underlying this inefficiency is the sparsity of target entities in training tasks, which limits opportunities for agents to learn and generalize efficient search behaviors. To address these challenges, we propose WebLeaper, a framework for constructing high-coverage IS tasks and generating efficient solution trajectories. We formulate IS as a tree-structured reasoning problem, enabling a substantially larger set of target entities to be embedded within a constrained…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.