The Tool Illusion: Rethinking Tool Use in Web Agents

Renze Lou; Baolin Peng; Wenlin Yao; Qianhui Wu; Hao Cheng; Suman Nath; Wenpeng Yin; Jianfeng Gao

arXiv:2604.03465·cs.CL·April 7, 2026

The Tool Illusion: Rethinking Tool Use in Web Agents

Renze Lou, Baolin Peng, Wenlin Yao, Qianhui Wu, Hao Cheng, Suman Nath, Wenpeng Yin, Jianfeng Gao

PDF

TL;DR

This paper conducts a comprehensive empirical study on tool use in web agents, addressing key questions about its benefits, design principles, and side effects to inform future research.

Contribution

It provides the first large-scale, controlled evaluation of web agent tool use across various sources, models, and benchmarks, revising previous conclusions.

Findings

01

Tools provide consistent gains for web agents.

02

Effective tools follow specific design principles.

03

Tool use may introduce certain side effects.

Abstract

As web agents rapidly evolve, an increasing body of work has moved beyond conventional atomic browser interactions and explored tool use as a higher-level action paradigm. Although prior studies have shown the promise of tools, their conclusions are often drawn from limited experimental scales and sometimes non-comparable settings. As a result, several fundamental questions remain unclear: i) whether tools provide consistent gains for web agents, ii) what practical design principles characterize effective tools, and iii) what side effects tool use may introduce. To establish a stronger empirical foundation for future research, we revisit tool use in web agents through an extensive and carefully controlled study across diverse tool sources, backbone models, tool-use frameworks, and evaluation benchmarks. Our findings both revise some prior conclusions and complement others with broader…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.