WebSynthesis: World-Model-Guided MCTS for Efficient WebUI-Trajectory Synthesis
Yifei Gao, Junhong Ye, Jiaqi Wang, Jitao Sang

TL;DR
WebSynthesis introduces a world-model-guided MCTS framework for efficient web trajectory synthesis, enabling scalable self-improvement of web agents by simulating web environments and reducing API costs.
Contribution
The paper presents WebSynthesis, a novel approach that uses a learned world model and tree-based planning to generate high-quality web trajectories efficiently.
Findings
Agent trained with WebSynthesis matches or exceeds real-data trained models.
Supports scalable, cost-effective web trajectory generation.
Enables reversible simulation of web environments.
Abstract
Recent advancements in large language models (LLMs) have significantly improved the capabilities of web agents. However, effectively navigating complex and dynamic web environments still requires more advanced trajectory-level planning and execution. Prior studies have addressed self-improving agents by collecting extensive GUI trajectories from real-environment interactions. Despite their effectiveness, these approaches encounter two critical challenges: (1) Uncontrollable environment states, where real or sandboxed web environments often yield unstable and non-deterministic feedback, complicating the reproduction and debugging of agent behaviors; and (2) High API costs, as generating even a single interaction trajectory can involve hundreds of queries, leading to considerable API usage and computational expenses. To address these limitations and enable scalable self-improvement for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
