AutoSurfer -- Teaching Web Agents through Comprehensive Surfing, Learning, and Modeling
Fazle Elahi Faisal, Qianhui Wu, Baolin Peng, Jianfeng Gao

TL;DR
AutoSurfer is a comprehensive web trajectory generator that improves website exploration and task synthesis for training web agents, resulting in higher accuracy and task diversity.
Contribution
It introduces a systematic breadth-first exploration, grounded task synthesis, and trajectory-guided refinement to enhance web agent training data quality.
Findings
Outperforms state-of-the-art methods on WebArena benchmark with 24.23% accuracy.
Achieves more diverse synthesized task distribution.
Enables more reliable and comprehensive website action coverage.
Abstract
Recent advances in multimodal large language models (LLMs) have revolutionized web agents that can automate complex tasks on websites. However, their accuracy remains limited by the scarcity of high-quality web trajectory training data. Existing automatic trajectory generation methods suffer from incomplete website coverage due to homepage-based task proposals or random-walk exploration. Such methods often result in hallucinated or ambiguous task synthesis that lead to incomplete and unreliable trajectory generation. Here, we present AutoSurfer, a comprehensive web trajectory generator that addresses these limitations through three key innovations. First, AutoSurfer employs a systematic breadth-first exploration strategy that maintains a queue of discovered pages and action traces, propagates knowledge across pages to avoid redundant exploration, and recursively expands multi-level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
