SynthAgent: Adapting Web Agents with Synthetic Supervision

Zhaoyang Wang; Yiming Liang; Xuchao Zhang; Qianhui Wu; Siwei Han; Anson Bastos; Rujia Wang; Chetan Bansal; Baolin Peng; Jianfeng Gao; Saravan Rajmohan; Huaxiu Yao

arXiv:2511.06101·cs.LG·April 14, 2026

SynthAgent: Adapting Web Agents with Synthetic Supervision

Zhaoyang Wang, Yiming Liang, Xuchao Zhang, Qianhui Wu, Siwei Han, Anson Bastos, Rujia Wang, Chetan Bansal, Baolin Peng, Jianfeng Gao, Saravan Rajmohan, Huaxiu Yao

PDF

1 Repo 3 Models

TL;DR

SynthAgent introduces a dual refinement framework for synthetic supervision, significantly improving web agent adaptation by generating high-quality synthetic data through task and trajectory refinement.

Contribution

It presents a novel synthetic supervision method that refines tasks and trajectories to enhance web agent adaptation in new environments.

Findings

01

SynthAgent outperforms existing synthetic data methods.

02

Refined synthetic data improves web agent performance.

03

Dual refinement reduces hallucinations and noise in data.

Abstract

Web agents struggle to adapt to new websites due to the scarcity of environment specific tasks and demonstrations. Recent works have explored synthetic data generation to address this challenge, however, they suffer from data quality issues where synthesized tasks contain hallucinations that cannot be executed, and collected trajectories are noisy with redundant or misaligned actions. In this paper, we propose SynthAgent, a fully synthetic supervision framework that aims at improving synthetic data quality via dual refinement of both tasks and trajectories. Our approach begins by synthesizing diverse tasks through categorized exploration of web elements, ensuring efficient coverage of the target environment. During trajectory collection, tasks are refined only when conflicts with observations are detected, which mitigates hallucinations while preserving task consistency. After…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aiming-lab/SynthAgent
github

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.