ANCHOR: Branch-Point Data Generation for GUI Agents

Jinbiao Wei; Yilun Zhao; Kangqi Ni; Arman Cohan

arXiv:2602.07153·cs.AI·April 14, 2026

ANCHOR: Branch-Point Data Generation for GUI Agents

Jinbiao Wei, Yilun Zhao, Kangqi Ni, Arman Cohan

PDF

1 Repo

TL;DR

ANCHOR is a framework that efficiently expands GUI interaction data from limited seed demonstrations by generating diverse, goal-consistent trajectories for training desktop agents.

Contribution

It introduces a scalable trajectory expansion method using branch points and verification to improve GUI agent training data quality and diversity.

Findings

01

Models trained on expanded data outperform zero-shot baselines.

02

ANCHOR-generated data improves generalization across applications.

03

The framework achieves better performance on desktop benchmarks.

Abstract

End-to-end GUI agents for real desktop environments require large amounts of high-quality interaction data, yet collecting human demonstrations is expensive and existing synthetic pipelines often suffer from limited task diversity or noisy, goal-drifting trajectories. We present a trajectory expansion framework Anchor that bootstraps scalable desktop supervision from a small set of verified seed demonstrations. Starting from each seed, we identify branch points that correspond to meaningful state changes and propose new, state-grounded task variants conditioned on the current GUI context. An executing agent then follows the proposed instructions to generate new trajectories, while a verifier enforces task completion via state-aware checks and trajectory-level consistency. To improve supervision quality, we further apply task-conditioned step-level filtering to remove ungrounded actions…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yale-nlp/Anchor
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.