GUI-GENESIS: Automated Synthesis of Efficient Environments with Verifiable Rewards for GUI Agent Post-Training

Yuan Cao; Dezhi Ran; Mengzhou Wu; Yuzhe Guo; Xin Chen; Ang Li; Gang Cao; Gong Zhi; Hao Yu; Linyi Li; Wei Yang; Tao Xie

arXiv:2602.14093·cs.AI·February 17, 2026

GUI-GENESIS: Automated Synthesis of Efficient Environments with Verifiable Rewards for GUI Agent Post-Training

Yuan Cao, Dezhi Ran, Mengzhou Wu, Yuzhe Guo, Xin Chen, Ang Li, Gang Cao, Gong Zhi, Hao Yu, Linyi Li, Wei Yang, Tao Xie

PDF

Open Access

TL;DR

GUI-GENESIS automatically creates efficient, verifiable GUI training environments from real applications, significantly reducing latency and costs while improving agent performance and enabling self-improvement pathways.

Contribution

It introduces a novel framework for synthesizing lightweight, verifiable GUI environments with code-native rewards, enhancing training efficiency and reliability.

Findings

01

Reduces environment latency by 10x

02

Cuts training costs by over $28,000 per epoch

03

Agents outperform base models and real-world RL baselines

Abstract

Post-training GUI agents in interactive environments is critical for developing generalization and long-horizon planning capabilities. However, training on real-world applications is hindered by high latency, poor reproducibility, and unverifiable rewards relying on noisy visual proxies. To address the limitations, we present GUI-GENESIS, the first framework to automatically synthesize efficient GUI training environments with verifiable rewards. GUI-GENESIS reconstructs real-world applications into lightweight web environments using multimodal code models and equips them with code-native rewards, executable assertions that provide deterministic reward signals and eliminate visual estimation noise. Extensive experiments show that GUI-GENESIS reduces environment latency by 10 times and costs by over $28,000 per epoch compared to training on real applications. Notably, agents trained with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Reinforcement Learning in Robotics · Artificial Intelligence in Games