Co-EPG: A Framework for Co-Evolution of Planning and Grounding in Autonomous GUI Agents

Yuan Zhao; Hualei Zhu; Tingyu Jiang; Shen Li; Xiaohang Xu; Hao Henry Wang

arXiv:2511.10705·cs.AI·November 17, 2025

Co-EPG: A Framework for Co-Evolution of Planning and Grounding in Autonomous GUI Agents

Yuan Zhao, Hualei Zhu, Tingyu Jiang, Shen Li, Xiaohang Xu, Hao Henry Wang

PDF

Open Access 1 Video

TL;DR

Co-EPG introduces a self-iterative framework for co-evolving planning and grounding models in GUI agents, significantly improving performance through iterative self-play without external data.

Contribution

This work presents a novel self-iterative training framework that enables co-evolution of planning and grounding models for GUI agents, outperforming state-of-the-art methods.

Findings

01

Outperforms existing methods after three iterations

02

Demonstrates continuous improvement with each iteration

03

Operates effectively without external data

Abstract

Graphical User Interface (GUI) task automation constitutes a critical frontier in artificial intelligence research. While effective GUI agents synergistically integrate planning and grounding capabilities, current methodologies exhibit two fundamental limitations: (1) insufficient exploitation of cross-model synergies, and (2) over-reliance on synthetic data generation without sufficient utilization. To address these challenges, we propose Co-EPG, a self-iterative training framework for Co-Evolution of Planning and Grounding. Co-EPG establishes an iterative positive feedback loop: through this loop, the planning model explores superior strategies under grounding-based reward guidance via Group Relative Policy Optimization (GRPO), generating diverse data to optimize the grounding model. Concurrently, the optimized Grounding model provides more effective rewards for subsequent GRPO…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Co-EPG: A Framework for Co-Evolution of Planning and Grounding in Autonomous GUI Agents· underline

Taxonomy

TopicsAI-based Problem Solving and Planning · Reinforcement Learning in Robotics · Artificial Intelligence in Games