Co-EPG: A Framework for Co-Evolution of Planning and Grounding in Autonomous GUI Agents
Yuan Zhao, Hualei Zhu, Tingyu Jiang, Shen Li, Xiaohang Xu, Hao Henry Wang

TL;DR
Co-EPG introduces a self-iterative framework for co-evolving planning and grounding models in GUI agents, significantly improving performance through iterative self-play without external data.
Contribution
This work presents a novel self-iterative training framework that enables co-evolution of planning and grounding models for GUI agents, outperforming state-of-the-art methods.
Findings
Outperforms existing methods after three iterations
Demonstrates continuous improvement with each iteration
Operates effectively without external data
Abstract
Graphical User Interface (GUI) task automation constitutes a critical frontier in artificial intelligence research. While effective GUI agents synergistically integrate planning and grounding capabilities, current methodologies exhibit two fundamental limitations: (1) insufficient exploitation of cross-model synergies, and (2) over-reliance on synthetic data generation without sufficient utilization. To address these challenges, we propose Co-EPG, a self-iterative training framework for Co-Evolution of Planning and Grounding. Co-EPG establishes an iterative positive feedback loop: through this loop, the planning model explores superior strategies under grounding-based reward guidance via Group Relative Policy Optimization (GRPO), generating diverse data to optimize the grounding model. Concurrently, the optimized Grounding model provides more effective rewards for subsequent GRPO…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAI-based Problem Solving and Planning · Reinforcement Learning in Robotics · Artificial Intelligence in Games
