FedGUI: Benchmarking Federated GUI Agents across Heterogeneous Platforms, Devices, and Operating Systems

Wenhao Wang; Haoting Shi; Mengying Yuan; Yiquan Lin; Panrong Tong; Hanzhang Zhou; Guangyi Liu; Pengxiang Zhao; Yue Wang; Siheng Chen

arXiv:2604.14956·cs.MA·April 17, 2026

FedGUI: Benchmarking Federated GUI Agents across Heterogeneous Platforms, Devices, and Operating Systems

Wenhao Wang, Haoting Shi, Mengying Yuan, Yiquan Lin, Panrong Tong, Hanzhang Zhou, Guangyi Liu, Pengxiang Zhao, Yue Wang, Siheng Chen

PDF

1 Repo

TL;DR

FedGUI introduces a comprehensive benchmark for federated GUI agents across diverse platforms, enabling systematic study of heterogeneity and fostering scalable, privacy-preserving GUI solutions.

Contribution

It provides the first benchmark with datasets and experiments for federated GUI agents across multiple platforms and heterogeneity types.

Findings

01

Cross-platform collaboration enhances federated GUI performance.

02

Platform and OS are the most influential heterogeneity factors.

03

FedGUI facilitates development of scalable, privacy-preserving GUI agents.

Abstract

Training GUI agents with traditional centralized methods faces significant cost and scalability challenges. Federated learning (FL) offers a promising solution, yet its potential is hindered by the lack of benchmarks that capture real-world, cross-platform heterogeneity. To bridge this gap, we introduce FedGUI, the first comprehensive benchmark for developing and evaluating federated GUI agents across mobile, web, and desktop platforms. FedGUI provides a suite of six curated datasets to systematically study four crucial types of heterogeneity: cross-platform, cross-device, cross-OS, and cross-source. Extensive experiments reveal several key insights: First, we show that cross-platform collaboration improves performance, extending prior mobile-only federated learning to diverse GUI environments; Second, we demonstrate the presence of distinct heterogeneity dimensions and identify…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wwh0411/FedGUI
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.