Mobile-Agent-v3.5: Multi-platform Fundamental GUI Agents

Haiyang Xu; Xi Zhang; Haowei Liu; Junyang Wang; Zhaozai Zhu; Shengjie Zhou; Xuhao Hu; Feiyu Gao; Junjie Cao; Zihua Wang; Zhiyuan Chen; Jitong Liao; Qi Zheng; Jiahui Zeng; Ze Xu; Shuai Bai; Junyang Lin; Jingren Zhou; Ming Yan

arXiv:2602.16855·cs.AI·February 20, 2026

Mobile-Agent-v3.5: Multi-platform Fundamental GUI Agents

Haiyang Xu, Xi Zhang, Haowei Liu, Junyang Wang, Zhaozai Zhu, Shengjie Zhou, Xuhao Hu, Feiyu Gao, Junjie Cao, Zihua Wang, Zhiyuan Chen, Jitong Liao, Qi Zheng, Jiahui Zeng, Ze Xu, Shuai Bai, Junyang Lin, Jingren Zhou, Ming Yan

PDF

Open Access 10 Models

TL;DR

The paper presents GUI-Owl-1.5, a multi-platform GUI agent model that achieves state-of-the-art results on various benchmarks by integrating innovative data pipelines, enhanced reasoning capabilities, and a new RL training algorithm for diverse environments.

Contribution

Introduces GUI-Owl-1.5, a native multi-platform GUI agent model with novel data collection, reasoning enhancements, and a specialized RL algorithm for multi-platform tasks.

Findings

01

Achieves top performance on 20+ GUI benchmarks.

02

Demonstrates effective multi-platform and multi-task capabilities.

03

Open-sourced model and demo available online.

Abstract

The paper introduces GUI-Owl-1.5, the latest native GUI agent model that features instruct/thinking variants in multiple sizes (2B/4B/8B/32B/235B) and supports a range of platforms (desktop, mobile, browser, and more) to enable cloud-edge collaboration and real-time interaction. GUI-Owl-1.5 achieves state-of-the-art results on more than 20+ GUI benchmarks on open-source models: (1) on GUI automation tasks, it obtains 56.5 on OSWorld, 71.6 on AndroidWorld, and 48.4 on WebArena; (2) on grounding tasks, it obtains 80.3 on ScreenSpotPro; (3) on tool-calling tasks, it obtains 47.6 on OSWorld-MCP, and 46.8 on MobileWorld; (4) on memory and knowledge tasks, it obtains 75.5 on GUI-Knowledge Bench. GUI-Owl-1.5 incorporates several key innovations: (1) Hybird Data Flywheel: we construct the data pipeline for UI understanding and trajectory generation based on a combination of simulated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMulti-Agent Systems and Negotiation · Artificial Intelligence in Games · Advanced Software Engineering Methodologies