ColorAgent: Building A Robust, Personalized, and Interactive OS Agent

Ning Li; Qiqiang Lin; Zheng Wu; Xiaoyun Mo; Weiming Zhang; Yin Zhao; Xiangmou Qu; Jiamu Zhou; Jun Wang; Congmin Zheng; Yuanyi Song; Hongjiang Chen; Heyuan Huang; Jihong Wang; Jiaxin Yin; Jingwei Yu; Junwei Liao; Qiuying Peng; Xingyu Lou; Jun Wang; Weiwen Liu; Zhuosheng Zhang; Weinan Zhang

arXiv:2510.19386·cs.MA·October 27, 2025

ColorAgent: Building A Robust, Personalized, and Interactive OS Agent

Ning Li, Qiqiang Lin, Zheng Wu, Xiaoyun Mo, Weiming Zhang, Yin Zhao, Xiangmou Qu, Jiamu Zhou, Jun Wang, Congmin Zheng, Yuanyi Song, Hongjiang Chen, Heyuan Huang, Jihong Wang, Jiaxin Yin, Jingwei Yu, Junwei Liao, Qiuying Peng, Xingyu Lou, Jun Wang, Weiwen Liu, Zhuosheng Zhang

PDF

TL;DR

ColorAgent is a novel OS agent that combines reinforcement learning, multi-agent frameworks, and personalized interaction to enable robust, long-horizon, and user-centric operations, achieving state-of-the-art success rates on benchmark tests.

Contribution

The paper introduces ColorAgent, a comprehensive OS agent with enhanced capabilities for long-term interaction and personalization, utilizing step-wise reinforcement learning and multi-agent systems.

Findings

01

Achieved 77.2% success on AndroidWorld benchmark.

02

Achieved 50.7% success on AndroidLab benchmark.

03

Proposed new directions for evaluation and collaboration in OS agents.

Abstract

With the advancements in hardware, software, and large language model technologies, the interaction between humans and operating systems has evolved from the command-line interface to the rapidly emerging AI agent interactions. Building an operating system (OS) agent capable of executing user instructions and faithfully following user desires is becoming a reality. In this technical report, we present ColorAgent, an OS agent designed to engage in long-horizon, robust interactions with the environment while also enabling personalized and proactive user interaction. To enable long-horizon interactions with the environment, we enhance the model's capabilities through step-wise reinforcement learning and self-evolving training, while also developing a tailored multi-agent framework that ensures generality, consistency, and robustness. In terms of user interaction, we explore personalized…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.