Autonomous Continual Learning for Environment Adaptation of Computer-Use Agents
Tianci Xue, Zeyi Liao, Tianneng Shi, Zilu Wang, Kai Zhang, Dawn Song, Yu Su, Huan Sun

TL;DR
This paper presents ACuRL, an autonomous curriculum reinforcement learning framework that enables computer-use agents to adapt continually to diverse environments without human-labeled data, achieving significant performance improvements.
Contribution
It introduces a novel autonomous curriculum learning approach with a robust automatic evaluator, facilitating environment-specific adaptation without human supervision.
Findings
Achieves 3-29% performance gains in target environments.
Maintains robustness with 93% agreement between CUAJudge and human judgments.
Effectively mitigates performance degradation under environment changes.
Abstract
Real-world digital environments are highly diverse and dynamic. These characteristics cause agents to frequently encounter unseen environments and distribution shifts, making continual learning in such environments essential for computer-use agents (CUAs). However, a key challenge lies in obtaining high-quality and environment-grounded training data without relying on costly human annotation. In this work, we introduce ACuRL, an Autonomous Curriculum Reinforcement Learning framework that continually adapts agents to specific environments with zero human data. The agent first explores an environment to acquire initial experiences. During subsequent iterative training, a curriculum task generator leverages these experiences together with feedback from the previous iteration to synthesize new tasks tailored for the agent's current capabilities. To provide reliable reward signals, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
