Training One Model to Master Cross-Level Agentic Actions via Reinforcement Learning

Kaichen He; Zihao Wang; Muyao Li; Anji Liu; Yitao Liang

arXiv:2512.09706·cs.LG·December 11, 2025

Training One Model to Master Cross-Level Agentic Actions via Reinforcement Learning

Kaichen He, Zihao Wang, Muyao Li, Anji Liu, Yitao Liang

PDF

Open Access

TL;DR

CrossAgent is a unified reinforcement learning model that dynamically switches between heterogeneous action spaces to improve adaptability and performance in complex, open-world environments like Minecraft.

Contribution

The paper introduces CrossAgent, a novel model that learns to select optimal action interfaces dynamically, combining supervised fine-tuning with a new policy optimization algorithm.

Findings

01

Achieves state-of-the-art results on 800+ Minecraft tasks.

02

Outperforms fixed-action baselines in generalization and efficiency.

03

Demonstrates effective adaptive action switching in complex environments.

Abstract

The paradigm of agentic AI is shifting from engineered complex workflows to post-training native models. However, existing agents are typically confined to static, predefined action spaces--such as exclusively using APIs, GUI events, or robotic commands. This rigidity limits their adaptability in dynamic environments where the optimal granularity of interaction varies contextually. To bridge this gap, we propose CrossAgent, a unified agentic model that masters heterogeneous action spaces and autonomously selects the most effective interface for each step of a trajectory. We introduce a comprehensive training pipeline that integrates cold-start supervised fine-tuning with a Multi-Turn Group Relative Policy Optimization (GRPO) algorithm. This approach enables the agent to learn adaptive action switching--balancing high-level efficiency with low-level precision--without human-specified…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Multimodal Machine Learning Applications · Artificial Intelligence in Games