ARL-Tangram: Unleash the Resource Efficiency in Agentic Reinforcement Learning

Bangjun Xiao; Yihao Zhao; Xiangwei Deng; Shihua Yu; Yuxing Xiang; Huaqiu Liu; Qiying Wang; Liang Zhao; Hailin Zhang; Xuanzhe Liu; Xin Jin; Fuli Luo

arXiv:2603.13019·cs.DC·March 16, 2026

ARL-Tangram: Unleash the Resource Efficiency in Agentic Reinforcement Learning

Bangjun Xiao, Yihao Zhao, Xiangwei Deng, Shihua Yu, Yuxing Xiang, Huaqiu Liu, Qiying Wang, Liang Zhao, Hailin Zhang, Xuanzhe Liu, Xin Jin, Fuli Luo

PDF

Open Access

TL;DR

ARL-Tangram introduces a resource management system that significantly enhances resource efficiency and reduces external resource usage in agentic reinforcement learning by enabling fine-grained, elastic resource sharing.

Contribution

It proposes action-level orchestration and an elastic scheduling algorithm for resource management in agentic RL, improving efficiency and resource utilization.

Findings

01

Up to 4.3× reduction in action completion time

02

Speeds up RL training step duration by up to 1.5×

03

Reduces external resource consumption by up to 71.2%

Abstract

Agentic reinforcement learning (RL) has emerged as a transformative workload in cloud clusters, enabling large language models (LLMs) to solve complex problems through interactions with real world. However, unlike traditional RL, agentic RL demands substantial external cloud resources, e.g., CPUs for code execution and GPUs for reward models, that exist outside the primary training cluster. Existing agentic RL framework typically rely on static over-provisioning, i.e., resources are often tied to long-lived trajectories or isolated by tasks, which leads to severe resource inefficiency. We propose the action-level orchestration, and incorporate it into ARL-Tangram, a unified resource management system that enables fine-grained external resource sharing and elasticity. ARL-Tangram utilizes a unified action-level formulation and an elastic scheduling algorithm to minimize action…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIoT and Edge/Fog Computing · Cloud Computing and Resource Management · Big Data and Digital Economy