ARIA: Training Language Agents with Intention-Driven Reward Aggregation

Ruihan Yang; Yikai Zhang; Aili Chen; Xintao Wang; Siyu Yuan; Jiangjie Chen; Deqing Yang; Yanghua Xiao

arXiv:2506.00539·cs.CL·June 5, 2025

ARIA: Training Language Agents with Intention-Driven Reward Aggregation

Ruihan Yang, Yikai Zhang, Aili Chen, Xintao Wang, Siyu Yuan, Jiangjie Chen, Deqing Yang, Yanghua Xiao

PDF

Open Access

TL;DR

ARIA introduces a novel reward aggregation method in intention space that reduces reward variance and enhances training efficiency for language agents in complex, open-ended environments, leading to notable performance improvements.

Contribution

The paper presents ARIA, a new approach that projects language actions into an intention space to improve reward signal density and reinforcement learning effectiveness.

Findings

01

Reduces policy gradient variance significantly.

02

Achieves an average of 9.95% performance improvement.

03

Outperforms baseline RL methods across tasks.

Abstract

Large language models (LLMs) have enabled agents to perform complex reasoning and decision-making through free-form language interactions. However, in open-ended language action environments (e.g., negotiation or question-asking games), the action space can be formulated as a joint distribution over tokens, resulting in an exponentially large action space. Sampling actions in such a space can lead to extreme reward sparsity, which brings large reward variance, hindering effective reinforcement learning (RL). To address this, we propose ARIA, a method that Aggregates Rewards in Intention space to enable efficient and effective language Agents training. ARIA aims to project natural language actions from the high-dimensional joint token distribution space into a low-dimensional intention space, where semantically similar actions are clustered and assigned shared rewards. This…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Reinforcement Learning in Robotics