ARIA: Training Language Agents with Intention-Driven Reward Aggregation
Ruihan Yang, Yikai Zhang, Aili Chen, Xintao Wang, Siyu Yuan, Jiangjie Chen, Deqing Yang, Yanghua Xiao

TL;DR
ARIA introduces a novel reward aggregation method in intention space that reduces reward variance and enhances training efficiency for language agents in complex, open-ended environments, leading to notable performance improvements.
Contribution
The paper presents ARIA, a new approach that projects language actions into an intention space to improve reward signal density and reinforcement learning effectiveness.
Findings
Reduces policy gradient variance significantly.
Achieves an average of 9.95% performance improvement.
Outperforms baseline RL methods across tasks.
Abstract
Large language models (LLMs) have enabled agents to perform complex reasoning and decision-making through free-form language interactions. However, in open-ended language action environments (e.g., negotiation or question-asking games), the action space can be formulated as a joint distribution over tokens, resulting in an exponentially large action space. Sampling actions in such a space can lead to extreme reward sparsity, which brings large reward variance, hindering effective reinforcement learning (RL). To address this, we propose ARIA, a method that Aggregates Rewards in Intention space to enable efficient and effective language Agents training. ARIA aims to project natural language actions from the high-dimensional joint token distribution space into a low-dimensional intention space, where semantically similar actions are clustered and assigned shared rewards. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Reinforcement Learning in Robotics
