Pushing Forward Pareto Frontiers of Proactive Agents with Behavioral Agentic Optimization

Yihang Yao; Zhepeng Cen; Haohong Lin; Shiqi Liu; Zuxin Liu; Jiacheng Zhu; Zhang-Wei Hong; Laixi Shi; Ding Zhao

arXiv:2602.11351·cs.AI·February 13, 2026

Pushing Forward Pareto Frontiers of Proactive Agents with Behavioral Agentic Optimization

Yihang Yao, Zhepeng Cen, Haohong Lin, Shiqi Liu, Zuxin Liu, Jiacheng Zhu, Zhang-Wei Hong, Laixi Shi, Ding Zhao

PDF

Open Access

TL;DR

This paper introduces BAO, a novel agentic reinforcement learning framework that enhances proactive LLM agents by balancing task performance and user engagement, leading to more effective multi-turn interactions.

Contribution

We propose BAO, a new RL framework combining behavior enhancement and regularization to improve proactive, user-aligned LLM agents in multi-turn scenarios.

Findings

01

BAO outperforms existing proactive RL baselines.

02

BAO achieves comparable or better performance than commercial LLM agents.

03

BAO effectively balances task success with user satisfaction.

Abstract

Proactive large language model (LLM) agents aim to actively plan, query, and interact over multiple turns, enabling efficient task completion beyond passive instruction following and making them essential for real-world, user-centric applications. Agentic reinforcement learning (RL) has recently emerged as a promising solution for training such agents in multi-turn settings, allowing interaction strategies to be learned from feedback. However, existing pipelines face a critical challenge in balancing task performance with user engagement, as passive agents can not efficiently adapt to users' intentions while overuse of human feedback reduces their satisfaction. To address this trade-off, we propose BAO, an agentic RL framework that combines behavior enhancement to enrich proactive reasoning and information-gathering capabilities with behavior regularization to suppress inefficient or…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Reinforcement Learning in Robotics