PA3: Policy-Aware Agent Alignment through Chain-of-Thought

Shubhashis Roy Dipta; Daniel Bis; Kun Zhou; Lichao Wang; Benjamin Z. Yao; Chenlei Guo; Ruhi Sarikaya

arXiv:2603.14602·cs.CL·March 24, 2026

PA3: Policy-Aware Agent Alignment through Chain-of-Thought

Shubhashis Roy Dipta, Daniel Bis, Kun Zhou, Lichao Wang, Benjamin Z. Yao, Chenlei Guo, Ruhi Sarikaya

PDF

Open Access

TL;DR

This paper introduces a multi-stage alignment approach for large language models to better adhere to business policies during reasoning, reducing prompt length and improving accuracy without including full policies in context.

Contribution

The paper presents a novel method for models to recall and apply policies during reasoning, along with a new reward and penalty to enhance policy adherence without lengthy prompts.

Findings

01

Model outperforms baseline by 16 points

02

Surpasses in-context baselines by 3 points

03

Uses 40% fewer words in prompts

Abstract

Conversational assistants powered by large language models (LLMs) excel at tool-use tasks but struggle with adhering to complex, business-specific rules. While models can reason over business rules provided in context, including all policies for every query introduces high latency and wastes compute. Furthermore, these lengthy prompts lead to long contexts, harming overall performance due to the "needle-in-the-haystack" problem. To address these challenges, we propose a multi-stage alignment method that teaches models to recall and apply relevant business policies during chain-of-thought reasoning at inference time, without including the full business policy in-context. Furthermore, we introduce a novel PolicyRecall reward based on the Jaccard score and a Hallucination Penalty for GRPO training. Altogether, our best model outperforms the baseline by 16 points and surpasses comparable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI in Service Interactions · Topic Modeling · Multimodal Machine Learning Applications