The Trajectory Alignment Coefficient in Two Acts: From Reward Tuning to Reward Learning
Calarina Muslimani, Yunshu Du, Kenta Kawamoto, Kaushik Subramanian, Peter Stone, Peter Wurman

TL;DR
This paper introduces the Trajectory Alignment Coefficient (TAC) as a tool to improve reward function design in reinforcement learning, supporting practitioners and enabling reward learning through a differentiable approximation called Soft-TAC.
Contribution
It presents TAC as a metric for aligning reward preferences with expert judgments and develops Soft-TAC for training reward models directly from human preferences.
Findings
TAC helps practitioners create more effective reward functions with lower workload.
Reward models trained with Soft-TAC better capture preference-specific objectives.
Using TAC as a reward learning objective yields policies with more distinct behaviors.
Abstract
The success of reinforcement learning (RL) is fundamentally tied to having a reward function that accurately reflects the task objective. Yet, designing reward functions is notoriously time-consuming and prone to misspecification. To address this issue, our first goal is to understand how to support RL practitioners in specifying appropriate weights for a reward function. We leverage the Trajectory Alignment Coefficient (TAC), a metric that evaluates how closely a reward function's induced preferences match those of a domain expert. To evaluate whether TAC provides effective support in practice, we conducted a human-subject study in which RL practitioners tuned reward weights for Lunar Lander. We found that providing TAC during reward tuning led participants to produce more performant reward functions and report lower cognitive workload relative to standard tuning without TAC. However,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman-Automation Interaction and Safety · Reinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety
