TL;DR
This paper introduces a lightweight, signal-based framework for efficiently triaging large, complex agent interaction trajectories, improving informativeness in evaluation and post-deployment analysis.
Contribution
It proposes a novel set of computationally inexpensive signals for trajectory triage, enabling better sampling and analysis of agentic interactions without affecting online performance.
Findings
Signal-based sampling achieves 82% informativeness rate, outperforming heuristic filtering and random sampling.
The approach provides a 1.52x efficiency gain per informative trajectory.
Robust across different reward levels and task domains.
Abstract
Agentic applications based on large language models increasingly rely on multi-step interaction loops involving planning, action execution, and environment feedback. While such systems are now deployed at scale, improving them post-deployment remains challenging. Agent trajectories are voluminous and non-deterministic, and reviewing each one, whether through human review or auxiliary LLMs, is slow and cost-prohibitive. We propose a lightweight, signal-based framework for triaging agentic interaction trajectories. Our approach computes cheap, broadly applicable signals from live interactions and attaches them as structured attributes for trajectory triage, identifying interactions likely to be informative without affecting online agent behavior. We organize signals into a coarse-grained taxonomy spanning interaction (misalignment, stagnation, disengagement, satisfaction), execution…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
