PIVOT: Bridging Planning and Execution in LLM Agents via Trajectory Refinement
Tuo Zhang, Alin-Ionut Popa, Yan Xu, Rui Song, Dimitrios Dimitriadis

TL;DR
PIVOT is a self-supervised framework that iteratively refines LLM-generated plans to improve execution success by addressing infeasibility and constraint violations, achieving state-of-the-art results.
Contribution
The paper introduces PIVOT, a novel trajectory refinement method that enhances plan-execution alignment in LLM agents through a four-stage, self-supervised process.
Findings
PIVOT achieves up to 94% improvement in constraint satisfaction with human feedback.
The autonomous version of PIVOT retains significant performance gains without external supervision.
PIVOT requires fewer tokens than competing methods, demonstrating computational efficiency.
Abstract
Large language model (LLM)-based agents frequently generate seemingly coherent plans that fail upon execution due to infeasible actions, constraint violations, and compounding errors over extended horizons. PIVOT (Plan-Inspect-eVOlve Trajectories) addresses this plan-execution misalignment through a self-supervised framework that treats trajectories as optimizable objects iteratively refined via environment interaction. The framework comprises four stages: PLAN generates candidate trajectories; INSPECT executes them and computes structured losses with textual gradients encoding plan-execution discrepancies; EVOLVE applies these signals to produce improved trajectories; and VERIFY performs a final global check against task constraints. A monotonic acceptance process ensures a non-decreasing solution quality. Empirical evaluations on DeepPlanning and GAIA demonstrate state-of-the-art…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
