Optimizing Trajectories with Closed-Loop Dynamic SQP
Sumeet Singh, Jean-Jacques Slotine, Vikas Sindhwani

TL;DR
This paper introduces a novel dynamic programming approach to optimize trajectories with constraints using a hybrid SQP method that incorporates closed-loop policies, significantly improving convergence speed over traditional open-loop methods.
Contribution
It develops a new dynamic programming-based recursion for constrained problems and a hybrid SQP algorithm with parallelized feedback gain computation.
Findings
Significant convergence speed improvements over standard open-loop SQP.
Effective computation of feedback gains for constrained trajectory optimization.
Validated on challenging benchmarks with promising results.
Abstract
Indirect trajectory optimization methods such as Differential Dynamic Programming (DDP) have found considerable success when only planning under dynamic feasibility constraints. Meanwhile, nonlinear programming (NLP) has been the state-of-the-art approach when faced with additional constraints (e.g., control bounds, obstacle avoidance). However, a nave implementation of NLP algorithms, e.g., shooting-based sequential quadratic programming (SQP), may suffer from slow convergence -- caused from natural instabilities of the underlying system manifesting as poor numerical stability within the optimization. Re-interpreting the DDP closed-loop rollout policy as a sensitivity-based correction to a second-order search direction, we demonstrate how to compute analogous closed-loop policies (i.e., feedback gains) for constrained problems. Our key theoretical result introduces a novel dynamic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Autonomous Vehicle Technology and Safety
