Guided Policy Search using Sequential Convex Programming for Initialization of Trajectory Optimization Algorithms
Taewan Kim, Purnanand Elango, Danylo Malyuta, and Behcet Acikmese

TL;DR
This paper introduces a guided policy search method that uses sequential convex programming to generate effective initial trajectories for nonlinear trajectory optimization, improving efficiency and solution quality in complex control tasks.
Contribution
The paper proposes a novel guided policy search approach combining SCP-based trajectory generation with neural network policy training for better initialization.
Findings
Generated initial trajectories close to optimal and feasible
Improved convergence speed of trajectory optimization
Validated on a real-world rocket descent problem
Abstract
Nonlinear trajectory optimization algorithms have been developed to handle optimal control problems with nonlinear dynamics and nonconvex constraints in trajectory planning. The performance and computational efficiency of many trajectory optimization methods are sensitive to the initial guess, i.e., the trajectory guess needed by the recursive trajectory optimization algorithm. Motivated by this observation, we tackle the initialization problem for trajectory optimization via policy optimization. To optimize a policy, we propose a guided policy search method that has two key components: i) Trajectory update; ii) Policy update. The trajectory update involves offline solutions of a large number of trajectory optimization problems from different initial states via Sequential Convex Programming (SCP). Here we take a single SCP step to generate the trajectory iterate for each problem. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Spacecraft Dynamics and Control · Optimization and Search Problems
