Guided Policy Search using Sequential Convex Programming for   Initialization of Trajectory Optimization Algorithms

Taewan Kim; Purnanand Elango; Danylo Malyuta; and Behcet Acikmese

arXiv:2110.06975·math.OC·January 17, 2024

Guided Policy Search using Sequential Convex Programming for Initialization of Trajectory Optimization Algorithms

Taewan Kim, Purnanand Elango, Danylo Malyuta, and Behcet Acikmese

PDF

Open Access

TL;DR

This paper introduces a guided policy search method that uses sequential convex programming to generate effective initial trajectories for nonlinear trajectory optimization, improving efficiency and solution quality in complex control tasks.

Contribution

The paper proposes a novel guided policy search approach combining SCP-based trajectory generation with neural network policy training for better initialization.

Findings

01

Generated initial trajectories close to optimal and feasible

02

Improved convergence speed of trajectory optimization

03

Validated on a real-world rocket descent problem

Abstract

Nonlinear trajectory optimization algorithms have been developed to handle optimal control problems with nonlinear dynamics and nonconvex constraints in trajectory planning. The performance and computational efficiency of many trajectory optimization methods are sensitive to the initial guess, i.e., the trajectory guess needed by the recursive trajectory optimization algorithm. Motivated by this observation, we tackle the initialization problem for trajectory optimization via policy optimization. To optimize a policy, we propose a guided policy search method that has two key components: i) Trajectory update; ii) Policy update. The trajectory update involves offline solutions of a large number of trajectory optimization problems from different initial states via Sequential Convex Programming (SCP). Here we take a single SCP step to generate the trajectory iterate for each problem. In…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Path Planning Algorithms · Spacecraft Dynamics and Control · Optimization and Search Problems