On Entropy Regularized Path Integral Control for Trajectory Optimization

Tom Lefebvre; Guillaume Crevecoeur

arXiv:2007.03960·math.OC·October 28, 2020

On Entropy Regularized Path Integral Control for Trajectory Optimization

Tom Lefebvre, Guillaume Crevecoeur

PDF

TL;DR

This paper extends Path Integral Control to a broader class of stochastic optimal control problems by introducing entropy regularization, analyzing convergence, and deriving explicit updates for improved trajectory optimization.

Contribution

It formulates Entropy Regularized Trajectory Optimization, broadening PIC applicability, and connects it with RL and stochastic search methods, providing theoretical insights and explicit algorithms.

Findings

01

Proposed a less restrictive class of SOC problems with entropy regularization.

02

Analyzed convergence behavior of the state trajectory distribution sequence.

03

Derived explicit update rules for the entropy regularized PIC.

Abstract

In this article we present a generalised view on Path Integral Control (PIC) methods. PIC refers to a particular class of policy search methods that are closely tied to the setting of Linearly Solvable Optimal Control (LSOC), a restricted subclass of nonlinear Stochastic Optimal Control (SOC) problems. This class is unique in the sense that it can be solved explicitly to yield a formal optimal state trajectory distribution. In this contribution we first review the PIC theory and discuss related algorithms tailored to policy search in general. We are able to identify a generic design strategy that relies on the existence of an optimal state trajectory distribution and finds a parametric policy by minimizing the cross entropy between the optimal and a state trajectory distribution parametrized through its policy. Inspired by this observation we then aim to formulate a SOC problem that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.