Safe Pontryagin Differentiable Programming

Wanxin Jin; Shaoshuai Mou; George J. Pappas

arXiv:2105.14937·cs.LG·October 27, 2021

Safe Pontryagin Differentiable Programming

Wanxin Jin, Shaoshuai Mou, George J. Pappas

PDF

Open Access 1 Repo 1 Video

TL;DR

Safe Pontryagin Differentiable Programming (Safe PDP) provides a theoretical framework for safety-critical learning and control tasks, ensuring safety constraints are satisfied at all stages through barrier functions and efficient approximations.

Contribution

It introduces a novel Safe PDP methodology that guarantees safety throughout learning and control by integrating barrier functions and efficient unconstrained problem solutions.

Findings

01

Successfully applied to safe policy optimization

02

Effective in safe motion planning for complex systems

03

Demonstrated safety guarantees in learning MPCs from demonstrations

Abstract

We propose a Safe Pontryagin Differentiable Programming (Safe PDP) methodology, which establishes a theoretical and algorithmic framework to solve a broad class of safety-critical learning and control tasks -- problems that require the guarantee of safety constraint satisfaction at any stage of the learning and control progress. In the spirit of interior-point methods, Safe PDP handles different types of system constraints on states and inputs by incorporating them into the cost or loss through barrier functions. We prove three fundamentals of the proposed Safe PDP: first, both the solution and its gradient in the backward pass can be approximated by solving their more efficient unconstrained counterparts; second, the approximation for both the solution and its gradient can be controlled for arbitrary accuracy by a barrier parameter; and third, importantly, all intermediate results…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wanxinjin/Safe-PDP
noneOfficial

Videos

Safe Pontryagin Differentiable Programming· slideslive

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Control Systems Optimization · Reinforcement Learning in Robotics

MethodsRandom Convolutional Kernel Transform