Using Neural Networks to Compute Approximate and Guaranteed Feasible   Hamilton-Jacobi-Bellman PDE Solutions

Frank Jiang; Glen Chou; Mo Chen; Claire J. Tomlin

arXiv:1611.03158·cs.LG·March 28, 2017·20 cites

Using Neural Networks to Compute Approximate and Guaranteed Feasible Hamilton-Jacobi-Bellman PDE Solutions

Frank Jiang, Glen Chou, Mo Chen, Claire J. Tomlin

PDF

Open Access

TL;DR

This paper introduces a neural network-based algorithm for approximating solutions to Hamilton-Jacobi-Bellman PDEs that guarantees feasible controls, reduces computational complexity, and extends planning capabilities without state space discretization.

Contribution

The authors develop a grid-free neural network framework that guarantees feasible control solutions for HJB PDEs, applicable to high-dimensional systems like the Dubins car.

Findings

01

Achieves near-optimal control with guaranteed feasibility.

02

Significantly reduces computation time and space complexity.

03

Enables longer horizon planning with minimal additional computation.

Abstract

To sidestep the curse of dimensionality when computing solutions to Hamilton-Jacobi-Bellman partial differential equations (HJB PDE), we propose an algorithm that leverages a neural network to approximate the value function. We show that our final approximation of the value function generates near optimal controls which are guaranteed to successfully drive the system to a target state. Our framework is not dependent on state space discretization, leading to a significant reduction in computation time and space complexity in comparison with dynamic programming-based approaches. Using this grid-free approach also enables us to plan over longer time horizons with relatively little additional computation overhead. Unlike many previous neural network HJB PDE approximating formulations, our approximation is strictly conservative and hence any trajectories we generate will be strictly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Adversarial Robustness in Machine Learning · Reinforcement Learning in Robotics