Using Neural Networks to Compute Approximate and Guaranteed Feasible Hamilton-Jacobi-Bellman PDE Solutions
Frank Jiang, Glen Chou, Mo Chen, Claire J. Tomlin

TL;DR
This paper introduces a neural network-based algorithm for approximating solutions to Hamilton-Jacobi-Bellman PDEs that guarantees feasible controls, reduces computational complexity, and extends planning capabilities without state space discretization.
Contribution
The authors develop a grid-free neural network framework that guarantees feasible control solutions for HJB PDEs, applicable to high-dimensional systems like the Dubins car.
Findings
Achieves near-optimal control with guaranteed feasibility.
Significantly reduces computation time and space complexity.
Enables longer horizon planning with minimal additional computation.
Abstract
To sidestep the curse of dimensionality when computing solutions to Hamilton-Jacobi-Bellman partial differential equations (HJB PDE), we propose an algorithm that leverages a neural network to approximate the value function. We show that our final approximation of the value function generates near optimal controls which are guaranteed to successfully drive the system to a target state. Our framework is not dependent on state space discretization, leading to a significant reduction in computation time and space complexity in comparison with dynamic programming-based approaches. Using this grid-free approach also enables us to plan over longer time horizons with relatively little additional computation overhead. Unlike many previous neural network HJB PDE approximating formulations, our approximation is strictly conservative and hence any trajectories we generate will be strictly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Adversarial Robustness in Machine Learning · Reinforcement Learning in Robotics
