TL;DR
This paper introduces a neural network-based method for solving high-dimensional optimal control problems, enabling real-time control in multi-agent path finding with scalable and efficient solutions that mitigate the curse of dimensionality.
Contribution
The authors fuse HJB and PMP approaches by parameterizing the value function with an NN, providing a grid-free, scalable method for high-dimensional optimal control.
Findings
Controls generated in milliseconds, much faster than traditional methods
Successfully applied to multi-agent collision avoidance in up to 150 dimensions
Number of NN parameters scales linearly with problem dimension
Abstract
We propose a neural network approach that yields approximate solutions for high-dimensional optimal control problems and demonstrate its effectiveness using examples from multi-agent path finding. Our approach yields controls in a feedback form, where the policy function is given by a neural network (NN). Specifically, we fuse the Hamilton-Jacobi-Bellman (HJB) and Pontryagin Maximum Principle (PMP) approaches by parameterizing the value function with an NN. Our approach enables us to obtain approximately optimal controls in real-time without having to solve an optimization problem. Once the policy function is trained, generating a control at a given space-time location takes milliseconds; in contrast, efficient nonlinear programming methods typically perform the same task in seconds. We train the NN offline using the objective function of the control problem and penalty terms that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
