A Neural Network Approach for High-Dimensional Optimal Control Applied   to Multi-Agent Path Finding

Derek Onken; Levon Nurbekyan; Xingjian Li; Samy Wu Fung; Stanley; Osher; Lars Ruthotto

arXiv:2104.03270·math.OC·June 29, 2022·IEEE Trans. Control. Syst. Technol.

A Neural Network Approach for High-Dimensional Optimal Control Applied to Multi-Agent Path Finding

Derek Onken, Levon Nurbekyan, Xingjian Li, Samy Wu Fung, Stanley, Osher, Lars Ruthotto

PDF

1 Repo

TL;DR

This paper introduces a neural network-based method for solving high-dimensional optimal control problems, enabling real-time control in multi-agent path finding with scalable and efficient solutions that mitigate the curse of dimensionality.

Contribution

The authors fuse HJB and PMP approaches by parameterizing the value function with an NN, providing a grid-free, scalable method for high-dimensional optimal control.

Findings

01

Controls generated in milliseconds, much faster than traditional methods

02

Successfully applied to multi-agent collision avoidance in up to 150 dimensions

03

Number of NN parameters scales linearly with problem dimension

Abstract

We propose a neural network approach that yields approximate solutions for high-dimensional optimal control problems and demonstrate its effectiveness using examples from multi-agent path finding. Our approach yields controls in a feedback form, where the policy function is given by a neural network (NN). Specifically, we fuse the Hamilton-Jacobi-Bellman (HJB) and Pontryagin Maximum Principle (PMP) approaches by parameterizing the value function with an NN. Our approach enables us to obtain approximately optimal controls in real-time without having to solve an optimization problem. Once the policy function is trained, generating a control at a given space-time location takes milliseconds; in contrast, efficient nonlinear programming methods typically perform the same task in seconds. We train the NN offline using the objective function of the control problem and penalty terms that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

donken/NeuralOC
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.