Safe and Near-Optimal Policy Learning for Model Predictive Control using   Primal-Dual Neural Networks

Xiaojing Zhang; Monimoy Bujarbaruah; Francesco Borrelli

arXiv:1906.08257·cs.LG·June 21, 2019·6 cites

Safe and Near-Optimal Policy Learning for Model Predictive Control using Primal-Dual Neural Networks

Xiaojing Zhang, Monimoy Bujarbaruah, Francesco Borrelli

PDF

Open Access 1 Repo

TL;DR

This paper introduces a supervised learning framework for explicit model predictive control that includes a certificate policy to estimate sub-optimality online, enabling fast and reliable control on resource-limited systems.

Contribution

It presents a novel approach to learn control and certificate policies simultaneously, providing probabilistic guarantees on feasibility and optimality without requiring real-time optimization.

Findings

01

Achieved up to 100x speedup in control computation.

02

Provided probabilistic bounds on policy infeasibility and suboptimality.

03

Demonstrated effectiveness on vehicle dynamics control.

Abstract

In this paper, we propose a novel framework for approximating the explicit MPC law for linear parameter-varying systems using supervised learning. In contrast to most existing approaches, we not only learn the control policy, but also a "certificate policy", that allows us to estimate the sub-optimality of the learned control policy online, during execution-time. We learn both these policies from data using supervised learning techniques, and also provide a randomized method that allows us to guarantee the quality of each learned policy, measured in terms of feasibility and optimality. This in turn allows us to bound the probability of the learned control policy of being infeasible or suboptimal, where the check is performed by the certificate policy. Since our algorithm does not require the solution of an optimization problem during run-time, it can be deployed even on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

monimoyb/pd_polLearn
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Fuel Cells and Related Materials · Fault Detection and Control Systems