Reinforced Model Predictive Control via Trust-Region Quasi-Newton Policy   Optimization

Dean Brandner; Sergio Lucia

arXiv:2405.17983·cs.LG·November 1, 2024

Reinforced Model Predictive Control via Trust-Region Quasi-Newton Policy Optimization

Dean Brandner, Sergio Lucia

PDF

1 Repo

TL;DR

This paper introduces a trust-region Quasi-Newton method for optimizing model predictive control policies, achieving faster convergence and better data efficiency compared to first-order reinforcement learning methods.

Contribution

It develops a novel second-order policy optimization algorithm for model predictive control using Quasi-Newton updates with trust-region constraints.

Findings

01

Outperforms first-order RL algorithms in data efficiency

02

Achieves superlinear convergence rate

03

Effective second-order derivative computation via linear system solutions

Abstract

Model predictive control can optimally deal with nonlinear systems under consideration of constraints. The control performance depends on the model accuracy and the prediction horizon. Recent advances propose to use reinforcement learning applied to a parameterized model predictive controller to recover the optimal control performance even if an imperfect model or short prediction horizons are used. However, common reinforcement learning algorithms rely on first order updates, which only have a linear convergence rate and hence need an excessive amount of dynamic data. Higher order updates are typically intractable if the policy is approximated with neural networks due to the large number of parameters. In this work, we use a parameterized model predictive controller as policy, and leverage the small amount of necessary parameters to propose a trust-region constrained Quasi-Newton…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

deanbrandner/ecc24_tr_improved_qn_po_for_mpc_in_rl
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.