Learning to Optimize in Model Predictive Control

Jacob Sacks; Byron Boots

arXiv:2212.02603·cs.RO·December 7, 2022

Learning to Optimize in Model Predictive Control

Jacob Sacks, Byron Boots

PDF

TL;DR

This paper introduces a method to improve sampling-based Model Predictive Control by learning more effective update rules, enabling better performance with fewer samples through imitation learning.

Contribution

It proposes a novel approach to learn the update rule in MPC, enhancing efficiency in sample-constrained scenarios by mimicking an expert with more samples.

Findings

01

Outperforms standard MPC with the same number of samples

02

Effective in multiple simulated robotics tasks

03

Reduces the need for extensive sampling in MPC

Abstract

Sampling-based Model Predictive Control (MPC) is a flexible control framework that can reason about non-smooth dynamics and cost functions. Recently, significant work has focused on the use of machine learning to improve the performance of MPC, often through learning or fine-tuning the dynamics or cost function. In contrast, we focus on learning to optimize more effectively. In other words, to improve the update rule within MPC. We show that this can be particularly useful in sampling-based MPC, where we often wish to minimize the number of samples for computational reasons. Unfortunately, the cost of computational efficiency is a reduction in performance; fewer samples results in noisier updates. We show that we can contend with this noise by learning how to update the control distribution more effectively and make better use of the few samples that we have. Our learned controllers are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsTest