TL;DR
This paper proposes a novel method to improve infinite-horizon model predictive control by using planner data as an approximate value function, enhancing efficiency and robustness in goal-directed robotics tasks.
Contribution
It introduces a new approach to approximate value functions for MPC using planner data, reducing reliance on reinforcement learning for robotics applications.
Findings
Planner data can serve as an effective approximate value function.
Using this approach improves MPC efficiency and resilience.
The method is particularly suited for goal-directed tasks like navigation.
Abstract
Model Predictive Control (MPC) is a classic tool for optimal control of complex, real-world systems. Although it has been successfully applied to a wide range of challenging tasks in robotics, it is fundamentally limited by the prediction horizon, which, if too short, will result in myopic decisions. Recently, several papers have suggested using a learned value function as the terminal cost for MPC. If the value function is accurate, it effectively allows MPC to reason over an infinite horizon. Unfortunately, Reinforcement Learning (RL) solutions to value function approximation can be difficult to realize for robotics tasks. In this paper, we suggest a more efficient method for value function approximation that applies to goal-directed problems, like reaching and navigation. In these problems, MPC is often formulated to track a path or trajectory returned by a planner. However, this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
