Goal-Conditioned Terminal Value Estimation for Real-time and Multi-task Model Predictive Control
Mitsuki Morita, Satoshi Yamamori, Satoshi Yagi, Norikazu Sugimoto, Jun, Morimoto

TL;DR
This paper introduces a goal-conditioned terminal value learning approach within MPC that enables real-time, multi-task control and diverse motion generation for robots, reducing computational costs and adapting to dynamic tasks.
Contribution
The study develops a hierarchical MPC framework with goal-conditioned terminal value learning for multitask optimization and diverse motion planning.
Findings
Enables real-time control of a bipedal robot on sloped terrain.
Allows the robot to track target trajectories effectively.
Reduces computational time compared to traditional MPC methods.
Abstract
While MPC enables nonlinear feedback control by solving an optimal control problem at each timestep, the computational burden tends to be significantly large, making it difficult to optimize a policy within the control period. To address this issue, one possible approach is to utilize terminal value learning to reduce computational costs. However, the learned value cannot be used for other tasks in situations where the task dynamically changes in the original MPC setup. In this study, we develop an MPC framework with goal-conditioned terminal value learning to achieve multitask policy optimization while reducing computational time. Furthermore, by using a hierarchical control structure that allows the upper-level trajectory planner to output appropriate goal-conditioned trajectories, we demonstrate that a robot model is able to generate diverse motions. We evaluate the proposed method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Control Systems Optimization · Fault Detection and Control Systems
