Goal-Conditioned Terminal Value Estimation for Real-time and Multi-task   Model Predictive Control

Mitsuki Morita; Satoshi Yamamori; Satoshi Yagi; Norikazu Sugimoto; Jun; Morimoto

arXiv:2410.04929·cs.RO·October 10, 2024

Goal-Conditioned Terminal Value Estimation for Real-time and Multi-task Model Predictive Control

Mitsuki Morita, Satoshi Yamamori, Satoshi Yagi, Norikazu Sugimoto, Jun, Morimoto

PDF

Open Access

TL;DR

This paper introduces a goal-conditioned terminal value learning approach within MPC that enables real-time, multi-task control and diverse motion generation for robots, reducing computational costs and adapting to dynamic tasks.

Contribution

The study develops a hierarchical MPC framework with goal-conditioned terminal value learning for multitask optimization and diverse motion planning.

Findings

01

Enables real-time control of a bipedal robot on sloped terrain.

02

Allows the robot to track target trajectories effectively.

03

Reduces computational time compared to traditional MPC methods.

Abstract

While MPC enables nonlinear feedback control by solving an optimal control problem at each timestep, the computational burden tends to be significantly large, making it difficult to optimize a policy within the control period. To address this issue, one possible approach is to utilize terminal value learning to reduce computational costs. However, the learned value cannot be used for other tasks in situations where the task dynamically changes in the original MPC setup. In this study, we develop an MPC framework with goal-conditioned terminal value learning to achieve multitask policy optimization while reducing computational time. Furthermore, by using a hierarchical control structure that allows the upper-level trajectory planner to output appropriate goal-conditioned trajectories, we demonstrate that a robot model is able to generate diverse motions. We evaluate the proposed method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Fault Detection and Control Systems