A view on learning robust goal-conditioned value functions: Interplay between RL and MPC
Nathan P. Lawrence, Philip D. Loewen, Michael G. Forbes, R. Bhushan Gopaluni, Ali Mesbah

TL;DR
This paper explores integrating reinforcement learning and model predictive control to develop robust, goal-conditioned policies that combine the strengths of both approaches for reliable decision-making under uncertainty.
Contribution
It introduces a novel framework that unites RL and MPC through a local-global value function perspective, enhancing robustness and goal-conditioned learning.
Findings
Demonstrates effectiveness on classical control benchmarks.
Shows improved robustness and reliability in decision-making.
Proposes a scenario-based approach for robustness in RL and MPC.
Abstract
Reinforcement learning (RL) and model predictive control (MPC) offer a wealth of distinct approaches for automatic decision-making under uncertainty. Given the impact both fields have had independently across numerous domains, there is growing interest in combining the general-purpose learning capability of RL with the safety and robustness features of MPC. To this end, this paper presents a tutorial-style treatment of RL and MPC, treating them as alternative approaches to solving Markov decision processes. In our formulation, RL aims to learn a global value function through offline exploration in an uncertain environment, whereas MPC constructs a local value function through online optimization. This local-global perspective suggests new ways to design policies that combine robustness and goal-conditioned learning. Robustness is incorporated into the RL and MPC pipelines through a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
