Dynamic Mirror Descent based Model Predictive Control for Accelerating Robot Learning
Utkarsh A. Mishra, Soumya R. Samineni, Prakhar Goel, Chandravaran, Kunjeti, Himanshu Lodha, Aman Singh, Aditya Sagi, Shalabh Bhatnagar and, Shishir Kolathaya

TL;DR
This paper introduces a hierarchical reinforcement learning framework combining model-based and model-free methods, utilizing dynamic mirror descent MPC to accelerate learning and reduce training iterations in robotic tasks.
Contribution
It proposes a novel hierarchical approach integrating DMD-MPC with off-policy RL, including a new algorithm M-DeMoRL, to improve sample efficiency and convergence in robot learning.
Findings
Faster convergence on MuJoCo benchmarks
Reduced training iterations in real robotic systems
Effective transfer from simulation to hardware
Abstract
Recent works in Reinforcement Learning (RL) combine model-free (Mf)-RL algorithms with model-based (Mb)-RL approaches to get the best from both: asymptotic performance of Mf-RL and high sample-efficiency of Mb-RL. Inspired by these works, we propose a hierarchical framework that integrates online learning for the Mb-trajectory optimization with off-policy methods for the Mf-RL. In particular, two loops are proposed, where the Dynamic Mirror Descent based Model Predictive Control (DMD-MPC) is used as the inner loop Mb-RL to obtain an optimal sequence of actions. These actions are in turn used to significantly accelerate the outer loop Mf-RL. We show that our formulation is generic for a broad class of MPC-based policies and objectives, and includes some of the well-known Mb-Mf approaches. We finally introduce a new algorithm: Mirror-Descent Model Predictive RL (M-DeMoRL), which uses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCardiovascular Function and Risk Factors
