Benchmarking Model Predictive Control and Reinforcement Learning Based Control for Legged Robot Locomotion in MuJoCo Simulation
Shivayogi Akki, Tan Chen

TL;DR
This paper compares Model Predictive Control and Reinforcement Learning for legged robot locomotion in simulation, highlighting their respective strengths and limitations in disturbance handling, energy efficiency, and terrain adaptability.
Contribution
It provides a standardized benchmark of MPC and RL controllers on a quadruped robot, offering insights into their performance differences under identical conditions.
Findings
RL handles disturbances and energy efficiency well
MPC recovers better from large perturbations
RL struggles with terrain generalization
Abstract
Model Predictive Control (MPC) and Reinforcement Learning (RL) are two prominent strategies for controlling legged robots, each with unique strengths. RL learns control policies through system interaction, adapting to various scenarios, whereas MPC relies on a predefined mathematical model to solve optimization problems in real-time. Despite their widespread use, there is a lack of direct comparative analysis under standardized conditions. This work addresses this gap by benchmarking MPC and RL controllers on a Unitree Go1 quadruped robot within the MuJoCo simulation environment, focusing on a standardized task-straight walking at a constant velocity. Performance is evaluated based on disturbance rejection, energy efficiency, and terrain adaptability. The results show that RL excels in handling disturbances and maintaining energy efficiency but struggles with generalization to new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Locomotion and Control
