ContractionPPO: Certified Reinforcement Learning via Differentiable Contraction Layers
Vrushabh Zinage, Narek Harutyunyan, Eric Verheyden, Fred Y. Hadaegh, Soon-Jo Chung

TL;DR
ContractionPPO integrates a neural contraction metric into reinforcement learning to certify and enhance the robustness and stability of legged robot control policies, ensuring reliable performance in unstructured environments.
Contribution
It introduces a novel method combining RL with a differentiable contraction metric layer for certifiable stability in legged robot control.
Findings
Demonstrates robust quadruped locomotion under external perturbations.
Provides theoretical guarantees of incremental exponential stability.
Shows successful transfer from simulation to real-world deployment.
Abstract
Legged locomotion in unstructured environments demands not only high-performance control policies but also formal guarantees to ensure robustness under perturbations. Control methods often require carefully designed reference trajectories, which are challenging to construct in high-dimensional, contact-rich systems such as quadruped robots. In contrast, Reinforcement Learning (RL) directly learns policies that implicitly generate motion, and uniquely benefits from access to privileged information, such as full state and dynamics during training, that is not available at deployment. We present ContractionPPO, a framework for certified robust planning and control of legged robots by augmenting Proximal Policy Optimization (PPO) RL with a state-dependent contraction metric layer. This approach enables the policy to maximize performance while simultaneously producing a contraction metric…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Locomotion and Control · Reinforcement Learning in Robotics · Robot Manipulation and Learning
