ContractionPPO: Certified Reinforcement Learning via Differentiable Contraction Layers

Vrushabh Zinage; Narek Harutyunyan; Eric Verheyden; Fred Y. Hadaegh; Soon-Jo Chung

arXiv:2603.19632·cs.RO·March 23, 2026

ContractionPPO: Certified Reinforcement Learning via Differentiable Contraction Layers

Vrushabh Zinage, Narek Harutyunyan, Eric Verheyden, Fred Y. Hadaegh, Soon-Jo Chung

PDF

Open Access

TL;DR

ContractionPPO integrates a neural contraction metric into reinforcement learning to certify and enhance the robustness and stability of legged robot control policies, ensuring reliable performance in unstructured environments.

Contribution

It introduces a novel method combining RL with a differentiable contraction metric layer for certifiable stability in legged robot control.

Findings

01

Demonstrates robust quadruped locomotion under external perturbations.

02

Provides theoretical guarantees of incremental exponential stability.

03

Shows successful transfer from simulation to real-world deployment.

Abstract

Legged locomotion in unstructured environments demands not only high-performance control policies but also formal guarantees to ensure robustness under perturbations. Control methods often require carefully designed reference trajectories, which are challenging to construct in high-dimensional, contact-rich systems such as quadruped robots. In contrast, Reinforcement Learning (RL) directly learns policies that implicitly generate motion, and uniquely benefits from access to privileged information, such as full state and dynamics during training, that is not available at deployment. We present ContractionPPO, a framework for certified robust planning and control of legged robots by augmenting Proximal Policy Optimization (PPO) RL with a state-dependent contraction metric layer. This approach enables the policy to maximize performance while simultaneously producing a contraction metric…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Locomotion and Control · Reinforcement Learning in Robotics · Robot Manipulation and Learning