Learning Contraction Policies from Offline Data

Navid Rezazadeh; Maxwell Kolarich; Solmaz S. Kia; Negar Mehr

arXiv:2112.05911·cs.LG·February 4, 2022

Learning Contraction Policies from Offline Data

Navid Rezazadeh, Maxwell Kolarich, Solmaz S. Kia, Negar Mehr

PDF

Open Access

TL;DR

This paper introduces a data-driven approach to learn convergent control policies for robots from offline data by jointly learning the policy and contraction metric using a learned dynamics model, leading to faster and more robust convergence.

Contribution

It proposes a novel method to learn contraction policies and metrics jointly from offline data, including a data augmentation technique using a learned dynamics model.

Findings

01

Enforcing contraction improves convergence speed.

02

The method enhances robustness of the learned policies.

03

Successful application on simulated robotic tasks.

Abstract

This paper proposes a data-driven method for learning convergent control policies from offline data using Contraction theory. Contraction theory enables constructing a policy that makes the closed-loop system trajectories inherently convergent towards a unique trajectory. At the technical level, identifying the contraction metric, which is the distance metric with respect to which a robot's trajectories exhibit contraction is often non-trivial. We propose to jointly learn the control policy and its corresponding contraction metric while enforcing contraction. To achieve this, we learn an implicit dynamics model of the robotic system from an offline data set consisting of the robot's state and input trajectories. Using this learned dynamics model, we propose a data augmentation algorithm for learning contraction policies. We randomly generate samples in the state-space and propagate them…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFuel Cells and Related Materials · Model Reduction and Neural Networks · Reinforcement Learning in Robotics