Generating Stable and Collision-Free Policies through Lyapunov Function   Learning

Alexandre Coulombe; Hsiu-Chin Lin

arXiv:2211.08976·cs.RO·November 17, 2022

Generating Stable and Collision-Free Policies through Lyapunov Function Learning

Alexandre Coulombe, Hsiu-Chin Lin

PDF

Open Access

TL;DR

This paper introduces a neural network-based method for learning stable, collision-free policies for robots by automatically learning Lyapunov functions, improving safety and reliability in motion planning.

Contribution

A novel approach that learns Lyapunov functions and policies simultaneously with a neural network, incorporating obstacle avoidance for safer robot deployment.

Findings

01

Successfully finds policies in simulation environments

02

Transfers policies to real-world scenarios

03

Guarantees collision avoidance with obstacle module

Abstract

The need for rapid and reliable robot deployment is on the rise. Imitation Learning (IL) has become popular for producing motion planning policies from a set of demonstrations. However, many methods in IL are not guaranteed to produce stable policies. The generated policy may not converge to the robot target, reducing reliability, and may collide with its environment, reducing the safety of the system. Stable Estimator of Dynamic Systems (SEDS) produces stable policies by constraining the Lyapunov stability criteria during learning, but the Lyapunov candidate function had to be manually selected. In this work, we propose a novel method for learning a Lyapunov function and a policy using a single neural network model. The method can be equipped with an obstacle avoidance module for convex object pairs to guarantee no collisions. We demonstrated our method is capable of finding policies…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Robotic Locomotion and Control