Contractive Dynamical Imitation Policies for Efficient Out-of-Sample   Recovery

Amin Abyaneh; Mahrokh G. Boroujeni; Hsiu-Chin Lin; Giancarlo; Ferrari-Trecate

arXiv:2412.07544·cs.LG·March 27, 2025

Contractive Dynamical Imitation Policies for Efficient Out-of-Sample Recovery

Amin Abyaneh, Mahrokh G. Boroujeni, Hsiu-Chin Lin, Giancarlo, Ferrari-Trecate

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a novel framework for imitation policies based on contractive dynamical systems, ensuring reliable out-of-sample recovery and convergence, with theoretical guarantees and empirical validation in robotics tasks.

Contribution

It presents a new approach using contractive dynamical systems and recurrent equilibrium networks to improve out-of-sample robustness in imitation learning.

Findings

01

Significant out-of-sample performance improvements in robotic tasks

02

Theoretical bounds on worst-case and expected loss established

03

Policy convergence guaranteed under all parameter choices

Abstract

Imitation learning is a data-driven approach to learning policies from expert behavior, but it is prone to unreliable outcomes in out-of-sample (OOS) regions. While previous research relying on stable dynamical systems guarantees convergence to a desired state, it often overlooks transient behavior. We propose a framework for learning policies modeled by contractive dynamical systems, ensuring that all policy rollouts converge regardless of perturbations, and in turn, enable efficient OOS recovery. By leveraging recurrent equilibrium networks and coupling layers, the policy structure guarantees contractivity for any parameter choice, which facilitates unconstrained optimization. We also provide theoretical upper bounds for worst-case and expected loss to rigorously establish the reliability of our method in deployment. Empirically, we demonstrate substantial OOS performance improvements…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aminabyaneh/scds-contractive-imitation
pytorchOfficial

Videos

Contractive Dynamical Imitation Policies for Efficient Out-of-Sample Recovery· slideslive

Taxonomy

TopicsFlow Measurement and Analysis · Image and Signal Denoising Methods · Ultrasound Imaging and Elastography