Back to the Manifold: Recovering from Out-of-Distribution States

Alfredo Reichlin; Giovanni Luca Marchetti; Hang Yin; Ali Ghadirzadeh; and Danica Kragic

arXiv:2207.08673·cs.LG·July 19, 2022·1 cites

Back to the Manifold: Recovering from Out-of-Distribution States

Alfredo Reichlin, Giovanni Luca Marchetti, Hang Yin, Ali Ghadirzadeh, and Danica Kragic

PDF

Open Access

TL;DR

This paper introduces a recovery policy for robots that uses data density approximation and a learned latent space to bring the robot back to in-distribution states after external perturbations, improving task success.

Contribution

The paper proposes a novel recovery policy framework that addresses deployment-time distributional shifts by leveraging data density and equivariant mappings.

Findings

01

Recovery policy successfully restores robot states after perturbations.

02

Behavioral cloning fails under distributional shift without recovery.

03

Real robotic experiments validate the effectiveness of the proposed method.

Abstract

Learning from previously collected datasets of expert data offers the promise of acquiring robotic policies without unsafe and costly online explorations. However, a major challenge is a distributional shift between the states in the training dataset and the ones visited by the learned policy at the test time. While prior works mainly studied the distribution shift caused by the policy during the offline training, the problem of recovering from out-of-distribution states at the deployment time is not very well studied yet. We alleviate the distributional shift at the deployment time by introducing a recovery policy that brings the agent back to the training manifold whenever it steps out of the in-distribution states, e.g., due to an external perturbation. The recovery policy relies on an approximation of the training data density and a learned equivariant mapping that maps visual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Mobile Crowdsensing and Crowdsourcing

MethodsTest