Online Adaptation for Enhancing Imitation Learning Policies

Federico Malato; Ville Hautamaki

arXiv:2406.04913·cs.AI·June 10, 2024

Online Adaptation for Enhancing Imitation Learning Policies

Federico Malato, Ville Hautamaki

PDF

Open Access 1 Repo

TL;DR

This paper introduces an online adaptation method that combines pre-trained policies with expert experience to improve imitation learning, especially in complex or poorly represented tasks, leading to better performance and robustness.

Contribution

The paper presents a novel online adaptation technique that enhances imitation learning policies by integrating expert experience, enabling recovery from failures and improving overall performance.

Findings

01

Adapted agents outperform pure imitation learning agents.

02

Adapted agents can succeed even when the base policy fails catastrophically.

03

The method improves robustness and generalization of imitation learning.

Abstract

Imitation learning enables autonomous agents to learn from human examples, without the need for a reward signal. Still, if the provided dataset does not encapsulate the task correctly, or when the task is too complex to be modeled, such agents fail to reproduce the expert policy. We propose to recover from these failures through online adaptation. Our approach combines the action proposal coming from a pre-trained policy with relevant experience recorded by an expert. The combination results in an adapted action that closely follows the expert. Our experiments show that an adapted agent performs better than its pure imitation learning counterpart. Notably, adapted agents can achieve reasonable performance even when the base, non-adapted policy catastrophically fails.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fmalato/online_adaptation
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOnline Learning and Analytics