Adversarial Imitation Learning from Visual Observations using Latent   Information

Vittorio Giammarino; James Queeney; Ioannis Ch. Paschalidis

arXiv:2309.17371·cs.LG·May 27, 2024

Adversarial Imitation Learning from Visual Observations using Latent Information

Vittorio Giammarino, James Queeney, Ioannis Ch. Paschalidis

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel imitation learning method from visual observations that leverages latent representations to overcome partial observability, achieving state-of-the-art results in robotic tasks and enhancing reinforcement learning efficiency.

Contribution

The paper provides a theoretical analysis of imitation learning in partially observable environments and proposes a new latent adversarial imitation algorithm from visual data.

Findings

01

Achieves state-of-the-art performance on robotic tasks

02

Improves reinforcement learning efficiency from pixel observations

03

Provides theoretical bounds on imitation suboptimality

Abstract

We focus on the problem of imitation learning from visual observations, where the learning agent has access to videos of experts as its sole learning source. The challenges of this framework include the absence of expert actions and the partial observability of the environment, as the ground-truth states can only be inferred from pixels. To tackle this problem, we first conduct a theoretical analysis of imitation learning in partially observable environments. We establish upper bounds on the suboptimality of the learning agent with respect to the divergence between the expert and the agent latent state-transition distributions. Motivated by this analysis, we introduce an algorithm called Latent Adversarial Imitation from Observations, which combines off-policy adversarial imitation techniques with a learned latent representation of the agent's state from sequences of observations. In…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vittoriogiammarino/ail_from_visual_obs
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Multimodal Machine Learning Applications

MethodsFocus