Visual Imitation Learning with Patch Rewards

Minghuan Liu; Tairan He; Weinan Zhang; Shuicheng Yan; Zhongwen Xu

arXiv:2302.00965·cs.AI·February 14, 2023·1 cites

Visual Imitation Learning with Patch Rewards

Minghuan Liu, Tairan He, Weinan Zhang, Shuicheng Yan, Zhongwen Xu

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces PatchAIL, a novel visual imitation learning method that uses patch-based rewards to improve learning accuracy and interpretability from visual demonstrations, outperforming existing methods.

Contribution

It proposes a patch-based discriminator to measure local expertise in images, enabling fine-grained rewards and enhanced training stability in imitation learning.

Findings

01

PatchAIL outperforms baseline methods on DeepMind Control and Atari tasks.

02

Patch rewards provide better interpretability of visual demonstrations.

03

PatchAIL improves training stability through regularization.

Abstract

Visual imitation learning enables reinforcement learning agents to learn to behave from expert visual demonstrations such as videos or image sequences, without explicit, well-defined rewards. Previous research either adopted supervised learning techniques or induce simple and coarse scalar rewards from pixels, neglecting the dense information contained in the image demonstrations. In this work, we propose to measure the expertise of various local regions of image samples, or called \textit{patches}, and recover multi-dimensional \textit{patch rewards} accordingly. Patch reward is a more precise rewarding characterization that serves as a fine-grained expertise measurement and visual explainability tool. Specifically, we present Adversarial Imitation Learning with Patch Rewards (PatchAIL), which employs a patch-based discriminator to measure the expertise of different local parts from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sail-sg/patchail
pytorchOfficial

Videos

Visual Imitation Learning with Patch Rewards· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Explainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications