Visual Imitation Learning with Patch Rewards
Minghuan Liu, Tairan He, Weinan Zhang, Shuicheng Yan, Zhongwen Xu

TL;DR
This paper introduces PatchAIL, a novel visual imitation learning method that uses patch-based rewards to improve learning accuracy and interpretability from visual demonstrations, outperforming existing methods.
Contribution
It proposes a patch-based discriminator to measure local expertise in images, enabling fine-grained rewards and enhanced training stability in imitation learning.
Findings
PatchAIL outperforms baseline methods on DeepMind Control and Atari tasks.
Patch rewards provide better interpretability of visual demonstrations.
PatchAIL improves training stability through regularization.
Abstract
Visual imitation learning enables reinforcement learning agents to learn to behave from expert visual demonstrations such as videos or image sequences, without explicit, well-defined rewards. Previous research either adopted supervised learning techniques or induce simple and coarse scalar rewards from pixels, neglecting the dense information contained in the image demonstrations. In this work, we propose to measure the expertise of various local regions of image samples, or called \textit{patches}, and recover multi-dimensional \textit{patch rewards} accordingly. Patch reward is a more precise rewarding characterization that serves as a fine-grained expertise measurement and visual explainability tool. Specifically, we present Adversarial Imitation Learning with Patch Rewards (PatchAIL), which employs a patch-based discriminator to measure the expertise of different local parts from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Explainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications
