Keyframe-Focused Visual Imitation Learning

Chuan Wen; Jierui Lin; Jianing Qian; Yang Gao; Dinesh Jayaraman

arXiv:2106.06452·cs.LG·June 14, 2021

Keyframe-Focused Visual Imitation Learning

Chuan Wen, Jierui Lin, Jianing Qian, Yang Gao, Dinesh Jayaraman

PDF

Open Access 1 Video

TL;DR

This paper introduces a simple keyframe-focused method for visual imitation learning that improves performance by emphasizing critical observation points, effectively scaling to complex tasks like urban driving.

Contribution

The authors propose a scalable keyframe weighting approach that enhances visual imitation learning, outperforming prior methods in complex environments.

Findings

01

Consistent performance improvements on image-based control tasks.

02

Effective imitation from observation histories in urban driving simulation.

03

Outperforms prior approaches in scalability and accuracy.

Abstract

Imitation learning trains control policies by mimicking pre-recorded expert demonstrations. In partially observable settings, imitation policies must rely on observation histories, but many seemingly paradoxical results show better performance for policies that only access the most recent observation. Recent solutions ranging from causal graph learning to deep information bottlenecks have shown promising results, but failed to scale to realistic settings such as visual imitation. We propose a solution that outperforms these prior approaches by upweighting demonstration keyframes corresponding to expert action changepoints. This simple approach easily scales to complex visual imitation settings. Our experimental results demonstrate consistent performance improvements over all baselines on image-based Gym MuJoCo continuous control tasks. Finally, on the CARLA photorealistic vision-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Keyframe-Focused Visual Imitation Learning· slideslive

Taxonomy

TopicsMultimodal Machine Learning Applications · Reinforcement Learning in Robotics · Human Pose and Action Recognition

MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator