Keyframe-Focused Visual Imitation Learning
Chuan Wen, Jierui Lin, Jianing Qian, Yang Gao, Dinesh Jayaraman

TL;DR
This paper introduces a simple keyframe-focused method for visual imitation learning that improves performance by emphasizing critical observation points, effectively scaling to complex tasks like urban driving.
Contribution
The authors propose a scalable keyframe weighting approach that enhances visual imitation learning, outperforming prior methods in complex environments.
Findings
Consistent performance improvements on image-based control tasks.
Effective imitation from observation histories in urban driving simulation.
Outperforms prior approaches in scalability and accuracy.
Abstract
Imitation learning trains control policies by mimicking pre-recorded expert demonstrations. In partially observable settings, imitation policies must rely on observation histories, but many seemingly paradoxical results show better performance for policies that only access the most recent observation. Recent solutions ranging from causal graph learning to deep information bottlenecks have shown promising results, but failed to scale to realistic settings such as visual imitation. We propose a solution that outperforms these prior approaches by upweighting demonstration keyframes corresponding to expert action changepoints. This simple approach easily scales to complex visual imitation settings. Our experimental results demonstrate consistent performance improvements over all baselines on image-based Gym MuJoCo continuous control tasks. Finally, on the CARLA photorealistic vision-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMultimodal Machine Learning Applications · Reinforcement Learning in Robotics · Human Pose and Action Recognition
MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator
