Resolving Copycat Problems in Visual Imitation Learning via Residual Action Prediction
Chia-Chi Chuang, Donglin Yang, Chuan Wen, Yang Gao

TL;DR
This paper introduces a new neural network architecture for visual imitation learning that addresses the copycat problem caused by information flow issues, improving performance on high-dimensional image observations.
Contribution
The paper proposes a residual action prediction architecture that mitigates copycat problems in visual imitation learning, scalable to high-dimensional images, and validated on CARLA and MuJoCo benchmarks.
Findings
Alleviates copycat problem in visual imitation learning
Outperforms existing methods on CARLA and MuJoCo
Scales effectively to high-dimensional image data
Abstract
Imitation learning is a widely used policy learning method that enables intelligent agents to acquire complex skills from expert demonstrations. The input to the imitation learning algorithm is usually composed of both the current observation and historical observations since the most recent observation might not contain enough information. This is especially the case with image observations, where a single image only includes one view of the scene, and it suffers from a lack of motion information and object occlusions. In theory, providing multiple observations to the imitation learning agent will lead to better performance. However, surprisingly people find that sometimes imitation from observation histories performs worse than imitation from the most recent observation. In this paper, we explain this phenomenon from the information flow within the neural network perspective. We also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Multimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI)
MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator
