Hierarchical Generative Adversarial Imitation Learning with Mid-level Input Generation for Autonomous Driving on Urban Environments
Gustavo Claudio Karl Couto, Eric Aislan Antonelo

TL;DR
This paper introduces a hierarchical GAIL architecture for autonomous urban driving that uses a mid-level BEV input generated by a GAN, significantly improving transferability and stability over raw image-based approaches.
Contribution
The work proposes a novel hierarchical GAIL framework combining GAN-generated BEV representations with imitation learning, enhancing robustness and generalization in urban driving tasks.
Findings
hGAIL achieves 98% success in new city navigation
Pure GAIL from cameras fails to learn the task
Hierarchical approach improves stability and transferability
Abstract
Deriving robust control policies for realistic urban navigation scenarios is not a trivial task. In an end-to-end approach, these policies must map high-dimensional images from the vehicle's cameras to low-level actions such as steering and throttle. While pure Reinforcement Learning (RL) approaches are based exclusively on engineered rewards, Generative Adversarial Imitation Learning (GAIL) agents learn from expert demonstrations while interacting with the environment, which favors GAIL on tasks for which a reward signal is difficult to derive, such as autonomous driving. However, training deep networks directly from raw images on RL tasks is known to be unstable and troublesome. To deal with that, this work proposes a hierarchical GAIL-based architecture (hGAIL) which decouples representation learning from the driving task to solve the autonomous navigation of a vehicle. The proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Model Reduction and Neural Networks · Traffic Prediction and Management Techniques
MethodsGenerative Adversarial Imitation Learning
