Hierarchical Generative Adversarial Imitation Learning with Mid-level   Input Generation for Autonomous Driving on Urban Environments

Gustavo Claudio Karl Couto; Eric Aislan Antonelo

arXiv:2302.04823·cs.LG·September 6, 2024

Hierarchical Generative Adversarial Imitation Learning with Mid-level Input Generation for Autonomous Driving on Urban Environments

Gustavo Claudio Karl Couto, Eric Aislan Antonelo

PDF

Open Access 1 Repo

TL;DR

This paper introduces a hierarchical GAIL architecture for autonomous urban driving that uses a mid-level BEV input generated by a GAN, significantly improving transferability and stability over raw image-based approaches.

Contribution

The work proposes a novel hierarchical GAIL framework combining GAN-generated BEV representations with imitation learning, enhancing robustness and generalization in urban driving tasks.

Findings

01

hGAIL achieves 98% success in new city navigation

02

Pure GAIL from cameras fails to learn the task

03

Hierarchical approach improves stability and transferability

Abstract

Deriving robust control policies for realistic urban navigation scenarios is not a trivial task. In an end-to-end approach, these policies must map high-dimensional images from the vehicle's cameras to low-level actions such as steering and throttle. While pure Reinforcement Learning (RL) approaches are based exclusively on engineered rewards, Generative Adversarial Imitation Learning (GAIL) agents learn from expert demonstrations while interacting with the environment, which favors GAIL on tasks for which a reward signal is difficult to derive, such as autonomous driving. However, training deep networks directly from raw images on RL tasks is known to be unstable and troublesome. To deal with that, this work proposes a hierarchical GAIL-based architecture (hGAIL) which decouples representation learning from the driving task to solve the autonomous navigation of a vehicle. The proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gustavokcouto/hgail
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Model Reduction and Neural Networks · Traffic Prediction and Management Techniques

MethodsGenerative Adversarial Imitation Learning