HG-DAgger: Interactive Imitation Learning with Human Experts

Michael Kelly; Chelsea Sidrane; Katherine Driggs-Campbell; Mykel J.; Kochenderfer

arXiv:1810.02890·cs.RO·March 12, 2019

HG-DAgger: Interactive Imitation Learning with Human Experts

Michael Kelly, Chelsea Sidrane, Katherine Driggs-Campbell, Mykel J., Kochenderfer

PDF

1 Repo

TL;DR

HG-DAgger enhances imitation learning by integrating human expert feedback with safety thresholds, improving performance and safety in autonomous driving tasks compared to traditional methods.

Contribution

This paper introduces HG-DAgger, a novel interactive imitation learning algorithm that incorporates human experts and safety thresholds to improve learning outcomes.

Findings

01

Outperforms DAgger and behavioral cloning in autonomous driving tasks.

02

Learns a safety threshold to predict novice performance in different state regions.

03

Effective in both simulated and real-world environments.

Abstract

Imitation learning has proven to be useful for many real-world problems, but approaches such as behavioral cloning suffer from data mismatch and compounding error issues. One attempt to address these limitations is the DAgger algorithm, which uses the state distribution induced by the novice to sample corrective actions from the expert. Such sampling schemes, however, require the expert to provide action labels without being fully in control of the system. This can decrease safety and, when using humans as experts, is likely to degrade the quality of the collected labels due to perceived actuator lag. In this work, we propose HG-DAgger, a variant of DAgger that is more suitable for interactive imitation learning from human experts in real-world systems. In addition to training a novice policy, HG-DAgger also learns a safety threshold for a model-uncertainty-based risk metric that can be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

CSL-KU/TinyLidarNet
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.