TL;DR
HG-DAgger enhances imitation learning by integrating human expert feedback with safety thresholds, improving performance and safety in autonomous driving tasks compared to traditional methods.
Contribution
This paper introduces HG-DAgger, a novel interactive imitation learning algorithm that incorporates human experts and safety thresholds to improve learning outcomes.
Findings
Outperforms DAgger and behavioral cloning in autonomous driving tasks.
Learns a safety threshold to predict novice performance in different state regions.
Effective in both simulated and real-world environments.
Abstract
Imitation learning has proven to be useful for many real-world problems, but approaches such as behavioral cloning suffer from data mismatch and compounding error issues. One attempt to address these limitations is the DAgger algorithm, which uses the state distribution induced by the novice to sample corrective actions from the expert. Such sampling schemes, however, require the expert to provide action labels without being fully in control of the system. This can decrease safety and, when using humans as experts, is likely to degrade the quality of the collected labels due to perceived actuator lag. In this work, we propose HG-DAgger, a variant of DAgger that is more suitable for interactive imitation learning from human experts in real-world systems. In addition to training a novice policy, HG-DAgger also learns a safety threshold for a model-uncertainty-based risk metric that can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
