HAIM-DRL: Enhanced Human-in-the-loop Reinforcement Learning for Safe and Efficient Autonomous Driving
Zilin Huang, Zihao Sheng, Chengyuan Ma, Sikai Chen

TL;DR
This paper introduces HAIM-DRL, a human-in-the-loop reinforcement learning framework that improves autonomous driving safety and efficiency by integrating human mentorship, partial demonstrations, and minimal intervention techniques.
Contribution
The paper presents a novel HAIM-DRL framework that incorporates human mentorship and demonstration data to enhance reinforcement learning for autonomous driving, avoiding manual reward design.
Findings
Outperforms traditional methods in safety and efficiency
Reduces traffic flow disturbance
Enhances generalization to unseen scenarios
Abstract
Despite significant progress in autonomous vehicles (AVs), the development of driving policies that ensure both the safety of AVs and traffic flow efficiency has not yet been fully explored. In this paper, we propose an enhanced human-in-the-loop reinforcement learning method, termed the Human as AI mentor-based deep reinforcement learning (HAIM-DRL) framework, which facilitates safe and efficient autonomous driving in mixed traffic platoon. Drawing inspiration from the human learning process, we first introduce an innovative learning paradigm that effectively injects human intelligence into AI, termed Human as AI mentor (HAIM). In this paradigm, the human expert serves as a mentor to the AI agent. While allowing the agent to sufficiently explore uncertain environments, the human expert can take control in dangerous situations and demonstrate correct actions to avoid potential…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Traffic control and management · Human-Automation Interaction and Safety
