Trustworthy Human-AI Collaboration: Reinforcement Learning with Human Feedback and Physics Knowledge for Safe Autonomous Driving
Zilin Huang, Zihao Sheng, Sikai Chen

TL;DR
This paper introduces PE-RLHF, a physics-enhanced reinforcement learning framework with human feedback, ensuring safe, trustworthy autonomous driving even with imperfect human input by integrating physics knowledge and dynamic human-AI collaboration.
Contribution
The paper proposes PE-RLHF, a novel framework combining physics knowledge and human feedback in reinforcement learning for autonomous driving, guaranteeing baseline safety and improving performance.
Findings
PE-RLHF outperforms traditional methods in safety and efficiency.
It maintains performance even with poor human feedback.
The approach generalizes well across diverse driving scenarios.
Abstract
In the field of autonomous driving, developing safe and trustworthy autonomous driving policies remains a significant challenge. Recently, Reinforcement Learning with Human Feedback (RLHF) has attracted substantial attention due to its potential to enhance training safety and sampling efficiency. Nevertheless, existing RLHF-enabled methods often falter when faced with imperfect human demonstrations, potentially leading to training oscillations or even worse performance than rule-based approaches. Inspired by the human learning process, we propose Physics-enhanced Reinforcement Learning with Human Feedback (PE-RLHF). This novel framework synergistically integrates human feedback (e.g., human intervention and demonstration) and physics knowledge (e.g., traffic flow model) into the training loop of reinforcement learning. The key advantage of PE-RLHF is its guarantee that the learned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI)
MethodsSoftmax · Attention Is All You Need
