Trustworthy Human-AI Collaboration: Reinforcement Learning with Human   Feedback and Physics Knowledge for Safe Autonomous Driving

Zilin Huang; Zihao Sheng; Sikai Chen

arXiv:2409.00858·cs.RO·September 6, 2024·2 cites

Trustworthy Human-AI Collaboration: Reinforcement Learning with Human Feedback and Physics Knowledge for Safe Autonomous Driving

Zilin Huang, Zihao Sheng, Sikai Chen

PDF

Open Access 1 Repo

TL;DR

This paper introduces PE-RLHF, a physics-enhanced reinforcement learning framework with human feedback, ensuring safe, trustworthy autonomous driving even with imperfect human input by integrating physics knowledge and dynamic human-AI collaboration.

Contribution

The paper proposes PE-RLHF, a novel framework combining physics knowledge and human feedback in reinforcement learning for autonomous driving, guaranteeing baseline safety and improving performance.

Findings

01

PE-RLHF outperforms traditional methods in safety and efficiency.

02

It maintains performance even with poor human feedback.

03

The approach generalizes well across diverse driving scenarios.

Abstract

In the field of autonomous driving, developing safe and trustworthy autonomous driving policies remains a significant challenge. Recently, Reinforcement Learning with Human Feedback (RLHF) has attracted substantial attention due to its potential to enhance training safety and sampling efficiency. Nevertheless, existing RLHF-enabled methods often falter when faced with imperfect human demonstrations, potentially leading to training oscillations or even worse performance than rule-based approaches. Inspired by the human learning process, we propose Physics-enhanced Reinforcement Learning with Human Feedback (PE-RLHF). This novel framework synergistically integrates human feedback (e.g., human intervention and demonstration) and physics knowledge (e.g., traffic flow model) into the training loop of reinforcement learning. The key advantage of PE-RLHF is its guarantee that the learned…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zilin-huang/PE-RLHF
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI)

MethodsSoftmax · Attention Is All You Need