Confidence-Guided Human-AI Collaboration: Reinforcement Learning with Distributional Proxy Value Propagation for Autonomous Driving

Li Zeqiao; Wang Yijing; Wang Haoyu; Li Zheng; Li Peng; Zuo zhiqiang; Hu Chuan

arXiv:2506.03568·cs.RO·June 6, 2025

Confidence-Guided Human-AI Collaboration: Reinforcement Learning with Distributional Proxy Value Propagation for Autonomous Driving

Li Zeqiao, Wang Yijing, Wang Haoyu, Li Zheng, Li Peng, Zuo zhiqiang, Hu Chuan

PDF

Open Access 1 Repo

TL;DR

This paper introduces a confidence-guided human-AI collaboration framework for autonomous driving that uses distributional RL to efficiently learn and switch between human-guided and autonomous policies, enhancing safety and performance.

Contribution

It proposes a novel C-HAC strategy employing distributional soft actor-critic with return distribution-based human intention modeling and dynamic policy switching for autonomous driving.

Findings

01

C-HAC outperforms traditional methods in safety and efficiency

02

Achieves rapid, stable learning with minimal human intervention

03

Validated through extensive real-world driving tests

Abstract

Autonomous driving promises significant advancements in mobility, road safety and traffic efficiency, yet reinforcement learning and imitation learning face safe-exploration and distribution-shift challenges. Although human-AI collaboration alleviates these issues, it often relies heavily on extensive human intervention, which increases costs and reduces efficiency. This paper develops a confidence-guided human-AI collaboration (C-HAC) strategy to overcome these limitations. First, C-HAC employs a distributional proxy value propagation method within the distributional soft actor-critic (DSAC) framework. By leveraging return distributions to represent human intentions C-HAC achieves rapid and stable learning of human-guided policies with minimal human interaction. Subsequently, a shared control mechanism is activated to integrate the learned human-guided policy with a self-learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lzqw/c-hac
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety · Adversarial Robustness in Machine Learning

MethodsSelf-Learning